evaluation
Evaluation is a process for assessing agent systems, requiring approaches that account for dynamic decision-making and non-deterministic behavior.
Browse and install thousands of AI Agent skills in the Killer-Skills directory. Supports Claude Code, Windsurf, Cursor, and more.
Evaluation is a process for assessing agent systems, requiring approaches that account for dynamic decision-making and non-deterministic behavior.
ralph-loop is an autonomous task loop that keeps Claude working on a task until it reaches a defined completion state.
chatgpt-bridge is a skill that generates machine-readable JSON output, following specific rules and protocols for efficient data exchange and processing.
prompt-optimizer is a skill that uses ultrathink principles to optimize and refine prompts, making them inevitable and effective for AI agents and applications.
semantik-plugin-development is a skill that allows developers to create plugins for Semantik, a self-hosted semantic search engine, to extend its capabilities for document ingestion, embedding, and other tasks.
full_auth is a skill that allows AI agents to perform phased implementation plans, executing all phases in sequence until completion, with explicit user permission.
chitti-mcp is a skill that provides decision management capabilities using Chitti's MCP tools, allowing for recording, recalling, and auditing decisions.
git-sync is a skill that provides Git synchronization for roo-extensions, utilizing submodule synchronization and conservative merges.
zerg-testing is a centralized testing framework that utilizes Make targets for managing AI agents, providing core commands for unit tests, E2E tests, and debugging
Hotkey is a feature that allows developers to assign custom keyboard shortcuts to specific actions in their AI agents, using constants and registrations.
pr-driver is a skill that automates quality checks for AI agents using bash scripts and JSON validation, ensuring tasks meet specific criteria.
code-style is a set of guidelines for formatting and naming Python code, including the use of Ruff, snake_case, and type hints to ensure consistency and readability.