agentic-engineering — agentic-engineering install agentic-engineering, everything-claude-code, official, agentic-engineering install, ide skills, eval-first execution loop, cost-aware model routing, decompose work AI agent, agentic-engineering for Claude, MCP agentic engineering, Claude Code

Verified
v1.0.0
GitHub

About this Skill

Ideal for Advanced AI Agents like AutoGPT and LangChain requiring eval-first execution and cost-aware model routing for complex task management. agentic-engineering is an AI agent skill for engineering workflows that uses an eval-first execution loop, decomposes work into agent-sized units, and routes model tiers by task complexity to enforce quality and risk controls.

Features

Implements the Eval-First Loop for defining capability and regression evals
Decomposes engineering work into agent-sized units for execution
Routes model tiers based on task complexity for cost-aware operations
Defines completion criteria before agent execution begins
Measures performance with evals and regression checks
Operates with Claude and MCP for AI agent workflows

# Core Topics

affaan-m affaan-m
[62.0k]
[7678]
Updated: 3/6/2026

Agent Capability Analysis

The agentic-engineering skill by affaan-m is an open-source official AI agent skill for Claude Code and other IDE workflows, helping agents execute tasks with better context, repeatability, and domain-specific guidance. Optimized for agentic-engineering install, eval-first execution loop, cost-aware model routing.

Ideal Agent Persona

Ideal for Advanced AI Agents like AutoGPT and LangChain requiring eval-first execution and cost-aware model routing for complex task management.

Core Value

Empowers agents to operate as agentic engineers, leveraging decomposition, eval-first execution, and regression checks to ensure quality and risk controls, while utilizing model tiers for efficient task complexity management.

Capabilities Granted for agentic-engineering

Decomposing complex tasks into agent-sized units for efficient execution
Implementing eval-first loops for capability evaluation and regression testing
Routing model tiers based on task complexity for optimized performance

! Prerequisites & Limits

  • Requires defined completion criteria before execution
  • Dependent on human oversight for quality and risk controls
  • Needs eval and regression checks for effective operation
Labs Demo

Browser Sandbox Environment

⚡️ Ready to unleash?

Experience this Agent in a zero-setup browser environment powered by WebContainers. No installation required.

Boot Container Sandbox

agentic-engineering

Install agentic-engineering, an AI agent skill for AI agent workflows and automation. Works with Claude Code, Cursor, and Windsurf with one-command setup.

SKILL.md
Readonly

Agentic Engineering

Use this skill for engineering workflows where AI agents perform most implementation work and humans enforce quality and risk controls.

Operating Principles

  1. Define completion criteria before execution.
  2. Decompose work into agent-sized units.
  3. Route model tiers by task complexity.
  4. Measure with evals and regression checks.

Eval-First Loop

  1. Define capability eval and regression eval.
  2. Run baseline and capture failure signatures.
  3. Execute implementation.
  4. Re-run evals and compare deltas.

Task Decomposition

Apply the 15-minute unit rule:

  • each unit should be independently verifiable
  • each unit should have a single dominant risk
  • each unit should expose a clear done condition

Model Routing

  • Haiku: classification, boilerplate transforms, narrow edits
  • Sonnet: implementation and refactors
  • Opus: architecture, root-cause analysis, multi-file invariants

Session Strategy

  • Continue session for closely-coupled units.
  • Start fresh session after major phase transitions.
  • Compact after milestone completion, not during active debugging.

Review Focus for AI-Generated Code

Prioritize:

  • invariants and edge cases
  • error boundaries
  • security and auth assumptions
  • hidden coupling and rollout risk

Do not waste review cycles on style-only disagreements when automated format/lint already enforce style.

Cost Discipline

Track per task:

  • model
  • token estimate
  • retries
  • wall-clock time
  • success/failure

Escalate model tier only when lower tier fails with a clear reasoning gap.

FAQ & Installation Steps

These questions and steps mirror the structured data on this page for better search understanding.

? Frequently Asked Questions

What is agentic-engineering?

Ideal for Advanced AI Agents like AutoGPT and LangChain requiring eval-first execution and cost-aware model routing for complex task management. agentic-engineering is an AI agent skill for engineering workflows that uses an eval-first execution loop, decomposes work into agent-sized units, and routes model tiers by task complexity to enforce quality and risk controls.

How do I install agentic-engineering?

Run the command: npx killer-skills add affaan-m/everything-claude-code/agentic-engineering. It works with Cursor, Windsurf, VS Code, Claude Code, and 19+ other IDEs.

What are the use cases for agentic-engineering?

Key use cases include: Decomposing complex tasks into agent-sized units for efficient execution, Implementing eval-first loops for capability evaluation and regression testing, Routing model tiers based on task complexity for optimized performance.

Which IDEs are compatible with agentic-engineering?

This skill is compatible with Cursor, Windsurf, VS Code, Trae, Claude Code, OpenClaw, Aider, Codex, OpenCode, Goose, Cline, Roo Code, Kiro, Augment Code, Continue, GitHub Copilot, Sourcegraph Cody, and Amazon Q Developer. Use the Killer-Skills CLI for universal one-command installation.

Are there any limitations for agentic-engineering?

Requires defined completion criteria before execution. Dependent on human oversight for quality and risk controls. Needs eval and regression checks for effective operation.

How To Install

  1. 1. Open your terminal

    Open the terminal or command line in your project directory.

  2. 2. Run the install command

    Run: npx killer-skills add affaan-m/everything-claude-code/agentic-engineering. The CLI will automatically detect your IDE or AI agent and configure the skill.

  3. 3. Start using the skill

    The skill is now active. Your AI agent can use agentic-engineering immediately in the current project.

Related Skills

Looking for an alternative to agentic-engineering or another official skill for your workflow? Explore these related open-source skills.

View All

flags

Logo of facebook
facebook

Use when you need to check feature flag states, compare channels, or debug why a feature behaves differently across release channels.

243.6k
0
Developer

extract-errors

Logo of facebook
facebook

Use when adding new error messages to React, or seeing unknown error code warnings.

243.6k
0
Developer

fix

Logo of facebook
facebook

Use when you have lint errors, formatting issues, or before committing code to ensure it passes CI.

243.6k
0
Developer

flow

Logo of facebook
facebook

Use when you need to run Flow type checking, or when seeing Flow type errors in React code.

243.6k
0
Developer