What is checkpoint-ambiguity-review?

Perfect for Code Review Agents needing advanced checkpoint analysis and test validation capabilities. SlopCodeBench: Measuring Code Erosion Under Iterative Specification Refinement

How do I install checkpoint-ambiguity-review?

Run the command: npx killer-skills add SprocketLab/slop-code-bench/checkpoint-ambiguity-review. It works with Cursor, Windsurf, VS Code, Claude Code, and 19+ other IDEs.

What are the use cases for checkpoint-ambiguity-review?

Key use cases include: Automating checkpoint ambiguity detection in iterative specification refinement, Generating test case reviews for reasonable but non-explicit interpretations, Debugging test failures due to ambiguous checkpoint specifications.

Which IDEs are compatible with checkpoint-ambiguity-review?

This skill is compatible with Cursor, Windsurf, VS Code, Trae, Claude Code, OpenClaw, Aider, Codex, OpenCode, Goose, Cline, Roo Code, Kiro, Augment Code, Continue, GitHub Copilot, Sourcegraph Cody, and Amazon Q Developer. Use the Killer-Skills CLI for universal one-command installation.

Are there any limitations for checkpoint-ambiguity-review?

Requires access to problem and checkpoint directories. Limited to Markdown and Python file formats. Needs inferencing capabilities for spec and test file paths.

checkpoint-ambiguity-review

Install checkpoint-ambiguity-review, an AI agent skill for AI agent workflows and automation. Works with Claude Code, Cursor, and Windsurf with one-command...

SKILL.md

Readonly

Checkpoint Ambiguity Review

Name: checkpoint-ambiguity-review
Availability: InStock
Author: SprocketLab

Overview

Review a checkpoint's spec and tests to find tests that enforce a reasonable but non-explicit interpretation, and report those cases with rationale and fixes.

Workflow

1) Collect inputs

Problem name and checkpoint number (N).
Test file path(s) and checkpoint spec path (if not provided, infer):
- Spec: problems/{problem}/checkpoint_{N}.md
- Tests: problems/{problem}/tests/test_checkpoint_{N}.py
- Also scan problems/{problem}/tests/conftest.py and problems/{problem}/tests/data/ if they influence expectations.
Optional: snapshot path for ambiguity verification.

2) Read the spec and tests

Extract explicit requirements from the spec.
Map each test assertion to a specific spec clause or an implied behavior.
Note any test assumptions that are not spelled out in the spec.

3) Flag ambiguous interpretations

Only report tests that enforce an interpretation that could reasonably differ given the spec wording. Do not report tests that are simply incorrect against explicit requirements.

Common ambiguity cues:

Output ordering when the spec does not mandate order.
Tie-breaking rules that are unstated.
Whitespace, casing, or formatting details not defined by the spec.
Rounding or precision requirements not defined.
Error handling for invalid inputs when not specified.
Boundary behavior (inclusive/exclusive) not stated.
Default values or optional fields not defined.
Determinism or randomness expectations not specified.
Multiple reasonable data structure representations (list vs set, map order).

4) Optional snapshot verification

If a snapshot is provided, run:

bash
1slop-code --quiet eval-snapshot {snapshot} -p {problem} -o /tmp/eval -c {N} -e configs/environments/docker-python3.12-uv.yaml --json

Use failures to corroborate ambiguity, not to invent it. A failing test is ambiguous only if the spec supports multiple reasonable interpretations.

5) Report format

Use the following structure for each ambiguous test:

## {test name} ({path}::{node_id})

**Why:** {spec language + alternate interpretation that could be valid}
**Fix:** {proposed test relaxation or spec clarification}

Keep entries concise and actionable. If no ambiguity is found, state that clearly (e.g., "No ambiguity issues found.").

checkpoint-ambiguity-review — community checkpoint-ambiguity-review, slop-code-bench, community, ide skills, Claude Code, Cursor, Windsurf

Agent Capability Analysis

Ideal Agent Persona

Core Value

↓ Capabilities Granted for checkpoint-ambiguity-review

! Prerequisites & Limits

Browser Sandbox Environment

⚡️ Ready to unleash?

checkpoint-ambiguity-review

Checkpoint Ambiguity Review

Overview

Workflow

1) Collect inputs

2) Read the spec and tests

3) Flag ambiguous interpretations

4) Optional snapshot verification

5) Report format

FAQ & Installation Steps

? Frequently Asked Questions

What is checkpoint-ambiguity-review?

How do I install checkpoint-ambiguity-review?

What are the use cases for checkpoint-ambiguity-review?

Which IDEs are compatible with checkpoint-ambiguity-review?

Are there any limitations for checkpoint-ambiguity-review?

↓ How To Install

Related Skills

Looking for an alternative to checkpoint-ambiguity-review or another community skill for your workflow? Explore these related open-source skills.

widget-generator

flags

zustand

data-fetching

checkpoint-ambiguity-review — community checkpoint-ambiguity-review, slop-code-bench, community, ide skills, Claude Code, Cursor, Windsurf

About this Skill

Agent Capability Analysis

Ideal Agent Persona

Core Value

↓ Capabilities Granted for checkpoint-ambiguity-review

! Prerequisites & Limits

Browser Sandbox Environment

⚡️ Ready to unleash?

checkpoint-ambiguity-review

Checkpoint Ambiguity Review

Overview

Workflow

1) Collect inputs

2) Read the spec and tests

3) Flag ambiguous interpretations

4) Optional snapshot verification

5) Report format

FAQ & Installation Steps

? Frequently Asked Questions

What is checkpoint-ambiguity-review?

How do I install checkpoint-ambiguity-review?

What are the use cases for checkpoint-ambiguity-review?

Which IDEs are compatible with checkpoint-ambiguity-review?

Are there any limitations for checkpoint-ambiguity-review?

↓ How To Install

Related Skills

Looking for an alternative to checkpoint-ambiguity-review or another community skill for your workflow? Explore these related open-source skills.

widget-generator

flags

zustand

data-fetching