agent-skill-tdd — Install & Live Demo

Why use it

Key features

6-step workflow: Serena -> Investigate -> Clarify -> Document -> Design Review -> TDD
Timestamped .requirements/ docs per task
Explicit requirements confirmation before code
Architecture review via Lad MCP before implementation
Per-change peer review on every edit
Agent-agnostic — Claude Code, Codex, Cursor, Windsurf, Antigravity

Live Demo

What it looks like in practice

agent-skill-tdd-skill.replay ▶ ready

0/0

Install

Pick your client

~/Library/Application Support/Claude/claude_desktop_config.json · Windows: %APPDATA%\Claude\claude_desktop_config.json

{
  "mcpServers": {
    "agent-skill-tdd-skill": {
      "command": "git",
      "args": [
        "clone",
        "https://github.com/Shelpuk-AI-Technology-Consulting/agent-skill-tdd",
        "~/.claude/skills/agent-skill-tdd"
      ],
      "_inferred": true
    }
  }
}

Open Claude Desktop → Settings → Developer → Edit Config. Restart after saving.

~/.cursor/mcp.json · .cursor/mcp.json

{
  "mcpServers": {
    "agent-skill-tdd-skill": {
      "command": "git",
      "args": [
        "clone",
        "https://github.com/Shelpuk-AI-Technology-Consulting/agent-skill-tdd",
        "~/.claude/skills/agent-skill-tdd"
      ],
      "_inferred": true
    }
  }
}

Cursor uses the same mcpServers schema as Claude Desktop. Project config wins over global.

VS Code → Cline → MCP Servers → Edit

{
  "mcpServers": {
    "agent-skill-tdd-skill": {
      "command": "git",
      "args": [
        "clone",
        "https://github.com/Shelpuk-AI-Technology-Consulting/agent-skill-tdd",
        "~/.claude/skills/agent-skill-tdd"
      ],
      "_inferred": true
    }
  }
}

Click the MCP Servers icon in the Cline sidebar, then "Edit Configuration".

~/.codeium/windsurf/mcp_config.json

{
  "mcpServers": {
    "agent-skill-tdd-skill": {
      "command": "git",
      "args": [
        "clone",
        "https://github.com/Shelpuk-AI-Technology-Consulting/agent-skill-tdd",
        "~/.claude/skills/agent-skill-tdd"
      ],
      "_inferred": true
    }
  }
}

Same shape as Claude Desktop. Restart Windsurf to pick up changes.

~/.continue/config.json

{
  "mcpServers": [
    {
      "name": "agent-skill-tdd-skill",
      "command": "git",
      "args": [
        "clone",
        "https://github.com/Shelpuk-AI-Technology-Consulting/agent-skill-tdd",
        "~/.claude/skills/agent-skill-tdd"
      ]
    }
  ]
}

Continue uses an array of server objects rather than a map.

~/.config/zed/settings.json

{
  "context_servers": {
    "agent-skill-tdd-skill": {
      "command": {
        "path": "git",
        "args": [
          "clone",
          "https://github.com/Shelpuk-AI-Technology-Consulting/agent-skill-tdd",
          "~/.claude/skills/agent-skill-tdd"
        ]
      }
    }
  }
}

Add to context_servers. Zed hot-reloads on save.

claude mcp add agent-skill-tdd-skill -- git clone https://github.com/Shelpuk-AI-Technology-Consulting/agent-skill-tdd ~/.claude/skills/agent-skill-tdd

One-liner. Verify with claude mcp list. Remove with claude mcp remove.

Use Cases

Real-world ways to use agent-skill-tdd

Implement a new feature with full TDD discipline

👤 Teams where AI coders jump to implementation and miss requirements ⏱ ~120 min intermediate

When to use: Non-trivial feature where 'just write it' has burned you before.

Prerequisites

Skill cloned — git clone https://github.com/Shelpuk-AI-Technology-Consulting/agent-skill-tdd ~/.claude/skills/agent-skill-tdd
Serena available — Install Serena MCP server
Lad MCP for design review — Install Lad MCP server

Flow

Kick off with the skill

Use agent-skill-tdd. I want to add a 'saved searches' feature. Start from step 1 — activate Serena and investigate.✓ Copied

→ Claude reads code, summarizes current state
Clarify requirements

Confirm what I just said maps to these acceptance criteria — don't code yet.✓ Copied

→ Back-and-forth until confirmed; .requirements/ file saved
Design review via Lad

Run design review — architecture fit, hotspots.✓ Copied

→ Design critique, not just approval
TDD loop

Now begin TDD — write a failing test, minimum code to pass, refactor. Peer review each step.✓ Copied

→ Red-green-refactor cycles, each reviewed

Outcome: Feature shipped with tests, requirements history, and design rationale preserved.

Pitfalls

Skipping clarify to 'save time' — That's where most AI coding value is lost — hold the line

Combine with: filesystem · github

Fix a bug after writing down what correct behavior should be

👤 Devs tired of AI 'fixes' that break other paths ⏱ ~60 min intermediate

When to use: A subtle bug where the reproduction is half the battle.

Flow

Investigate + document

Use agent-skill-tdd bug mode. Symptom: <description>. Write the reproduction + expected behavior to .requirements/ first.✓ Copied

→ Timestamped doc with As-Is and To-Be
Write failing test

Test that expresses the expected behavior — currently failing.✓ Copied

→ Red test
Fix + peer review

Minimum fix to go green. Peer review.✓ Copied

→ Green + review notes

Outcome: A bug fix with a test preventing regression.

Stop before implementing — validate you understand the spec

👤 Teams where spec-misreading causes rework ⏱ ~30 min beginner

When to use: The ticket is ambiguous and you suspect you're about to build the wrong thing.

Flow

Only run steps 1–4

Use agent-skill-tdd — investigate + clarify + document + design review. Stop before coding.✓ Copied

→ Requirements doc + design review without touching code

Outcome: You either discover you misread the ticket or greenlight with confidence.

Combinations

Pair with other MCPs for X10 leverage

agent-skill-tdd-skill + filesystem

Keep .requirements/ in the repo for future agents to read

Commit .requirements/ to git so the next session has context.✓ Copied

agent-skill-tdd-skill + github

Link requirements doc in the PR description

When opening the PR, paste the .requirements/ file path in the description.✓ Copied

Tools

What this MCP exposes

Tool	Inputs	When to call	Cost
activate_serena	repo path	Step 1	0 — local
investigate	task description	Step 2	0
clarify	task, user answers	Step 3	0
document_requirements	confirmed requirements	Step 4	0
design_review	requirements + repo	Step 5	0
tdd_cycle	requirements	Step 6	0

Cost & Limits

What this costs to run

API quota: None from the skill; Serena and Lad have their own footprints
Tokens per call: Moderate — each step is a round trip
Monetary: Free
Tip: Full 6-step is overkill for trivial tasks — run steps 1–4 only on small work

Security

Permissions, secrets, blast radius

Credential storage: None

Data egress: Only via Serena/Lad if configured

Troubleshooting

Common errors and fixes

Serena not available

Install the Serena MCP server and register it with your agent

Requirements doc not created

Ensure filesystem write access to .requirements/

Verify: ls .requirements/

Design review step feels rubber-stamp

Force an adversarial framing: 'what's the strongest objection to this design?'

Alternatives

agent-skill-tdd vs others

Alternative	When to use it instead	Tradeoff
claude-gpt-workflow	You want cross-model review instead of self-TDD discipline	Higher API cost
planning-with-files-skill	You want persistence across sessions but less TDD rigor	No test-first enforcement

More

Resources

📖 Read the official README on GitHub

🐙 Browse open issues

🔍 Browse all 400+ MCP servers and Skills