A community-driven registry for Claude, Cursor, Windsurf, Cline & more. Not affiliated with Anthropic.
Are you the author? Sign in to claim
Pressure-test your specs with LLM reasoning before writing code. Agent skill for Claude Code, Codex, Gemini CLI, and 14+
Pressure-test your specs with LLM reasoning before writing code.
Vibe testing is a technique for validating specification documents by simulating real-world scenarios against them. An LLM reads your spec docs, traces through a concrete user scenario step by step, and flags every gap, conflict, and ambiguity — before anyone writes a line of implementation.
We test code obsessively. Unit tests, integration tests, E2E tests. But specifications? We "review" them in a meeting.
Vibe testing moves the discovery of design flaws to the cheapest possible moment: before implementation begins.
1. Write a scenario: a named persona, a concrete goal, step-by-step interaction
2. Give an LLM all your spec docs + the scenario
3. The LLM traces each step, identifies governing specs, flags gaps
4. You get a structured gap report with severity ratings
No code. No test harness. Just reasoning.
Works with Claude Code, OpenAI Codex, Gemini CLI, Cursor, GitHub Copilot, OpenCode, and any tool supporting the Agent Skills open standard.
git clone https://github.com/knot0-com/vibe-testing.git ~/.claude/skills/vibe-testing
git clone https://github.com/knot0-com/vibe-testing.git ~/.codex/skills/vibe-testing
git clone https://github.com/knot0-com/vibe-testing.git ~/.gemini/skills/vibe-testing
git clone https://github.com/knot0-com/vibe-testing.git ~/.agent/skills/vibe-testing
git clone https://github.com/knot0-com/vibe-testing.git .claude/skills/vibe-testing
Once installed, the skill activates when you ask your coding agent to validate specs:
> /vibe-testing
> "Test my specs against a realistic scenario"
> "Find gaps in the architecture docs before we start building"
> "Vibe test the design docs in docs/v2/"
vibe-testing/
├── SKILL.md # The skill definition (Agent Skills standard)
├── references/
│ └── simulator-prompt.md # Copy-paste prompt templates
└── examples/
└── example-vibe-test.md # Complete example: e-commerce checkout flow
Vibe tests produce a structured gap report:
| Severity | Meaning |
|---|---|
| BLOCKING | Spec cannot answer. Implementation impossible without resolution. |
| DEGRADED | Workaround exists but it's fragile. |
| COSMETIC | Missing convenience. Not a correctness issue. |
The included example tests an e-commerce checkout against specs for auth, payments, inventory, orders, notifications, and shipping. A single scenario — "first-time buyer, payment declined, retries with new card" — found:
Each would have been a rewrite-level discovery weeks into implementation.
MIT
Built by Knot0 — software that assembles itself.
Claude Code skill for YouTube creators — channel audits, video SEO, retention scripts, thumbnails, content strategy, Sho
AI image generation skill for Claude Code -- Creative Director powered by Gemini
A Claude Code skill by Hao (駱君昊) that learns your Facebook voice and auto-posts to FB / IG / Threads / X with a 14-day c
Universal SEO skill for Claude Code. 25 sub-skills + 18 sub-agents covering technical SEO, E-E-A-T, schema, GEO/AEO, bac