A community-driven registry for Claude, Cursor, Windsurf, Cline & more. Not affiliated with Anthropic.
Are you the author? Sign in to claim
Structured AI development workflow template for Claude Code. Multi-agent orchestration with specialist coding agents, pa
A structured multi-agent development workflow for Claude Code. Named after a capybara who watches from the sidelines.
You tell Claude Code to "implement issue #42" and it writes code. Sometimes great code. Sometimes it misses edge cases, skips tests, introduces scope creep, or produces something that doesn't match the acceptance criteria. You catch it in review — after it already committed.
Graybark Workflow adds structure to the chaos. Instead of one AI session doing everything, it orchestrates a team of specialist agents with built-in quality gates that catch problems before they reach your PR.
Issue
│
▼
┌─────────────────────────┐
│ ORCHESTRATOR │ ← Main Claude Code session
│ (reads, plans, routes) │
└──────┬──────────┬────────┘
│ │
┌────▼───┐ ┌───▼────┐
│Backend │ │Frontend│ ← Specialist agents in isolated worktrees
│ agent │ │ agent │ (code + tests, run in parallel)
└────┬───┘ └───┬────┘
│ │
▼ ▼
┌─────────────────────────┐
│ MERGE + VERIFY │ ← verify.sh: lint, types, tests, secrets, scope
└──────────┬──────────────┘
│
┌────────┼────────┐
▼ ▼ ▼
┌────┐ ┌────┐ ┌──────┐
│Corr│ │Sec │ │ Conv │ ← Review agents (parallel, JSON verdicts)
└──┬─┘ └──┬─┘ └──┬───┘
│ │ │
▼ ▼ ▼
┌─────────────────────────┐
│ EVALUATE & COMMIT │ ← Auto-parse reviews, commit if clean
└─────────────────────────┘
│
▼
Compound Learning ← Document what was learned
If verify or review fails → the orchestrator sends feedback to the specialist and retries. Max 4 iterations, then it escalates with a graded label: needs-human-p2 (quick fix possible), needs-human-p1 (unclear how to proceed), or needs-human-p0 (requires architectural or product decision).
| Layer | What | Purpose |
|---|---|---|
| Workflow | implement-issue.md | 11-step issue→PR pipeline |
| Specialists | frontend, backend, architect agents | Code + test in worktree isolation |
| Advisors | UX designer, scope guardian | Advise without coding |
| Reviewers | correctness, security, conventions | Parallel diff review with JSON verdicts |
| Quality gate | verify.sh | Blocks on lint, type, test, secret, scope failures |
| Ship-check | ready.sh | Branch-ship-readiness audit before landing |
| Skills | compound-learning, parallel-dispatch, ideate, compress-logs | Reusable agent routines |
| Rules | always, on-frontend, on-backend, on-migration, on-testing | Injected into agent prompts based on affected files |
| Hooks | .claude/settings.json | Workflow-gate, git add -A-blocker, verify-before-commit nudge |
# Option A: Clone and copy
git clone git@github.com:PontusNyberg/Graybark-workflow.git
cp -r Graybark-workflow/.ai your-project/.ai
cp -r Graybark-workflow/.claude your-project/.claude
cp Graybark-workflow/CLAUDE.md your-project/CLAUDE.md
# Option B: Use as GitHub template
# Click "Use this template" on GitHub
Every project-specific part is marked with TODO:. Find them all:
grep -r "TODO:" .ai/ .claude/ CLAUDE.md
The main things to customize:
CLAUDE.md — Your project name, tech stack, file paths, hard requirements.claude/agents/*.md — Your specialists' expertise and framework knowledge.ai/scripts/verify.sh — Your linter, type checker, and test commands.ai/rules/on-*.md — Your coding conventions per domainYou: implement #42
Claude: *reads issue, loads rules, plans work packages, dispatches specialists...*
That's it. The workflow handles the rest.
.ai/
├── CLAUDE.md # Agent system overview
├── workflows/
│ └── implement-issue.md # The 11-step workflow
├── rules/
│ ├── always.md # Universal rules (scope, types, git, quality)
│ ├── on-frontend.md # Frontend conventions
│ ├── on-backend.md # Backend conventions
│ ├── on-migration.md # Database migration rules
│ └── on-testing.md # Test writing rules
├── agents/
│ ├── reviewer-correctness.md # "Does it fulfill the acceptance criteria?"
│ ├── reviewer-security.md # "Is it secure?"
│ └── reviewer-conventions.md # "Does it follow our patterns?"
├── scripts/
│ ├── verify.sh # Quality gate (customize this!)
│ ├── ready.sh # Branch-ship-readiness audit before landing
│ └── evaluate-reviews.sh # Parses reviewer JSON verdicts
├── skills/
│ ├── compound-learning.md # Post-implementation learning capture
│ ├── parallel-dispatch.md # Dependency analysis for parallel agents
│ ├── ideate.md # Proactive improvement identification
│ ├── compress-logs.md # Compress iteration logs for long sessions
│ └── triggers.yml # When to inject which skill
└── logs/ # Session-local (gitignored)
.claude/
├── agents/
│ ├── frontend-developer.md # Codes UI, components, pages
│ ├── backend-developer.md # Codes API, database, auth
│ ├── tech-lead.md # Codes config, CI/CD, architecture
│ ├── product-designer.md # UX advice (no code)
│ ├── product-skeptic.md # Scope control (no code)
│ └── TEAM.md # Who does what
└── settings.json # Hooks: workflow-gate, git-add-A blocker, verify nudge
CLAUDE.md # Entry point — Claude reads this first
docs/
├── solutions/ # Compound learning artifacts
├── brainstorms/ # Ideation output
├── plans/ # Technical plans
└── retros/ # Sprint retrospectives
Each specialist agent gets a full git worktree — an isolated copy of the repo. This means:
Three reviewers with different lenses catch more than one reviewer trying to check everything:
They run in parallel (fast) and output structured JSON (parseable).
Without a limit, the agent will retry forever on a fundamentally broken approach. Max 4 iterations forces escalation to a human when the problem is deeper than a quick fix.
Every multi-iteration issue teaches something. The compound-learning skill captures why things went wrong and feeds it back into rules, skills, or verify.sh checks — so the same mistake doesn't happen twice.
| Step | What happens | Who |
|---|---|---|
| 1. Prepare | Read issue, load rules | Orchestrator |
| 2. Validate | Check acceptance criteria are clear | Orchestrator |
| 3. Consult | Ask UX/scope advisors (optional) | Advisors |
| 4. Plan | Break into work packages, dependency analysis | Orchestrator |
| 4b. Match skills | Check triggers.yml for matching skills | Orchestrator |
| 5. Dispatch | Spawn specialists in worktrees | Specialists |
| 5b. Merge | Merge worktree branches to feature branch | Orchestrator |
| 6. Verify completeness | All ACs covered? All planned files changed? | Orchestrator |
| 7. Verify quality | Run verify.sh (lint, types, tests, secrets) | verify.sh |
| 8. Review | Parallel review (3 generic + cross-review) | Reviewers |
| 9. Evaluate | Parse JSON verdicts, check for blockers | evaluate-reviews.sh |
| 10. Land | Stage, commit, push, create PR — verify branch is up-to-date (Landing Protocol) | Orchestrator |
| 11. Learn | Document insights in docs/solutions/ | Orchestrator |
Step 10's Landing Protocol treats an issue as incomplete until git status confirms the branch is committed, pushed, and up to date with the remote — no "I committed locally" half-states.
If step 7 or 9 fails → back to step 5 with error context. Max 4 loops.
After copying into your project:
CLAUDE.md — Project name, tech stack, file paths, hard requirements.claude/agents/frontend-developer.md — Your frontend framework and patterns.claude/agents/backend-developer.md — Your backend framework and patterns.claude/agents/tech-lead.md — Your architecture and CI/CD.ai/rules/always.md — Adjust or remove monorepo-specific rules.ai/rules/on-frontend.md — Component structure, styling, testing approach.ai/rules/on-backend.md — API framework, database, auth patterns.ai/rules/on-migration.md — Your migration tool and naming conventions.ai/rules/on-testing.md — Test runner, file placement, mocking strategy.ai/scripts/verify.sh — Critical: add your linter, type checker, and test commands.ai/scripts/ready.sh — Replace TODO/REPO placeholder with your owner/repo.ai/skills/triggers.yml — Add skills for your recurring patterns.claude/settings.json — Review hooks (workflow-gate paths, etc.) and adjust to your project.gitignore — Your build output, dependencies, etc.gh) for issue reading and PR creationjq for review evaluation (optional but recommended)This workflow was extracted from a production project (PennyKoll) where it orchestrated 17+ sprints of development. The project-specific parts were replaced with TODO: markers, but the orchestration patterns, quality gates, and agent definitions are battle-tested.
MIT
Portable skills, agents, and templates that add Spec-Driven Development and TDD workflows to any Claude Code project. De
Template para nuevos proyectos con Claude Code: CLAUDE.md, slash commands y documentación automática
ATLAS: a senior-engineer layer for Claude Code. Explore with wireframes & prototypes, clarify the essentials, capture it
Claude Code governance templates by tech stack : CLAUDE.md, scoped rules, architecture docs, cost control & dev-level ad