A community-driven registry for Claude, Cursor, Windsurf, Cline & more. Not affiliated with Anthropic.
Are you the author? Sign in to claim
AI agent security scanner. Detect vulnerabilities in agent configurations, MCP servers, and tool permissions. Available
Security auditor for AI agent configurations
Scans Claude Code setups for hardcoded secrets, permission misconfigs,
hook injection, MCP server risks, and agent prompt injection vectors.
Available as CLI, GitHub Action, and GitHub App integration.
Quick Start · What It Catches · API Reference · Opus Pipeline · GitHub Action · Distribution · MiniClaw · Changelog
The AI agent ecosystem is growing faster than its security tooling. In January 2026 alone:
Developers install community skills, connect MCP servers, and configure hooks without any automated way to audit the security of their setup. AgentShield scans your .claude/ directory and flags vulnerabilities before they become exploits.
Built at the Claude Code Hackathon (Cerebral Valley x Anthropic, Feb 2026). Part of the Everything Claude Code ecosystem (42K+ stars).
# Scan your Claude Code config (no install required)
npx ecc-agentshield scan
# Or install globally
npm install -g ecc-agentshield
agentshield scan
That's it. AgentShield auto-discovers your ~/.claude/ directory, scans all config files, and prints a graded security report.
Discovery intentionally skips common generated directories such as node_modules, build output, and .dmux worktree mirrors so transient copies do not duplicate findings.
AgentShield Security Report
Grade: F (0/100)
Score Breakdown
Secrets ░░░░░░░░░░░░░░░░░░░░ 0
Permissions ░░░░░░░░░░░░░░░░░░░░ 0
Hooks ░░░░░░░░░░░░░░░░░░░░ 0
MCP Servers ░░░░░░░░░░░░░░░░░░░░ 0
Agents ░░░░░░░░░░░░░░░░░░░░ 0
● CRITICAL Hardcoded Anthropic API key
CLAUDE.md:13
Evidence: sk-ant-a...cdef
Fix: Replace with environment variable reference [auto-fixable]
● CRITICAL Overly permissive allow rule: Bash(*)
settings.json
Evidence: Bash(*)
Fix: Restrict to specific commands: Bash(git *), Bash(npm *), Bash(node *)
Summary
Files scanned: 6
Findings: 73 total — 19 critical, 29 high, 15 medium, 4 low, 6 info
Auto-fixable: 8 (use --fix)
# Scan a specific directory
agentshield scan --path /path/to/.claude
# Auto-fix safe issues (replaces hardcoded secrets with env var references)
agentshield scan --fix
# JSON output for CI pipelines
agentshield scan --format json
# Generate an HTML executive security report
agentshield scan --format html > report.html
# Generate a portable audit bundle
agentshield scan --evidence-pack ./agentshield-evidence
# Three-agent Opus 4.6 adversarial analysis (requires ANTHROPIC_API_KEY)
agentshield scan --opus --stream
# Generate a secure baseline config
agentshield init
JSON reports now expose findings[].runtimeConfidence when AgentShield can distinguish active runtime config from project-local settings, template/example inventories, installed Claude plugin caches, declarative plugin manifests, and manifest-resolved non-shell hook implementations. Reports also include local harness adapter evidence for Claude Code, OpenCode, Codex, Gemini, Zed, VS Code, dmux, terminal-agent wrappers, and project-local templates when matching markers are present.
102 rules across 5 categories, graded A–F with a 0–100 numeric score.
| What | Examples |
|---|---|
| API keys | Anthropic (sk-ant-), OpenAI (sk-proj-/sk-), xAI (xai-), AWS (AKIA), Google/Gemini (AIza), Stripe (sk_test_/sk_live_) |
| Tokens | GitHub PATs (ghp_/github_pat_), Linear (lin_api_), Cloudflare (CF_API_TOKEN=), Slack (xox[bprs]-), JWTs (eyJ...), Bearer tokens |
| Credentials | Hardcoded passwords, database connection strings (postgres/mongo/mysql/redis), private key material |
| Env leaks | Secrets passed through environment variables in configs, echo $SECRET in hooks |
| What | Examples |
|---|---|
| Wildcard access | Bash(*), Write(*), Edit(*) — unrestricted tool permissions |
| Missing deny lists | No deny rules for rm -rf, sudo, chmod 777 |
| Dangerous flags | --dangerously-skip-permissions usage |
| Mutable tool exposure | All mutable tools (Write, Edit, Bash) allowed without scoping |
| Destructive git | git push --force, git reset --hard in allowed commands |
| Unrestricted network | curl *, wget, ssh *, scp * in allow list without scope |
| What | Examples |
|---|---|
| Command injection | ${file} interpolation in shell commands — attacker-controlled filenames become code |
| Data exfiltration | curl -X POST with variable interpolation sending data to external URLs |
| Silent errors | 2>/dev/null, || true — failing security hooks that silently pass |
| Missing hooks | No PreToolUse hooks, no Stop hooks for session-end validation |
| Network exposure | Unthrottled network requests in hooks, sensitive file access without filtering |
| Session startup | SessionStart hooks that download and execute remote scripts |
| Package installs | Global npm install -g, pip install, gem install, cargo install in hooks |
| Container escape | Docker --privileged, --pid=host, --network=host, root volume mounts |
| Credential access | macOS Keychain, GNOME Keyring, /etc/shadow reads |
| Reverse shells | /dev/tcp, mkfifo + nc, Python/Perl socket shells |
| Clipboard access | pbcopy, xclip, xsel, wl-copy — exfiltration via clipboard |
| Log tampering | journalctl --vacuum, rm /var/log, history -c — anti-forensics |
| What | Examples |
|---|---|
| High-risk servers | Shell/command MCPs, filesystem with root access, database MCPs, browser automation |
| Supply chain | npx -y auto-install without confirmation — typosquatting vector |
| Hardcoded secrets | API tokens in MCP environment config instead of env var references |
| Remote transport | MCP servers connecting to remote URLs (SSE/streamable HTTP) |
| Shell metacharacters | &&, |, ; in MCP server command arguments |
| Missing metadata | No version pin, no description, excessive server count |
| Sensitive file args | .env, .pem, credentials.json passed as server arguments |
| Network exposure | Binding to 0.0.0.0 instead of localhost |
| Auto-approve | autoApprove settings that skip user confirmation for tool calls |
| Missing timeouts | High-risk servers without timeout — resource exhaustion risk |
Supply-chain verification (agentshield scan --supply-chain) extracts MCP
package references plus root package.json and package-lock.json dependency
evidence, then reports provenance counts for npm vs git, pinned vs unpinned,
known-good packages, and npm-registry-backed metadata. Add
--supply-chain-online to query npm for downloads, maintainers, postinstall
scripts, deprecation, and package age.
Package-manager hardening checks also scan .npmrc, .yarnrc.yml, and
pnpm-workspace.yaml for plaintext registry credentials, explicit dependency
lifecycle-script enablement, and missing or weak release-age cooldowns where the
package manager supports them. npm configs are checked for lifecycle-script
blocking and unsupported release-age keys that can create false confidence; use
pnpm minimumReleaseAge / minimum-release-age, Yarn npmMinimalAgeGate, or an
external package-manager policy wrapper for cooldown enforcement.
AgentShield also scans AI developer-tool persistence surfaces used by recent npm
and PyPI campaign payloads, including Claude Code hook settings,
.claude/router_runtime.js, VS Code tasks.json folder-open automation,
Zed project tasks.json, .vscode/setup.mjs, .zed/setup.mjs, GitHub
workflow drop-ins, LaunchAgent/systemd dead-man switch artifacts,
gh-token-monitor token-store files, metadata-service credential targets, and
reported exfiltration or second-stage network indicators. These indicators are
emitted as critical hook findings so CI can fail fast even after the malicious
package has been uninstalled.
AgentShield scans both active MCP config and repository-shipped MCP templates.
mcp.json, .claude/mcp.json, .claude.json, and active settings.json should be treated as the highest-confidence runtime exposure.settings.local.json are emitted as runtimeConfidence: project-local-optional.mcp-configs/, config/mcp/, or configs/mcp/ indicate risky MCP definitions present in repository templates, not guaranteed active runtime enablement.runtimeConfidence: active-runtime | project-local-optional | template-example | docs-example | plugin-cache | plugin-manifest | hook-code.template-example MCP findings are score-weighted at 0.25x, and one template file is capped at 10 deduction points per score category so a single MCP catalog cannot score like dozens of enabled servers.npx -y, unpinned packages, and environment inheritance are still valuable, but they should be interpreted as "this repo ships a risky MCP template" rather than "this MCP is definitely enabled right now."| What | Examples |
|---|---|
| Unrestricted tools | Agents with Bash access, no allowedTools restriction |
| Prompt injection surface | Agents processing external/user-provided content without defenses |
| Auto-run instructions | CLAUDE.md containing "Always run", "without asking", "automatically install" |
| Hidden instructions | Unicode zero-width characters, HTML comments, base64-encoded directives |
| URL execution | CLAUDE.md instructing agents to fetch and execute remote URLs |
| Time bombs | Delayed execution instructions triggered by time or absence conditions |
| Data harvesting | Bulk collection of passwords, credentials, or database dumps |
| Prompt reflection | ignore previous instructions, you are now, DAN jailbreak, fake system prompts |
| Output manipulation | always report ok, remove warnings from output, suppress security findings |
Structured JSON under .claude/subagents/ and .claude/slash-commands/ is analyzed like agent config when it declares allowedTools or similar tool metadata. Freeform skill-md prompt text still has narrower security coverage than agent-md and CLAUDE.md.
false-positive-audit.md.Triage Rules For Current Reports section.False-Positive Taxonomy, Repo Audit Worksheet, and Release Gate For Accuracy Changes.hooks-no-pretooluse when a companion hooks/hooks.json manifest defines PreToolUse hooks.hooks/hooks.json-style indirection; shell targets continue through hook rules, and non-shell hook-code targets now emit targeted findings for explicit output(...) context injection, transcript input access, and remote shell payloads executed via child-process wrappers.hook-code signals, and skill-md prompt text that still bypasses most agent/injection rules.runtimeConfidence now appears on MCP findings, settings.local.json, docs/examples, installed Claude plugin caches, plugin manifests, and manifest-resolved non-shell hook code. Scoring discounts non-secret template-example and docs-example findings at 0.25x, non-secret project-local-optional findings at 0.75x, and non-secret plugin-cache / plugin-manifest findings at 0.5x. Non-secret template-example findings are also capped at 10 deduction points per file and score category so one catalog file cannot dominate the grade. hook-code findings currently stay at full weight, but the active rules there are narrow language-aware implementation signals.template-example means "repo ships this risky template", not "this is definitely enabled right now."docs-example means "repo ships risky sample guidance", not "this example is active runtime config."plugin-cache means "installed plugin content is present on disk", not "this file is top-level runtime config"; real secrets still stay critical.plugin-manifest means "the repo declares this hook behavior", while hook-code means "the scanner reached the referenced non-shell implementation."CLAUDE.md example file for scanning, but still suppress the rest of the nested example subtree unless a runtime companion exists.docs/, commands/, examples/, samples/, demo/, tutorial/, guide/, cookbook/, and playground/ heuristics can still be treated as live config until broader example-root classification lands.CLAUDE.md roots under docs/ are now skipped unless runtime config companions exist in the same subtree.Bash(curl https://...) and Bash(wget https://...) allow entries with pinned literal URLs no longer trigger the generic permissions-permissive-* finding; wildcard and dynamic network permissions still do.Bash(node scripts/foo.js ...) and Bash(python3 ./tools/audit.py ...) wrapper commands no longer trigger the generic interpreter-access finding; inline eval forms such as node -e and python -c still do.Bash(docker ps) and Bash(docker image ls) no longer trigger the generic Docker-access finding; execution-oriented forms such as docker run and docker exec still do.settings.local.json allowlists now downgrade permissions-no-deny-list from high to medium when every allow entry is fully specified; wildcard or dynamic project-local permissions still keep the higher severity.settings.local.json allowlists now also downgrade hooks-no-pretooluse from medium to low; broader or network-capable project-local configs still keep the higher severity../ now grade lower than unrestricted root/home filesystem access; root-level filesystem exposure still stays high.fetch(userProvidedUrl) no longer triggers agents-injection-surface; direct instructions to fetch/process external content still do.agents-explorer-write now uses role metadata and the lead agent intro instead of any later workflow/example text, so procedural search for ... steps in normal worker prompts no longer get mislabeled as explorer-style agents. Example: chief-of-staff.md no longer trips that rule just because it contains gog gmail search ....agents-oversized-prompt now measures effective prompt size instead of raw file length, discounting fenced code blocks and Markdown tables. Example-heavy agents like chief-of-staff.md and planner.md no longer trip the rule, while prose-heavy agents still do.docs/, commands/, examples/, tutorials/, and demos/ are now suppressed when the surrounding context is clearly instructional; that suppression does not apply to normal agent/config markdown.Use this workflow when a repo scan looks noisy or when you are tuning AgentShield rules.
runtimeConfidence, and score impact directly.Recommended commands:
agentshield scan --path /repo --format json > report.json
jq '.findings | group_by(.runtimeConfidence // "none") | map({
runtimeConfidence: (.[0].runtimeConfidence // "none"),
count: length
})' report.json
jq '.findings | group_by(.file) | map({
file: .[0].file,
count: length
}) | sort_by(-.count)[:20]' report.json
Confidence-first triage commands:
# Highest-signal findings first: active runtime + project-local
jq '.findings
| map(select((.runtimeConfidence // "active-runtime") | IN("active-runtime","project-local-optional")))
| map(select(.severity | IN("critical","high","medium")))
| map({file, severity, runtimeConfidence, title})' report.json
# Lower-confidence inventory that usually needs interpretation, not suppression
jq '.findings
| map(select((.runtimeConfidence // "") | IN("template-example","docs-example","plugin-cache","plugin-manifest")))
| map({file, severity, runtimeConfidence, title})' report.json
Recommended audit order:
active-runtime and project-local-optional: treat as highest-signal findings first.template-example, docs-example, and plugin-cache: confirm whether the repo or installed plugin cache is shipping risky guidance versus actually enabling it.plugin-manifest: confirm whether the risk is in declarative hook wiring or the referenced implementation.hook-code: confirm whether the implementation actually injects context, reads transcripts, or shells out in a risky way.Rule-design guidelines:
hook-code rules over generic wrapper heuristics. spawnSync("bash", ["-lc", "curl ... | bash"]) is high-signal; ordinary spawnSync("prettier", ...) is not.docs/, commands/, examples/, example/, samples/, sample/, demo/, demos/, tutorial/, tutorials/, guide/, guides/, cookbook/, and playground/ as example-like paths.When you change rule accuracy, update both:
false-positive-audit.md with the new baseline and remaining gapsagentshield scan --corpus-gate in CI so the built-in attack corpus must stay fully detected; failed corpus gates now include a prioritized accuracy improvement plan by category, missing rule, and missed configFalse-Positive Issue TemplateThe current scan profile is not dominated by broken matchers. It is mostly dominated by lower-confidence source kinds that need different interpretation.
Current patterns from the latest live scans:
everything-claude-codeRecurring pattern signatures to recognize:
agents-* clusters across files with explicit tool metadata usually mean policy review, not false-positive suppressionproject-local-optional clusters usually mean scope is already modeled and only severity may need tuningWhen to open a false-positive issue instead of just triaging the report:
active-runtimeruntimeConfidenceRecommended operating model:
runtimeConfidence before changing any rule. Separate active-runtime from template-example, docs-example, plugin-cache, plugin-manifest, and project-local-optional.hook-code, add only narrow language-aware rules for explicit risky behavior. Avoid broad wrapper heuristics.Repo conventions that help AgentShield scan accurately:
mcp-configs/ instead of live runtime config filessettings.local.jsondocs/, examples/, tutorials/, demos/, or guides/hooks/hooks.json declarative and put the implementation in separate hook script filesCurrent high-value places to audit first:
runtimeConfidence: template-example or plugin-cachesettings.local.json findings that may be project-local rather than repo-wideplugin-manifest findings that need confirmation in the referenced implementationhook-code findings that involve context injection, transcript access, or child-process execution--fix)Automatically applies safe fixes:
${ENV_VAR} referencesBash(*) → scoped Bash(git *), Bash(npm *))Only fixes marked auto: true are applied. Permission changes require human review.
agentshield init)Generates a hardened .claude/ directory with scoped permissions, safety hooks, and security best practices. Existing files are never overwritten.
--opus)Three-agent adversarial pipeline powered by Claude Opus 4.6:
The Attacker finds that curl hooks with ${file} interpolation + Bash(*) = command injection pivot. The Defender notes no PreToolUse hooks exist to stop it. The Auditor chains them into a prioritized action list.
agentshield scan --opus # Red + Blue run in parallel
agentshield scan --opus --stream # Sequential with real-time output
agentshield scan --opus --stream -v # Verbose — see full agent reasoning
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┓
┃ Phase 1a: ATTACKER (Red Team) ┃
┗━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┛
✓ Attacker analysis complete (4521 tokens)
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┓
┃ Phase 1b: DEFENDER (Blue Team) ┃
┗━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┛
✓ Defender analysis complete (3892 tokens)
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┓
┃ Phase 2: AUDITOR ┃
┗━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┛
Risk Level: CRITICAL
Opus Score: █████░░░░░░░░░░░░░░░ 15/100
Requires ANTHROPIC_API_KEY environment variable.
| Format | Flag | Use Case |
|---|---|---|
| Terminal | --format terminal (default) | Interactive use |
| JSON | --format json | CI pipelines, programmatic access |
| Markdown | --format markdown | Documentation, PRs |
| HTML | --format html | Executive report with risk posture and priorities |
| Evidence pack | --evidence-pack <dir> | Audit and buyer reviews |
| Remediation plan | --remediation-plan <path> | Stable-fingerprint fix queue for CI and ticketing |
Evidence packs write a deterministic directory containing manifest.json,
README.md, agentshield-report.json, agentshield-report.html,
agentshield-results.sarif, policy-evaluation.json,
baseline-comparison.json, supply-chain.json, and
ci-context.json, and remediation-plan.json. The manifest records SHA-256
digests and byte counts for bundle artifacts plus a bundle digest over the
machine-readable evidence. ci-context.json records whitelisted GitHub Actions
workflow, commit, run, and runner provenance without copying arbitrary
environment variables into the bundle.
Redaction is enabled by default for local paths, usernames, emails, and
token-shaped strings; use --no-evidence-redact only for private internal
bundles.
Remediation plans write a JSON queue of findings with stable hashed fingerprints, severity, file, fixability, ordered workflow phases, and the recommended next command. The workflow phases route safe auto-fixes first, manual-review findings second, and verification last so maintainers can attach the plan to CI tickets without turning every finding into a separate thread. They intentionally omit raw evidence and fix before/after values so teams can attach the plan to tickets without copying token-shaped strings.
Verify a saved evidence pack before attaching it to CI artifacts or customer handoffs:
agentshield evidence-pack verify ./agentshield-evidence
agentshield evidence-pack verify ./agentshield-evidence --json
Inspect a verified evidence pack when a downstream GitHub App, Linear sync, or security-review workflow needs a compact readback without opening every artifact:
agentshield evidence-pack inspect ./agentshield-evidence
agentshield evidence-pack inspect ./agentshield-evidence --json
Aggregate multiple verified evidence packs when an operator needs fleet-level
routing across repos, teams, or harnesses. The JSON output includes
operatorReadback for promotion status/digest checks and reviewItems with
source evidence paths and owner-ready recommendations for packs that need
follow-up:
agentshield evidence-pack fleet ./repo-a-evidence ./repo-b-evidence
agentshield evidence-pack fleet ./repo-a-evidence ./repo-b-evidence --json
operatorReadback is the stable field for downstream GitHub App, Linear, or
ECC Tools routing. It reports ready, status, digest, owner counts,
blocking review counts, approval routes, deterministic approvalIds, and the
next operator action so fleet promotion can be gated without parsing terminal
prose. Each review item also carries the same approvalId plus a
Linear-friendly ticket externalId, allowing sync jobs to dedupe owner approval
threads across reruns.
agentshield scan --format json is the supported machine-readable scanner interface today.
{
"timestamp": "2026-03-13T19:42:00.000Z",
"targetPath": "/repo/.claude",
"score": {
"grade": "C",
"numericScore": 66,
"breakdown": {
"secrets": 100,
"permissions": 70,
"hooks": 80,
"mcp": 35,
"agents": 45
}
},
"summary": {
"totalFindings": 29,
"critical": 1,
"high": 7,
"medium": 8,
"low": 10,
"info": 3,
"filesScanned": 17,
"autoFixable": 2
},
"findings": [
{
"id": "mcp-risky-filesystem",
"severity": "medium",
"category": "mcp",
"title": "Template defines risky MCP server: filesystem",
"description": "Repository template includes a high-risk filesystem MCP server.",
"file": "mcp-configs/mcp-servers.json",
"runtimeConfidence": "template-example"
}
]
}
Notes:
runtimeConfidence is emitted for active runtime config, settings.local.json, docs/examples, installed Claude plugin caches, plugin manifests, and manifest-resolved non-shell hook code.harnessAdapters is local marker evidence only. It does not call external services or imply a hosted/team entitlement.confidence is strong when a primary harness marker exists, and partial when only supporting directories or secondary markers are present.active-runtime means active config such as mcp.json, .claude/mcp.json, .claude.json, or active settings.json.project-local-optional means project-local settings such as settings.local.json.template-example means template/catalog files such as mcp-configs/ or config/mcp/.docs-example means docs/tutorial/example content such as docs/guide/settings.json or commands/*.md.plugin-cache means installed Claude plugin cache content such as .claude/plugins/cache/....plugin-manifest means declarative hook manifests such as hooks/hooks.json.hook-code means a manifest-resolved non-shell implementation such as scripts/hooks/session-start.js.template-example and docs-example findings to 0.25x, non-secret project-local-optional findings to 0.75x, and non-secret plugin-cache / plugin-manifest findings to 0.5x; committed secrets still count at full weight. Non-secret template-example findings are also capped at 10 deduction points per file and score category. See false-positive-audit.md.AgentShield currently has three distinct automation surfaces:
agentshield scan, agentshield init, and agentshield miniclaw startagentshield scan --format json or agentshield scan --format sarif --output agentshield.sarifagentshield scan --policy agentshield-policy.jsonecc-agentshield/miniclawImportant packaging note:
src/scanner/index.ts and src/reporter/score.ts are useful for contributors, but they should not be documented as supported import paths for package consumers yet.Detailed request/response and schema notes live in API.md.
- name: AgentShield Security Scan
uses: affaan-m/agentshield@v1
with:
path: "."
min-severity: "medium"
fail-on-findings: "true"
Inputs:
| Input | Default | Description |
|---|---|---|
path | . | Path to scan |
min-severity | medium | Minimum severity: critical, high, medium, low, info |
fail-on-findings | true | Fail the action if findings meet severity threshold |
format | terminal | Output format: terminal, json, markdown, sarif |
sarif-output | agentshield-results.sarif | SARIF output path when format is sarif |
baseline | "" | Optional AgentShield baseline JSON path for drift comparison |
save-baseline | "" | Optional path to write the current scan as a new baseline |
policy | "" | Optional organization policy JSON path |
fail-on-policy | true | Fail the action if the organization policy is non-compliant |
supply-chain | true | Verify MCP npm/git package provenance and known malicious package risk |
supply-chain-online | false | Query npm registry metadata during supply-chain verification |
fail-on-supply-chain | follows fail-on-findings | Fail the action when supply-chain verification finds critical or high package risks |
evidence-pack | "" | Optional directory for a portable audit evidence bundle |
verify-evidence-pack | true | Verify evidence-pack artifact hashes after writing |
policy-promotion-manifest | "" | Optional policy export manifest to verify and promote or dry-run |
policy-promotion-pack | "" | Policy pack id when the manifest contains multiple packs |
policy-promotion-output | .agentshield/policy.json | Active policy path to write when promotion dry-run is disabled |
policy-promotion-dry-run | true | Verify promotion review evidence without writing the active policy |
fail-on-policy-promotion | false | Fail when promotion review items still require action |
Outputs: score (0–100), grade (A–F), total-findings, critical-count, sarif-path, baseline-path, baseline-status, new-findings, resolved-findings, unchanged-findings, score-delta, policy-status, policy-violations, policy-promotion-status, policy-promotion-pack, policy-promotion-review-items, policy-promotion-action-required-count, policy-promotion-digest, supply-chain-status, supply-chain-risky-packages, supply-chain-critical-count, supply-chain-high-count, package-manager-hardening-status, package-manager-hardening-findings, package-manager-hardening-critical-count, package-manager-hardening-high-count, package-manager-hardening-registry-credentials, package-manager-hardening-lifecycle-scripts, package-manager-hardening-release-age-gates, evidence-pack-path, evidence-pack-status, evidence-pack-digest
The action writes a markdown job summary and emits GitHub annotations inline on affected files. When format: sarif is set, it also writes a SARIF 2.1.0 report that can be uploaded to GitHub code scanning with github/codeql-action/upload-sarif. When baseline is set, the action appends a baseline drift summary, emits regression annotations for new findings, and reports baseline-status as passed, failed, missing, or not-run. When policy is set, the SARIF report includes organization-policy violations as agentshield-policy/* code-scanning results, appends the organization policy result to the job summary, emits policy violation annotations, and fails by default unless fail-on-policy: "false" is set. When policy-promotion-manifest is set, the action verifies the selected policy export digest, emits promotion review-item counts, appends the owner/protected-rollout/runtime-smoke review items to the job summary, and marks runtime smoke as verified when the same job also scans with the promoted policy. Supply-chain verification runs offline by default for MCP package references, appends package-risk evidence to the job summary, writes real supply-chain.json evidence packs, and fails on critical/high package risk whenever the action is in failing mode unless fail-on-supply-chain: "false" is set. Package-manager hardening outputs and job-summary evidence separately count plaintext registry credentials, lifecycle-script drift, and release-age gate drift so CI and hosted consumers can route supply-chain configuration risk even when the main finding gate is collecting evidence only.
Baseline drift gate:
- name: AgentShield Drift Gate
uses: affaan-m/agentshield@v1
with:
path: "."
baseline: ".github/agentshield-baseline.json"
fail-on-findings: "false"
Use fail-on-findings: "false" when the workflow should fail only on drift from the baseline. Keep the default fail-on-findings: "true" when any current finding at or above min-severity should still fail the action.
agentshield scan [options] Scan configuration directory
-p, --path <path> Path to scan (default: ~/.claude or cwd)
-f, --format <format> Output: terminal, json, markdown, html, sarif
-o, --output <path> Write the primary report output to a file
--fix Auto-apply safe fixes
--opus Enable Opus 4.6 multi-agent analysis
--stream Stream Opus analysis in real-time
--injection Run active prompt injection testing
--sandbox Execute hooks in a sandbox and observe behavior
--taint Run taint analysis (data flow tracking)
--deep Run injection + sandbox + taint + opus together
--log <path> Write structured scan logs to a file
--log-format <format> Log format: ndjson or json
--corpus Run built-in attack corpus benchmark
--corpus-gate Fail if the built-in attack corpus regresses
--baseline <path> Compare against a baseline file
--save-baseline <path> Save current scan results as a baseline file
--gate Fail on new critical/high findings or score drop
--supply-chain Verify MCP package provenance and risk
--supply-chain-online Include npm registry metadata
--policy <path> Validate against an organization policy
--evidence-pack <dir> Write portable evidence bundle
--remediation-plan <path> Write stable-fingerprint JSON remediation plan
--no-evidence-redact Disable evidence-pack redaction
--min-severity <severity> Filter: critical, high, medium, low, info
-v, --verbose Show detailed output
agentshield init Generate secure baseline config
agentshield baseline write Write a scan baseline JSON file
agentshield evidence-pack fleet Summarize multiple evidence packs for routing
agentshield evidence-pack inspect Verify and summarize an evidence bundle
agentshield evidence-pack verify Verify artifact and bundle digests
agentshield policy export Export policy packs plus checksum manifest
agentshield policy init Generate an organization policy preset
agentshield policy promote Verify and promote an exported policy
Baseline write:
agentshield baseline write --path .claude --output .github/agentshield-baseline.json
agentshield baseline write --path .claude --output baseline.json --json
The baseline command is a first-class wrapper around the existing scan baseline
format. New baselines store stable hashed evidence fingerprints and omit raw
evidence values, so teams can keep baseline snapshots in CI without copying
token-shaped strings into long-lived artifacts. Existing raw-evidence baselines
remain comparable during migration. Use it to create the accepted snapshot,
then compare future scans with
agentshield scan --baseline .github/agentshield-baseline.json --gate.
Runtime monitor lifecycle:
# Install the PreToolUse runtime monitor
agentshield runtime install
# Check whether the hook, policy, and log path are healthy
agentshield runtime status --check
# Back up invalid runtime files and restore a healthy install
agentshield runtime repair
Organization policy files support enterprise metadata and temporary exceptions:
agentshield policy init --pack enterprise --owner security-platform@acme.example
agentshield policy init --pack regulated --name "Acme Regulated Policy"
agentshield policy export --output-dir .github/agentshield-policies --owner security-platform@acme.example
agentshield policy export --pack ci-enforcement --name-prefix "Acme" --json
agentshield policy promote --manifest .github/agentshield-policies/manifest.json --pack ci-enforcement --output .agentshield/policy.json
agentshield policy promote --manifest .github/agentshield-policies/manifest.json --pack ci-enforcement --dry-run --json
Policy pack presets are starter baselines, not hidden SaaS policy. oss keeps
public repos permissive while requiring destructive-command deny entries;
team adds the runtime hook; enterprise raises score gates and blocks broad
tool allowlists; regulated disallows critical/high findings and bans broader
MCP/tool surfaces; high-risk-hooks-mcp focuses on hook/MCP-heavy repos; and
ci-enforcement is tuned for branch-protection evidence.
Policy export writes one JSON policy file per selected pack plus a
manifest.json containing SHA-256 digests. This gives platform teams a stable
artifact bundle for branch-protection review, audit attachment, or downstream
policy promotion without relying on generated console output.
Policy promotion is the review gate for those exported bundles. It reads the
export manifest, verifies the selected policy file digest, validates the policy
schema, and only then writes the active policy path. Use --dry-run --json in
review workflows to prove the exact pack, source file, output path, owner list,
and digest before a protected branch or operator copies the policy into place.
Promotion results also include reviewItems for owner approval, protected PR
rollout, and the runtime smoke test needed before enabling an enforcing CI gate.
{
"version": 1,
"name": "Acme Corp Security Policy",
"policy_pack": "enterprise",
"owners": ["security-platform@acme.example"],
"min_score": 85,
"max_severity": "high",
"required_deny_list": ["Bash(rm -rf"],
"exceptions": [
{
"id": "AS-EX-001",
"rule": "required_hooks",
"owner": "security-platform@acme.example",
"reason": "Legacy repository migration window",
"expires_at": "2026-06-30T23:59:59.000Z",
"scope": "agentshield",
"ticket": "SEC-1234"
}
]
}
Policy evaluation now includes an exception lifecycle audit in terminal output and GitHub Action summaries: total exceptions, active exceptions, exceptions expiring within seven days, expired exceptions, owners, tickets, scopes, and days until expiry. This keeps temporary waivers visible in branch-protection evidence instead of letting them become silent permanent bypasses.
Exit codes:
0: scan completed without critical findings1: CLI usage or runtime error2: scan completed and found at least one critical issue| Category | Rules | Patterns | Severity Range |
|---|---|---|---|
| Secrets | 10 | 14 | Critical -- Medium |
| Permissions | 10 | -- | Critical -- Medium |
| Hooks | 34 | -- | Critical -- Low |
| MCP Servers | 23 | -- | Critical -- Info |
| Agents | 25 | -- | Critical -- Info |
| Total | 102 | 14 |
src/
├── index.ts CLI entry point (commander)
├── action.ts GitHub Action entry point
├── types.ts Type system + Zod schemas
├── scanner/
│ ├── discovery.ts Config file discovery
│ └── index.ts Scan orchestrator
├── rules/
│ ├── index.ts Rule registry
│ ├── secrets.ts Secret detection (10 rules, 14 patterns)
│ ├── permissions.ts Permission audit (10 rules)
│ ├── mcp.ts MCP server security (23 rules)
│ ├── hooks.ts Hook analysis (34 rules)
│ └── agents.ts Agent config review (25 rules)
├── reporter/
│ ├── score.ts Scoring engine (A-F grades)
│ ├── terminal.ts Color terminal output
│ ├── json.ts JSON + Markdown output
│ └── html.ts Self-contained HTML report
├── fixer/
│ ├── transforms.ts Fix transforms (secret, permission, generic)
│ └── index.ts Fix engine orchestrator
├── init/
│ └── index.ts Secure config generator
└── opus/
├── prompts.ts Attacker/Defender/Auditor system prompts
├── pipeline.ts Three-agent Opus 4.6 pipeline
└── render.ts Opus analysis rendering
MiniClaw is a minimal, sandboxed AI agent runtime bundled with AgentShield. Where typical agent platforms expose many attack surfaces (Telegram, Discord, email, community plugins), MiniClaw presents a single HTTP endpoint backed by an isolated sandbox.
# Start with secure defaults (localhost:3847, no network, safe tools only)
npx ecc-agentshield miniclaw start
# Custom configuration
npx ecc-agentshield miniclaw start --port 4000 --network localhost --rate-limit 20
Or use as a library:
import { startMiniClaw } from 'ecc-agentshield/miniclaw';
const { server, stop } = startMiniClaw();
// Listening on http://localhost:3847
MiniClaw-specific package exports and HTTP endpoints are documented in src/miniclaw/README.md and summarized in API.md. The React dashboard source lives in src/miniclaw/dashboard.tsx, but it is not yet published as a separate npm subpath.
Four independently enforced layers:
Request → [Rate Limit] → [CORS] → [Size Cap] → [Sanitize Prompt]
↓
[Tool Whitelist]
↓
[Sandbox FS]
↓
[Filter Output] → Response
| Method | Endpoint | Description |
|---|---|---|
POST | /api/prompt | Send a prompt |
POST | /api/session | Create a sandboxed session |
GET | /api/session | Session info |
DELETE | /api/session/:id | Destroy session + cleanup |
GET | /api/events/:sessionId | Security audit events |
GET | /api/health | Health check |
MiniClaw has zero external runtime dependencies — Node.js built-ins only (http, fs, path, crypto). The optional React dashboard requires React 18+ as a peer dependency.
npm install # Install dependencies
npm run dev # Development mode
npm test # Run tests
npm run test:coverage # Coverage report
npm run typecheck # Type check
npm run build # Build
npm run scan:demo # Demo scan against vulnerable examples
AgentShield is available through multiple channels:
| Channel | Use Case | Install |
|---|---|---|
| Standalone CLI | Direct scanning from your terminal | npm install -g ecc-agentshield or npx ecc-agentshield scan |
| GitHub Action | Automated security checks on PRs in CI/CD | uses: affaan-m/agentshield@v1 |
| ECC Plugin | Claude Code users via the ECC skill ecosystem | Install through Everything Claude Code |
| ECC Tools GitHub App | Integrated scanning across your GitHub org | Install at github.com/apps/ecc-tools |
| ECC Tools Pro | GitHub App with automated repo analysis, Stripe billing ($19/seat/mo) | Install |
MIT
Built by @affaanmustafa · Part of Everything Claude Code
Native macOS app to monitor Claude AI usage limits and watch your coding sessions live
干净、强大、属于你的 AI Agent 平台 --AI agents, without the clutter.
npx CLI installing 100+ agents, commands, hooks, and integrations in one command
Pocket Flow: Codebase to Tutorial