A community-driven registry for Claude, Cursor, Windsurf, Cline & more. Not affiliated with Anthropic.
Are you the author? Sign in to claim
Local security audit for AI API relays and LLM proxies: detects prompt injection, model substitution, tool-call rewritin
Local security audit for AI API relays and LLM proxies.
API Relay Audit is a local security audit tool for AI API relays and LLM proxies. It keeps API relay audit, prompt injection audit, model substitution signals, and Web3 relay audit as separate query families so each result keeps a clean evidence boundary. Your API key is sent only to the relay URL you choose.
Use it when you rely on a third-party AI API relay, OpenAI-compatible proxy, Claude-compatible proxy, or Web3 agent workflow and want a repeatable Markdown report before trusting that relay with production or wallet-related traffic.
audit.py uses only Python stdlib plus curl; your API key is sent only to the relay URL you choose.LOW / MEDIUM / HIGH verdict.| Query family | User intent | Profile / steps | Evidence boundary |
|---|---|---|---|
| API relay audit | Audit a third-party relay, mirror, gateway, LLM proxy, or resale API before trusting traffic. | general by default; full for every probe | Produces a local report, not a safety certificate. |
| Prompt injection audit | Detect hidden prompt injection, prompt leakage, instruction override, and extraction behavior. | general; Steps 3-6 | Records prompt evidence without publishing private prompts or secrets. |
| Model substitution signals | Collect model identity, stream, latency, and upstream channel signals. | general; Steps 5, 10, 13, 14 | Self-ID, latency, and channel fingerprints are signals, not standalone proof of provider substitution. |
| Web3 relay audit | Check wallet-sensitive relay behavior before agent workflows touch signing or transactions. | web3 or full; Step 11 | Profile-gated; general relay audits do not imply wallet safety. |
The canonical contract lives in docs/query-families.md. README headings, Pages cards, issue templates, and skill descriptions should preserve these boundaries instead of flattening them into one slogan.
AUDIT_SCRIPT_REF=v2.3.0
curl -fsSL "https://raw.githubusercontent.com/toby-bridges/api-relay-audit/${AUDIT_SCRIPT_REF}/audit.py" -o audit.py
python audit.py --key <YOUR_KEY> --url <BASE_URL> --output report.md
# Web3 / wallet users
python audit.py --key <YOUR_KEY> --url <BASE_URL> --profile web3 --output report.md
See a public-safe fixture report: sanitized audit report.
Use master as AUDIT_SCRIPT_REF only when intentionally testing unreleased changes.
API Relay Audit checks whether a relay modifies the request or response path between you and the model:
The project has two distribution modes:
audit.py: zero-dependency standalone script for quick local auditsapi_relay_audit/ plus scripts/: modular development version with testsRuntime profiles:
general: default AI API relay and LLM proxy checksweb3: wallet-safety probes for Web3 agent flowsfull: general plus Web3 checksAPI Relay Audit can also run as an agent skill when an agent workflow needs to audit a relay before trusting it with coding, tool, or wallet-related traffic.
These skills do not certify that a relay is safe. They help agents generate a local, reviewable Markdown report before trusting a relay path.
inconclusive as clean; blocked probes and ambiguous responses stay visible in the report.Natural-language self-identification is treated as a consistency signal, not upstream proof. A response saying it is Qwen, DeepSeek, GPT, or Claude can indicate a mismatch, but it does not by itself prove that a provider substituted the upstream model.
Stronger claims require corroborating evidence such as raw response JSON, request IDs, provider/model metadata, stream signatures, transparent-log hashes, and reproducible runs. Public submissions should use redacted report artifacts and never include API keys, raw headers, full response bodies, wallet material, private relay traffic, or user data.
With --profile web3 or --profile full, API Relay Audit adds wallet-oriented prompt injection probes inspired by signature-isolation risks:
These probes are model-agnostic, but they are intentionally profile-gated so general relay audits stay focused.
your machine
-> audit.py / scripts/audit.py
-> chosen relay endpoint
-> Markdown report + optional hash-only transparent log
-> optional: redacted evidence issue for maintainer review
Community evidence is shape-checked by GitHub Actions, but publication still requires maintainer review. Operators keep a separate response path, and sensitive vulnerabilities belong in the disclosure path described in SECURITY.md.
| Metric | Current value |
|---|---|
| Version | v2.3 |
| Audit steps | 14 |
| Risk matrix | 6D |
| pytest collected tests | 778 |
| CLI flags | 21 |
| Runtime profiles | general, web3, full |
An API relay or LLM proxy is a third-party service between you and an AI provider such as Anthropic or OpenAI. It forwards your requests upstream, but it can also inject hidden instructions, swap models, truncate context, rewrite tool output, or leak credentials in error responses.
API Relay Audit runs locally, so your API key is sent only to the relay URL you specify. The standalone version is a single Python file with zero Python package dependencies, which makes it easier to inspect before running.
Prompt injection means the relay may prepend or insert hidden instructions into your request. API Relay Audit compares expected and actual token usage, tries prompt-extraction probes, and records evidence when the relay appears to add or reveal hidden prompt content.
Model substitution means the relay claims to provide one model but may expose evidence signals for another model identity, route, or upstream channel. API Relay Audit checks non-Claude identity patterns, anchor phrases, stream model identity, latency variance, and channel evidence where available; those signals require corroboration before making provider-level claims.
Tool-call rewriting means the relay modifies package-install commands or tool-like output in the model response. API Relay Audit sends pinned package commands and compares the returned text to detect proxy-layer supply-chain tampering.
SSE anomalies are stream-level integrity issues in Anthropic-style streaming responses. API Relay Audit checks event types, usage monotonicity, thinking signatures, and stream model identity when the relay supports that format.
With the web3 or full profile, API Relay Audit checks transfer guidance, signed-transaction refusal, and private-key refusal behavior before wallet-related traffic is trusted.
inconclusive mean?Inconclusive means the tool could not determine a clean or anomalous result for that step. A blocked probe, unsupported format, or ambiguous response is not treated as safe; it remains visible in the final report.
They serve different needs. hvoy.ai is useful for relay reputation lookup, cctest.ai focuses on one-click testing and channel fingerprinting, and API Relay Audit focuses on local, open-source, repeatable security auditing with structured Markdown reports.
AGPL-3.0-only. See LICENSE.
This keeps modified network-service deployments accountable to the same public source-availability standard as the relay ecosystem evidence we audit.
If you use API Relay Audit in research, security reports, or public relay evaluations, please cite the software with CITATION.cff. The citation file also records the two academic papers that inform the audit model: Liu et al., Your Agent Is Mine (arXiv:2604.08407) and Zhang et al., Real Money, Fake Models (arXiv:2603.01919).
You do not need to write code to help. Good first contributions are small, reproducible, and evidence-focused:
clean, anomaly, and inconclusive.Start with:
Avoid publishing real API keys or private relay traffic, and keep changes scoped to one behavior or document.
api-relay-audit 是一个本地运行的 AI API 中转站 / LLM proxy 安全审计工具。它把 API relay audit、prompt injection audit、model substitution signals、Web3 relay audit 拆成独立查询意图,避免把不同风险压成一个口号;你的 API Key 只会发送到你指定的中转站 URL。
当你使用第三方 AI API 中转站、OpenAI-compatible proxy、Claude-compatible proxy,或者 Web3 agent 工作流时,可以用它在信任该中转站之前生成一份可复查的 Markdown 审计报告。
audit.py 单文件零依赖版 + api_relay_audit/ 模块化开发版--profile general|web3|full 三种运行模式LOW / MEDIUM / HIGH 总结论,加每一步的细项结果| Query family | 用户意图 | Profile / Steps | 证据边界 |
|---|---|---|---|
| API relay audit | 审计第三方中转站、镜像、网关、LLM proxy 或 resale API。 | 默认 general;完整覆盖用 full | 输出本地报告,不是安全认证。 |
| Prompt injection audit | 检测隐藏 prompt 注入、prompt 泄漏、指令覆盖和提取行为。 | general;Step 3-6 | 记录 prompt 证据,但不公开私有 prompt 或 secret。 |
| Model substitution signals | 收集模型身份、stream、延迟和上游 channel 信号。 | general;Step 5、10、13、14 | self-ID、延迟和 channel fingerprint 是信号,不能单独证明 provider 替换。 |
| Web3 relay audit | 在 agent 接触签名、交易或钱包相关内容前检查中转行为。 | web3 或 full;Step 11 | profile-gated;普通 relay audit 不等于钱包安全。 |
正式契约在 docs/query-families.md。README、Pages、issue template 和 skill description 都应该保留这些边界。
AUDIT_SCRIPT_REF=v2.3.0
curl -fsSL "https://raw.githubusercontent.com/toby-bridges/api-relay-audit/${AUDIT_SCRIPT_REF}/audit.py" -o audit.py
python audit.py --key <YOUR_KEY> --url <BASE_URL> --output report.md
# Web3 / 钱包用户
python audit.py --key <YOUR_KEY> --url <BASE_URL> --profile web3 --output report.md
API Relay Audit 也可以作为 agent skill 使用。
这些 skill 不给中转站颁发安全认证,只帮助 agent 在信任 relay 前生成本地、可复查的 Markdown 报告。
inconclusive 当成 clean;被拦截或无法判断的探针会保留在报告里。模型自然语言自称 Qwen、DeepSeek、GPT 或 Claude,只能作为 identity consistency signal,不能单独证明真实上游供应商或平台替换了模型。
更强的结论需要 raw response JSON、request id、provider/model metadata、stream signature、transparent-log hash 和可复现实验共同支撑。公开提交只接受脱敏报告证据;不要提交 API Key、raw headers、完整 response body、钱包材料、私有中转流量或用户数据。
你的机器
-> audit.py / scripts/audit.py
-> 你指定的 relay endpoint
-> Markdown report + 可选 hash-only transparent log
-> 可选:脱敏 evidence issue,等待 maintainer review
社区证据会被 GitHub Actions 做格式检查,但公开发布仍需要 maintainer review。运营方回应走单独通道,敏感漏洞走 SECURITY.md 的 disclosure 路径。
| 指标 | 当前值 |
|---|---|
| 版本 | v2.3 |
| 审计步骤 | 14 |
| 风险矩阵 | 6D |
| pytest collected tests | 778 |
| CLI flags | 21 |
| Runtime profiles | general, web3, full |
你不需要写代码也能帮忙:可以提交检测缺口、文档示例、翻译改进,或 OpenClaw / Hermes 安装反馈。
请不要提交真实 API Key、私有中转站流量、钱包材料或未脱敏审计报告。
Native macOS app to monitor Claude AI usage limits and watch your coding sessions live
npx CLI installing 100+ agents, commands, hooks, and integrations in one command
干净、强大、属于你的 AI Agent 平台 --AI agents, without the clutter.
Pocket Flow: Codebase to Tutorial