A community-driven registry for Claude, Cursor, Windsurf, Cline & more. Not affiliated with Anthropic.
Are you the author? Sign in to claim
Lightweight proxy for LLM
A single-binary LLM proxy that connects coding assistants and AI agents to local and upstream models. Translates between API protocols, routes requests across backends, and adds tools that local backends lack — web search, image description, PDF text extraction, and OCR. Works with Claude Code, Codex, OpenCode, Qwen Code, OpenClaw, and any OpenAI/Anthropic-compatible client.
Landing page · Config generator · Releases
You need data security and self-host models or have upstream secure vendors (Azure, Bedrock, etc) that don't have all the expected tooling you're used to. You want to use glm-5.1 for planning and MiniMax-M2.5 for implementation and agent work, with Qwen3-VL-8B as your vision processor. You want to connect using Claude Code and Codex and have it 'just work'. You upload a PDF and it works, you upload an image and that works too. Call for a web search? The proxy intercepts natively and sends it through Tavily or Brave.
/mcp/sse exposes web search for OpenCode, Qwen Code, and any MCP-compatible agent./qdrant/* proxies to a Qdrant vector database with separate app key auth and automatic multi-tenant isolation../go-llm-proxy -config config.yaml
Or with Docker (limited testing):
docker compose -f docker/docker-compose.yml up -d
listen: ":8080"
models:
- name: my-model
backend: http://192.168.1.10:8000/v1
keys:
- key: sk-your-secret-key
name: admin
See config.yaml.example for a fully annotated starter config with all options.
What works with each coding assistant through the proxy.
Protocol
| Claude Code | Codex CLI | OpenCode | Qwen Code | |
|---|---|---|---|---|
| Native API | Anthropic Messages | OpenAI Responses | Chat Completions | Chat Completions |
| Translation | auto-translated | auto-translated | passthrough | passthrough |
Core features
| Claude Code | Codex CLI | OpenCode | Qwen Code | |
|---|---|---|---|---|
| Text + streaming | ✓ | ✓ | ✓ | ✓ |
| Tool calling | ✓ | ✓ | ✓ | ✓ |
| Multi-turn tool loops | ✓ | ✓ | ✓ | ✓ |
| Reasoning display | ✓ | ✓ | — | — |
| Extended thinking | ✓ | ✓ | — | — |
Proxy-side processing (details)
| Claude Code | Codex CLI | OpenCode | Qwen Code | |
|---|---|---|---|---|
| Web search (Tavily / Brave) | ✓ proxy | ✓ proxy | ✓ MCP | ✓ MCP |
| Image description | ✓ vision | ✓ vision | ✓ vision | ✓ vision |
| PDF text extraction | ✓ proxy | client-side | ✓ | ✓ |
| Scanned PDF / OCR | ✓ OCR model | ✓ OCR model | ✓ | ✓ |
| Context compaction | — | ✓ | — | — |
| Usage logging & reports | ✓ | ✓ | ✓ | ✓ |
Each assistant speaks a different API protocol. The proxy detects this and translates automatically — no per-model configuration needed for the common case.
Optional. Handles content that local backends don't support natively:
processors:
vision: Qwen3-VL-8B # vision model for image descriptions
ocr: paddleOCR # fast model for PDF page text extraction (optional; vision is tried automatically if OCR fails)
web_search_key: tvly-... # Tavily or Brave Search key (auto-detected from prefix)
Without processors, the proxy just translates and routes. With it, images, PDFs, and search work on text-only backends.
| Processor | Model | Notes |
|---|---|---|
| Vision | Qwen3-VL-8B | Best quality/speed balance for image description. Handles charts, screenshots, diagrams. |
| OCR | PaddleOCR-VL-1.5 (0.9B) | Purpose-built for documents. 94.5% accuracy, 109 languages, ~2s/page. Tiny VRAM footprint. |
| Web search | Tavily or Brave Search | Tavily free: 1,000 req/month. Brave free: $5/month credit. Auto-detected from key prefix. |
| Topic | Link |
|---|---|
| Configuration reference | docs/config-reference.md |
| Claude Code | docs/claude-code.md |
| Codex CLI | docs/codex.md |
| OpenCode | docs/opencode.md |
| Qwen Code | docs/qwen-code.md |
| Processing pipeline | docs/pipeline.md |
| Docker deployment | docs/docker.md |
| Production deployment | docs/deployment.md |
| Usage monitoring | docs/usage.md |
| Qdrant proxy | docs/qdrant.md |
| Security | docs/security.md |
Claude Code skill for YouTube creators — channel audits, video SEO, retention scripts, thumbnails, content strategy, Sho
AI image generation skill for Claude Code -- Creative Director powered by Gemini
A Claude Code skill by Hao (駱君昊) that learns your Facebook voice and auto-posts to FB / IG / Threads / X with a 14-day c
Universal SEO skill for Claude Code. 25 sub-skills + 18 sub-agents covering technical SEO, E-E-A-T, schema, GEO/AEO, bac