A community-driven registry for Claude, Cursor, Windsurf, Cline & more. Not affiliated with Anthropic.
36 packages found
Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.
Benchmarking the gap between AI agent hype and architecture. Three agent archetypes, 73-point performance spread, stress
15 AI-powered GTM skills for Claude Code. Campaign tested frameworks for cold email, ICP research, signal scoring, campa
Screen-reader navigation cost analyzer — models the real effort to discover, reach, and operate interactive web content
A native desktop application for developing, testing, and debugging Model Context Protocol servers.
📄 Production-ready MCP server for PDF processing - 5-10x faster with parallel processing and 94%+ test coverage
Local context engine for AI coding agents. Routes tasks to relevant files, tests, rules, and skills, supports prompt cac
✨✨Latest Advances on Neuro-Symbolic Learning in the era of Large Language Models
A curated system of production-ready Claude Code skills with quantitative evaluation reports, golden test fixtures, and
The open-source web-browsing backend for AI agents & workflow engines. Ships a 42-tool MCP server for Claude Code/Cursor
🕷️ An adaptive Web Scraping framework that handles everything from a single request to a full-scale crawl!
Combining a five-level AI framework with git-native memory overcomes session amnesia, enabling anticipation of problems
A curated list of developer tools, SDKs, libraries, and testing utilities for Model Context Protocol (MCP) server develo
Claude Code skills for medical research — literature search, reporting guidelines, statistical analysis, publication fig
Build Claude Code–style deep agents in Python: tool-calling, sandboxed execution, multi-agent teams, skills, checkpoints
ConcoLLMic: the first language- and theory-agonistic concolic execution engine via LLM agents
Self-installing personal AI orchestrator. Hand the latest blueprint file to a Claude Code session and it builds a full m
Synthetic monitoring and CI smoke tests for LLM inference endpoints.
63 deterministic quant computation tools for autonomous financial agents. Options, derivatives, risk, portfolio, statist
Fastest enterprise AI gateway (50x faster than LiteLLM) with adaptive load balancer, cluster mode, guardrails, 1000+ mod
AI-native framework for building trading systems with polyglot bindings.
CLI-first API testing tool. YAML-defined tests, structured JSON output, built for AI-assisted workflows.
A Claude Code skill that burns tokens on demand. Stress test, inflate metrics, or just set money on fire.
Regression testing for AI agents. Snapshot behavior,diff tool calls,catch regressions in CI. Works with LangGraph, CrewA
🧠 Kotlin MCP Server for Android app development using OpenAI, Gemini, or OpenRouter. Enables AI-assisted coding via Aid
A Claude Code skill that encodes battle-tested editorial principles, section-specific rhetorical moves, and a structured
AI skill for Claude Code and Codex that helps agents write correct R for Six Sigma and SPC work, including control chart
High-performance Rust hooks for Claude Code skill auto-activation. ~2ms startup, zero dependencies, production-tested pa
Automated Claude Code QA gate for AI-assisted development — Codex code review + Playwright browser smoke testing before
Autonomous orchestration framework for Claude Code with MemPalace-inspired memory (4-layer stack, 818-token wake-up), pa
AI-powered E2E testing for 10 platforms. 253 MCP tools. Zero config. Works with Claude, Cursor, Windsurf, Copilot. Test
CLI, MCP server, and npm library that turns any website into an API — no docs, no SDK, no browser.
Zero-dependency browser automation CLI. 70+ commands, 10 test assertions, smart commands (click/fill by text — no LLM ne
Adversarial multi-model reasoning verification MCP server for AI agents. Claude, Grok, and DeepSeek challenge each decis
Pre-execution governance for AI agents. Sub-millisecond tool call validation, drift detection, circuit breakers, human-i
A Claude Code skill that adds a rubric-based eval layer to any agent project. Framework-agnostic — generates rubric, tes