A community-driven registry for Claude, Cursor, Windsurf, Cline & more. Not affiliated with Anthropic.
6 packages found
Benchmark for evaluating LLM agents on smart-contract vulnerability discovery and exploitation
A collection of structured AI agent skills that enable Claude Code, Cursor, GitHub Copilot, and other AI coding assi
Status: Experimental. This repository provides a multi-language reference implementation of the variants proposal for th
Jagged Frontier: LLM vulnerability detection benchmark harnesses (API + Claude Code agentic)