A community-driven registry for Claude, Cursor, Windsurf, Cline & more. Not affiliated with Anthropic.
1 packages found
ML-Dev-Bench is a benchmark for evaluating AI agents against various ML development tasks.