A community-driven registry for Claude, Cursor, Windsurf, Cline & more. Not affiliated with Anthropic.
48 packages found
A repo lists papers related to LLM based agent
总结Prompt&LLM论文,开源数据&模型,AIGC应用
A curated list of Generative AI tools, works, models, and references
非线智能 NoneLinear - ReLE评测:中文AI大模型能力评测(持续更新):目前已囊括374个大模型,覆盖chatgpt、gpt-5.4、谷歌gemini-3.1-pro、Claude-4.6、文心ERNIE-X1.1、ERNIE
[Up-to-date] A curated list of resources on graph-empowered agents and agent-facilitated graph learning (Graphs Meet Age
[NeurIPS 2024 D&B] GTA: A Benchmark for General Tool Agents & [arXiv 2026] GTA-2
A Systematic Survey of Deep Research
[ICML2025 Oral] LLM-SRBench: A New Benchmark for Scientific Equation Discovery with Large Language Models
[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling
Awesome papers involving LLMs in Social Science.
Awesome LLM Papers and repos on very comprehensive topics.
xLAM: A Family of Large Action Models to Empower AI Agent Systems
Yunjue Agent: A Fully Reproducible, Zero-Start In-Situ Self-Evolving Agent System for Open-Ended Tasks
The benchmark tasks and evaluation harness for "PhysicianBench: Evaluating LLM Agents in Real-World EHR Environments".
[ICLR 2025 Oral] This is the official repo for the paper "LLM-SR" on Scientific Equation Discovery and Symbolic Regressi
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
"DeepCode: Open Agentic Coding (Paper2Code & Text2Web & Text2Backend)"
SkillsVote: Lifecycle Governance of Agent Skills from Collection, Recommendation to Evolution
Odyssey: Empowering Minecraft Agents with Open-World Skills
ML-Dev-Bench is a benchmark for evaluating AI agents against various ML development tasks.
Official Implementation of UA^{2}-Agent and other baseline algorithms of "Towards Unified Alignment Between Agents, Huma
YantrikDB memory provider for NousResearch/hermes-agent — self-maintaining memory with canonicalization, contradiction t
LLM Agent that leverages cheminformatics tools to provide informed responses.
Agentic Theorem Prover for Rocq for Program Verification
ICML 2026 · Plug-and-play long-term memory for LLM agents
[CVPR 2025 🔥]A Large Multimodal Model for Pixel-Level Visual Grounding in Videos
Code and Data for "MIRAI: Evaluating LLM Agents for Event Forecasting"
🧙🏻 Code and benchmark for our Findings of ACL 2024 paper - "TimeChara: Evaluating Point-in-Time Character Hallucinatio
A general purpose scientific writer
🔴 VERY LARGE AI TOOL LIST! 🔴 Curated list of AI Tools - Updated 2026
A curated list of awesome things related to Anthropic Claude
[ICLR'25] OpenRCA: Can Large Language Models Locate the Root Cause of Software Failures?
[ICLR2026] The official repository for the CodeGym project: "Generalizable End-to-End Tool-Use RL with Synthetic CodeGym
The pretty much "official" DSPy framework for Typescript
RepairAgent is an autonomous LLM-based agent for software repair.
Towards Large Multimodal Models as Visual Foundation Agents
AI Observability & Evaluation
MLX Studio - Home of JANG_Q - Image Gen/Edit + Chat/Code All in one - + OpenClaw (Anthropic API)
Agent Interface Discovery
[ACL 2024 Findings] MedAgents: Large Language Models as Collaborators for Zero-shot Medical Reasoning https://arxiv.org/
Declarative agent config for Claude Code, Codex, and Cursor: an agent.json manifest for skills, MCP servers, and stack p
Structured deep research skill for Claude Code/Open Code/Codex with human-in-the-loop control
Official Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan
Code repo for the paper: Attacking Vision-Language Computer Agents via Pop-ups
The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".
The RL Bridge for LLM-based Agent Applications. Made Simple & Flexible.
🤖 Awesome list of AGI Agents. Agents 精选资源合集.