A community-driven registry for Claude, Cursor, Windsurf, Cline & more. Not affiliated with Anthropic.
4 packages found
💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.
Awesome LLM Papers and repos on very comprehensive topics.
[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural langu
[CVPR 2025 🔥]A Large Multimodal Model for Pixel-Level Visual Grounding in Videos