A community-driven registry for Claude, Cursor, Windsurf, Cline & more. Not affiliated with Anthropic.
Are you the author? Sign in to claim
Desktop app to control your computer with AI using your terminal, browser, mouse & keyboard
Local AI agent that controls your computer. Give it natural language instructions and watch it take screenshots, move your mouse, click, type, and run terminal commands.
Built with Tauri, React, and Rust.
https://github.com/user-attachments/assets/8edd92a7-7d3e-472a-9e48-3b561f0257d6
Here, I used it to autonomously read and reply to tweets, lol. This is purely for demonstration/research, you should not attempt to do the same, lol.
Computer Use Mode - Takes over your screen. Sees what you see via screenshots and controls your cursor and keyboard directly. Use when the task spans multiple apps or needs full desktop access. You step away while it works.
Background Mode - Runs async while you do other things. Uses Chrome DevTools Protocol for web automation and terminal for everything else. Doesn't touch your mouse or keyboard. Faster and more reliable for web + terminal tasks.
Requirements:
# install deps
npm install
# add your api key
echo "ANTHROPIC_API_KEY=your-key-here" > .env
# run dev
npm run tauri dev
# or build for production
npm run tauri build
On macOS, you'll need to grant accessibility permissions when prompted (System Settings → Privacy & Security → Accessibility).
⌃⇧C - push-to-talk → computer use mode⌃⇧B - push-to-talk → background mode⌘⇧H - help mode (screenshot + quick prompt)⌘⇧S - stop agentPRs welcome. Hit me up on Twitter @ishanxnagpal.
An AI-powered custom node for ComfyUI designed to enhance workflow automation and provide intelligent assistance
Deterministic multi-agent pipeline for end-to-end software development, orchestrating CLI-based AI tools (e.g. Gemini, C
💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.
干净、强大、属于你的 AI Agent 平台 --AI agents, without the clutter.