A community-driven registry for Claude, Cursor, Windsurf, Cline & more. Not affiliated with Anthropic.
Are you the author? Sign in to claim
Give Claude a video input. /watch downloads from YouTube/Instagram/X/Vimeo/any yt-dlp site, extracts frames, and transcr
Let Claude watch a video from almost anywhere — and transcribe it 100% on your machine, no API key.
Paste a URL (YouTube, Instagram, X/Twitter, Vimeo, TikTok, Loom, and ~1800 more) or a local file, ask a question, and Claude downloads the video, extracts frames, pulls a timestamped transcript, and reads every frame as an image. By the time it answers, it has seen the video and heard the audio.
/watch https://youtu.be/dQw4w9WgXcQ what happens at the 30 second mark?
This is a fork of bradautomates/claude-video. The big change: transcription runs locally via mlx-whisper (Apple Silicon) or openai-whisper (CPU) instead of a paid Whisper API. No keys, no config file, no audio ever leaves your machine. Also adds browser-cookie support so login-gated sources (Instagram, X) work.
Claude Code (plugin):
/plugin marketplace add mathiaschu/claude-video
/plugin install watch@claude-video
Generic skill folder (Codex, etc.):
git clone https://github.com/mathiaschu/claude-video.git ~/.claude/skills/watch
Then run once to install dependencies (or let the first /watch do it):
python3 ~/.claude/skills/watch/scripts/setup.py
Windows: use
pythoninstead ofpython3(thepython3alias is a Microsoft Store stub).ffmpeg+yt-dlpinstall via thewingetcommands the setup script prints; for transcription usepip install openai-whisper(mlx-whisper is Apple Silicon only).
pip3 install mlx-whisper — preferred on Apple Silicon, runs on the GPU/Neural Engine.pip3 install openai-whisper — CPU fallback, cross-platform.There is no API key. Captions cover most public videos for free; when a video has no captions, the audio is transcribed locally.
.mp4, .mov, .mkv, .webm, …).yt-dlp downloads it into a temp working directory. Local files are probed in place, no download.ffmpeg extracts frames at an auto-scaled rate. Duration-aware budget — ≤30s gets ~30 frames, 1-3min ~60, 3-10min ~80, longer 100 sparsely. Hard ceilings: 2 fps, 100 frames. JPEGs at 512px wide (bump with --resolution 1024 to read on-screen text).Reads each frame as an image and aligns them to the timestamped transcript.Public videos download with no auth. For sources that require a login — Instagram, X/Twitter, age-restricted or private/unlisted YouTube — /watch needs your own browser cookies so yt-dlp can authenticate as you. You don't have to set anything up in advance: just run /watch <url> normally, and if it fails with a login/private error, do this.
/watch https://www.instagram.com/reel/XXXX/ --cookies-from-browser chrome
Supported: chrome, firefox, safari, edge, brave, chromium, opera, vivaldi.
macOS gotchas (the usual culprits):
If browser extraction keeps fighting you, export the cookies manually (works everywhere):
instagram.com).~/Downloads/cookies.txt./watch https://www.instagram.com/reel/XXXX/ --cookies ~/Downloads/cookies.txt
Your cookies are read live from your own machine and piped straight into yt-dlp. The skill never copies, stores, logs, or transmits them. If you exported a cookies.txt, it stays on your disk — delete it whenever you want.
| Flag | What it does |
|---|---|
--start T / --end T | Focus on a section (SS, MM:SS, or HH:MM:SS); denser frame rate |
--max-frames N | Lower the frame cap for a tighter token budget |
--resolution W | Frame width in px (default 512; 1024 to read on-screen text) |
--fps F | Override auto-fps (clamped to 2 fps) |
--cookies-from-browser B | Read cookies from a local browser for login-gated sources |
--cookies FILE | Use a Netscape-format cookies.txt instead |
--whisper mlx|openai-whisper | Force a specific local engine |
--no-whisper | Skip transcription entirely (frames only) |
.env, no config file, no secret of any kind.Original skill: bradautomates/claude-video (MIT). This fork swaps the Whisper API for local transcription and adds browser-cookie support. Licensed MIT — see LICENSE.
A Claude Code skill by Hao (駱君昊) that learns your Facebook voice and auto-posts to FB / IG / Threads / X with a 14-day c
1000+ skills curated from Anthropic, Vercel, Stripe, and other engineering teams
Human + AI music production workflow for Suno - skills, templates, and tools
Claude Code skill for YouTube creators — channel audits, video SEO, retention scripts, thumbnails, content strategy, Sho