A community-driven registry for the Claude Code ecosystem. Not affiliated with Anthropic.
Are you the author? Sign in to claim
Gaudio Lab Audio AI MCP Server — Stem Separation, DME Separation, AI Text Sync
MCP server for Gaudio Lab Audio AI API. Separate vocals, instruments, dialogue, music, effects from any audio/video — or sync lyrics to timestamps — all through natural language in your AI tools.
Works with Claude, ChatGPT, Cursor, VS Code, GitHub Copilot, and any MCP-compatible client.
Add to your MCP client config:
{
"mcpServers": {
"gaudio": {
"command": "npx",
"args": ["-y", "@gaudiolab/mcp-developers"],
"env": {
"GAUDIO_API_KEY": "your-api-key-here"
}
}
}
}
Then just ask in natural language:
| Tool | Description |
|---|---|
gaudio_get_key_info | Get API key info: credits, project, permitted models |
gaudio_list_models | List available AI models by category |
gaudio_upload_file | Upload audio/video/text file (multipart, auto-chunked) |
gaudio_create_job | Create a processing job |
gaudio_get_job | Check job status and get download URLs |
gaudio_separate_audio | All-in-one: upload → process → download URLs |
gaudio_sync_lyrics | All-in-one lyrics sync with timestamps |
| Model | Description | Type Options |
|---|---|---|
gsep_music_hq_v1 | Multi-instrument separation | vocal, drum, bass, electric_guitar, acoustic_piano |
gsep_music_shq_v1 | Super HQ vocal + accompaniment | vocal |
gsep_speech_hq_v1 | Speech / noise removal | speech |
Max: 1GB / 20 min per file. Types can be combined (e.g. vocal,drum).
| Model | Description |
|---|---|
gsep_dme_dtrack_v1 | Dialogue extraction |
gsep_dme_d2track_v1 | Dialogue + vocals |
gsep_dme_metrack_v1 | Music + effects |
gsep_dme_me2track_v1 | Music + effects v1 |
gsep_dme_me2track_v2 | Music + effects v2 (high quality) |
gsep_dme_mtrack_v1 | Music only |
gsep_dme_etrack_v1 | Effects only |
Max: 10GB / 200 min per file.
| Model | Description | Languages |
|---|---|---|
gts_lyrics_line_v1 | Lyrics line sync | en, ko, ja, zh-cn |
Max: 1GB / 10 min. Text: .txt (UTF-8), min 2 lines, max 60 chars/line.
Output: CSV (timestamp, lyric_text, confidence_score) + JSON report.
| Type | Formats |
|---|---|
| Audio | WAV, FLAC, MP3, M4A |
| Video | MOV, MP4 (audio auto-extracted) |
| Text | TXT (UTF-8) |
Output: MP3 (48kHz/320kbps) + WAV (same as input). Download URLs valid for 48 hours.
Upload file → Create job → Poll status → Get download URLs
The high-level tools (gaudio_separate_audio, gaudio_sync_lyrics) handle this entire flow automatically. Upload IDs are valid for 72 hours and can be reused across multiple jobs.
MIT
Run Claude Code as an MCP server so any agent can delegate coding tasks to it
Browser automation using accessibility snapshots instead of screenshots
English-first Korean equity intelligence MCP — DART filings, foreign-holder 5%-rule flows, activist filings, KRX news. F
Unity MCP acts as a bridge between AI assistants and your Unity Editor. Give your LLM tools to manage assets, control sc
via web
0
via CLI