A community-driven registry for the Claude Code ecosystem. Not affiliated with Anthropic.
Are you the author? Sign in to claim
Convert documentation sites, GitHub repos, and PDFs into structured Claude AI skills
English | 简体中文 | 日本語 | 한국어 | Español | Français | Deutsch | Português | Türkçe | العربية | हिन्दी | Русский
🧠 The data layer for AI systems. Skill Seekers turns documentation sites, GitHub repos, PDFs, videos, notebooks, wikis, and 10+ more source types into structured knowledge assets—ready to power AI Skills (Claude, Gemini, OpenAI), RAG pipelines (LangChain, LlamaIndex, Pinecone), and AI coding assistants (Cursor, Windsurf, Cline) in minutes, not hours.
🌐 Visit SkillSeekersWeb.com - Browse 24+ preset configs, share your configs, and access complete documentation!
📋 View Development Roadmap & Tasks - 134 tasks across 10 categories, pick any to contribute!
Skill Seekers is a multi-repo project. Here's where everything lives:
| Repository | Description | Links |
|---|---|---|
| Skill_Seekers | Core CLI & MCP server (this repo) | PyPI |
| skillseekersweb | Website & documentation | Live |
| skill-seekers-configs | Community config repository | |
| skill-seekers-action | GitHub Action for CI/CD | |
| skill-seekers-plugin | Claude Code plugin | |
| homebrew-skill-seekers | Homebrew tap for macOS |
Want to contribute? The website and configs repos are great starting points for new contributors!
Skill Seekers is the universal preprocessing layer that sits between raw documentation and every AI system that consumes it. Whether you are building Claude skills, a LangChain RAG pipeline, or a Cursor .cursorrules file — the data preparation is identical. You do it once, and export to all targets.
# One command → structured knowledge asset
skill-seekers create https://docs.react.dev/
# or: skill-seekers create facebook/react
# or: skill-seekers create ./my-project
# Export to any AI system
skill-seekers package output/react --target claude # → Claude AI Skill (ZIP)
skill-seekers package output/react --target langchain # → LangChain Documents
skill-seekers package output/react --target llama-index # → LlamaIndex TextNodes
skill-seekers package output/react --target cursor # → .cursorrules
skill-seekers package output/react --target ibm-bob # → IBM Bob skill directory
| Output | Target | What it powers |
|---|---|---|
| Claude Skill (ZIP + YAML) | --target claude | Claude Code, Claude API |
| Gemini Skill (tar.gz) | --target gemini | Google Gemini |
| OpenAI / Custom GPT (ZIP) | --target openai | GPT-4o, custom assistants |
| LangChain Documents | --target langchain | QA chains, agents, retrievers |
| LlamaIndex TextNodes | --target llama-index | Query engines, chat engines |
| Haystack Documents | --target haystack | Enterprise RAG pipelines |
| Pinecone-ready (Markdown) | --target markdown | Vector upsert |
| ChromaDB / FAISS / Qdrant | --target chroma/faiss/qdrant | Local vector DBs |
| IBM Bob Skill (directory) | --target ibm-bob | IBM Bob project/global skills |
Cursor .cursorrules | --target markdown → copy SKILL.md | Cursor IDE .cursorrules |
| Windsurf / Cline / Continue | --target claude → copy | VS Code, IntelliJ, Vim |
# 1. Install
pip install skill-seekers
# 2. Create skill from any source
skill-seekers create https://docs.django.com/
# 3. Package for your AI platform
skill-seekers package output/django --target claude
That's it! You now have output/django-claude.zip ready to use.
# Use a different AI agent for enhancement (default: claude)
skill-seekers create https://docs.django.com/ --agent kimi
skill-seekers create https://docs.django.com/ --agent codex
skill-seekers create https://docs.django.com/ --agent-cmd "my-custom-agent run"
Point scan at any project and an AI agent reads its manifests, README,
Dockerfile/CI and sampled source imports — then emits one config per detected
framework plus a <project>-codebase.json for your own code. Pins the
detected version so re-running reports bumps:
skill-seekers scan ./my-react-app --out ./configs/scanned/
# → react.json, vite.json, tailwind.json, jest.json, my-react-app-codebase.json
# Then build any of them
skill-seekers create ./configs/scanned/react.json
If a detection has no existing preset, the AI generates a fresh config; on exit you can optionally publish it back to the community registry.
# GitHub repository
skill-seekers create facebook/react
# Local project
skill-seekers create ./my-project
# PDF document
skill-seekers create manual.pdf
# Word document
skill-seekers create report.docx
# EPUB e-book
skill-seekers create book.epub
# Jupyter Notebook
skill-seekers create notebook.ipynb
# OpenAPI spec
skill-seekers create openapi.yaml
# PowerPoint presentation
skill-seekers create presentation.pptx
# AsciiDoc document
skill-seekers create guide.adoc
# Local HTML file (auto-detected by extension)
skill-seekers create page.html
# Whole directory of HTML files (auto-detected for HTML-dominant dirs)
skill-seekers create ./mirror_output/site/
# Force HTML mode on a mixed/code-heavy directory
skill-seekers create ./repo/ --html-path ./repo/docs/build/html/
# RSS/Atom feed
skill-seekers create feed.rss
# Man page
skill-seekers create curl.1
# Video (YouTube, Vimeo, or local file — requires skill-seekers[video])
skill-seekers create --video-url https://www.youtube.com/watch?v=... --name mytutorial
# First time? Auto-install GPU-aware visual deps:
skill-seekers create --setup
# Confluence wiki
skill-seekers create --space-key TEAM --name wiki
# Notion pages
skill-seekers create --database-id ... --name docs
# Slack/Discord chat export
skill-seekers create --chat-export-path ./slack-export --name team-chat
# Package for multiple platforms
for platform in claude gemini openai langchain; do
skill-seekers package output/django --target $platform
done
Skill Seekers is the data layer for AI systems. It transforms 18 source types—documentation websites, GitHub repositories, PDFs, videos, Jupyter Notebooks, Word/EPUB/AsciiDoc documents, OpenAPI specs, PowerPoint presentations, RSS feeds, man pages, Confluence wikis, Notion pages, Slack/Discord exports, and more—into structured knowledge assets for every AI target:
| Use Case | What you get | Examples |
|---|---|---|
| AI Skills | Comprehensive SKILL.md + references | Claude Code, Gemini, GPT |
| RAG Pipelines | Chunked documents with rich metadata | LangChain, LlamaIndex, Haystack |
| Vector Databases | Pre-formatted data ready for upsert | Pinecone, Chroma, Weaviate, FAISS |
| AI Coding Assistants | Context files your IDE AI reads automatically | Cursor, Windsurf, Cline, Continue.dev |
| I want to... | Read this |
|---|---|
| Get started quickly | Quick Start - 3 commands to first skill |
| Understand concepts | Core Concepts - How it works |
| Scrape sources | Scraping Guide - All source types |
| Enhance skills | Enhancement Guide - AI enhancement |
| Export skills | Packaging Guide - Platform export |
| Look up commands | CLI Reference - All 20 commands |
| Configure | Config Format - JSON specification |
| Fix issues | Troubleshooting - Common problems |
Complete documentation: docs/README.md
Instead of spending days on manual preprocessing, Skill Seekers:
security-focus, architecture-comprehensive, or custom YAML presetsDocuments, LlamaIndex TextNodes, Haystack Documents.cursorrules / .windsurfrules / .clinerules automatically--start-time and --end-time--target flag selects platform| Platform | Format | Upload | Enhancement | API Key | Custom Endpoint |
|---|---|---|---|---|---|
| Claude AI | ZIP + YAML | ✅ Auto | ✅ Yes | ANTHROPIC_API_KEY | ANTHROPIC_BASE_URL |
| Google Gemini | tar.gz | ✅ Auto | ✅ Yes | GOOGLE_API_KEY | - |
| OpenAI ChatGPT | ZIP + Vector Store | ✅ Auto | ✅ Yes | OPENAI_API_KEY | - |
| MiniMax AI | ZIP + Knowledge Files | ✅ Auto | ✅ Yes | MINIMAX_API_KEY | - |
| Generic Markdown | ZIP | ❌ Manual | ❌ No | - | - |
# Claude (default - no changes needed!)
skill-seekers package output/react/
skill-seekers upload react.zip
# Google Gemini
pip install skill-seekers[gemini]
skill-seekers package output/react/ --target gemini
skill-seekers upload react-gemini.tar.gz --target gemini
# OpenAI ChatGPT
pip install skill-seekers[openai]
skill-seekers package output/react/ --target openai
skill-seekers upload react-openai.zip --target openai
# MiniMax AI
pip install skill-seekers[minimax]
skill-seekers package output/react/ --target minimax
skill-seekers upload react-minimax.zip --target minimax
# Generic Markdown (universal export)
skill-seekers package output/react/ --target markdown
# Use the markdown files directly in any LLM
The optional AI enhancement step (used by create, scan, and enhance) does not require an Anthropic key. You have three ways to power it:
1. Use a subscription you already pay for — no API credits at all (LOCAL agent mode)
Skill Seekers can shell out to a coding-agent CLI you're already logged into, so enhancement runs on your existing plan instead of metered API tokens:
skill-seekers create <source> --agent codex # OpenAI Codex CLI → your ChatGPT Plus
skill-seekers create <source> --agent claude # Claude Code → your Claude Pro/Max
Supported agents: claude, codex, copilot, opencode, kimi, and custom
(pair --agent custom with --agent-cmd "<your-cli> ..." to drive any other tool).
2. Any OpenAI-compatible provider (OpenRouter, Groq, Cerebras, Mistral, NVIDIA NIM, …)
All of these expose an OpenAI-compatible /v1 endpoint. Point Skill Seekers at one with three env vars — it detects OPENAI_API_KEY, and the OpenAI SDK honors OPENAI_BASE_URL automatically:
export OPENAI_API_KEY="<your provider key>"
export OPENAI_BASE_URL="https://openrouter.ai/api/v1" # provider endpoint (see table)
export OPENAI_MODEL="<a model that provider offers>" # required — default gpt-4o won't exist elsewhere
skill-seekers create <source>
| Provider | OPENAI_BASE_URL |
|---|---|
| OpenRouter | https://openrouter.ai/api/v1 |
| Groq | https://api.groq.com/openai/v1 |
| Cerebras | https://api.cerebras.ai/v1 |
| Mistral | https://api.mistral.ai/v1 |
| NVIDIA NIM | https://integrate.api.nvidia.com/v1 |
Provider detection picks the first API-key env var it finds (
ANTHROPIC_API_KEY→MOONSHOT_API_KEY→GOOGLE_API_KEY→OPENAI_API_KEY). If you want the OpenAI-compatible route, make sure the higher-priority keys are unset.
3. Claude-compatible endpoints (e.g. GLM, proxies)
export ANTHROPIC_API_KEY="your-key"
export ANTHROPIC_BASE_URL="https://your-claude-compatible-endpoint/v1"
Google Gemini (GOOGLE_API_KEY) and Kimi/Moonshot (MOONSHOT_API_KEY) are also supported natively. See Environment Variables Reference for the full list, including per-provider model overrides.
Installation:
# Install with Gemini support
pip install skill-seekers[gemini]
# Install with OpenAI support
pip install skill-seekers[openai]
# Install with MiniMax support
pip install skill-seekers[minimax]
# Install with all LLM platforms
pip install skill-seekers[all-llms]
✅ LangChain Documents - Direct export to Document format with page_content + metadata
✅ LlamaIndex TextNodes - Export to TextNode format with unique IDs + embeddings
✅ Pinecone-Ready Format - Optimized for vector database upsert
Quick Export:
# LangChain Documents (JSON)
skill-seekers package output/django --target langchain
# → output/django-langchain.json
# LlamaIndex TextNodes (JSON)
skill-seekers package output/django --target llama-index
# → output/django-llama-index.json
# Markdown (Universal)
skill-seekers package output/django --target markdown
# → output/django-markdown/SKILL.md + references/
Complete RAG Pipeline Guide: RAG Pipelines Documentation
Transform any framework documentation into expert coding context for 4+ AI assistants:
✅ Cursor IDE - Generate .cursorrules for AI-powered code suggestions
✅ Windsurf - Customize Windsurf's AI assistant context with .windsurfrules
✅ Cline (VS Code) - System prompts + MCP for VS Code agent
✅ Continue.dev - Context servers for IDE-agnostic AI
Quick Export for AI Coding Tools:
# For any AI coding assistant (Cursor, Windsurf, Cline, Continue.dev)
skill-seekers create --config configs/django.json
skill-seekers package output/django --target claude # or --target markdown
# Copy to your project (example for Cursor)
cp output/django-claude/SKILL.md my-project/.cursorrules
# Or for Windsurf
cp output/django-claude/SKILL.md my-project/.windsurf/rules/django.md
# Or for Cline
cp output/django-claude/SKILL.md my-project/.clinerules
# Or for Continue.dev (HTTP server)
python examples/continue-dev-universal/context_server.py
# Configure in ~/.continue/config.json
Integration Hub: All AI System Integrations
Three Streams Explained:
from skill_seekers.cli.unified_codebase_analyzer import UnifiedCodebaseAnalyzer
# Analyze GitHub repo with all three streams
analyzer = UnifiedCodebaseAnalyzer()
result = analyzer.analyze(
source="https://github.com/facebook/react",
depth="c3x", # or "basic" for fast analysis
fetch_github_metadata=True
)
# Access code stream (C3.x analysis)
print(f"Design patterns: {len(result.code_analysis['c3_1_patterns'])}")
print(f"Test examples: {result.code_analysis['c3_2_examples_count']}")
# Access docs stream (repository docs)
print(f"README: {result.github_docs['readme'][:100]}")
# Access insights stream (GitHub metadata)
print(f"Stars: {result.github_insights['metadata']['stars']}")
print(f"Common issues: {len(result.github_insights['common_problems'])}")
See complete documentation: Three-Stream Implementation Summary
~/.config/skill-seekers/config.json (600 permissions)prompt, wait, switch, fail--non-interactive flag fails fast without prompts--profile flag to select specific GitHub accountQuick Setup:
# One-time configuration (5 minutes)
skill-seekers config --github
# Use specific profile for private repos
skill-seekers create mycompany/private-repo --profile work
# CI/CD mode (fail fast, no prompts)
skill-seekers create owner/repo --non-interactive
# Resume interrupted job
skill-seekers resume --list
skill-seekers resume github_react_20260117_143022
Rate Limit Strategies Explained:
Generate skill-seekers as a skill to use within your AI agent (Claude Code, Kimi, Codex, etc.):
# Generate the skill
./scripts/bootstrap_skill.sh
# Install to Claude Code
cp -r output/skill-seekers ~/.claude/skills/
What you get:
C3.4: Configuration Pattern Extraction with AI Enhancement
extract_config_patterns tool with enhancement supportC3.3: AI-Enhanced How-To Guides
Usage:
# Quick analysis (1-2 min, basic features only)
skill-seekers scan tests/ --quick
# Comprehensive analysis with AI (20-60 min, all features)
skill-seekers scan tests/ --comprehensive
# With AI enhancement
skill-seekers scan tests/ --enhance
Full Documentation: docs/features/HOW_TO_GUIDES.md
Reusable YAML-defined enhancement pipelines that control how AI transforms your raw documentation into a polished skill.
default, minimal, security-focus, architecture-comprehensive, api-documentation~/.config/skill-seekers/workflows/# Apply a single workflow
skill-seekers create ./my-project --enhance-workflow security-focus
# Chain multiple workflows (applied in order)
skill-seekers create ./my-project \
--enhance-workflow security-focus \
--enhance-workflow minimal
# Manage presets
skill-seekers workflows list # List all (bundled + user)
skill-seekers workflows show security-focus # Print YAML content
skill-seekers workflows copy security-focus # Copy to user dir for editing
skill-seekers workflows add ./my-workflow.yaml # Install a custom preset
skill-seekers workflows remove my-workflow # Remove a user preset
skill-seekers workflows validate security-focus # Validate preset structure
# Copy multiple at once
skill-seekers workflows copy security-focus minimal api-documentation
# Add multiple files at once
skill-seekers workflows add ./wf-a.yaml ./wf-b.yaml
# Remove multiple at once
skill-seekers workflows remove my-wf-a my-wf-b
YAML preset format:
name: security-focus
description: "Security-focused review: vulnerabilities, auth, data handling"
version: "1.0"
stages:
- name: vulnerabilities
type: custom
prompt: "Review for OWASP top 10 and common security vulnerabilities..."
- name: auth-review
type: custom
prompt: "Examine authentication and authorisation patterns..."
uses_history: true
--async flag)--agent flag--agent-cmd to specify a custom agent CLI command for enhancement--agent and --agent-cmd available on all commands (create, scrape, github, pdf, etc.)# Basic install (documentation scraping, GitHub analysis, PDF, packaging)
pip install skill-seekers
# With all LLM platform support
pip install skill-seekers[all-llms]
# With MCP server
pip install skill-seekers[mcp]
# Everything
pip install skill-seekers[all]
Need help choosing? Run the setup wizard:
skill-seekers-setup
| Install | Features |
|---|---|
pip install skill-seekers | Scraping, GitHub analysis, PDF, all platforms |
pip install skill-seekers[gemini] | + Google Gemini support |
pip install skill-seekers[openai] | + OpenAI ChatGPT support |
pip install skill-seekers[all-llms] | + All LLM platforms |
pip install skill-seekers[mcp] | + MCP server for Claude Code, Cursor, etc. |
pip install skill-seekers[video] | + YouTube/Vimeo transcript & metadata extraction |
pip install skill-seekers[video-full] | + Whisper transcription & visual frame extraction |
pip install skill-seekers[jupyter] | + Jupyter Notebook support |
pip install skill-seekers[pptx] | + PowerPoint support |
pip install skill-seekers[confluence] | + Confluence wiki support |
pip install skill-seekers[notion] | + Notion pages support |
pip install skill-seekers[rss] | + RSS/Atom feed support |
pip install skill-seekers[chat] | + Slack/Discord chat export support |
pip install skill-seekers[asciidoc] | + AsciiDoc document support |
pip install skill-seekers[all] | Everything enabled |
Video visual deps (GPU-aware): After installing
skill-seekers[video-full], runskill-seekers create --setupto auto-detect your GPU and install the correct PyTorch variant + easyocr. This is the recommended way to install visual extraction dependencies.
The fastest way to go from config to uploaded skill - complete automation:
# Install React skill from official configs (auto-uploads to Claude)
skill-seekers install --config react
# Install from local config file
skill-seekers install --config configs/custom.json
# Install without uploading (package only)
skill-seekers install --config django --no-upload
# Preview workflow without executing
skill-seekers install --config react --dry-run
Time: 20-45 minutes total | Quality: Production-ready (9/10) | Cost: Free
Phases executed:
📥 PHASE 1: Fetch Config (if config name provided)
📖 PHASE 2: Scrape Documentation
✨ PHASE 3: AI Enhancement (MANDATORY - no skip option)
📦 PHASE 4: Package Skill
☁️ PHASE 5: Upload to Claude (optional, requires API key)
Requirements:
--agent to select a different AI agentSkill Seekers supports 21 LLM platforms, 8 RAG/vector targets, 18 source types, and full feature parity across all targets.
Platforms: Claude AI, Google Gemini, OpenAI ChatGPT, MiniMax AI, Generic Markdown, OpenCode, Kimi (Moonshot AI), DeepSeek AI, Qwen (Alibaba), OpenRouter, Together AI, Fireworks AI Source Types: Documentation websites, GitHub repos, PDFs, Word (.docx), EPUB, Video, Local codebases, Jupyter Notebooks, Local HTML, OpenAPI/Swagger, AsciiDoc, PowerPoint (.pptx), RSS/Atom feeds, Man pages, Confluence wikis, Notion pages, Slack/Discord chat exports
See Complete Feature Matrix for detailed platform and feature support.
| Feature | Claude | Gemini | OpenAI | MiniMax | Markdown |
|---|---|---|---|---|---|
| Format | ZIP + YAML | tar.gz | ZIP + Vector | ZIP + Knowledge | ZIP |
| Upload | ✅ API | ✅ API | ✅ API | ✅ API | ❌ Manual |
| Enhancement | ✅ Sonnet 4 | ✅ 2.0 Flash | ✅ GPT-4o | ✅ M2.7 | ❌ None |
| All Skill Modes | ✅ | ✅ | ✅ | ✅ | ✅ |
# Scrape documentation website
skill-seekers create --config configs/react.json
# Quick scrape without config
skill-seekers create https://react.dev --name react
# With async mode (3x faster)
skill-seekers create --config configs/godot.json --async --workers 8
# Use a specific AI agent for enhancement
skill-seekers create --config configs/react.json --agent kimi
# Basic PDF extraction
skill-seekers create --pdf docs/manual.pdf --name myskill
# Advanced features
skill-seekers create --pdf docs/manual.pdf --name myskill \
--extract-tables \ # Extract tables
--parallel \ # Fast parallel processing
--workers 8 # Use 8 CPU cores
# Scanned PDFs (requires: pip install pytesseract Pillow)
skill-seekers create --pdf docs/scanned.pdf --name myskill --ocr
# Install video support
pip install skill-seekers[video] # Transcripts + metadata
pip install skill-seekers[video-full] # + Whisper + visual frame extraction
# Auto-detect GPU and install visual deps (PyTorch + easyocr)
skill-seekers create --setup
# Extract from YouTube video
skill-seekers create --video-url https://www.youtube.com/watch?v=dQw4w9WgXcQ --name mytutorial
# Extract from a YouTube playlist
skill-seekers create --video-playlist https://www.youtube.com/playlist?list=... --name myplaylist
# Extract from a local video file
skill-seekers create --video-file recording.mp4 --name myrecording
# Extract with visual frame analysis (requires video-full deps)
skill-seekers create --video-url https://www.youtube.com/watch?v=... --name mytutorial --visual
# With AI enhancement (cleans OCR + generates polished SKILL.md)
skill-seekers create --video-url https://www.youtube.com/watch?v=... --visual --enhance-level 2
# Clip a specific section of a video (supports seconds, MM:SS, HH:MM:SS)
skill-seekers create --video-url https://www.youtube.com/watch?v=... --start-time 1:30 --end-time 5:00
# Use Vision API for low-confidence OCR frames (requires ANTHROPIC_API_KEY)
skill-seekers create --video-url https://www.youtube.com/watch?v=... --visual --vision-ocr
# Re-build skill from previously extracted data (skip download)
skill-seekers create --from-json output/mytutorial/video_data/extracted_data.json --name mytutorial
Full guide: See docs/VIDEO_GUIDE.md for complete CLI reference, visual pipeline details, AI enhancement options, and troubleshooting.
# Basic repository scraping
skill-seekers create facebook/react
# With authentication (higher rate limits)
export GITHUB_TOKEN=ghp_your_token_here
skill-seekers create facebook/react
# Customize what to include
skill-seekers create django/django \
--include-issues \ # Extract GitHub Issues
--max-issues 100 \ # Limit issue count
--include-changelog # Extract CHANGELOG.md
Combine documentation + GitHub + PDF into one unified skill with conflict detection:
# Use existing unified configs
skill-seekers create --config configs/react_unified.json
skill-seekers create --config configs/django_unified.json
# Or create unified config
cat > configs/myframework_unified.json << 'EOF'
{
"name": "myframework",
"merge_mode": "rule-based",
"sources": [
{
"type": "documentation",
"base_url": "https://docs.myframework.com/",
"max_pages": 200
},
{
"type": "github",
"repo": "owner/myframework",
"code_analysis_depth": "surface"
}
]
}
EOF
skill-seekers create --config configs/myframework_unified.json
Conflict Detection automatically finds:
Full Guide: See docs/features/UNIFIED_SCRAPING.md for complete documentation.
Share custom configs across teams using private git repositories:
# Option 1: Using MCP tools (recommended)
# Register your team's private repo
add_config_source(
name="team",
git_url="https://github.com/mycompany/skill-configs.git",
token_env="GITHUB_TOKEN"
)
# Fetch config from team repo
fetch_config(source="team", config_name="internal-api")
Supported Platforms:
GITHUB_TOKEN), GitLab (GITLAB_TOKEN), Gitea (GITEA_TOKEN), Bitbucket (BITBUCKET_TOKEN)Full Guide: See docs/reference/GIT_CONFIG_SOURCES.md for complete documentation.
graph LR
A[Documentation Website] --> B[Skill Seekers]
B --> C[Scraper]
B --> D[AI Enhancement]
B --> E[Packager]
C --> F[Organized References]
D --> F
F --> E
E --> G[AI Skill .zip]
G --> H[Upload to AI Platform]
--agent).zip fileThe system is organized into 8 core modules and 5 utility modules (~200 classes total):

| Module | Purpose | Key Classes |
|---|---|---|
| CLICore | Git-style command dispatcher | CLIDispatcher, SourceDetector, CreateCommand |
| Scrapers | 18 source-type extractors | DocToSkillConverter, GitHubScraper, UnifiedScraper |
| Adaptors | 20+ output platform formats | SkillAdaptor (ABC), ClaudeAdaptor, LangChainAdaptor |
| Analysis | C3.x codebase analysis pipeline | UnifiedCodebaseAnalyzer, PatternRecognizer, 10 GoF detectors |
| Enhancement | AI-powered skill improvement via AgentClient | AgentClient, AIEnhancer, UnifiedEnhancer, WorkflowEngine |
| Packaging | Package, upload, install skills | PackageSkill, InstallAgent |
| MCP | FastMCP server (40 tools) | SkillSeekerMCPServer, 10 tool modules |
| Sync | Doc change detection | ChangeDetector, SyncMonitor, Notifier |
Utility modules: Parsers (28 CLI parsers), Storage (S3/GCS/Azure), Embedding (multi-provider vectors), Benchmark (performance), Utilities (16 shared helpers).
Full UML diagrams: docs/UML_ARCHITECTURE.md | StarUML project: docs/UML/skill_seekers.mdj | HTML API reference: docs/UML/html/
Before you start, make sure you have:
python3 --versiongit --versionFirst time user? → Start Here: Bulletproof Quick Start Guide 🎯
Once your skill is packaged, you need to upload it to Claude:
# Set your API key (one-time)
export ANTHROPIC_API_KEY=sk-ant-...
# Package and upload automatically
skill-seekers package output/react/ --upload
# OR upload existing .zip
skill-seekers upload output/react.zip
# Package skill
skill-seekers package output/react/
# → Creates output/react.zip
# Then manually upload:
# - Go to https://claude.ai/skills
# - Click "Upload Skill"
# - Select output/react.zip
In Claude Code, just ask:
"Package and upload the React skill"
Skill Seekers can automatically install skills to 19 AI coding agents.
# Install to specific agent
skill-seekers install-agent output/react/ --agent cursor
# Install to IBM Bob (project-local .bob/skills/)
skill-seekers install-agent output/react/ --agent bob
# Install to all agents at once
skill-seekers install-agent output/react/ --agent all
# Preview without installing
skill-seekers install-agent output/react/ --agent cursor --dry-run
| Agent | Path | Type |
|---|---|---|
| Claude Code | ~/.claude/skills/ | Global |
| Cursor | .cursor/skills/ | Project |
| VS Code / Copilot | .github/skills/ | Project |
| Amp | ~/.amp/skills/ | Global |
| Goose | ~/.config/goose/skills/ | Global |
| OpenCode | ~/.opencode/skills/ | Global |
| Windsurf | ~/.windsurf/skills/ | Global |
| Roo Code | .roo/skills/ | Project |
| Cline | .cline/skills/ | Project |
| Aider | ~/.aider/skills/ | Global |
| Bolt | .bolt/skills/ | Project |
| Kilo Code | .kilo/skills/ | Project |
| Continue | ~/.continue/skills/ | Global |
| Kimi Code | ~/.kimi/skills/ | Global |
| IBM Bob | .bob/skills/ | Project |
Skill Seekers ships an MCP server for use from Claude Code, Cursor, Windsurf, VS Code + Cline, or IntelliJ IDEA.
# stdio mode (Claude Code, VS Code + Cline)
python -m skill_seekers.mcp.server_fastmcp
# HTTP mode (Cursor, Windsurf, IntelliJ)
python -m skill_seekers.mcp.server_fastmcp --transport http --port 8765
# Auto-configure all agents at once
./setup_mcp.sh
All 40 tools available:
list_configs, generate_config, validate_config, estimate_pages, scrape_docs, package_skill, upload_skill, enhance_skill, install_skillscrape_github, scrape_pdf, unified_scrape, merge_sources, detect_conflicts, add_config_source, fetch_config, list_config_sources, remove_config_source, split_configexport_to_chroma, export_to_weaviate, export_to_faiss, export_to_qdrantcloud_upload, cloud_download, cloud_listFull Guide: docs/guides/MCP_SETUP.md
# List all presets
# skill-seekers list-configs # Not available in v3.7.0
| Category | Presets |
|---|---|
| Web Frameworks | react, vue, angular, svelte, nextjs |
| Python | django, flask, fastapi, sqlalchemy, pytest |
| Game Development | godot, pygame, unity |
| Tools & DevOps | docker, kubernetes, terraform, ansible |
| Unified (Docs + GitHub) | react-unified, vue-unified, nextjs-unified, and more |
# Option 1: Interactive
skill-seekers create --interactive
# Option 2: Copy and edit a preset
cp configs/react.json configs/myframework.json
nano configs/myframework.json
skill-seekers create --config configs/myframework.json
{
"name": "myframework",
"description": "When to use this skill",
"base_url": "https://docs.myframework.com/",
"selectors": {
"main_content": "article",
"title": "h1",
"code_blocks": "pre code"
},
"url_patterns": {
"include": ["/docs", "/guide"],
"exclude": ["/blog", "/about"]
},
"categories": {
"getting_started": ["intro", "quickstart"],
"api": ["api", "reference"]
},
"rate_limit": 0.5,
"max_pages": 500
}
The tool searches in this order:
./configs/ (current directory)~/.config/skill-seekers/configs/ (user config directory)output/
├── godot_data/ # Scraped raw data
│ ├── pages/ # JSON files (one per page)
│ └── summary.json # Overview
│
└── godot/ # The skill
├── SKILL.md # Enhanced with real examples
├── references/ # Categorized docs
│ ├── index.md
│ ├── getting_started.md
│ ├── scripting.md
│ └── ...
├── scripts/ # Empty (add your own)
└── assets/ # Empty (add your own)
main_content selectorarticle, main, div[role="main"]# Force re-scrape
rm -rf output/myframework_data/
skill-seekers create --config configs/myframework.json
Edit the config categories section with better keywords.
# Delete old data and re-scrape
rm -rf output/godot_data/
skill-seekers create --config configs/godot.json
# Check if API key is set
echo $ANTHROPIC_API_KEY
# Try LOCAL mode instead (uses Claude Code Max, no API key needed)
skill-seekers enhance output/react/ --mode LOCAL
# Monitor background enhancement status
skill-seekers enhance-status output/react/ --watch
# Set a GitHub token (5000 req/hour vs 60/hour anonymous)
export GITHUB_TOKEN=ghp_your_token_here
# Or configure multiple profiles
skill-seekers config --github
| Task | Time | Notes |
|---|---|---|
| Scraping (sync) | 15-45 min | First time only, thread-based |
| Scraping (async) | 5-15 min | 2-3x faster with --async flag |
| Building | 1-3 min | Fast rebuild from cache |
| Re-building | <1 min | With --skip-scrape |
| Enhancement (LOCAL) | 30-60 sec | Uses Claude Code Max |
| Enhancement (API) | 20-40 sec | Requires API key |
| Video (transcript) | 1-3 min | YouTube/local, transcript only |
| Video (visual) | 5-15 min | + OCR frame extraction |
| Packaging | 5-10 sec | Final .zip creation |
Control analysis depth with --preset:
skill-seekers create https://docs.react.dev/ --preset quick # Fast, surface-level
skill-seekers create https://docs.react.dev/ --preset standard # Balanced (default)
skill-seekers create https://docs.react.dev/ --preset comprehensive # Deep, exhaustive
skill-seekers create https://docs.react.dev/ --dry-run # Preview without scraping
skill-seekers create https://docs.react.dev/ --fresh # Ignore cache, full re-scrape
skill-seekers create https://docs.react.dev/ --resume # Resume interrupted job
skill-seekers create https://docs.react.dev/ --skip-scrape # Re-package existing output
skill-seekers doctor # Diagnose installation & environment
skill-seekers sync-config # Detect config drift
skill-seekers stream <source> # Streaming ingestion for large docs
skill-seekers update output/react/ # Incremental update
skill-seekers multilang <source> # Multi-language skill generation
skill-seekers quality output/react/ # Quality scoring
skill-seekers package output/react/ --chunk-for-rag --chunk-tokens 512 --chunk-overlap-tokens 50
skill-seekers package output/react/ --marketplace --marketplace-category frontend
| Extra | Install | Purpose |
|---|---|---|
browser | pip install "skill-seekers[browser]" | Headless Playwright for SPA sites |
embedding | pip install "skill-seekers[embedding]" | Embedding server support |
s3 / gcs / azure | pip install "skill-seekers[s3]" etc. | Cloud storage upload |
rag-upload | pip install "skill-seekers[rag-upload]" | Combined vector DB upload deps |
MIT License - see LICENSE file for details
Happy skill building! 🚀
ML engineering — model training, deployment, MLOps, monitoring
DevOps practices — CI/CD, containers, monitoring, infrastructure automation
Professional skills marketplace with production-ready skills for enhanced development
Self-learning system that captures corrections and syncs them to CLAUDE.md and AGENTS.md
Community Package
@yusufkaraaslan on GitHub