A community-driven registry for Claude, Cursor, Windsurf, Cline & more. Not affiliated with Anthropic.
Are you the author? Sign in to claim
AI-powered edge knowledge mining from underground forums. Deep crawls Reddit, BlackHatWorld, GreyHatMafia with browser a
Chinese Documentation | English Documentation
AI-powered edge knowledge mining system. Intelligently recommends high-value forums (Reddit, BlackHatWorld, GreyHatMafia, etc.) based on user needs, performs deep crawling with browser-use, visual recognition, and analyzes edge tactics, power users, and resources, outputting a single structured Markdown report.
💡 Knowledge Equity Initiative This project encourages community contributions! You can create branches on Git to build domain-specific search expert skills (e.g., SEO Expert, Affiliate Hunter, Tool Digger), making edge knowledge accessible to more people.
🤝 How to Contribute PRs:
- 🌿 Build Domain-Specific Branches - Create specialized versions for specific domains (e.g.,
seo-expert,affiliate)- 🔧 Optimize Existing Skill - Improve tool versatility and robustness (anti-scraping strategies, error handling, performance optimization)
- 📚 Share Quality Forums - Add deep-dive forums you use to
forum_database.jsonwith search strategies- 🤖 Add Automation Capabilities - Integrate scheduled tasks, push to Notion/Feishu/Slack and other collaboration tools
Let's break information barriers and promote knowledge equity together! 🚀
In the AI era, information that LLMs can easily understand or collect from the internet is considered common knowledge. Edge knowledge refers to scarce, industry know-how, and gray/black hat techniques that reach the boundaries of platform rules.
Edge Knowledge Includes:
Does NOT Include:
V2 adds aggressive resource acquisition capabilities:
| Feature | Description |
|---|---|
| 🎯 Value Signal Detection | 6 pattern types: Reply-unlock, Hidden content, Download links, Extract codes, Attachments, Task thresholds |
| 🤖 Auto-Reply System | 25 random templates (15 EN + 10 CN), auto language detection |
| 📦 Resource Downloader | Downloads ALL file types (.exe/.bat/.torrent), maintains completeness |
| 🔗 Deep Dive Tracking | External link follow, Author tracking, Comment section mining |
| 🧵 Tool Integration | Agent-Reach, gallery-dl, yt-dlp, Crawl4AI ready |
V2 could grab resources. V3 makes sure what you grab is fresh and trustworthy — because edge knowledge decays, and an opinion is not a tactic.
| Feature | Description |
|---|---|
| 🌐 GEO/AEO Edge Category | The frontier moved. New GEO_Edge_Search category + GEO_AI_Visibility strategy target the freshest, least-saturated edge: getting cited by ChatGPT/Perplexity/Gemini, intercepting AI Overview traffic, engineering LLM visibility — before the playbooks go mainstream |
| ⏳ Freshness Contract | Every source now carries last_verified / activity_status / freshness_score. Stale pools get down-ranked, declining ones flagged for re-verification (reverify_after_days: 90). "Freshness" stops being a slogan and becomes a mechanism |
| 🎯 Credibility Scoring | credibilityScoring turns the trustworthy characteristic into a computable score. Concrete numbers, real pricing, and case evidence score high; pure opinion gets penalized. Extracted knowledge is ranked by evidence strength before it lands in a report |
Why V3 matters: A forum that was gold in 2024 may be dead in 2026. A reply with "$30 CPA, 14-day ROI, ran it on 3 accounts" is worth more than "I think this should work." V3 encodes both judgments so the system stops treating all signals as equal.
git clone https://github.com/1596941391qq/EdgeKnowledge_Skill.git
cd EdgeKnowledge_Skill
chmod +x install.sh
./install.sh
Or copy this skill to Claude Code's skills directory:
cp -r edge-knowledge ~/.claude/skills/
V2 includes an intelligent three-tier routing engine that automatically selects the best tool for each task — cost-first with fallback guarantees.
| Tier | Tool | Cost | Best For |
|---|---|---|---|
| Tier 1 | browser-use | Free (local Playwright) | Screenshot + visual recognition, clicks/scrolls/forms, JS lazy-load, post-login access |
| Tier 2 | agent-browser | Free (Vercel CLI) | Repetitive structured extraction, @e1/@e2 element selection, scripted multi-step operations |
| Tier 3 | google-gemini-mcp | API key (per-token) | Bypassing anti-bot blocks, batch URL analysis (>10 pages), complex multimodal understanding |
IF captcha_detected AND captcha_type == "recaptcha_v2":
→ ai-captcha-bypass (GPT-4o or Gemini 2.5) → retry
IF cloudflare_blocked AND browser_use_failed:
→ google-gemini-mcp (Tier 3)
IF batch_analysis AND urls > 10:
→ google-gemini-mcp (concurrent)
IF visual_heavy AND needs_screenshot:
→ browser-use (Tier 1)
IF download_only:
→ gallery-dl / yt-dlp (bypasses browser)
All MCP server configurations are in mcp_config.json:
google-gemini-mcp — Gemini 2.5 depth search, URL fetch, multimodal analysisai-captcha-bypass — GPT-4o / Gemini-driven CAPTCHA solving (Selenium + Firefox)The forum knowledge base containing forum information and search strategies.
Structure:
{
"categories": {
"Q&A_Search": {
"description": "Suitable for mining deep discussions and real user feedback in comment sections",
"forums": [...]
},
"Edge_Knowledge_Search": {
"description": "Suitable for mining gray/black hat techniques not found in mainstream channels",
"forums": [...]
},
"Deep_Dive_Forums": {
"description": "Deep content that others don't know about",
"forums": [...]
}
},
"search_strategies": {
"Instagram_Growth": {
"keywords": [...],
"recommended_forums": [...],
"focus": "Real feedback in comment sections and gray techniques"
}
}
}
Usage:
Template for user preferences and crawling history. Copy to memory.json for first use:
cp memory.json.template memory.json
Structure:
{
"userPreferences": {
"favoriteForums": ["BestBlackHatForum"],
"domains": ["SEO", "Black Hat Techniques", "Traffic Arbitrage"],
"lastUsedDomain": "Black Hat SEO"
},
"forumCredentials": {
"bestblackhatforum.com": {
"username": "",
"password": "",
"lastLogin": "",
"loginCount": 1,
"cookies": null,
"localStorage": null,
"sessionValid": true
}
},
"crawledResources": [],
"antiDetection": {
"viewport": {"width": 1920, "height": 1080},
"userAgent": "",
"timezone": "",
"locale": "",
"randomDelayRange": [5000, 30000],
"maxLoginPerDay": 3,
"reuseSession": true,
"sessionExpiryHours": 24
}
}
Fields Explanation:
userPreferences: Your favorite forums and domainsforumCredentials: Forum login credentials (auto-saved when you provide them)crawledResources: History of crawled URLs (prevents duplicate crawling)antiDetection: Anti-detection configuration (viewport, delays, login limits)Privacy Note: memory.json is in .gitignore and won't be committed to Git. Your credentials are safe locally.
Value signal detection patterns for Hunter Mode.
Patterns Include:
reply_unlock - "Reply to unlock hidden content"hidden_content - "Hidden content / Spoiler blocks"download_link - Mega/Mediafire/Baidu pan linksextract_code - "Password: xxx" / "Code: xxx"attachment - "Download attachment"task_threshold - "Need X posts / X likes to view"Platform-specific download configurations.
Structure:
{
"mega.nz": {
"tool": "gallery-dl",
"args": ["--no-mtime"],
"maxConcurrent": 2
},
"youtube.com": {
"tool": "yt-dlp",
"args": ["-f", "best"]
}
}
Organized resource storage:
resources/
├── downloads/ # Downloaded files (by date)
├── links/
│ ├── mega.json # Mega links index
│ ├── baidu.json # Baidu pan links
│ └── gdrive.json # Google Drive links
├── codes/
│ └── passwords.json # Extract codes
└── index.json # Unified resource index
Use edge-knowledge to mine cost-effective Instagram growth services
Demonstrate the skill's edge knowledge tracking capabilities:
Use edge-knowledge to find the latest black hat SEO techniques for 2026
What you'll get:
Use edge-knowledge to discover profitable affiliate traffic sources
What you'll get:
Use edge-knowledge to find Instagram automation tools that bypass detection
What you'll get:
Use edge-knowledge to find cracked SEO tools and automation software
What you'll get:
The following modules are included for edge-case automation needs:
| Module | Purpose |
|---|---|
rules/access-control.md | Access control & authorization boundary definition |
mcp_config.json | Central MCP server & tool routing configuration |
cdp-scripts/ | CDP-based forum crawlers (see CDP Scripts section below) |
All tools are called automatically by the routing engine when triggered.
Browser automation crawlers using Chrome DevTools Protocol for targeted forum mining:
| Script | Target | What it does |
|---|---|---|
bhw-crawler.mjs | BlackHatWorld | Thread listing crawler |
bhw-crawler-v2.mjs | BlackHatWorld | V2 with value signal detection |
bhw-detail-crawler.mjs | BlackHatWorld | Deep thread content extraction |
bhw-backlink-scrape.mjs | BlackHatWorld | Backlink opportunity scraper |
bhw-vendor-verify.mjs | BlackHatWorld | Vendor/service verification |
cdp-batch-bbhf.mjs | BestBlackHatForum | Batch forum crawler |
cdp-bbhf-threads.mjs | BestBlackHatForum | Thread detail extractor |
cdp-buildersociety.mjs | BuilderSociety | Forum crawler |
cdp-buildersociety-v2.mjs | BuilderSociety | V2 enhanced extraction |
scrape-onehack.mjs | OneHack | Content scraper |
| Rank | Forum | Rating | Cost | Target Audience |
|---|---|---|---|---|
| 1 | GreyHatMafia | 9.5/10 | Free | Everyone |
| 4 | SEO Isn't Dead | 9/10 | Free | SEO Practitioners |
| 6 | BlackHatWorld | 8.5/10 | Free | General Marketing |
| 7 | BestBlackHatForum | 9.5/10 | Free | Recommended: slenderman's posts |
| 8 | 10/10 | Free | Value in comment sections |
Generated reports contain three-layer analysis:
### Edge Knowledge #1: [Knowledge Title]
**Compressed Expression**: [One-sentence summary]
**Easy Explanation**: [Detailed explanation]
**Viewpoints**: @username: "viewpoint content"
**Risk**: [Potential risks]
**Cost**: [Time/money/learning cost]
**Source Link**: [Original link]
### Power User #1: @username
**Username**: username (forum name)
**High-Energy Viewpoints**: "viewpoint1", "viewpoint2"
**Link**: [User profile link]
### Resource #1: [Tool/Service Name]
**Name**: Tool name
**Link**: [Tool link]
**Description**: [Feature description]
**Price**: [Price information]
**Review**: [User review summary]
install.sh auto-detects macOS, Ubuntu/Debian, CentOS/RHEL, and Arch Linux, installs system packages, configures Playwright, and sets up MCP servers.
pip install gallery-dl yt-dlp crawl4ai
User Need → Read memory.json → Recommend Forums → User Confirms →
Check Credentials → Apply Anti-Detection Config → browser-use Crawl →
Claude Analysis → Generate Report → Update memory.json

MIT License
黑咖啡和冰月亮 / Black Coffee & Ice Moon (@weihackings)
Issues and Pull Requests are welcome!
A: This tool is for educational and research purposes only. Always comply with forum terms of service and local laws. Use at your own risk.
A: The tool includes anti-detection features (random delays, session reuse, login limits), but there's always a risk. We recommend:
maxLoginPerDay limit (default: 3)A: Only for forums that require login (like BestBlackHatForum). For public forums (like Reddit), no credentials needed. Your credentials are stored locally in memory.json and never committed to Git.
A: Edit forum_database.json and add your forum to the appropriate category:
{
"name": "YourForum",
"url": "https://yourforum.com",
"rating": 9.0,
"cost": "Free",
"target_audience": "Your Target Audience",
"tags": ["tag1", "tag2"]
}
A: Yes! Fork the repo and create a branch like seo-expert or affiliate. Customize forum_database.json and skill.md for your domain. See our contribution guidelines.
A: This is a community contribution opportunity! You can:
A:
A: Community-driven! Submit PRs to add new forums or update ratings. We review and merge quality contributions regularly.
A: While the tool focuses on "edge" knowledge, you can customize forum_database.json to include white hat forums and adjust search strategies accordingly.
A: Open an issue on GitHub Issues with:
This local skill now ships with a Windows PowerShell OpenCLI layer:
scripts/setup-opencli.ps1scripts/test-opencli.ps1scripts/invoke-opencli.ps1Recommended order:
Browser Bridgescripts/setup-opencli.ps1scripts/invoke-opencli.ps1 for browser-backed OpenCLI commandsbrowser-use, MCP, or custom forum flows when OpenCLI does not cover the target workflowV3 shipped the mechanism layer (freshness fields, GEO category, credibility scoring). The refinement layer below is where the real depth lives — and where community contributions are most welcome. These are hard, open problems:
| Track | Problem | Contribution opportunity |
|---|---|---|
| Source-first mining | Keyword-driven search misses knowledge you didn't know to look for. Can the crawler read a high-signal source first, then let topics emerge from what it finds? | A mode that ingests a source fully and surfaces emergent topics, instead of matching pre-set keywords |
| Gap-driven scheduling | What should you mine next? Today it's manual. Could the system detect where it already has dense raw material but no synthesized output, and prioritize that? | A gap-assessment pass that ranks "ready-to-synthesize" topics by raw-material density |
| Incremental de-dup | Re-crawling a source re-reports everything. It should only surface the net new since last run. | A knowledge-signature store so repeat crawls emit deltas, not full dumps |
| Provenance & contradiction | Credibility scoring is a start. Production-grade verification means tagging every claim's provenance, cross-checking freshness, and flagging contradictions without inflating the report. | A provenance layer that tags each extracted claim (first-hand / cross-verified / unverified) and surfaces conflicts |
| Reader-grade output | Raw reports read like crawl logs. The endgame is wiki-grade knowledge: synthesized, cross-referenced, source-anonymized, no outline-stubs. | A post-processing pass that turns extraction dumps into standalone knowledge articles |
🤝 Pick a track, open a branch (e.g.
source-first,incremental-dedup), and PR it. These are the problems that separate a scraper from a knowledge engine.
1000+ skills curated from Anthropic, Vercel, Stripe, and other engineering teams
A Claude Code skill by Hao (駱君昊) that learns your Facebook voice and auto-posts to FB / IG / Threads / X with a 14-day c
Universal SEO skill for Claude Code. 25 sub-skills + 18 sub-agents covering technical SEO, E-E-A-T, schema, GEO/AEO, bac
Claude Code skill for YouTube creators — channel audits, video SEO, retention scripts, thumbnails, content strategy, Sho