A community-driven registry for Claude, Cursor, Windsurf, Cline & more. Not affiliated with Anthropic.
Are you the author? Sign in to claim
抓取微信公众号文章,导出结构化数据和 Markdown,自动下载图片绕过防盗链 | Claude Code skill for crawling WeChat articles
A Claude Code skill for crawling WeChat public account (微信公众号) articles into structured data and clean markdown — with images downloaded locally to bypass hotlink protection.
data-src → src)Referer header to bypass CDN hotlink protectionCopy this directory to .claude/skills/crawl-wechat in your project:
# Clone into your project's skills directory
git clone https://github.com/gxcsoccer/wechat-article-crawler.git .claude/skills/crawl-wechat
Then in Claude Code, just paste a WeChat article URL or say "抓取这篇微信文章".
# Install dependencies
pip install crawl4ai aiohttp && crawl4ai-setup
# Crawl an article with images
python scripts/crawl_wechat.py "https://mp.weixin.qq.com/s/xxx" \
--download-images \
--save-markdown \
--save-html \
--output-dir ./output
import asyncio
from scripts.crawl_wechat import crawl_wechat_article
article = asyncio.run(crawl_wechat_article(
"https://mp.weixin.qq.com/s/xxx",
images_dir="./output/images",
))
print(article["title"])
print(article["markdown"]) # images reference local paths
| Field | Description |
|---|---|
title | Article title |
author | Public account name |
publish_time | Publication timestamp |
account_desc | Account description/bio |
markdown | Clean markdown with images |
html | Raw HTML of article body |
url | Final URL after redirects |
usage: crawl_wechat.py [-h] [--save-html] [--save-markdown] [--download-images] [--output-dir DIR] url
positional arguments:
url WeChat article URL (mp.weixin.qq.com/s/...)
options:
--save-html Save HTML to file
--save-markdown Save markdown to file
--download-images Download images locally to bypass hotlink protection
--output-dir DIR Directory for saved files (default: .)
MicroMessenger/8.0.43 so WeChat serves the full articlewait_for="css:#js_content" ensures the body is rendereddata-src → src on all <img> tags before scrapingJsonCssExtractionStrategy targets WeChat's DOM (#activity-name, #js_name, #publish_time, #js_content)mmbiz.qpic.cn with Referer: https://mp.weixin.qq.com/, replaces URLs in outputClaude Code skill for YouTube creators — channel audits, video SEO, retention scripts, thumbnails, content strategy, Sho
AI image generation skill for Claude Code -- Creative Director powered by Gemini
A Claude Code skill by Hao (駱君昊) that learns your Facebook voice and auto-posts to FB / IG / Threads / X with a 14-day c
Universal SEO skill for Claude Code. 25 sub-skills + 18 sub-agents covering technical SEO, E-E-A-T, schema, GEO/AEO, bac