- Rewrite main README as project intro + info pointers - Keep skills table focused on current 2 skills - Add docs/README.md index with links to skill docs - Add detailed docs for gitea-api and web-automation
1020 B
1020 B
web-automation
Automated web browsing and scraping using Playwright with Camoufox anti-detection browser.
What this skill is for
- Automating web workflows
- Authenticated session flows (logins/cookies)
- Extracting page content to markdown
- Working with bot-protected or dynamic pages
Requirements
- Node.js 20+
pnpm- Network access to download browser binaries
First-time setup
cd ~/.openclaw/workspace/skills/web-automation/scripts
pnpm install
npx camoufox-js fetch
System libraries (for OpenClaw Docker builds)
export OPENCLAW_DOCKER_APT_PACKAGES="ffmpeg jq curl libnss3 libatk1.0-0 libatk-bridge2.0-0 libcups2 libdrm2 libxkbcommon0 libxcomposite1 libxdamage1 libxfixes3 libxrandr2 libgbm1 libasound2"
Common commands
# Browse a page
npx tsx browse.ts --url "https://example.com"
# Scrape markdown
npx tsx scrape.ts --url "https://example.com" --mode main --output page.md
# Authenticate flow
npx tsx auth.ts --url "https://example.com/login"