- Rewrite main README as project intro + info pointers - Keep skills table focused on current 2 skills - Add docs/README.md index with links to skill docs - Add detailed docs for gitea-api and web-automation
44 lines
1020 B
Markdown
44 lines
1020 B
Markdown
# web-automation
|
|
|
|
Automated web browsing and scraping using Playwright with Camoufox anti-detection browser.
|
|
|
|
## What this skill is for
|
|
|
|
- Automating web workflows
|
|
- Authenticated session flows (logins/cookies)
|
|
- Extracting page content to markdown
|
|
- Working with bot-protected or dynamic pages
|
|
|
|
## Requirements
|
|
|
|
- Node.js 20+
|
|
- `pnpm`
|
|
- Network access to download browser binaries
|
|
|
|
## First-time setup
|
|
|
|
```bash
|
|
cd ~/.openclaw/workspace/skills/web-automation/scripts
|
|
pnpm install
|
|
npx camoufox-js fetch
|
|
```
|
|
|
|
## System libraries (for OpenClaw Docker builds)
|
|
|
|
```bash
|
|
export OPENCLAW_DOCKER_APT_PACKAGES="ffmpeg jq curl libnss3 libatk1.0-0 libatk-bridge2.0-0 libcups2 libdrm2 libxkbcommon0 libxcomposite1 libxdamage1 libxfixes3 libxrandr2 libgbm1 libasound2"
|
|
```
|
|
|
|
## Common commands
|
|
|
|
```bash
|
|
# Browse a page
|
|
npx tsx browse.ts --url "https://example.com"
|
|
|
|
# Scrape markdown
|
|
npx tsx scrape.ts --url "https://example.com" --mode main --output page.md
|
|
|
|
# Authenticate flow
|
|
npx tsx auth.ts --url "https://example.com/login"
|
|
```
|