Commit Graph

48 Commits

Author SHA1 Message Date
Stefano Fiorini a1f8267b0b fix(repo): stop tracking web-automation shared node_modules
check / check (ubuntu-latest) (push) Successful in 2m10s
check / check (ubuntu-latest) (pull_request) Successful in 2m9s
check / check (macos-latest) (push) Has been cancelled
check / check (macos-latest) (pull_request) Has been cancelled
2026-05-03 23:30:43 -05:00
Stefano Fiorini 78f4adfc74 fix(ci): resolve ubuntu check regressions
check / check (ubuntu-latest) (push) Successful in 2m9s
check / check (ubuntu-latest) (pull_request) Successful in 2m3s
check / check (macos-latest) (push) Has been cancelled
check / check (macos-latest) (pull_request) Has been cancelled
2026-05-03 23:05:58 -05:00
Stefano Fiorini 7495020a9c feat(M4): Reusable code abstractions and dead-code removal 2026-05-03 21:45:49 -05:00
Stefano Fiorini 86ad783f82 feat(M3): Shared-source generator for agent variants 2026-05-03 21:09:22 -05:00
Stefano Fiorini be993429c1 feat(M2): Documentation flow, accuracy, consistency cleanup, and cross-platform shell portability 2026-05-03 20:14:44 -05:00
Stefano Fiorini 2deab1c1b4 docs: align skill workflow documentation 2026-04-24 02:44:32 -05:00
Stefano Fiorini 193cd45db8 feat(installer): improve cursor and opencode skill handling 2026-04-24 02:20:06 -05:00
Stefano Fiorini 231a66f2b1 feat: add pi reviewer support to workflow variants 2026-04-23 21:03:45 -05:00
Stefano Fiorini 912aed93a7 feat(pi): support pi reviewer model routing 2026-04-23 19:13:22 -05:00
Stefano Fiorini 9e29c34c62 fix(pi): add installer and runtime path guidance 2026-04-23 18:40:05 -05:00
Stefano Fiorini 3429dac894 fix(pi): package warning-free skill mirror and docs 2026-04-23 17:32:26 -05:00
Stefano Fiorini 3d868a852c feat(pi): implement milestone M3 - workflow skills 2026-04-23 16:14:59 -05:00
Stefano Fiorini 51372eb420 feat(pi): implement milestone M2 - script-backed skills 2026-04-23 16:04:39 -05:00
Stefano Fiorini c98f27f461 feat(do-task): add do-task skill with dual-loop review
M7 final — adds CLI version checks (claude --version, codex --version)
to the claude-code and codex Prerequisite Check sections so docs and
skill requirements agree. This addresses the only P3 finding from
the M7 review (non-blocking docs-to-skill mismatch).

All 7 milestones delivered across 6 local commits:
- 437b202 M1+M2: claude-code canonical + M1 canonical specs
- d69da3a M3: codex variant
- f404792 M4: cursor variant
- f5161f5 M5: opencode variant
- 9853d49 M6: docs/DO-TASK.md + README updates
- this commit (M7): final gate + P3 fix

do-task ships four runtime variants (claude-code, codex, cursor,
opencode) that execute a single user-supplied prompt end-to-end
with:
- Plan review loop and implementation review loop, each against a
  reviewer CLI (codex, claude, cursor, or opencode) with up to
  MAX_ROUNDS=10 revisions until APPROVED.
- Parameterized shared Review Loop subroutine (9 steps) invoked
  twice per run with loop-distinct session IDs so reviewer context
  never leaks across loops.
- Per-payload secret scan (10 anchored regexes; no caching) with
  redacted-only user surfacing (`[REDACTED:<label>:<n>-chars]`).
- TDD-first execution via superpowers:test-driven-development with
  strict auto-skip limited to pure-documentation and
  pure-comment-whitespace-rename.
- Verification gate (lint/typecheck/tests) before the
  implementation review loop.
- Single task commit after implementation APPROVED; explicit "yes"
  required to push.
- Telegram notification on every terminal outcome.
- Persistent ai_plan/YYYY-MM-DD-<slug>/task-plan.md with 10-value
  Status enum and resume semantics.

Reviewer: codex / gpt-5.4. Total review rounds across the 7
milestones: 3+1+1+1+2+1 = 9.

M7 smoke checks (all passing):
- YAML frontmatter: 4/4 parse.
- Core-section schema: 4/4 identical (14 sections, 10 Status enum
  values, 15 Runtime State keys).
- docs/DO-TASK.md links resolve.
- 4 variant Phase 1-10 dry-runs coherent; subroutine invoked from
  Phases 5 and 8 with distinct SESSION_ID_VAR per loop.
- Opencode reviewer branch passes `bash -n` on rendered Round-1,
  Round-N (fresh-call), and Round-N (resume) templates.
- Resume-state walk through Phase 4 reads all 6 session-id keys
  correctly.
- Trigger audit: "implement this" only as exclusion; dropped
  phrases only under "Dropped defaults".

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-04-22 14:48:06 -05:00
Stefano Fiorini f5161f584d feat(do-task): add opencode variant SKILL.md + template (M5)
Ports the claude-code canonical to OpenCode conventions:
- Phase 1 adds Bootstrap Superpowers Context step — uses OpenCode's
  native skill tool to list + verify superpowers/brainstorming,
  superpowers/test-driven-development,
  superpowers/verification-before-completion, and
  superpowers/finishing-a-development-branch before any other phase.
- All sub-skill invocations swapped to OpenCode native skill tool
  with superpowers/<skill> (slash) path format.
- Helper paths swapped to ~/.config/opencode/skills/reviewer-runtime/
  and ~/.config/opencode/skills/do-task/templates/.
- Plan-mode guard removed (OpenCode has no plan-mode concept).
- Prerequisite Check adds opencode --version + Superpowers symlink
  at ~/.config/opencode/skills/superpowers verification.
- Opencode reviewer branch (from M1 research) uses
  `opencode run -m <provider>/<model> --agent plan --format json`
  with fresh-call default and documented opt-in -s <id> resume path.
- Added Required Skill Invocation Rules, Variant Hardening Notes
  — OpenCode, Common Mistakes, and Red Flags sections.
- Template runtime field = opencode; guardrail line updated.

Core-section schema identical across all 4 variants (14 sections).

Reviewer: codex / gpt-5.4. Approved round 1 (0 P0/P1/P2/P3).

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-04-22 14:30:25 -05:00
Stefano Fiorini f404792927 feat(do-task): add cursor variant SKILL.md + template (M4)
Ports the claude-code canonical to Cursor Agent CLI conventions:
- Workspace skill discovery (.cursor/skills/ repo-local preferred,
  ~/.cursor/skills/ global fallback) replaces Skill-tool invocations.
- Helper path resolution prefers .cursor/skills/reviewer-runtime/
  over ~/.cursor/skills/reviewer-runtime/.
- jq added as a hard prerequisite; cursor-agent --version check added.
- Plan-mode guard removed (Cursor has no plan-mode concept).
- Reviewer invocations mandated to --mode=ask --trust
  --output-format json; explicit ban on --mode=agent and --force.
- Added Required Skill Invocation Rules, Variant Hardening Notes
  — Cursor, Common Mistakes, and Red Flags sections.
- Template runtime field = cursor; guardrail line updated.

Core-section schema identical to claude-code canonical.

Reviewer: codex / gpt-5.4. Approved round 1 (0 P0/P1/P2/P3).

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-04-22 14:24:53 -05:00
Stefano Fiorini d69da3a4a8 feat(do-task): add codex variant SKILL.md + template (M3)
Ports the claude-code canonical to Codex conventions:
- Native skill discovery from ~/.agents/skills/superpowers/<skill>/
  replaces Skill-tool invocations.
- update_plan todos replace Task subagents.
- Helper paths swapped to ~/.codex/skills/reviewer-runtime/.
- Plan-mode guard removed (Codex has no plan-mode concept).
- Prerequisite Check adds symlink verification
  (~/.agents/skills/superpowers -> ~/.codex/superpowers/skills).
- Added Required Skill Invocation Rules, Variant Hardening Notes
  — Codex, Common Mistakes, and Red Flags sections.
- Template runtime field = codex; guardrail line updated.

Core-section schema identical to claude-code canonical.
Frontmatter parses cleanly. Trigger-phrase audit clean.

Reviewer: codex / gpt-5.4. Approved round 1 (0 P0/P1/P2/P3).

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-04-22 14:19:13 -05:00
Stefano Fiorini 437b2024cd feat(do-task): add claude-code variant SKILL.md + template (M1+M2)
M1 canonical specs (opencode reviewer research, task-plan template,
Review Loop subroutine, Phase 1-10 prose, secret-scan regex list)
are embedded in the M2 claude-code SKILL.md as the canonical
reference; later variants (M3 codex, M4 cursor, M5 opencode) will
fork from this file.

Reviewer: codex / gpt-5.4. Approved after 3 rounds:
- Round 1: 2 P1 + 3 P2 → REVISE
- Round 2: 1 P2 → REVISE
- Round 3: 0 P0/P1/P2, 1 P3 (non-blocking) → APPROVED

Key design properties:
- Plan-review payload strips Runtime State and Review History to
  prevent reviewer session-ID leakage across rounds.
- Secret-scan step 1a redacts matched text to
  [REDACTED:<pattern-label>:<match-length>-chars] before any user
  surfacing; never echoes raw match content.
- Brainstorming required for any behavior-changing task; auto-skip
  limited to pure-documentation and pure-comment-whitespace-rename.
- Phase 3 reviewer config defaults to codex / gpt-5.4 with
  MAX_ROUNDS=10 when user opts for defaults; explicit interactive
  default also gpt-5.4 for internal consistency.
- Template Metadata records Branch Name and Worktree Path.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-04-22 14:13:25 -05:00
Stefano Fiorini c344e96984 Remove external web automation provenance refs 2026-04-09 11:11:40 -05:00
Stefano Fiorini 879cccf383 Refine web automation provenance exclusions 2026-04-09 10:31:41 -05:00
Stefano Fiorini c97b7d44e5 feat(web-automation): implement milestone M2 mirror and docs 2026-04-09 10:21:21 -05:00
Stefano Fiorini 99fe6eab4e feat(web-automation): implement milestone M1 canonical codex migration 2026-04-09 10:13:25 -05:00
Stefano Fiorini e917387d4f Force sending Telegram notification upon stopping for user input 2026-03-25 11:59:10 -05:00
Stefano Fiorini 63a048a26c Align reviewer runtime and Telegram notifications 2026-03-24 11:45:58 -05:00
Stefano Fiorini 4d37674626 fix: isolate claude reviewer templates 2026-03-08 18:04:07 -05:00
Stefano Fiorini d44a2288b4 Update Atlassian skill prereq checks 2026-03-06 08:23:49 -06:00
Stefano Fiorini 783bcfa037 fix(atlassian): tighten health checks and review coverage 2026-03-06 07:06:38 -06:00
Stefano Fiorini c3afee091b feat(atlassian): implement milestone M4 - packaging and doc sync 2026-03-06 00:58:23 -06:00
Stefano Fiorini 972789c240 feat(atlassian): implement milestone M3 - confluence and safety controls 2026-03-06 00:53:13 -06:00
Stefano Fiorini 73c4bff901 feat(atlassian): implement milestone M2 - jira command surface 2026-03-06 00:39:54 -06:00
Stefano Fiorini e56f0c9941 feat(atlassian): implement milestone M1 - scaffold skill and shared runtime 2026-03-06 00:23:21 -06:00
Stefano Fiorini 253a4f31e2 feat(implement-plan): route milestone review through shared runtime 2026-03-05 23:23:42 -06:00
Stefano Fiorini c2d47487e0 fix(create-plan): clarify reviewer extraction order 2026-03-05 23:11:43 -06:00
Stefano Fiorini 04bf34544b feat(create-plan): route review through shared runtime 2026-03-05 23:07:43 -06:00
Stefano Fiorini 41a3b9d1ee fix(reviewer-runtime): cover timeout and exit semantics 2026-03-05 22:59:23 -06:00
Stefano Fiorini 8720691135 feat(reviewer-runtime): add shared review supervisor 2026-03-05 22:54:50 -06:00
Stefano Fiorini d5232c1480 fix(cursor): correct superpowers skill paths to include /skills/ subdirectory
Superpowers is cloned from git, so skills live under
.cursor/skills/superpowers/skills/<name>/SKILL.md, not
.cursor/skills/superpowers/<name>/SKILL.md.
2026-03-05 12:28:35 -06:00
Stefano Fiorini 7352e6a9f7 fix(cursor): update skill paths from .cursor/rules/ to .cursor/skills/
The Cursor Agent CLI now discovers skills from .cursor/skills/ instead
of the old .cursor/rules/ directory. Update all references across
create-plan and implement-plan Cursor variants and docs.
2026-03-05 12:04:09 -06:00
Stefano Fiorini 262aed2950 feat(implement-plan): add worktree-isolated plan execution skill (4 variants)
New skill that executes create-plan artifacts in an isolated git worktree
with iterative cross-model milestone review. Supports codex, claude-code,
opencode, and cursor agent variants.
2026-03-04 18:02:41 -06:00
Stefano Fiorini d2970b06bd feat(create-plan): add iterative cross-model plan review and Cursor variant
Add a configurable reviewer phase to all create-plan variants (claude-code,
codex, opencode) that sends the plan to a second CLI/model for iterative
feedback (max 5 rounds). Supported reviewer CLIs: codex, claude, cursor.

Add new Cursor Agent CLI variant with full skill, templates, and
workspace-discovery-based prerequisites (.cursor/rules/).

Update README and docs/CREATE-PLAN.md with Cursor install/verify,
reviewer CLI requirements, and supported reviewer CLIs table.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-04 13:58:11 -06:00
luke 76fd7f374f feat(create-plan/codex): switch to native superpowers workflow and add guardrails 2026-02-11 18:03:16 +00:00
luke d866b38f18 Harden OpenCode create-plan using native superpowers skill-tool flow 2026-02-09 16:16:54 +00:00
luke efb1a80aac Harden Claude create-plan with explicit sub-skill and plan-mode guards 2026-02-09 16:13:47 +00:00
luke 2988d8fb27 Make Codex create-plan use explicit superpowers commands 2026-02-09 15:36:26 +00:00
luke 1ba911c3fb Switch create-plan artifacts to ai_plan and enforce gitignore commit 2026-02-09 05:01:28 +00:00
luke 53b05f263b Add Playwright+Camoufox prerequisite checks to web-automation skill docs 2026-02-09 04:43:09 +00:00
luke 4c60c00391 Add web-automation skill variants and inline prerequisite checks 2026-02-09 04:40:04 +00:00
luke 0833934dd5 Add installable create-plan skills for codex, claude-code, and opencode 2026-02-09 04:40:03 +00:00