Commit Graph

65 Commits

Author SHA1 Message Date
Stefano Fiorini 86ad783f82 feat(M3): Shared-source generator for agent variants 2026-05-03 21:09:22 -05:00
Stefano Fiorini be993429c1 feat(M2): Documentation flow, accuracy, consistency cleanup, and cross-platform shell portability 2026-05-03 20:14:44 -05:00
Stefano Fiorini 0443381aa0 feat(M1): Baseline verification, quality tooling foundation, and current-state report 2026-05-03 19:26:55 -05:00
Stefano Fiorini 2deab1c1b4 docs: align skill workflow documentation 2026-04-24 02:44:32 -05:00
Stefano Fiorini 193cd45db8 feat(installer): improve cursor and opencode skill handling 2026-04-24 02:20:06 -05:00
Stefano Fiorini d62899308a feat(installer): support pi package remove and update 2026-04-23 22:55:41 -05:00
Stefano Fiorini 8ea6d08e77 docs: add pi manual install guidance 2026-04-23 21:53:27 -05:00
Stefano Fiorini 3966b77623 chore: package skill manager resources 2026-04-23 21:37:15 -05:00
Stefano Fiorini 494e29f797 docs: add skill manager documentation 2026-04-23 21:27:52 -05:00
Stefano Fiorini f01721a45b feat: add multi-client skill manager 2026-04-23 21:21:31 -05:00
Stefano Fiorini 231a66f2b1 feat: add pi reviewer support to workflow variants 2026-04-23 21:03:45 -05:00
Stefano Fiorini ce4746b769 test: add reviewer support verification 2026-04-23 20:49:09 -05:00
Stefano Fiorini 912aed93a7 feat(pi): support pi reviewer model routing 2026-04-23 19:13:22 -05:00
Stefano Fiorini 9e29c34c62 fix(pi): add installer and runtime path guidance 2026-04-23 18:40:05 -05:00
Stefano Fiorini 3429dac894 fix(pi): package warning-free skill mirror and docs 2026-04-23 17:32:26 -05:00
Stefano Fiorini 0456c51291 docs(pi): implement milestone M6 - validation and surfacing 2026-04-23 16:26:42 -05:00
Stefano Fiorini f2c4d39abd feat(pi): implement milestone M5 - package surface 2026-04-23 16:22:08 -05:00
Stefano Fiorini d7651ad57c docs(pi): implement milestone M4 - extension assessment 2026-04-23 16:17:24 -05:00
Stefano Fiorini 3d868a852c feat(pi): implement milestone M3 - workflow skills 2026-04-23 16:14:59 -05:00
Stefano Fiorini 51372eb420 feat(pi): implement milestone M2 - script-backed skills 2026-04-23 16:04:39 -05:00
Stefano Fiorini 7ba6f90e14 docs(pi): implement milestone M1 - research and shared docs 2026-04-23 15:58:19 -05:00
Stefano Fiorini c98f27f461 feat(do-task): add do-task skill with dual-loop review
M7 final — adds CLI version checks (claude --version, codex --version)
to the claude-code and codex Prerequisite Check sections so docs and
skill requirements agree. This addresses the only P3 finding from
the M7 review (non-blocking docs-to-skill mismatch).

All 7 milestones delivered across 6 local commits:
- 437b202 M1+M2: claude-code canonical + M1 canonical specs
- d69da3a M3: codex variant
- f404792 M4: cursor variant
- f5161f5 M5: opencode variant
- 9853d49 M6: docs/DO-TASK.md + README updates
- this commit (M7): final gate + P3 fix

do-task ships four runtime variants (claude-code, codex, cursor,
opencode) that execute a single user-supplied prompt end-to-end
with:
- Plan review loop and implementation review loop, each against a
  reviewer CLI (codex, claude, cursor, or opencode) with up to
  MAX_ROUNDS=10 revisions until APPROVED.
- Parameterized shared Review Loop subroutine (9 steps) invoked
  twice per run with loop-distinct session IDs so reviewer context
  never leaks across loops.
- Per-payload secret scan (10 anchored regexes; no caching) with
  redacted-only user surfacing (`[REDACTED:<label>:<n>-chars]`).
- TDD-first execution via superpowers:test-driven-development with
  strict auto-skip limited to pure-documentation and
  pure-comment-whitespace-rename.
- Verification gate (lint/typecheck/tests) before the
  implementation review loop.
- Single task commit after implementation APPROVED; explicit "yes"
  required to push.
- Telegram notification on every terminal outcome.
- Persistent ai_plan/YYYY-MM-DD-<slug>/task-plan.md with 10-value
  Status enum and resume semantics.

Reviewer: codex / gpt-5.4. Total review rounds across the 7
milestones: 3+1+1+1+2+1 = 9.

M7 smoke checks (all passing):
- YAML frontmatter: 4/4 parse.
- Core-section schema: 4/4 identical (14 sections, 10 Status enum
  values, 15 Runtime State keys).
- docs/DO-TASK.md links resolve.
- 4 variant Phase 1-10 dry-runs coherent; subroutine invoked from
  Phases 5 and 8 with distinct SESSION_ID_VAR per loop.
- Opencode reviewer branch passes `bash -n` on rendered Round-1,
  Round-N (fresh-call), and Round-N (resume) templates.
- Resume-state walk through Phase 4 reads all 6 session-id keys
  correctly.
- Trigger audit: "implement this" only as exclusion; dropped
  phrases only under "Dropped defaults".

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-04-22 14:48:06 -05:00
Stefano Fiorini 9853d4937b docs(do-task): add DO-TASK.md + README updates (M6)
docs/DO-TASK.md covers:
- Purpose, Requirements (variant-specific prereqs + dependency-
  missing messages per variant), Reviewer CLI Requirements table
  (4 CLIs including opencode with fresh-call default).
- Install (4 subsections: Codex, Claude Code, OpenCode, Cursor).
- Per-variant Verify Installation subsections checking CLI binary,
  SKILL.md, run-review.sh, notify-telegram.sh, Superpowers
  sub-skills, and variant extras (Codex symlink, Cursor jq,
  OpenCode Superpowers ls, Cursor repo-vs-global lookup).
- Key Behavior, Dual Review Loops, Subroutine Steps, Reviewer Output
  Contract, Runtime Artifacts, Persistent Artifact Status enum
  (10 values), Failure Handling.
- Secret Scan (subroutine step 1a; per-payload; no caching) with
  canonical 10-pattern regex list and redaction contract.
- Supported Reviewer CLIs table (4 rows, including opencode).
- Notifications, Template Guardrails (14 core sections + Runtime
  State keys), Variant Hardening Notes (4 subsections), Execution
  Workflow Rules.

docs/README.md adds DO-TASK.md entry.

README.md:
- Skills table adds 4 do-task rows (codex, claude-code, opencode,
  cursor).
- Docs links add "Do-task guide" entry.
- Repository Layout adds do-task/ subdirectory.

Reviewer: codex / gpt-5.4. Approved round 2:
- Round 1: 2 P2 (prereqs inaccurate, Verify Installation incomplete)
  + 1 P3 -> REVISE.
- Round 2: 0 P0/P1/P2/P3 -> APPROVED.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-04-22 14:40:49 -05:00
Stefano Fiorini f5161f584d feat(do-task): add opencode variant SKILL.md + template (M5)
Ports the claude-code canonical to OpenCode conventions:
- Phase 1 adds Bootstrap Superpowers Context step — uses OpenCode's
  native skill tool to list + verify superpowers/brainstorming,
  superpowers/test-driven-development,
  superpowers/verification-before-completion, and
  superpowers/finishing-a-development-branch before any other phase.
- All sub-skill invocations swapped to OpenCode native skill tool
  with superpowers/<skill> (slash) path format.
- Helper paths swapped to ~/.config/opencode/skills/reviewer-runtime/
  and ~/.config/opencode/skills/do-task/templates/.
- Plan-mode guard removed (OpenCode has no plan-mode concept).
- Prerequisite Check adds opencode --version + Superpowers symlink
  at ~/.config/opencode/skills/superpowers verification.
- Opencode reviewer branch (from M1 research) uses
  `opencode run -m <provider>/<model> --agent plan --format json`
  with fresh-call default and documented opt-in -s <id> resume path.
- Added Required Skill Invocation Rules, Variant Hardening Notes
  — OpenCode, Common Mistakes, and Red Flags sections.
- Template runtime field = opencode; guardrail line updated.

Core-section schema identical across all 4 variants (14 sections).

Reviewer: codex / gpt-5.4. Approved round 1 (0 P0/P1/P2/P3).

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-04-22 14:30:25 -05:00
Stefano Fiorini f404792927 feat(do-task): add cursor variant SKILL.md + template (M4)
Ports the claude-code canonical to Cursor Agent CLI conventions:
- Workspace skill discovery (.cursor/skills/ repo-local preferred,
  ~/.cursor/skills/ global fallback) replaces Skill-tool invocations.
- Helper path resolution prefers .cursor/skills/reviewer-runtime/
  over ~/.cursor/skills/reviewer-runtime/.
- jq added as a hard prerequisite; cursor-agent --version check added.
- Plan-mode guard removed (Cursor has no plan-mode concept).
- Reviewer invocations mandated to --mode=ask --trust
  --output-format json; explicit ban on --mode=agent and --force.
- Added Required Skill Invocation Rules, Variant Hardening Notes
  — Cursor, Common Mistakes, and Red Flags sections.
- Template runtime field = cursor; guardrail line updated.

Core-section schema identical to claude-code canonical.

Reviewer: codex / gpt-5.4. Approved round 1 (0 P0/P1/P2/P3).

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-04-22 14:24:53 -05:00
Stefano Fiorini d69da3a4a8 feat(do-task): add codex variant SKILL.md + template (M3)
Ports the claude-code canonical to Codex conventions:
- Native skill discovery from ~/.agents/skills/superpowers/<skill>/
  replaces Skill-tool invocations.
- update_plan todos replace Task subagents.
- Helper paths swapped to ~/.codex/skills/reviewer-runtime/.
- Plan-mode guard removed (Codex has no plan-mode concept).
- Prerequisite Check adds symlink verification
  (~/.agents/skills/superpowers -> ~/.codex/superpowers/skills).
- Added Required Skill Invocation Rules, Variant Hardening Notes
  — Codex, Common Mistakes, and Red Flags sections.
- Template runtime field = codex; guardrail line updated.

Core-section schema identical to claude-code canonical.
Frontmatter parses cleanly. Trigger-phrase audit clean.

Reviewer: codex / gpt-5.4. Approved round 1 (0 P0/P1/P2/P3).

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-04-22 14:19:13 -05:00
Stefano Fiorini 437b2024cd feat(do-task): add claude-code variant SKILL.md + template (M1+M2)
M1 canonical specs (opencode reviewer research, task-plan template,
Review Loop subroutine, Phase 1-10 prose, secret-scan regex list)
are embedded in the M2 claude-code SKILL.md as the canonical
reference; later variants (M3 codex, M4 cursor, M5 opencode) will
fork from this file.

Reviewer: codex / gpt-5.4. Approved after 3 rounds:
- Round 1: 2 P1 + 3 P2 → REVISE
- Round 2: 1 P2 → REVISE
- Round 3: 0 P0/P1/P2, 1 P3 (non-blocking) → APPROVED

Key design properties:
- Plan-review payload strips Runtime State and Review History to
  prevent reviewer session-ID leakage across rounds.
- Secret-scan step 1a redacts matched text to
  [REDACTED:<pattern-label>:<match-length>-chars] before any user
  surfacing; never echoes raw match content.
- Brainstorming required for any behavior-changing task; auto-skip
  limited to pure-documentation and pure-comment-whitespace-rename.
- Phase 3 reviewer config defaults to codex / gpt-5.4 with
  MAX_ROUNDS=10 when user opts for defaults; explicit interactive
  default also gpt-5.4 for internal consistency.
- Template Metadata records Branch Name and Worktree Path.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-04-22 14:13:25 -05:00
Stefano Fiorini c344e96984 Remove external web automation provenance refs 2026-04-09 11:11:40 -05:00
Stefano Fiorini 879cccf383 Refine web automation provenance exclusions 2026-04-09 10:31:41 -05:00
Stefano Fiorini c97b7d44e5 feat(web-automation): implement milestone M2 mirror and docs 2026-04-09 10:21:21 -05:00
Stefano Fiorini 99fe6eab4e feat(web-automation): implement milestone M1 canonical codex migration 2026-04-09 10:13:25 -05:00
Stefano Fiorini e917387d4f Force sending Telegram notification upon stopping for user input 2026-03-25 11:59:10 -05:00
Stefano Fiorini 63a048a26c Align reviewer runtime and Telegram notifications 2026-03-24 11:45:58 -05:00
Stefano Fiorini 4d37674626 fix: isolate claude reviewer templates 2026-03-08 18:04:07 -05:00
Stefano Fiorini d44a2288b4 Update Atlassian skill prereq checks 2026-03-06 08:23:49 -06:00
Stefano Fiorini 783bcfa037 fix(atlassian): tighten health checks and review coverage 2026-03-06 07:06:38 -06:00
Stefano Fiorini c3afee091b feat(atlassian): implement milestone M4 - packaging and doc sync 2026-03-06 00:58:23 -06:00
Stefano Fiorini 972789c240 feat(atlassian): implement milestone M3 - confluence and safety controls 2026-03-06 00:53:13 -06:00
Stefano Fiorini 73c4bff901 feat(atlassian): implement milestone M2 - jira command surface 2026-03-06 00:39:54 -06:00
Stefano Fiorini e56f0c9941 feat(atlassian): implement milestone M1 - scaffold skill and shared runtime 2026-03-06 00:23:21 -06:00
Stefano Fiorini 30fb7fa7f0 chore(gitignore): ignore .worktrees local worktrees 2026-03-06 00:09:41 -06:00
Stefano Fiorini 253a4f31e2 feat(implement-plan): route milestone review through shared runtime 2026-03-05 23:23:42 -06:00
Stefano Fiorini c2d47487e0 fix(create-plan): clarify reviewer extraction order 2026-03-05 23:11:43 -06:00
Stefano Fiorini 04bf34544b feat(create-plan): route review through shared runtime 2026-03-05 23:07:43 -06:00
Stefano Fiorini 41a3b9d1ee fix(reviewer-runtime): cover timeout and exit semantics 2026-03-05 22:59:23 -06:00
Stefano Fiorini 8720691135 feat(reviewer-runtime): add shared review supervisor 2026-03-05 22:54:50 -06:00
Stefano Fiorini 526101fd23 chore(gitignore): ignore ai_plan local planning artifacts 2026-03-05 22:19:50 -06:00
Stefano Fiorini d5232c1480 fix(cursor): correct superpowers skill paths to include /skills/ subdirectory
Superpowers is cloned from git, so skills live under
.cursor/skills/superpowers/skills/<name>/SKILL.md, not
.cursor/skills/superpowers/<name>/SKILL.md.
2026-03-05 12:28:35 -06:00
Stefano Fiorini 7352e6a9f7 fix(cursor): update skill paths from .cursor/rules/ to .cursor/skills/
The Cursor Agent CLI now discovers skills from .cursor/skills/ instead
of the old .cursor/rules/ directory. Update all references across
create-plan and implement-plan Cursor variants and docs.
2026-03-05 12:04:09 -06:00
Stefano Fiorini 262aed2950 feat(implement-plan): add worktree-isolated plan execution skill (4 variants)
New skill that executes create-plan artifacts in an isolated git worktree
with iterative cross-model milestone review. Supports codex, claude-code,
opencode, and cursor agent variants.
2026-03-04 18:02:41 -06:00