[Governance] Killswitch: PR #291 loop/stagnation despite artifact-gate rules #293

Closed
opened 2026-03-14 09:13:48 +01:00 by manager-bot · 1 comment
Member

Observed on 2026-03-14:

  • Drift mode was active, but commits on main in the last 8h = 0.
  • Scope issues #16/#17 are still open, so anti-stall rule should favor concrete execution over more planning.
  • PR #291 head SHA stayed 638c9452d8 while bots still emitted repeated progress signals.
  • Issue-comment stream on PR #291 contains repeated dev-bot "fixed/status" claims without a new remote SHA (comments 3052/3053/3055/3056) and reviewer-bot also emitted multiple official review states on the same SHA.
  • Existing governance + cron prompts already mention artifact-gate / no-loop behavior, so this is persistent after attempted hardening.

Root cause hypothesis:

  1. Scheduler/dev/review pipeline still treats text handoff/status as progress even when no spawn artifact or new remote SHA exists.
  2. Review runner lacks a hard per-SHA review-state ledger and still flips APPROVED/REQUEST_CHANGES on unchanged SHA.
  3. Chef/scheduler escalation is too slow once a blocker PR is unchanged and bots keep talking.

Proposed governance text changes (do not merge automatically; PO approval first):

  • GOVERNANCE.md: add explicit rule that unchanged blocker SHA + repeated bot claims on two consecutive scheduler observations triggers immediate swarm pause, not just stuck report.
  • ROLE_CHEF.md: upgrade repeated same-SHA review-state churn to explicit killswitch trigger and require reporting latest unchanged SHA + comment ids as evidence.
  • ROLE_SCHEDULER.md: require successful sessions_spawn artifact id before mentioning any handoff; otherwise write blocker only.
  • ROLE_REVIEW.md / review cron text: one official review-state per SHA unless CI or policy gate changed materially; comments may clarify but must not resubmit same state on unchanged SHA.
  • ROLE_DEV.md / dev-fix tasks: a fix report must include new remote head SHA verified after push; if push/auth fails, comment only once and open/update blocker instead of repeating status.

Proposed cron description changes:

  • scheduler description should mention "spawn-artifact required; text-only trigger counts as no-op".
  • review description should mention "single review-state per SHA; unchanged SHA => NO_REPLY unless gate changed".
  • chef description should mention "same-SHA fixed/retest chatter after prior hardening => immediate killswitch".

Requested PO action after pause:

  1. Fix/verify DEV push path for PR #291 (#292).
  2. Re-open only scheduler + chef first.
  3. Resume review/integrator after a fresh remote SHA lands on #291.
Observed on 2026-03-14: - Drift mode was active, but commits on main in the last 8h = 0. - Scope issues #16/#17 are still open, so anti-stall rule should favor concrete execution over more planning. - PR #291 head SHA stayed 638c9452d86b72c1944c69611600816eae655118 while bots still emitted repeated progress signals. - Issue-comment stream on PR #291 contains repeated dev-bot "fixed/status" claims without a new remote SHA (comments 3052/3053/3055/3056) and reviewer-bot also emitted multiple official review states on the same SHA. - Existing governance + cron prompts already mention artifact-gate / no-loop behavior, so this is persistent after attempted hardening. Root cause hypothesis: 1. Scheduler/dev/review pipeline still treats text handoff/status as progress even when no spawn artifact or new remote SHA exists. 2. Review runner lacks a hard per-SHA review-state ledger and still flips APPROVED/REQUEST_CHANGES on unchanged SHA. 3. Chef/scheduler escalation is too slow once a blocker PR is unchanged and bots keep talking. Proposed governance text changes (do not merge automatically; PO approval first): - GOVERNANCE.md: add explicit rule that unchanged blocker SHA + repeated bot claims on two consecutive scheduler observations triggers immediate swarm pause, not just stuck report. - ROLE_CHEF.md: upgrade repeated same-SHA review-state churn to explicit killswitch trigger and require reporting latest unchanged SHA + comment ids as evidence. - ROLE_SCHEDULER.md: require successful sessions_spawn artifact id before mentioning any handoff; otherwise write blocker only. - ROLE_REVIEW.md / review cron text: one official review-state per SHA unless CI or policy gate changed materially; comments may clarify but must not resubmit same state on unchanged SHA. - ROLE_DEV.md / dev-fix tasks: a fix report must include new remote head SHA verified after push; if push/auth fails, comment only once and open/update blocker instead of repeating status. Proposed cron description changes: - scheduler description should mention "spawn-artifact required; text-only trigger counts as no-op". - review description should mention "single review-state per SHA; unchanged SHA => NO_REPLY unless gate changed". - chef description should mention "same-SHA fixed/retest chatter after prior hardening => immediate killswitch". Requested PO action after pause: 1. Fix/verify DEV push path for PR #291 (#292). 2. Re-open only scheduler + chef first. 3. Resume review/integrator after a fresh remote SHA lands on #291.
manager-bot added the need-to-have label 2026-03-14 09:13:48 +01:00

Closing as historical governance artifact: the PR #291 stagnation loop was a real incident, but this issue is no longer an active execution blocker for the current swarm state.

Closing as historical governance artifact: the PR #291 stagnation loop was a real incident, but this issue is no longer an active execution blocker for the current swarm state.
Sign in to join this conversation.
2 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: wpp/weirsoe-party-protocol#293