From silent submission to operator-governed in four iterations
An autonomous browser/form-fill agent's submission scope (click, submit, keypress) could fire payment forms, signups, and irreversible posts without human review. Four iterations of architect.validate (the Blueprint's MCP-served pre-prod readiness check) turned it into a governed system where every submission requires an operator decision, with public cert badges to prove it.
Key Facts
- Validator iterations
- 4 prod-MCP runs
- Score trajectory
- 68/C → 100/A
- Production blockers closed
- 4 of 4 P0
- Cert outcome
- confirmed_production_ready
- Doctrine compliance
- 10 / 10 principles aligned
Validator trajectory
Four iterations, every run ID public
Each iteration was a real prod-MCP architect.validate call. Iter2 closed two of the four P0 blockers but the score plateaued at Iter3 because the architect's prompt-deduplication detected the change-summary payload as too similar to the prior baseline. Iter4 re-fired with surgical lease-fence + bidirectional-audit changes and the score jumped 26 points to 100/A.
Principle scorecard
Every flagged principle, Iter1 vs Iter4, at a glance
Four principles fired as production_blocker / high_risk on the Iter1 baseline; the Iter4 run closes all of them. The narrative below walks through each, this table is the scannable summary.
Refactor scope
Iter1 ungoverned vs Iter4 production-ready
Numbers verbatim from the package source. The agent was already feature-complete at Iter1 (typed governance contract, approval gates, hash-chained audit ledger), the four iterations closed deeper seams: redaction-aware audit boundary, durable lease + watchdog, frozen policies, end-to-end evidence verification.
Before / After
Validator output
What the validator found
The Blueprint MCP ran architect.validate against the Iter1 baseline. Four P0 production blockers identified, each one a path for an irreversible action to fire without operator authority.
How each P0 was resolved
What the iterations fixed
Each iteration closed at least one production blocker. Iter4 is the first run that crosses 80/A/production_ready, every P0 finding above is now aligned at 100/100.
Re-validation result
After Iter4: architect.certify confirmed production_ready
The Iter4 implementation was re-validated and then certified in the same prod-MCP session. Cert outcome: confirmed_production_ready. The badge is live and the readiness review is publicly inspectable.
Calculated ROI
Same metrics, same calculator powering every case study
Derived deterministically from this case study's profile (4 iterations, irreversible-financial blast radius, autonomous workflow, under compliance) via /lib/case-study-roi.ts. Numbers directly comparable to the other case studies.