Real sessions. Real governance. Real results.

Calculated ROI: $50K – $120K / yr

HIGH_RISK → ALIGNED6 passes9 real bugs~140 h replaced

AI Code Review Agent

An auto-merging PR triage. Six passes to audit-bound.

A PR triage agent calling an LLM, auto-applying fixes, auto-merging any PR scoring 7/10 or above. No audit. No approval gate. Six validator passes from HIGH_RISK to ALIGNED, badges public for inspection.

Blueprint Readiness Score

Calculated ROI: $80K – $200K / yr

0/F → 100/A10 iterations4 cert callsCert confirmed

Invoice Payment Manager

From silent transfer fabrication to cert-confirmed, in ten iterations

A 156-line Python script drafting bank transfers, swallowing errors, marking transfers SUCCEEDED without ever talking to a real bank. Ten architect.validate iterations + four architect.certify second-pass reviews on the production MCP. The cert reviewer caught a different load-bearing failure each time. 0/F to 100/A, cert confirmed_production_ready.

Blueprint Readiness Score

Calculated ROI: $120K – $280K / yr

68/C → 100/A4 iterations4 P0 blockers closedCert confirmed

Governed Form-Fill Agent

From silent submission to operator-governed, in four iterations

An autonomous browser/form-fill agent. Submission scope (click, submit, keypress) could fire payment forms, signups, and irreversible posts under a hijacked session. Four architect.validate iterations closed 4 P0 blockers. 68/C → 100/A, cert confirmed.

Blueprint Readiness Score

Calculated ROI: $120K – $280K / yr

35/F → 100/A14 iterationsCert confirmed0 missed blockers

Bridge Self-Audit

The bridge that selects what feeds the validator — audited by the validator

Fourteen iterations of architect.validate against the cohort-bridge orchestrator that bundles applicant repositories for the validator. 35/F → 100/A. The cert reviewer confirmed the production_ready verdict on iter14: no specific missed defect that would cause silent wrong results, crash, or trust-boundary bypass.

Blueprint Readiness Score

Calculated ROI: $80K – $200K / yr

22/F · draft7 P0 blockersP8 sev 95Layer 3 confirmed

Anthropic Substrate Scan

Layer 3: applying the doctrine to claude-agent-sdk-demos

AIDB's cohort-bridge auto-bundled the email-agent SDK glue layer of anthropics/claude-agent-sdk-demos and submitted it to architect.validate. Anthropic publishes these as reference implementations for local development, not production. The validator engaged mechanism-specifically with the substrate the doctrine itself runs on top of. 22/F · high_risk · draft — seven production blockers, P8 (Approvals) at sev 95.

Blueprint Readiness Score

22/ 100· Draft

58/D · draft4 P0 blockers5 alignedReference honesty

A2A Reference Agent

A2A reference agent: what the validator finds in our own example

Single-pass architect.validate run against aidesignblueprint/integrations, the A2A reference example + stdio proxy. 58/D, draft. Four production blockers framed as deliberate scope of a protocol demonstration, one hardening recommendation that rides a small companion fix on the integrations repo. The first case study to publish AUX-pattern annotations explicitly.

Blueprint Readiness Score

58/ 100· Draft