The deployment decision that changes everything
Choosing where a model runs is an infrastructure and governance decision, not a performance optimisation. It changes your privacy posture, your reliability obligations, your compliance evidence trail, and how much your orchestration layer needs to compensate for capability gaps.
Key Facts
- Deployment modes
- Local / open-source · Self-hosted · Managed API
- Dimensions compared
- 12 design and governance dimensions
- Applies to
- All 10 Blueprint principles
- Primary question
- Where does the data go, and what happens when the model fails?
The decision, not the hype
Model choice for agentic systems is about control, privacy, and what breaks — not about benchmark scores. The relevant questions are: Where does the data go? Who can observe the inference? What happens when the model or the provider is unavailable? What compliance evidence can you produce?
Deployment mode comparison
Data boundary
Local / open-source
Data never leaves device or org
Self-hosted
Data stays on org infrastructure
Managed API
Data is processed by provider
Privacy posture
Local / open-source
Strongest — no external exposure
Self-hosted
Strong — depends on infrastructure controls
Managed API
Requires provider trust and DPA
Latency
Local / open-source
Variable — hardware-dependent
Self-hosted
Controlled — predictable on known infrastructure
Managed API
Provider-dependent — varies by region and load
Cost behaviour
Local / open-source
Fixed compute cost
Self-hosted
Fixed compute plus operational overhead
Managed API
Variable — billed per token
Tool calling
Local / open-source
Limited in most open models
Self-hosted
Model-dependent
Managed API
Strong in frontier models
Context window
Local / open-source
Often smaller
Self-hosted
Model-dependent
Managed API
Largest available
Reliability tier
Local / open-source
Experimental to production-capable
Self-hosted
Production-capable
Managed API
Production to enterprise-governed
Offline capability
Local / open-source
Full
Self-hosted
Partial
Managed API
None
Vendor dependency
Local / open-source
None on model weights
Self-hosted
Infrastructure dependency only
Managed API
High — provider availability is a dependency
Compliance evidence
Local / open-source
Org-controlled audit trail
Self-hosted
Org-controlled audit trail
Managed API
Provider attestation required
Review burden
Local / open-source
Higher — no provider audit trail
Self-hosted
Medium
Managed API
Lower — provider handles infrastructure audit
Orchestration needs
Local / open-source
Higher — compensates for capability gaps
Self-hosted
Medium
Managed API
Lower — offloads reasoning complexity