The Spine · 9 layers · 2 tiers

The architecture for running AI agents in production.

A vendor-neutral catalog of nine named layers. Plug in any outside application through governed boundaries, and run the agents you build yourself on a runtime you own.

Ride the line Read the specs on GitHubOpen at the edges, sovereign at the core

85 to 100x

less token usage when tools are discovered progressively instead of dumped into context

405K

tokens a naive MCP server burns before the agent does any real work

~100%

jailbreak success against prompt-only defenses, the case for deterministic governance

64%

improvement from a scoped tool interface over a raw shell, same model and task

Sources: Speakeasy and Kruczek (token reduction), JailbreakBench and Andriushchenko et al. (jailbreak rates), SWE-agent at Princeton NLP (tool interface). All cited in the specs.

Stop 01 · The map

What the modern data platform did for analytics, the Spine does for agents.

Big data got serious when it got an architecture. The warehouse, the data lake, and the lakehouse gave the enterprise one governed place for ingestion, cataloging, lineage, and access control, so analytics could run at petabyte scale without chaos. AI agents are at that same moment now, and the Spine is the enterprise-grade answer: nine versioned, citable specifications that give agents the same rigor. Discovery, coordination, governance, grounded data, an audited registry, and a runtime you own.

And it runs on top of the data estate you already have. The Spine’s grounded-data layer plugs agents directly into your existing platforms, from the cloud warehouse to the Spark lakehouse, governed by your canonical definitions and your row and column entitlements. Your big-data investment becomes the trusted foundation your agents reason over, not a surface they can leak.

Hover any station to expand its layer, or swipe and tap on a phone. Nine layers across two tiers, from your data estate to agents in production.

DatabricksApache SparkLakehouseSnowflakeUnity CatalogdbtBigQueryData warehousesData lakes

The data platforms the Spine grounds your agents in.

Stop 02 · The nine layers

Every layer is a specification.
Open one.

Foundation linethe substrates everything stands on

The temporal substrate. Project state, memory, and a verification-gated record of done that survive the context-window boundary, so the next session picks up the thread without loss.

owns the failure: bad continuityfoundation line

Read the DCS spec on GitHub

The grounding substrate. A canonical semantic model (text to metric, not text to SQL) plus data-level entitlements, so answers are consistent and an agent sees only what its user may see.

owns the failure: bad groundingfoundation line

Private spec · request access

The system of record layer. One continuously-reconciled catalog of every agentic asset, that discovery reads from and governance enforces against. Shadow assets become detectable, not invisible.

owns the failure: bad or missing registryfoundation line

Private spec · request access

The execution substrate. A sovereign, first-party runtime where agents are identity-bound, isolated, ephemeral, and bounded by construction, composing the whole catalog. A specification you own, portable across any substrate.

owns the failure: bad or unbounded executionfoundation line

Private spec · request access

Capability linewhat the agents can do

Surfaces the 5 to 8 tools an agent needs on demand, instead of dumping a thousand into the context window. Semantic entities, a gateway, SLA-aware routing.

owns the failure: bad tool datacapability line

Read the PDS spec on GitHub

Planner, generator, and evaluator are structurally separated, so the checker cannot simply agree with the maker. Coordination that catches its own mistakes.

owns the failure: bad reasoning, bad evaluationcapability line

Read the ACS spec on GitHub

Every external signal (markets, logistics, geopolitics, supplier health) arrives typed and provenance-stamped, so the reasoning that used it is auditable.

owns the failure: bad world datacapability line

Read the ESF spec on GitHub

Composite scoring with confidence bands, tenant-conditioned weights, and signal-version provenance. Not one mystery number.

owns the failure: bad scoringcapability line

Private spec · request access

Every action passes deterministic policy before it reaches the wire. Actions the policy denies are structurally impossible, not merely unlikely. Identity per agent, audit by construction.

owns the failure: bad governancecapability line

Read the AGS spec on GitHub

Click any layer to open it. Five of the nine are public open source; four are private.

Stop 03 · What good looks like

Not a vibe. A bar you can measure against.

Every layer ships with target SLAs. These are the production thresholds the spec holds you to, the line between “we have agents” and “we run agents in production.” One headline target per layer below; the full tables live in each spec.

Target SLAs · all layers foundation capability interchange

AGS

actions ever executed without passing policy first

GDS

agent queries that hit raw data tables directly

ARS

unregistered shadow assets reachable in production

SRS

unbounded or unattributed execution incidents

DCS

false 'task complete' declarations across sessions

ESF

decisions made on expired external signals

PDS

tools loaded into context by default, out of 200+ available

5-8

CRI

scores that declare their method, no mystery numbers

100%

ACS

of seeded defects caught by the adversarial evaluator — detection rate, not rejection volume

≥90%

Notice how many targets are zero. In production these are not aspirations, they are invariants the architecture has to make structurally true.

Stop 04 · Service alerts

When something breaks, you know who dropped the ball.

The catalog turns “the AI broke” into a specific, ownable layer. Click a failure to light up the layer that owns it.

Read the full attribution dictionary — MAST and AgentRx crosswalks, plus the agent postmortem template

Stop 05 · The two doors

Two doors.

There are exactly two ways anything reaches your agent estate, and the Spine governs both.

Door 1

Outside applications plug into the Spine

Any third-party or closed-source agent, tool, or AI application connects through governed boundaries: discovered through one curated surface, every action policy-gated and audited, the data grounded and entitlement-scoped, tracked in one system of record. Best of breed, no lock-in.

third-party app → governed boundary → the Spine

Door 2

Your own agents run on the Spine

For the agents you build yourself, the Sovereign Runtime Spine is the execution model: identity-bound, isolated, bounded by construction, composing the whole catalog. A runtime you own and run on infrastructure you control, portable across any substrate.

your agent → SRS runtime you own → the full catalog

Stop 06 · Take it with you

Two PDFs. One for the architect, one for the exec.

The full white paper for the person who will build against the specs — and the plain-English 101 for the person they need to convince.

For the architect

The Spine — Enterprise Architecture White Paper

The nine layers in depth: the failure surfaces they own, the target SLAs, the convergence evidence, and the build sequence. Version 1.6.

Download the white paper (PDF)

For the exec they report to

Spine 101 — the plain-English explainer

No acronyms unexplained, no diagrams that need a legend. What the nine layers are, why they exist, and what owning them buys you.

Download Spine 101 (PDF)

End of the lineDestination · Your AI agents · in production

The catalog lives on GitHub.

PDS, ACS, ESF, AGS, and DCS are public open source under CC BY 4.0 + MIT. CRI, GDS, ARS, and SRS are held private.

Read PDS on GitHub The failure-attribution dictionary 10 questions for any vendor

The architecture for running AI agents in production.

What the modern data platform did for analytics, the Spine does for agents.

Every layer is a specification.
Open one.

Durable Context Spine

Grounded Data Spine

Agent Registry Spine

Sovereign Runtime Spine

Progressive Discovery Spine

Adversarial Coordination Spine

External Signal Fabric

Composite Risk Index

Agent Governance Spine

Not a vibe. A bar you can measure against.

When something breaks, you know who dropped the ball.

Two doors.

Outside applications plug into the Spine

Your own agents run on the Spine

Two PDFs. One for the architect, one for the exec.

The Spine — Enterprise Architecture White Paper

Spine 101 — the plain-English explainer

The catalog lives on GitHub.

The architecture for running AI agents in production.

What the modern data platform did for analytics, the Spine does for agents.

Every layer is a specification. Open one.

Durable Context Spine

Grounded Data Spine

Agent Registry Spine

Sovereign Runtime Spine

Progressive Discovery Spine

Adversarial Coordination Spine

External Signal Fabric

Composite Risk Index

Agent Governance Spine

Not a vibe. A bar you can measure against.

When something breaks, you know who dropped the ball.

Two doors.

Outside applications plug into the Spine

Your own agents run on the Spine

Two PDFs. One for the architect, one for the exec.

The Spine — Enterprise Architecture White Paper

Spine 101 — the plain-English explainer

The catalog lives on GitHub.

Every layer is a specification.
Open one.