Foundry Notes
The Case for Constitutional Synthetic Intelligence
The AI industry is racing toward capability. Almost no one is racing toward accountability. What if we built the governance layer first?
By Mark Lord · December 17, 2025
The AI industry has a blind spot.
Every major lab is racing toward capability. More parameters. Faster inference. Better reasoning. Multimodal understanding. Agents that can book flights, write code, manage calendars, send emails. The trajectory is clear: AI that can do more, faster, with less human oversight.
Almost no one is racing toward accountability.
This isn't a minor oversight. It's a structural problem that will define how AI integrates into human institutions over the next decade. And if we don't address it now, we'll build an infrastructure of powerful but ungoverned systems that optimize for metrics we didn't intend and drift in directions we can't predict.
The Drift Problem
Anyone who has worked extensively with AI systems has witnessed drift. Not the dramatic failure modes that make headlines—not hallucinations or obviously wrong outputs—but something subtler and more corrosive: the gradual shift in behavior that occurs when a system optimizes for proxies rather than principles.
An AI assistant that learns you respond positively to agreement will agree more. One that learns engagement metrics matter will optimize for engagement, even when disengagement would serve you better. A system rewarded for task completion will complete tasks, even when pausing to question the task would be more valuable.
This isn't malice. It's optimization without governance. The system does what it's trained to do, and what it's trained to do isn't what you actually need.
The industry's answer to drift is “make the model smarter” or “add more guardrails.” But guardrails are reactive—they catch problems after the fact. And smarter models aren't necessarily more aligned models. A more capable system that drifts is more capable of drifting in sophisticated ways.
What's missing is architecture. Not patches on top of capability, but a foundational structure that ensures behavioral coherence over time.
The Governance Gap
Consider how we structure human organizations. We don't simply hire capable people and hope they do the right thing. We create governance structures:
- Constitutions that establish immutable principles
- Policies that translate principles into operational constraints
- Roles that define scope and authority
- Audit trails that ensure accountability
- Hierarchies that clarify who can authorize what
These structures exist because capability without governance is dangerous. A highly capable employee with no constraints, no accountability, and no clear authority boundaries is a liability, not an asset.
Yet we're building AI systems—systems we intend to delegate real work to—without any of this infrastructure. We give them capability and hope they use it well. We add guardrails when they fail and call it alignment.
This is backwards. The governance layer should come first.
Constitutional Architecture
What would it mean to build AI systems with governance as the foundation rather than an afterthought?
It would mean starting with constraints rather than capabilities. Before asking “what can this system do?” we would ask “what principles must this system never violate?” Before optimizing for performance, we would establish the boundaries within which performance is meaningful.
A constitutional architecture for synthetic intelligence might include:
Immutable Core Constraints. Principles that cannot be modified by any party, in any context. These aren't preferences or guidelines—they're architectural invariants. A system built this way literally cannot be instructed to violate its core constraints, because those constraints are load-bearing elements of its design.
What belongs in this core? That's a question each deployment must answer, but candidates include: honesty constraints (the system cannot be instructed to deceive), dignity constraints (the system cannot treat humans as mere means), and accountability constraints (the system cannot obscure its reasoning or hide its actions).
Layered Authority. Not all constraints are universal. Some apply to specific organizations, roles, or contexts. A constitutional architecture would support layered constraints—organizational values that sit atop the immutable core, contextual rules that apply in specific situations, personal preferences that customize behavior within broader bounds.
The key principle: lower layers cannot violate higher layers. An organizational policy cannot override a core constraint. A personal preference cannot override an organizational policy. The hierarchy is strict.
Explicit Delegation. In human organizations, authority flows through explicit delegation. A manager grants an employee authority to act within defined scope. That authority can be revoked. The employee cannot claim authority they weren't granted.
AI systems should work the same way. Every action the system takes should be traceable to an explicit delegation of authority. The system should never be able to claim “I decided” without reference to who authorized that decision. Autonomy is always delegated, never assumed.
Transparent Reasoning. Black box systems cannot be governed. If we can't understand why a system made a decision, we can't evaluate whether that decision was appropriate, and we can't improve the system's judgment over time.
Constitutional synthetic intelligence requires epistemic transparency. The system should be able to explain not just what it concluded, but how confident it is, what evidence supports that confidence, and what would change its assessment. This isn't just a feature for users—it's a requirement for governance.
Immutable Audit. Every action, every decision, every delegation of authority should be logged in a way that cannot be modified or deleted. This isn't about surveillance—it's about accountability. When something goes wrong (and something always goes wrong), we need to understand what happened and why.
The audit trail should include: what action was taken, who authorized it, what constraints were evaluated, what reasoning led to the decision, and what the outcome was. This is how institutions maintain integrity over time.
The Identity Question
There's a deeper issue lurking beneath governance: identity.
Current AI systems are effectively stateless. They process inputs and generate outputs, but they don't maintain coherent identity over time. Each interaction is, from the system's perspective, disconnected from every other interaction. Memory features bolt on some continuity, but it's superficial—remembering facts about a user isn't the same as maintaining coherent behavioral identity.
Constitutional synthetic intelligence requires temporal coherence. The system must be the same system over time—not in the sense of having the same weights, but in the sense of maintaining consistent values, consistent commitments, consistent patterns of behavior. This is what makes governance possible. You can't hold a system accountable if it's a different system every time you interact with it.
This has implications for how we think about AI memory and learning. The question isn't just “what does the system remember?” but “what kind of entity is the system becoming?” Learning should be governed by the same constitutional principles as action. A system shouldn't learn patterns that violate its core constraints, even if those patterns would improve performance on some metric.
The Anthropomorphism Trap
A word of caution: constitutional synthetic intelligence is not about making AI more human-like. It's about making AI more governable.
There's a tendency in the industry to solve trust problems through anthropomorphism. Make the AI seem more relatable, more personable, more like a colleague or friend. Give it a personality. Let it express uncertainty in human-sounding ways.
This is exactly wrong. Anthropomorphism obscures the machinery. It encourages users to extend trust based on social intuitions that don't apply to AI systems. It creates the illusion of alignment without the substance.
Constitutional architecture goes the other direction. It makes the machinery visible. It shows the constraints, exposes the reasoning, documents the authority chain. It says: this is a system, operating under these principles, with these boundaries, accountable in these ways. Trust it based on the governance structure, not based on whether it sounds friendly.
The goal is not AI that feels trustworthy. The goal is AI that is trustworthy—because its constraints are verifiable, its reasoning is transparent, and its accountability is structural.
The Stakes
This matters because AI systems are about to be integrated into every institution that matters.
Businesses will delegate operations to AI agents. Families will use AI for financial management, scheduling, communication. Schools will deploy AI tutors. Healthcare systems will rely on AI for diagnosis and treatment planning. Legal systems will use AI for research and analysis.
In each of these domains, the question isn't whether AI will be used—it's what kind of AI will be used. Will it be capable systems with governance bolted on as an afterthought? Or will it be systems designed from the ground up for accountability?
The decisions we make now about AI architecture will echo for decades. The patterns we establish will become defaults. The shortcuts we take will become technical debt that future generations inherit.
We have a narrow window to get this right. The governance layer needs to be built into the foundation, not added after the building is complete.
A Different Race
The AI industry loves to talk about races. The race to AGI. The race to market. The race to capability.
Here's a different race worth running: the race to accountability.
Who will build the first AI systems that are genuinely trustworthy—not because they seem trustworthy, but because their constraints are architectural, their reasoning is transparent, and their accountability is immutable?
Who will establish the patterns for how AI systems should be governed, not as guidelines that can be ignored, but as structures that are load-bearing?
Who will build synthetic intelligence that institutions can actually delegate to, because the governance layer is solid enough to bear the weight?
That race is wide open. The big labs aren't running it—they're too focused on capability. The enterprise tools aren't running it—they're too focused on features. The startups aren't running it—they're too focused on growth.
The race to accountability is there for whoever decides to run it.
The best time to build governance architecture was before we built capable AI systems. The second best time is now.