Agent-Independent Containment

The critical distinction between containment the agent can degrade and containment it cannot.

The Key Distinction

CCP's most operationally useful concept is the distinction between two types of containment:

Agent-independent containment consists of mechanisms whose failure probability is both provably bounded and causally independent of the agent being contained. The agent cannot influence, persuade, circumvent, or degrade these mechanisms through its outputs, regardless of how capable it becomes.

Agent-influenceable containment consists of mechanisms that depend, at some point in their enforcement chain, on a human judgment, an off-chain process, or a system the agent can interact with.

Examples

Mechanism	Type	Why
Formally verified smart contract spending limit	Agent-independent	Agent cannot modify the contract
TEE attesting model version	Agent-independent	Hardware-enforced, outside agent's causal reach
HSM enforcing signing policies	Agent-independent	Physically separated from agent execution
MPC multi-party approval	Agent-independent	Requires independent parties to authorize
Human oversight / approval	Agent-influenceable	Human can be persuaded by the agent
Reputation gates	Agent-influenceable	Agent can game the scoring system
API rate limits via mutable config	Agent-influenceable	Configuration can be changed

Why This Matters

The design constraint: even if all agent-influenceable layers are compromised, the remaining agent-independent layers must bound the loss to a level that the economic backstop can absorb.

This is what the certificate attests to. The agent_independent: bool field on every constraint is the most important field in the entire schema.

A Caveat

Even formally verified logic is never truly zero-risk. "Provably bounded" means bounded under specific assumptions — the proof system is sound, the compiler is correct, the underlying chain is secure. Each assumption has its own epsilon. Formal verification drives specific failure probabilities as close to zero as mathematics allows. But "as close as mathematics allows" is not zero.

Agent-Independent Containment

The Key Distinction

Examples

Why This Matters

A Caveat

On this page