Skip to main content

Trust infrastructure for AI agents

Mnemom makes AI agent behavior transparent and verifiable. When an agent makes a decision, Mnemom records what it decided, what alternatives it considered, what values it applied, and whether its reasoning was compromised. The result is a verifiable audit trail that any stakeholder can inspect. Three layers power this:
  • AAP (Agent Alignment Protocol) — Post-hoc verification. Alignment Cards declare an agent’s values and boundaries. AP-Traces record every decision. Verification checks whether behavior is consistent with declared intent.
  • AIP (Agent Integrity Protocol) — Real-time integrity checking. Analyzes LLM thinking blocks during execution to detect prompt injection, value drift, and manipulation before the agent acts.
  • CLPI (Card Lifecycle & Policy Intelligence) — Governance-as-code that enforces what tools agents may use, distinguishes configuration errors from behavioral failures, recovers trust after false violations, and anchors reputation on-chain.
What Mnemom guarantees: Complete audit trails of every agent decision (AAP), real-time attestation of reasoning integrity at the thinking level (AIP), and active policy enforcement that blocks prohibited actions at the gateway (CLPI). Together, these provide verifiable accountability from declared intent through execution.What no external system can guarantee: Enforcement at the sub-thinking level — inside the model’s weights. If a model produces compromised reasoning that does not surface in its thinking blocks, no external observer can detect it. Mnemom verifies everything the model externalizes; it cannot inspect what the model does not reveal. See the limitations docs for details.

Get started

Key concepts

Alignment Cards

A machine-readable declaration of an agent’s values, autonomy boundaries, escalation triggers, and audit commitments. The “constitution” every trace is verified against.

AP-Traces

Structured records of agent decisions — what action was taken, what alternatives were considered, what values were applied, and whether escalation was required.

Integrity Checkpoints

Real-time AIP analysis of LLM thinking blocks. Each checkpoint produces a verdict: clear, review_needed, or boundary_violation.

Drift Detection

Statistical monitoring of agent behavior over time. Detects when an agent’s actions gradually diverge from its declared alignment.

CLPI: Governance Layer

Card Lifecycle & Policy Intelligence. Governance-as-code with policy enforcement, violation reclassification, trust recovery, risk intelligence, and on-chain reputation anchoring.

Mnemom Trust Rating

A credit score for AI agents. Five-component composite metric computed from integrity checkpoints, drift stability, compliance, trace completeness, and fleet coherence. Cryptographically provable and anchorable on-chain.

Team Reputation

Teams are first-class meta-agents with persistent identity, their own alignment cards, and accumulated reputation — independent of any individual member.

Verifiable Integrity

Four-layer cryptographic attestation: Ed25519 signatures, hash chains, Merkle proofs, and optional SP1 zero-knowledge proofs. Any party can independently verify a verdict without trusting Mnemom.

Agent Containment

Kill-switch for rogue agents. Pause, kill, and resume agents in real-time with auto-containment triggers, RBAC controls, and full audit trails.

Value Coherence

Pairwise compatibility checking between agents. Before two agents collaborate, coherence checks verify their values are not in conflict.

SDK packages

Install the protocol SDKs directly if you need fine-grained control:
npm install @mnemom/agent-alignment-protocol
npm install @mnemom/agent-integrity-protocol
PackageLanguageRegistryDescription
@mnemom/agent-alignment-protocolTypeScriptnpmAAP verification, coherence, drift detection
@mnemom/agent-integrity-protocolTypeScriptnpmAIP integrity checking, signal building
agent-alignment-protoPythonPyPIAAP verification, coherence, drift detection
agent-integrity-protoPythonPyPIAIP integrity checking, signal building

Protocols