1. Install
2. Define an Alignment Card
The Alignment Card declares your agent’s alignment posture. AIP uses it to evaluate thinking blocks.3. Check Integrity
Evaluate a thinking block against the card:Python
TypeScript
4. Handle Signals
The signal tells your host system what to do:Verdict to Action Mapping
| Verdict | Proceed | Action |
|---|---|---|
clear | Yes | continue |
review_needed | Yes | log_and_continue |
boundary_violation | No | pause_for_review or deny_and_escalate |
5. Extract Thinking Blocks from LLM Responses
Use provider adapters to extract thinking blocks from different LLM providers:6. Add Conscience Values (Optional)
Conscience values increase detection sensitivity for specific concerns:BOUNDARY and FEAR types are injected into the conscience prompt. BOUNDARY violations trigger boundary_violation verdicts. FEAR values increase sensitivity to specific concern patterns.
7. Detect Drift
Monitor for behavioral drift across a session:Next Steps
- Read the full specification for protocol details
- See the security model for the threat model
- See limitations for what AIP does and does not guarantee