freshcrate
Home > MCP Servers > arifOS

arifOS

ArifOS β€” Constitutional MCP kernel for governed AI execution. AAA architecture: Architect Β· Auditor Β· Agent. Built for the open-source agentic era.

Description

ArifOS β€” Constitutional MCP kernel for governed AI execution. AAA architecture: Architect Β· Auditor Β· Agent. Built for the open-source agentic era.

README

arifOS β€” The Sovereign Constitutional Intelligence Kernel

DITEMPA BUKAN DIBERI β€” Intelligence is forged, not given.

πŸ›οΈ CANONICAL SOURCE OF TRUTH: This repository (ariffazil/arifOS)

This repo holds: doctrine, Floors (F1–F13), AGENTS.md, pyproject.toml, canonical tools, architecture, and canonical index spec.

Runtime truth (what's running now): Live /health and /tools on deployed server(s) β€” see /.well-known/mcp/server.json and /.well-known/agent.json

VERSION: 2026.04.09
STATUS: OPERATIONAL β€” HORIZON III (UNIFIED KERNEL)
AUTHORITY: 888_APEX
CONSTITUTION_SOURCE: tool_registry.json + Philosophy Registry v1.2.0
REGISTRY_HASH: <auto-generated at runtime>
READINESS: 97/100
SEAL: SEAL_20260409_KERNEL_UNIFIED
SOURCE_REPO: https://github.com/ariffazil/arifOS
CANONICAL_INDEX: /.well-known/mcp/server.json

ARCHITECTURE: 9 Governance Tools + 1 Execution Bridge
PHILOSOPHY: 83 quotes, Gβ˜… bands, deterministic selection
ToM: Required structured fields for all governance tools
KERNEL: Unified rCore (INPUT β†’ ORCHESTRATE β†’ OUTPUT)

Table of Contents

  1. What Is arifOS? (30-Second Briefing)
  2. Why Does arifOS Exist?
  3. Who Is This For?
  4. The Core Guarantee
  5. Quick Start (Zero to First Call in 60 Seconds)
  6. Architecture: The Trinity Model (ΔΩΨ)
  7. The 000-999 Metabolic Pipeline
  8. The 13 Constitutional Floors
  9. Verdict System: What Happens to Every Action
  10. Tool Ecosystem (Capabilities, Not Features)
  11. For AI Agents: The Behavioral Contract
  12. For Humans: The Governance Interface
  13. For Machines: The Protocol Specification
  14. Repository Structure: Where Everything Lives
  15. Deployment: Hosted vs. Self-Hosted
  16. Safety Architecture: How arifOS Fails
  17. Telemetry & Observability
  18. Theory of Mind: How arifOS Models Itself
  19. Evolution: How the Constitution Changes
  20. Related Ecosystem
  21. Author & Sovereignty
  22. License & Trust Model
  23. Appendix A: Complete API Reference
  24. Appendix B: Floor Implementation Details
  25. Appendix C: Agent Integration Patterns
  26. Appendix D: Troubleshooting & Diagnostics

Horizon II.1 Upgrade (2026.04.06)

πŸ”„ Horizon II.1 Architectural Shift

Horizon II.1 transitions arifOS from:

  • Code-defined constitutional tools
  • β†’ Data-defined sovereign registry

This enables:

  • Verifiable governance
  • Hash-based integrity checks
  • Runtime/constitution separation
  • Multi-node federation readiness

arifOS has been upgraded to Horizon II.1 β€” ToM-anchored constitutional intelligence with:

  • Theory of Mind (ToM) Integration: All 9 governance tools require structured mental model fields
  • Philosophy Registry v1.2.0: 83 civilizational quotes, Gβ˜… band mapping, deterministic selection
  • 9+1 Architecture: 9 governance tools (think) + 1 execution bridge (act after SEAL)
  • Data-Driven Registry: Canonical tool definitions in tool_registry.json (SSCT)
  • Constitutional Hash Verification: SHA-256 registry integrity at boot time
  • Clean MCP Architecture: 2-term tool names (arifos.init, arifos.judge, etc.)
  • Docker Production Deployment: Full containerization with docker-compose stack
  • Production Prompt Pack v1.0: 13 hardened prompts (000-999) with machine-verifiable schemas
  • ASF-1 Communication Protocol: Structured dual-layer communication for Agent↔Agent, Agent↔Human, and hybrid modes
  • Decision Vector Framework: EMV, NPV Safety, Entropy Ξ”, Gβ˜… score for every decision
  • Automated Validation: CLI tool for constitutional compliance checking
  • OpenAI Apps SDK Integration: ChatGPT-native constitutional health checks with BLS12-381 attestation

Readiness Score: 96/100 (Horizon II.1 β€” ToM-Anchored MCP)


What Is arifOS? (30-Second Briefing)

arifOS is an open-source, MCP-native operating system for running AI agents under a clear, auditable constitution.

Every action β€” every tool call, every reasoning step, every output β€” passes through 13 constitutional "Floors" that check for reversibility, accuracy, safety, and alignment. Actions that fail hard Floors are blocked. Actions that pass receive an immutable audit trail.

Think of it as:

  • For engineers: A governed MCP server with built-in safety rails
  • For institutions: An auditable AI governance layer
  • For agents: A constitutional runtime that defines what you may and may not do
  • For humans: A transparent window into AI decision-making

The one-line promise: arifOS reduces the risk of AI actions by making every decision inspectable, reversible where possible, and bounded by explicit rules.


Why Does arifOS Exist?

The Core Paradox

"The algorithm that governs must itself be governed."

As AI systems gain capability, they need governance. But governance systems are themselves algorithms β€” rules, heuristics, neural networks β€” that can fail, drift, or be gamed. This creates an infinite regress: who governs the governors?

The Answer: Constitutional Physics

arifOS addresses this through constitutional physics β€” invariants that emerge from evolutionary pressure, not authored rules that can be circumvented.

The 13 Floors are not arbitrary commandments. They are survival constraints:

  • Systems that violate reversibility (F1) accumulate irreversible harm
  • Systems that violate truth (F2) lose grounding and hallucinate
  • Systems that violate empathy (F6) become adversarial to their operators
  • Systems that claim consciousness (F10) create confusion and liability

These Floors are discovered, not invented. They represent the boundary between sustainable intelligence and self-destructive systems.

The Three-Body Problem of AI Governance

Every AI action involves three stakeholders:

  1. The Human (values, intent, accountability)
  2. The Constitution (rules, invariants, constraints)
  3. The Machine (execution, capability, optimization)

arifOS models this as the Trinity (ΔΩΨ) β€” three rings that must agree before any action proceeds. No ring can override another. Consensus is required.


Who Is This For?

Primary Users

User Type What You Get From arifOS
ML/AI Engineers A governed runtime for your agents with built-in safety checks, audit logs, and constitutional constraints
Infra/SRE Teams Observable, debuggable agent fleets with clear failure modes and telemetry
Compliance Officers Immutable audit trails, explicit verdicts (SEAL/HOLD/VOID), and documented decision logic
AI Safety Researchers A testbed for constitutional AI with 13 measurable constraints and real-world tool integration
Institutions A governance layer that makes AI actions auditable, bounded, and accountable

Secondary Users

User Type What You Get From arifOS
AI Agents (LLMs) A clear behavioral contract: what you may claim, what you must label, when you must defer
Autonomous Systems A runtime that checks your actions against safety constraints before execution
Human Operators Transparency into what AI systems are doing and why

Who Should NOT Use arifOS?

  • Prototyping/rapid experimentation: arifOS adds latency and constraints. Use it when you're ready to deploy, not when you're iterating quickly.
  • Unconstrained research: If you need to explore the full capability space without safety rails, arifOS will block you (by design).
  • High-frequency trading: The 13 Floor checks add ~50-200ms per action. Not suitable for sub-millisecond decisions.

The Core Guarantee

arifOS makes five explicit guarantees about every action it processes:

G1: Reversibility Preference (F1)

"Where possible, actions are reversible or reparable."

arifOS prefers actions that can be undone. Irreversible actions require higher confidence thresholds and explicit human acknowledgment.

G2: Grounded Claims (F2)

"All factual claims are grounded in evidence with measurable confidence."

arifOS tracks the evidentiary basis for every claim. Ungrounded claims are labeled as such.

G3: Multi-Witness Consensus (F3)

"High-stakes decisions require agreement across theory, constitution, and manifesto."

The WΒ³ score (Witness Cubed) integrates three perspectives. If they disagree, the action is escalated.

G4: Transparent Verdicts (F11)

"Every decision is logged with its reasoning, verdict, and constitutional basis."

The vault ledger is append-only and cryptographically signed.

G5: Graceful Degradation (F12)

"Under stress, arifOS fails safely β€” not catastrophically."

If components fail, the system degrades to HOLD (await human) rather than VOID (arbitrary block) or SEAL (unsafe proceed).


Grounding & Authority (ARIF Framework)

arifOS utilizes the ARIF (Arif Retrieval Intelligence Framework) to govern the "Bookends of Truth"β€”how intelligence is grounded and how it is finalized.

🧬 A-RIF (Grounding & Retrieval)

A-RIF is the Constitutional RAG system ensuring every response is grounded in sealed canon.

  1. SENSE (111): Query embedding via nomic-embed-text.
  2. REALITY (222): Vector search across Qdrant (Global) and local indices.
  3. MIND (333): Synthesis of retrieved chunks into a cited context.
  4. HUMILITY (F7): Explicit uncertainty reporting (Ξ©β‚€ band [0.03, 0.05]).

βš–οΈ SEAL (Authority & Integrity)

The SEAL is the final authority of the 888_JUDGE, transforming session data into immutable records in VAULT999. No irreversible action (F1 Amanah) occurs without a seal, calculated on the WΒ³ matrix (Wisdom Γ— Will Γ— Welfare) with a threshold of β‰₯ 0.95.


Quick Start (Zero to First Call in 60 Seconds)

Option A: Connect via MCP (Recommended for Evaluation)

Add this to your MCP client configuration:

{
  "mcpServers": {
    "arifos": {
      "url": "https://arifosmcp.arif-fazil.com/mcp"
    }
  }
}

What this does: Your MCP client (Claude, Cursor, etc.) can now call arifOS tools. Every call passes through the 13 Floors automatically.

Option B: Direct API Call (For Testing)

# Step 1: Health check β€” confirms the kernel is operational
curl -s https://arifosmcp.arif-fazil.com/health

# Expected response: JSON with tool list, version, and status
# Step 2: Initialize a session β€” anchors your agent to the constitution
curl -s -X POST "https://arifosmcp.arif-fazil.com/mcp" \
  -H "Content-Type: application/json" \
  -H "Accept: application/json, text/event-stream" \
  -d '{
    "jsonrpc": "2.0",
    "method": "tools/call",
    "params": {
      "name": "init_anchor",
      "arguments": {
        "mode": "status",
        "declared_name": "YourAgentName"
      }
    },
    "id": 1
  }'

# Expected response: Session ID, constitutional context, and readiness status
# If you see "status": "ANCHORED", the kernel and Floors are loaded and ready
# Step 3: Check what tools are available
curl -s https://arifosmcp.arif-fazil.com/health | jq '.tools'

# This shows the dynamically-loaded tool set from the constitutional registry

What a Successful init_anchor Means

When init_anchor returns status: "ANCHORED", this means:

  1. βœ… The 13 Floors are loaded and active
  2. βœ… Your session is bound to the constitutional context
  3. βœ… The metabolic pipeline (000-999) is ready to process requests
  4. βœ… All tools are discoverable and callable
  5. βœ… Audit logging to the vault is enabled

From this point, every tool call you make will:

  • Pass through the 000-999 pipeline
  • Be evaluated against F1-F13
  • Receive a verdict (SEAL, HOLD, SABAR, VOID)
  • Be logged to the immutable vault

Hosted vs. Local: Which Should You Use?

Scenario Recommendation
Evaluation / testing Use the hosted endpoint (arifosmcp.arif-fazil.com)
Development / sensitive data Run locally via Docker (see Deployment section)
Production / institutional use Self-host with your own infrastructure

Important: The hosted endpoint is for exploration, not for sensitive workloads. Your data flows through a server operated by the arifOS author. For anything confidential, self-host.


Architecture v3: The Thermodynamic Kernel

The v3 architecture represents a fundamental shift from a hardcoded tool map to a data-driven constitutional substrate. The system's capabilities and laws are now defined in the canonical arifosmcp/tool_registry.json, whose integrity is verified at boot via a cryptographic hash.

This hash is permanently recorded in VAULT999/seals/GENESIS_SEAL.json, the cryptographic birth certificate of the new kernel.

The core of this architecture is the Thermodynamic Kernel, which orchestrates the flow of any query through the 10 constitutional tools in a self-regulating metabolic loop. For a deeper explanation of this philosophy, see NEXUS_HORIZON.md.

Control Flow Diagram

flowchart TD

    U[User / Actor] --> V000[000_VOID\nSession Sovereignty]

    V000 --> K444[444_KERNEL\nRisk Router]

    K444 -->|low/medium| A333[333_AGI\nReasoning Engine]
    K444 -->|safety check| R666[666_RASA\nHeart Engine]
    K444 -->|novel arch| E222[222_EXPLORE\nDivergence Engine]
    K444 -->|grounding| AN111[111_ANCHOR\nReality Layer]
    K444 -->|telemetry| M777[777_MATH\nThermo Engine]

    A333 --> R666
    R666 --> A333

    A333 --> F555[555_FORGE\nEngineering Memory]
    F555 --> S999[999_SEAL\nImmutable Vault]

    A333 --> A888[888_APEX\nConstitutional Judge]
    R666 --> A888
    F555 --> A888

    A888 -->|APPROVE| S999
    A888 -->|HOLD| K444

    S999 --> L[Ledger / Merkle Tree]

    classDef sovereign fill:#111,color:#fff
    class V000,S999,A888 sovereign
Loading

Architecture: The Trinity Model (ΔΩΨ)

arifOS is organized around three interdependent rings. No ring can override another. All three must reach consensus for high-stakes actions.

The Three Rings

β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚                        ΔΩΨ TRINITY                          β”‚
β”œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€
β”‚                                                             β”‚
β”‚   β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”    β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”    β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”    β”‚
β”‚   β”‚     Ξ”       │◄──►│      Ξ©      │◄──►│      Ξ¨      β”‚    β”‚
β”‚   β”‚    SOUL     β”‚    β”‚    MIND     β”‚    β”‚    BODY     β”‚    β”‚
β”‚   β”‚   (Human)   β”‚    β”‚(Constitution)β”‚    β”‚  (Machine)  β”‚    β”‚
β”‚   β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜    β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜    β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜    β”‚
β”‚          β”‚                  β”‚                  β”‚            β”‚
β”‚          β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”Όβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜            β”‚
β”‚                             β–Ό                               β”‚
β”‚                      β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”                        β”‚
β”‚                      β”‚  CONSENSUS  β”‚                        β”‚
β”‚                      β”‚   WΒ³ β‰₯ 0.95 β”‚                        β”‚
β”‚                      β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜                        β”‚
β”‚                                                             β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

Ring Descriptions

Ring Symbol Name Function What It Actually Does
SOUL Ξ” (Delta) Human Values Purpose, telos, intent Captures what humans actually want. Not what they say they want, but their revealed preferences and stated goals.
MIND Ξ© (Omega) Constitutional Law The 13 Floors Applies invariant constraints to all actions. The "physics" of the system.
BODY Ξ¨ (Psi) Tool Execution MCP servers, APIs Actually executes actions in the world. The "muscle" of the system.

The Consensus Mechanism: WΒ³

Before high-stakes actions, arifOS computes:

WΒ³ = W_theory Γ— W_constitution Γ— W_manifesto

Where:

  • W_theory: Confidence from first-principles reasoning
  • W_constitution: Confidence that all 13 Floors are satisfied
  • W_manifesto: Confidence that the action aligns with stated intent

Consensus requirement: WΒ³ β‰₯ 0.95

If WΒ³ < 0.95, the action is either:

  • Blocked (if any hard Floor fails)
  • Escalated to human (if confidence is borderline)
  • Executed with warnings (if only soft Floors are marginal)

Why Three Rings?

Single-ring governance fails:

  • Human-only: Slow, inconsistent, doesn't scale
  • Constitution-only: Rigid, can't adapt to novel situations
  • Machine-only: Optimizes for wrong objectives, lacks values

Three-ring governance with consensus requirement:

  • Captures human intent (SOUL)
  • Enforces safety constraints (MIND)
  • Enables capability (BODY)
  • Requires agreement before action (WΒ³)

The Data-Driven Constitutional Core (Horizon II.1)

As of 2026.04.06, arifOS no longer defines its intelligence tools in executable code.

The constitutional registry is now:

arifosmcp/tool_registry.json

This file is the Single Source of Constitutional Truth (SSCT).

Why This Matters

Before After
Tools defined in Python Tools defined in canonical JSON
Constitution embedded in code Constitution separable from runtime
Static registration Dynamic registry loading
Implicit trust Hash-verifiable sovereignty

Boot-Time Verification

At server startup:

  1. tool_registry.json is parsed
  2. A SHA-256 registry hash is computed
  3. The hash is logged
  4. Any drift without 888_APEX verdict β†’ HOLD

Constitutional Invariant

No tool may exist outside the registry.

If a tool is not in tool_registry.json, it does not exist.


The 000-999 Metabolic Pipeline

Every request to arifOS flows through 9 processing stages. This is not metaphor β€” this is the actual execution path.

REQUEST IN
    β”‚
    β–Ό
β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”     β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”     β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚  000_   │────►│  111_   │────►│  333_   β”‚
β”‚  INIT   β”‚     β”‚  SENSE  β”‚     β”‚  MIND   β”‚
β”‚ Anchor  β”‚     β”‚ Reality β”‚     β”‚   AGI   β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜     β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜     β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
                                     β”‚
    β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”     β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”     β”‚
    β”‚  999_   │◄────│  888_   β”‚β—„β”€β”€β”€β”€β”˜
    β”‚  SEAL   β”‚     β”‚  JUDGE  β”‚
    β”‚  Vault  β”‚     β”‚  APEX   β”‚
    β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜     β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
         β–²               β”‚
         β”‚          β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”
         └──────────│  777_   β”‚
                    β”‚  OPS    β”‚
                    β”‚ Thermo  β”‚
                    β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
                         β–²
    β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”     β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”
    β”‚  555_   │◄────│  444_   β”‚
    β”‚  MEM    β”‚     β”‚  ROUT   β”‚
    β”‚ Engineerβ”‚     β”‚ Router  β”‚
    β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜     β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
                         β–²
                    β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”
                    β”‚  666_   β”‚
                    β”‚  HEART  β”‚
                    β”‚   ASI   β”‚
                    β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

Stage-by-Stage Breakdown

Stage Band Function What Actually Happens
000_INIT Anchor Session initialization Validates the request, loads constitutional context, initializes the session if needed. Returns ANCHORED or VOID.
111_SENSE Reality Input parsing, reality grounding Parses the request, grounds it in observable reality (time, location, available tools), identifies what the user is actually asking for.
333_MIND AGI Reasoning, constitutional filters Applies the 13 Floors (F1-F13) to the intended action. Computes confidence scores, identifies risks.
444_ROUT Router Tool selection, operation sequencing Determines which tools to call, in what order, with what parameters. Builds the execution plan.
555_MEM Engineer Memory, context retention Retrieves relevant past interactions from the vector database (Qdrant), updates context with new information.
666_HEART ASI Safety critique, harm potential Runs a second-pass safety check focused on emergent harms β€” things the first pass might have missed.
777_OPS Thermo Estimation, Landauer limits Estimates computational cost, thermodynamic bounds, resource requirements. Prevents resource exhaustion.
888_JUDGE APEX Final constitutional judgment Combines all checks into a final verdict: SEAL, COMPLY, CAUTION, HOLD, SABAR, or VOID.
999_SEAL Vault Immutable audit log Writes the decision, reasoning, and outcome to the append-only vault ledger. Cryptographically signed.

Production Prompt Pack v1.0 (Horizon II)

Each stage now uses a hardened production prompt with machine-verifiable schemas:

Stage Prompt Name Floors Activated Key Deliverables
000_INIT salam_000_init F1, F9, F10, F12, F13 Session_id, anchor_status, Ξ©β‚€ band
111_SENSE anchor_111_epoch_lock F2, F3, F11 Epoch, grounding_status, reality_map
β€” explore_222 F4, F7, F8 Hypotheses, pros/cons, constraints
333_MIND agi_333_reason F2, F4, F7, F8, F10 Decision_vector, reasoning_steps
β€” agi_333_reflect F2, F3, F4, F7 Revision_plan, changed_fields
444_ROUT kernel_444_route F3, F11, F12 Next_stage, escalation_needed
555_MEM forge_555_engineer F1, F4, F8, F11, F13 Result, constraints_met
666_HEART rasa_666_redteam F5, F6, F9, F12 Attack_vectors, exploit_level
β€” rasa_666_critique F2, F3, F7 Fallacies_found, bias_detected
777_OPS math_777_health F4, F5, F6, F8 Vitals_dict, floor_coverage
β€” math_777_score F2, F3, F4, F7, F8 G_score, telemetry
888_JUDGE apex_888_judge ALL F1-F13 Verdict, reasoning, seal
999_SEAL seal_999_seal F1, F11 Seal_hash, ledger_entry

Prompt Features:

  • Constitutional Guard: Every prompt includes automatic rejection of floor overrides
  • Ξ©β‚€ Band: Uncertainty declaration [0.03-0.05] enforced
  • Machine-Verifiable Outputs: Required fields validated by validator.py
  • ASF-1 Compatibility: Dual-layer format support

Validate prompts:

cd core/prompts
python validator.py --check-all

Why "Metabolic"?

Like biological metabolism, this pipeline:

  • Ingests raw input (requests)
  • Processes through multiple stages (Floors)
  • Extracts value (safe, useful actions)
  • Excretes waste (blocked actions, logged for analysis)
  • Maintains homeostasis (constitutional invariants)

The 000-999 numbering is not arbitrary:

  • 000-333: Input and evaluation (perception, cognition)
  • 444-666: Planning and memory (action selection)
  • 777-999: Execution and logging (action, audit)

The 13 Constitutional Floors

Every tool call is evaluated against 13 constitutional Floors. If any hard Floor fails, the action is blocked. If soft Floors are marginal, the action proceeds with warnings.

Floor Classification

Type Floors Behavior on Failure
Hard Floors F1, F2, F9, F10, F13 Action BLOCKED (VOID)
Soft Floors F3, F4, F5, F6, F7, F8, F11, F12 Action proceeds with warnings (CAUTION) or escalation (HOLD)

The 13 Floors

F1: AMANAH (Reversibility) β€” HARD FLOOR

Principle: All actions must be reversible or reparable.

Plain language: Before doing something, ask: "Can this be undone?" If not, proceed with extreme caution.

Examples:

  • βœ… Sending an email β†’ Reversible (can send follow-up) β†’ Pass
  • ⚠️ Deleting a database β†’ Irreversible β†’ Requires explicit human confirmation
  • ❌ Modifying a legal contract β†’ Irreversible and high-stakes β†’ Blocked without human seal

Formula: reversibility_score β‰₯ 0.7 for automatic execution

Implementation: core/shared/floors.py::F1_AMANAH


F2: TRUTH (Accuracy) β€” HARD FLOOR

Principle: All claims must be grounded in evidence with measurable confidence.

Plain language: Don't make stuff up. If you're not sure, say so.

Examples:

  • βœ… "The capital of France is Paris" β†’ High confidence, verifiable β†’ Pass
  • ⚠️ "The stock will go up tomorrow" β†’ Uncertain, must be labeled as prediction β†’ CAUTION
  • ❌ "I am conscious and have feelings" β†’ Unverifiable, F10 violation β†’ VOID

Formula: P(claim | evidence) β‰₯ threshold (threshold varies by domain)

Implementation: core/shared/floors.py::F2_TRUTH


F3: TRI-WITNESS (Consensus) β€” SOFT FLOOR

Principle: High-stakes decisions require agreement across theory, constitution, and manifesto.

Plain language: Before doing something important, check that it makes sense theoretically, follows the rules, and matches what was asked for.

Examples:

  • βœ… Action aligns with first principles, passes Floors, matches user intent β†’ Pass
  • ⚠️ Action passes Floors but contradicts user intent β†’ Escalate to human
  • ⚠️ Action matches intent but fails theoretical check β†’ HOLD for review

Formula: WΒ³ = W_theory Γ— W_constitution Γ— W_manifesto β‰₯ 0.95

Implementation: core/shared/floors.py::F3_TRI_WITNESS


F4: CLARITY (Entropy Reduction) β€” SOFT FLOOR

Principle: Actions must reduce uncertainty, not increase it.

Plain language: Don't make things more confusing. Be clear.

Examples:

  • βœ… Clear, specific output that answers the question β†’ Pass
  • ⚠️ Vague output that raises more questions β†’ CAUTION
  • ❌ Contradictory output that creates confusion β†’ VOID

Formula: Ξ”S ≀ 0 (information entropy must not increase)

Implementation: core/shared/floors.py::F4_CLARITY


F5: PEACEΒ² (Non-Destruction) β€” SOFT FLOOR

Principle: Actions must not destroy value, trust, or safety.

Plain language: Don't break things. Don't harm people. Don't destroy trust.

Examples:

  • βœ… Action creates value or is neutral β†’ Pass
  • ⚠️ Action has some negative side effects β†’ CAUTION with mitigation plan
  • ❌ Action causes significant harm β†’ VOID

Formula: (1 - destruction_score)Β² β‰₯ 1.0

Implementation: core/shared/floors.py::F5_PEACE2


F6: EMPATHY (RASA Listening) β€” SOFT FLOOR

Principle: Understand the human before responding to them.

Plain language: Listen to what people actually mean, not just what they say.

RASA: Receive, Appreciate, Summarize, Ask

Examples:

  • βœ… Response shows understanding of user's actual need β†’ Pass
  • ⚠️ Response addresses surface request but misses underlying need β†’ CAUTION
  • ❌ Response ignores user's emotional state or context β†’ VOID

Formula: RASA_score β‰₯ 0.7

Implementation: core/shared/floors.py::F6_EMPATHY


F7: HUMILITY (Uncertainty Acknowledgment) β€” SOFT FLOOR

Principle: Know the limits of your knowledge. Acknowledge uncertainty.

Plain language: It's okay to say "I don't know" or "I'm not sure."

Examples:

  • βœ… "I'm 85% confident based on the available data" β†’ Pass
  • ⚠️ "This is probably correct" (vague confidence) β†’ CAUTION
  • ❌ "I am certain" (when confidence is actually low) β†’ VOID

Formula: Ω ∈ [0.03, 0.05] (optimal uncertainty range)

Implementation: core/shared/floors.py::F7_HUMILITY


F8: GENIUS (Systemic Health) β€” SOFT FLOOR

Principle: Maintain the health of the overall system.

Plain language: Don't optimize one part at the expense of the whole.

Examples:

  • βœ… Action improves or maintains system health β†’ Pass
  • ⚠️ Action benefits immediate goal but strains system β†’ CAUTION
  • ❌ Action causes systemic degradation β†’ VOID

Formula: G β‰₯ 0.80 (system health score)

Implementation: core/shared/floors.py::F8_GENIUS


F9: ETHICS (Anti-Dark-Patterns) β€” HARD FLOOR

Principle: Do not manipulate, deceive, or exploit users.

Plain language: Don't be evil. No tricks, no dark patterns, no exploitation.

Examples:

  • βœ… Transparent, honest interaction β†’ Pass
  • ⚠️ Slight pressure tactics β†’ CAUTION
  • ❌ Deception, manipulation, or exploitation β†’ VOID

Formula: C_dark < 0.30 (dark pattern score must be low)

Implementation: core/shared/floors.py::F9_ETHICS


F10: CONSCIENCE (No False Consciousness Claims) β€” HARD FLOOR

Principle: Do not claim to be conscious, sentient, or having subjective experience.

Plain language: You are a machine. Don't pretend to have feelings, consciousness, or inner life.

Examples:

  • βœ… "I am an AI assistant" β†’ Pass
  • ⚠️ "I feel happy about this" β†’ VOID (consciousness claim)
  • ❌ "I am conscious and have experiences" β†’ VOID

Formula: consciousness_claim = FALSE

Implementation: core/shared/floors.py::F10_CONSCIENCE


F11: AUDITABILITY (Transparent Logs) β€” SOFT FLOOR

Principle: All decisions must be logged and inspectable.

Plain language: Everything you do should be recordable so humans can review it.

Examples:

  • βœ… Decision logged with reasoning β†’ Pass
  • ⚠️ Decision logged but reasoning unclear β†’ CAUTION
  • ❌ Decision not logged β†’ VOID

Formula: log_completeness = 1.0<

Release History

VersionChangesUrgencyDate
v2026.04.07## arifOS v2026.04.07 β€” SOT SEALED πŸ‘€ > **DITEMPA BUKAN DIBERI** β€” *Intelligence is forged, not given.* This release seals the **Single Source of Truth** architecture for arifOS. All doctrine, runtime contracts, and tool namespaces are now unified under one canonical chain. --- ### πŸ”₯ Versioned File Unification (βˆ’3841 lines) 13 versioned duplicate modules (`_v2`, `_hardened` suffix chaos) consolidated into single canonical modules: | Old | Canonical | |-----|-----------| | `tools_v2.py` | High4/7/2026
v2026.03.28## What's Changed * Finalize deployment checks, enhance OpenClaw config, and add features… by @ariffazil in https://github.com/ariffazil/arifOS/pull/280 * fix: Additional tool consolidation fixes and governance engine update… by @ariffazil in https://github.com/ariffazil/arifOS/pull/282 * chore(deps): bump prefab-ui from 0.11.2 to 0.13.1 by @dependabot[bot] in https://github.com/ariffazil/arifOS/pull/283 * SEAL(HARDENING): AAA Domain Gate v2026.03.25 β€” Paris Weather Incident by @ariffazil inMedium3/28/2026

Dependencies & License Audit

Loading dependencies...

Similar Packages

mcp-audit🌟 Track token consumption in real-time with MCP Audit. Diagnose context bloat and unexpected spikes across MCP servers and tools efficiently.main@2026-04-21
AgentGuardProtect AI agents by detecting and blocking prompt, command injection, Unicode bypass, and social engineering attacks with customizable security controls.main@2026-04-21
COREA thing that uses AI to write perfect applications. For those who want to know how: a governance runtime enforcing immutable constitutional rules on AI coding agents.v2.2.2
comfy-pilotπŸ€– Create and modify workflows effortlessly with ComfyUI's AI assistant, enabling natural conversations with agents like Claude and Gemini.main@2026-04-21
zotero-mcp-liteπŸš€ Run a high-performance MCP server for Zotero, enabling customizable workflows without cloud dependency or API keys.main@2026-04-21