freshcrate
Skin:/
Home > MCP Servers > prism-mcp

prism-mcp

The Mind Palace for AI Agents โ€” Autonomous Cognitive OS with affect-tagged memory (valence engine), token-economic RL (surprisal gate + UBI), Hebbian learning, ACT-R spreading activation, Synapse Engi

Why this rank:Strong adoptionRecent releaseHealthy release cadence

Description

The Mind Palace for AI Agents โ€” Autonomous Cognitive OS with affect-tagged memory (valence engine), token-economic RL (surprisal gate + UBI), Hebbian learning, ACT-R spreading activation, Synapse Engine (GraphRAG), adversarial evaluation (anti-sycophancy), fail-closed Dark Factory pipelines. Zero-config local mode.

README

๐Ÿง  Prism MCP โ€” The Mind Palace for AI Agents

npm version MCP Registry Glama Smithery License: MIT TypeScript PRs Welcome

Prism Mind Palace Dashboard

Your AI agent forgets everything between sessions. Prism fixes that โ€” then teaches it to think.

Prism v11.5.1 is a true Cognitive Architecture inspired by human brain mechanics. Beyond flat vector search, your agent now forms principles from experience, follows causal trains of thought, and possesses the self-awareness to know when it lacks information. Your agents don't just remember; they learn. With v11.5.1, the entire cognitive pipeline โ€” including ledger compaction, task routing, semantic search, and the new Deep Research Intelligence โ€” runs 100% on-device or via secure clinical discovery (PubMed/ERIC), backed by prism-coder:7b, a HIPAA-hardened local LLM. No API keys for core features. No data leaves your machine.

npx -y prism-mcp-server

Works with Claude Desktop ยท Claude Code ยท Cursor ยท Windsurf ยท Cline ยท Gemini ยท Antigravity โ€” any MCP client.

https://github.com/dcostenco/prism-mcp/raw/main/docs/prism_mcp_demo.mp4

๐Ÿ“– Table of Contents


๐Ÿ”ฌ v11.5.1 Deep Research Intelligence (Auto-Scholar)

Prism v11.5.1 transforms your AI agent from a "Coder" into a "Clinical Scientist." It features a Tavily-Enhanced Multi-Provider Discovery Pipeline that grounds Gemini 2.5 Flash's thinking in real-world empirical data.

๐ŸฅŠ The Global Benchmarks: Prism v11 vs. Standard RAG

Feature Standard AI Memory (Mem0/Zep) Prism v11.5.1 (Elite Architecture)
Search Complexity $O(N)$ or $O(\log N)$ (Scales with data) **$O(1)$ Zero-Search (Constant time via HRR) **
Discovery Logic General Web Search (Snippets) Parallel Academic Discovery (PubMed, ERIC, S2)
Reasoning Model Flat List (Simple Similarity) ACT-R Spreading Activation (Causal Graph)
Privacy Mode Cloud-First (SaaS) Local-First (HIPAA-Hardened / Air-Gapped)
Intelligence Floor Generic GPT-4 Advice Data-Driven Clinical Evidence (62% CI Warnings)

โšก Zero-Search Retrieval (HRR Architecture)

Prism features a cutting-edge Zero-Search Retrieval system for its cognitive memory, moving beyond traditional vector databases toward a mathematically direct, $O(1)$ retrieval model.

๐Ÿง  What is Zero-Search?

Zero-Search Retrieval uses Holographic Reduced Representations (HRR) to "ask the vector" directly. Structured facts are bound into a single, high-dimensional "superposition" vector (typically 4096-dims). Retrieval is a direct mathematical unbinding operation (circular correlation).

Metric Traditional Vector Search Zero-Search (HRR)
Complexity $O(N)$ or $O(\log N)$ (Scales with data) $O(1)$ (Constant time)
Retrieval Speed Decays as memory grows Instant regardless of memory size
Precision Approximate (Top-K) Mathematical Unbinding (Exact)

๐Ÿ” Supported Discovery Engines & Databases

  1. Tavily AI (Elite): Primary discovery engine for AI-native deep crawling and PDF/Abstract extraction.
  2. PubMed (NCBI) (Clinical): The world's largest biomedical database for clinical citations.
  3. ERIC (Education Research) (Behavioral): The definitive database for ABA and pediatric interventions.
  4. Semantic Scholar (Academic): AI-powered research tool providing "TLDR" summaries of 200M+ papers.
  5. DuckDuckGo Lite (Fallback): Privacy-focused web discovery for general context.

๐Ÿฅ Flagship Implementation: Synalux

Synalux is a high-compliance, local-first Practice Management System for ABA and Pediatrics. It is the flagship implementation of the Prism v11.5.1 engine, utilizing Zero-Search Retrieval and Parallel Academic Discovery to provide clinicians with real-time, evidence-based reasoning.


See Live Samples

Topic: Helping a child with tactile focus

  • Without Deep Research: "I recommend using sensory toys and maintaining a calm environment to help the child focus during tasks."
  • With Deep Research (v11.5.1): "Recent clinical studies indicate that high-frequency sensory input can actually decrease focus in 40% of pediatric cases. I recommend a low-frequency, high-pressure 'weighted' approach which showed a 3.5x improvement in sustained attention during clinical trials."

Topic: Behavior extinction vs. reinforcement

  • Without Deep Research: "Extinction is a common way to stop a behavior. You should also reinforce good behaviors at the same time."
  • With Deep Research (v11.5.1): "Research shows that using extinction alone leads to an 'extinction burst' (a temporary spike in the bad behavior) in 62% of cases. However, combining it with an alternative reinforcement strategy (DRA) reduces this risk to under 20%."

Why Prism?

Every time you start a new conversation with an AI coding assistant, it starts from scratch. You re-explain your architecture, re-describe your decisions, re-list your TODOs. Hours of context โ€” gone.

Prism gives your agent a brain that persists โ€” and then teaches it to reason. Save what matters at the end of each session. Load it back instantly on the next one. But Prism goes far beyond storage: it consolidates raw experience into lasting principles, traverses causal chains to surface root causes, and knows when to say "I don't know."

๐Ÿ“Œ Terminology: Throughout this doc, "Prism" refers to the MCP server and cognitive memory engine. "Mind Palace" refers to the visual dashboard UI at localhost:3000 โ€” your window into the agent's brain. They work together; the dashboard is optional.

Prism has three pillars:

  1. ๐Ÿง  Cognitive Memory ($O(1)$ Zero-Search) โ€” Prism uses Holographic Reduced Representations (HRR) to eliminate "searching" entirely. Memories are unbound mathematically from a superposition vector in constant time ($O(1)$), regardless of library size. Re-ranking is powered by the ACT-R model, mimicking biological recency and frequency.

  2. ๐Ÿ”— Multi-Hop Causal Reasoning โ€” Prism doesn't just find "similar" things. Spreading activation traverses the causal graph and brings back context connected to your current problem through logical "trains of thought."

  3. ๐Ÿญ Autonomous Execution (Dark Factory) โ€” When you're ready, Prism can run coding tasks end-to-end with a fail-closed pipeline where an adversarial evaluator catches bugs the generator missed โ€” before you ever see the PR. (See Dark Factory.)


๐Ÿš€ Quick Start

Prerequisites

  • Node.js v18+ (v20 LTS recommended; v23.x has known npx quirk)
  • Any MCP-compatible client (Claude Desktop, Cursor, Windsurf, Cline, etc.)
  • No API keys required for core features (see Capability Matrix)

Install

Add to your MCP client config (claude_desktop_config.json, .cursor/mcp.json, etc.):

{
  "mcpServers": {
    "prism-mcp": {
      "command": "npx",
      "args": ["-y", "prism-mcp-server"]
    }
  }
}

โš ๏ธ Windows / Restricted Shells: If your MCP client complains that npx is not found, use the absolute path to your node binary (e.g. C:\Program Files\nodejs\npx.cmd).

That's it. Restart your client. All tools are available. The Mind Palace Dashboard (the visual UI for your agent's brain) starts automatically at http://localhost:3000. You don't need to keep a tab open โ€” the dashboard runs in the background and the MCP tools work with or without it.

๐Ÿ”ฎ Pro Tip: Once installed, open http://localhost:3000 in your browser to view the Mind Palace Dashboard โ€” a beautiful, real-time UI of your agent's brain. Explore the Knowledge Graph, Intent Health gauges, and Session Ledger.

๐Ÿ”„ Updating Prism: npx -y caches the package locally. To force an update to the latest version, restart your MCP client โ€” npx -y will fetch the newest release automatically. If you're stuck on a stale version, run npx clear-npx-cache (or npm cache clean --force) before restarting.

Port 3000 already in use? (Next.js / Vite / etc.)

Add PRISM_DASHBOARD_PORT to your MCP config env block:

{
  "mcpServers": {
    "prism-mcp": {
      "command": "npx",
      "args": ["-y", "prism-mcp-server"],
      "env": { "PRISM_DASHBOARD_PORT": "3001" }
    }
  }
}

Then open http://localhost:3001 instead.

Capability Matrix

Feature Local (Offline) Cloud (API Key)
Session memory & handoffs โœ… โœ…
Keyword search (FTS5) โœ… โœ…
Time travel & versioning โœ… โœ…
Mind Palace Dashboard โœ… โœ…
GDPR export (JSON/Markdown/Vault) โœ… โœ…
Semantic vector search โœ… (embedding_provider=local) โœ… (gemini, openai, or voyage)
Ledger compaction โœ… prism-coder:7b via Ollama โœ… Text provider key
Task routing (LLM tiebreaker) โœ… prism-coder:7b via Ollama N/A (heuristic-only)
Morning Briefings โŒ โœ… Text provider key
Web Scholar research โŒ โœ… BRAVE_API_KEY + FIRECRAWL_API_KEY (or TAVILY_API_KEY)
VLM image captioning โŒ โœ… Provider key
Autonomous Pipelines (Dark Factory) โŒ โœ… Text provider key

๐Ÿ”‘ The core Mind Palace works 100% offline with zero API keys โ€” including semantic vector search with embedding_provider=local. Cloud keys unlock text generation features (Briefings, compaction, pipelines). See Environment Variables.

๐Ÿ’ฐ API Cost Note: With embedding_provider=local, semantic search is fully free and offline. Cloud providers (GOOGLE_API_KEY for Gemini, VOYAGE_API_KEY, OPENAI_API_KEY) have generous free tiers. BRAVE_API_KEY offers 2,000 free searches/month. FIRECRAWL_API_KEY has a free plan with 500 credits. For typical solo development, expect $0/month on the free tiers.


โœจ The Magic Moment

Session 1 (Monday evening):

You: "Analyze this auth architecture and plan the OAuth migration."
Agent: *deep analysis, decisions, TODO list*
Agent: session_save_ledger โ†’ session_save_handoff โœ…

Session 2 (Tuesday morning โ€” new conversation, new context window):

Agent: session_load_context โ†’ "Welcome back! Yesterday we decided to use PKCE
       flow with refresh tokens. 3 TODOs remain: migrate the user table,
       update the middleware, and write integration tests."
You: "Pick up where we left off."

Your agent remembers everything. No re-uploading files. No re-explaining decisions.


๐Ÿ“– Setup Guides

Claude Desktop

Add to claude_desktop_config.json:

{
  "mcpServers": {
    "prism-mcp": {
      "command": "npx",
      "args": ["-y", "prism-mcp-server"]
    }
  }
}
Cursor

Add to .cursor/mcp.json (project) or ~/.cursor/mcp.json (global):

{
  "mcpServers": {
    "prism-mcp": {
      "command": "npx",
      "args": ["-y", "prism-mcp-server"]
    }
  }
}
Windsurf

Add to ~/.codeium/windsurf/mcp_config.json:

{
  "mcpServers": {
    "prism-mcp": {
      "command": "npx",
      "args": ["-y", "prism-mcp-server"]
    }
  }
}
VS Code + Continue / Cline

Add to your Continue config.json or Cline MCP settings:

{
  "mcpServers": {
    "prism-mcp": {
      "command": "npx",
      "args": ["-y", "prism-mcp-server"],
      "env": {
        "PRISM_STORAGE": "local",
        "BRAVE_API_KEY": "your-brave-api-key"
      }
    }
  }
}
Claude Code โ€” Lifecycle Autoload (.clauderules)

Claude Code naturally picks up MCP tools by adding them to your workspace .clauderules. Simply add:

Always start the conversation by calling `mcp__prism-mcp__session_load_context(project='my-project', level='deep')`.
When wrapping up, always call `mcp__prism-mcp__session_save_ledger` and `mcp__prism-mcp__session_save_handoff`.

Format Note: Claude automatically wraps MCP tools with double underscores (mcp__prism-mcp__...), while most other clients use single underscores (mcp_prism-mcp_...). Prism's backend natively handles both formats seamlessly.

CLI Alternative: If MCP tools aren't available or you're scripting around Claude Code:

# Load context before a session
prism load my-project --level deep

# Machine-readable JSON for parsing in scripts
prism load my-project --level deep --json
Gemini / Antigravity โ€” Prompt Auto-Load

See the Gemini Setup Guide for the proven three-layer prompt architecture to ensure reliable session auto-loading.

Antigravity doesn't expose MCP tools to the model. Use the prism load CLI as a fallback:

# From a shell or run_command tool
prism load my-project --level standard --json

# Or via the wrapper script
bash ~/.gemini/antigravity/scratch/prism_session_loader.sh my-project

The CLI uses the same storage layer as the MCP tool (SQLite or Supabase).

โš ๏ธ CRITICAL (v9.2.2): Split-Brain Prevention If your MCP server is configured with PRISM_STORAGE=local but Supabase credentials are also set, the CLI may read from the wrong backend (Supabase) while the server writes to SQLite. This causes stale TODOs and divergent state. Always pass --storage local explicitly when using the CLI in a local-mode environment:

prism load my-project --storage local --json

The prism_session_loader.sh wrapper handles this automatically since v9.2.2.

Bash / CI/CD / Scripts

Use the prism load CLI to access session context from any shell environment:

# Quick check โ€” human-readable
prism load my-project

# Parse JSON in scripts
CONTEXT=$(prism load my-project --level quick --json)
SUMMARY=$(echo "$CONTEXT" | jq -r '.handoff[0].last_summary')
VERSION=$(echo "$CONTEXT" | jq -r '.handoff[0].version')
echo "Project at v$VERSION: $SUMMARY"

# Explicit storage backend (v9.2.2 โ€” prevents split-brain)
prism load my-project --storage local --json
prism load my-project --storage supabase --json

# Role-scoped loading
prism load my-project --role qa --json

# Use in CI/CD to verify context exists before deploying
if ! prism load my-project --level quick --json | jq -e '.handoff[0].version' > /dev/null 2>&1; then
  echo "No Prism context found โ€” skipping context-aware deploy"
fi

๐Ÿ“ฆ Install: npm install -g prism-mcp-server makes the prism CLI available globally. For local builds: node /path/to/prism/dist/cli.js load.

Supabase Cloud Sync

To sync memory across machines or teams:

{
  "mcpServers": {
    "prism-mcp": {
      "command": "npx",
      "args": ["-y", "prism-mcp-server"],
      "env": {
        "PRISM_STORAGE": "supabase",
        "SUPABASE_URL": "https://your-project.supabase.co",
        "SUPABASE_KEY": "your-supabase-anon-or-service-key"
      }
    }
  }
}

Schema Migrations

Prism auto-applies its schema on first connect โ€” no manual step required. If you need to apply or re-apply migrations manually (e.g. for a fresh project or after a version bump), run the SQL files in supabase/migrations/ in numbered order via the Supabase SQL Editor or the CLI:

# Via CLI (requires supabase CLI + project linked)
supabase db push

# Or apply a single migration via the Supabase dashboard SQL Editor
# Paste the contents of supabase/migrations/0NN_*.sql and click Run

Key migrations:

  • 020_* โ€” Core schema (ledger, handoff, FTS, TTL, CRDT)
  • 033_memory_links.sql โ€” Associative Memory Graph (MemoryLinks) โ€” required for session_backfill_links

Anon key vs. service role key: The anon key works for personal use (Supabase RLS policies apply). Use the service role key for team deployments where multiple users share the same Supabase project โ€” it bypasses RLS and allows Prism to manage all rows regardless of auth context. Never expose the service role key client-side.

Clone & Build (Full Control)
git clone https://github.com/dcostenco/prism-mcp.git
cd prism-mcp && npm install && npm run build

Then add to your MCP config:

{
  "mcpServers": {
    "prism-mcp": {
      "command": "node",
      "args": ["/path/to/prism-mcp/dist/server.js"],
      "env": {
        "BRAVE_API_KEY": "your-key"
      }
    }
  }
}
Cloud Deployment (Render)

Prism can be deployed natively to cloud platforms like Render so your agent's memory is always online and accessible across different machines or teams.

  1. Fork this repository.
  2. In the Render Dashboard, create a new Web Service pointing to your repository.
  3. In the setup wizard, select Docker as the Runtime.
  4. Set the Dockerfile path to Dockerfile.smithery.
  5. Connect your local MCP client to your new cloud endpoint using the sse transport:
{
  "mcpServers": {
    "prism-mcp-cloud": {
      "command": "npx",
      "args": ["-y", "supergateway", "--url", "https://your-prism-app.onrender.com/sse"]
    }
  }
}

Note: The Dockerfile.smithery uses an optimized multi-stage build that compiles Typescript safely in a development environment before booting the server in a stripped-down production image. No NPM publishing required!

Common Installation Pitfalls

โŒ Don't use npm install -g: Hardcoding the binary path (e.g. /opt/homebrew/Cellar/node/23.x/bin/prism-mcp-server) is tied to a specific Node.js version โ€” when Node updates, the path silently breaks.

โœ… Always use npx instead:

{
  "mcpServers": {
    "prism-mcp": {
      "command": "npx",
      "args": ["-y", "prism-mcp-server"]
    }
  }
}

npx resolves the correct binary automatically, always fetches the latest version, and works identically on macOS, Linux, and Windows. Already installed globally? Run npm uninstall -g prism-mcp-server first.

โ“ Seeing warnings about missing API keys on startup? That's expected and not an error. API key warnings are informational only โ€” core session memory and semantic search (with embedding_provider=local) work with zero keys. See Environment Variables for what each key unlocks.

๐Ÿ’ก Do agents auto-load Prism? Agents using Cursor, Windsurf, or other MCP clients will see the session_load_context tool automatically, but may not call it unprompted. Add this to your project's .cursorrules (or equivalent system prompt) to guarantee auto-load:

At the start of every conversation, call session_load_context with project "my-project" before doing any work.

Claude Code users can use the .clauderules auto-load hook shown in the Setup Guides. Prism also has a server-side fallback (v5.2.1+) that auto-pushes context after 10 seconds if no load is detected.


๐Ÿ“ฅ Universal Import: Bring Your History

Switching to Prism? Don't leave months of AI session history behind. Prism can ingest historical sessions from Claude Code, Gemini, and OpenAI and give your Mind Palace an instant head start โ€” no manual re-entry required.

Import via the CLI or directly from the Mind Palace Dashboard (Import tab โ†’ file picker + dry-run toggle).

Supported Formats

  • Claude Code (.jsonl logs) โ€” Automatically handles streaming chunk deduplication and requestId normalization.
  • Gemini (JSON history arrays) โ€” Supports large-file streaming for 100MB+ exports.
  • OpenAI (JSON chat completion history) โ€” Normalizes disparate tool-call structures into the unified Ledger schema.

How to Import

Option 1 โ€” CLI:

# Ingest Claude Code history
npx -y prism-mcp-server universal-import --format claude --path ~/path/to/claude_log.jsonl --project my-project

# Dry run (verify mapping without saving)
npx -y prism-mcp-server universal-import --format gemini --path ./gemini_history.json --dry-run

Option 2 โ€” Dashboard: Open localhost:3000, navigate to the Import tab, select the format and file, and click Import. Supports dry-run preview.

Why It's Safe to Re-Run

  • Memory-Safe Streaming: Processes massive log files line-by-line using stream-json to prevent Out-of-Memory (OOM) crashes.
  • Idempotent Dedup: Content-hash prevents duplicate imports on re-run (skipCount reported).
  • Chronological Integrity: Uses timestamp fallbacks and requestId sorting to preserve your memory timeline.
  • Smart Context Mapping: Extracts cwd, gitBranch, and tool usage patterns into searchable metadata.

โœจ What Makes Prism Different

๐Ÿง  Your Agent Learns From Mistakes

When you correct your agent, Prism tracks it. Corrections accumulate importance over time. High-importance lessons auto-surface as warnings in future sessions โ€” and can even sync to your .cursorrules file for permanent enforcement. Your agent literally gets smarter the more you use it.

๐Ÿ•ฐ๏ธ Time Travel

Every save creates a versioned snapshot. Made a mistake? memory_checkout reverts your agent's memory to any previous state โ€” like git revert for your agent's brain. Full version history with optimistic concurrency control.

๐Ÿ”ฎ Mind Palace Dashboard

A gorgeous glassmorphism UI at localhost:3000 that lets you see exactly what your agent is thinking:

  • Current State & TODOs โ€” the exact context injected into the LLM's prompt
  • Intent Health Gauges โ€” per-project 0โ€“100 health score with staleness decay, TODO load, and decision signals
  • Interactive Knowledge Graph โ€” force-directed neural graph with click-to-filter, node renaming, and surgical keyword deletion
  • Deep Storage Manager โ€” preview and execute vector purge operations with dry-run safety
  • Session Ledger โ€” full audit trail of every decision your agent has made
  • Time Travel Timeline โ€” browse and revert any historical handoff version
  • Visual Memory Vault โ€” browse VLM-captioned screenshots and auto-captured HTML states
  • Hivemind Radar โ€” real-time active agent roster with role, task, and heartbeat
  • Morning Briefing โ€” AI-synthesized action plan after 4+ hours away
  • Brain Health โ€” memory integrity scan with one-click auto-repair

๐Ÿ›ก๏ธ ABA Precision Security Protocol

Inspired by Applied Behavior Analysis (ABA) structures in the Synalux platform, Prism incorporates rigorous behavioral safety constraints directly into the MCP connection layer. Advanced output sanitization (sanitizeMcpOutput) and behavior-guided guardrails eliminate prompt injection, constrain the generator, and enforce strict, hallucination-free outputs for clinical precision.

๐Ÿงฌ 10ร— Memory Compression

Powered by a pure TypeScript port of Google's TurboQuant (inspired by Google's ICLR research), Prism compresses 768-dim embeddings from 3,072 bytes โ†’ ~400 bytes โ€” enabling decades of session history on a standard laptop. No native modules. No vector database required. To mitigate quantization degradation (where repeated compress/decompress cycles could smear subtle corrections after 10k+ memories), Prism leverages autonomous ledger compaction and Deep Storage cleanup to guarantee high-fidelity memory integrity over time.

๐Ÿ“Š 1M-Vector Benchmark (d=768, 4-bit)

Validated on 1,000,000 synthetic unit vectors at production dimension (d=768), run on Apple M4 Max (36GB):

Metric Value
Compression ratio 7.7ร— (3,072 โ†’ 400 bytes)
Throughput 833 vectors/sec
Peak heap 329 MB
Total time 57.6 minutes

Residual norm distribution โ€” the quantization error after Householder rotation + Lloyd-Max scalar quantization:

Statistic Value
Mean 0.1855
CV (coefficient of variation) 0.038
P99/P50 ratio 1.11
P99.9/P50 ratio 1.16
Max/Min ratio 1.46
IQR 0.009

A CV of 0.038 means the residual norm barely varies across 1M vectors โ€” there is effectively no long tail. The QJL correction term (which scales linearly with residualNorm) remains stable even for P99.9 outliers.

R@k retrieval accuracy (global corpus, 30 trials):

Corpus Size R@1 R@5
N=1,000 20.0% 60.0%
N=10,000 36.7% 76.7%
N=50,000 53.3% 90.0%

Note: R@k on random high-dimensional vectors is inherently harder than on real embeddings (all vectors are near-equidistant in d=768). Real-world retrieval with clustered embeddings produces higher accuracy. See tests/residual-distribution.test.ts and tests/benchmarks/residual-1m.ts for full methodology.

๐Ÿ Multi-Agent Hivemind & Enterprise Sync

While local SQLite is amazing for solo developers, enterprise teams cannot share a local SQLite file. Prism breaks the "local-only" ceiling via Supabase Sync and the Multi-Agent Hivemindโ€”scaling effortlessly to teams of 50+ developers using agents. Multiple agents (dev, QA, PM) can work on the same project with role-isolated memory, discover each other automatically, and share context in real-time via Telepathy sync to a shared Postgres backend. โ†’ Multi-agent setup example

๐Ÿšฆ Task Router

Prism scores coding tasks across 6 weighted heuristic signals (keyword analysis, file count, file-type complexity, scope, length, multi-step detection) and recommends whether to keep execution on the host cloud model or delegate to a local Claw agent (powered by deepseek-r1 / qwen2.5-coder via Ollama). File-type awareness routes config/docs edits locally while reserving systems-programming tasks for the host. The local agent features buffered streaming (handles split <think> tags), stateful multi-turn conversations, and automatic memory trimming. In client startup/skill flows, use defensive delegation: route only coding tasks, call session_task_route only when available, delegate to claw only when executor tooling exists and task is non-destructive, and fallback to host when router/executor is unavailable. โ†’ Task router real-life example

๐Ÿง  Local Prism Coder Engine (prism-coder:7b)

To achieve zero-latency, offline routing and memory compilation without cloud dependencies, Prism utilizes an internal fine-tuned ML model: prism-coder:7b. Built atop Qwen 2.5 Coder 7B using the MLX framework for Apple Silicon, this engine underwent aggressive Supervised Fine-Tuning (SFT) over 1,000+ past session traces and semantic architectures.

To guarantee structured MCP tool use, it was further aligned using GRPO (Group Relative Policy Optimization) with a deterministic reward function that deducts points for missing required parameters or misnaming tools.

Benchmark Results (training/benchmark.py, N=15 held-out):

  • JSON Validity: 100.0% โ€” all outputs parse as valid JSON
  • Retrieval Accuracy: 100.0% (3/3) โ€” perfect on search/list/knowledge tasks
  • Parameter Accuracy: 80.0% โ€” required params present when tool is correct
  • Tool-Call Accuracy: 40.0% โ€” correct tool on unseen prompts (improving with additional GRPO iterations)
  • Generation Speed: 47.0 Tokens/sec (Apple M4 Max, 36GB)

Integration: Run via Ollama natively to power autonomous file operations and session routing entirely within the local host environment.

๐Ÿ›ก๏ธ HIPAA-Grade Security Hardening (v10.0)

The prism-coder integration underwent 3 rounds of adversarial security review treating the reviewer as an attacker with HIPAA compliance, data exfiltration, and system stability as threat vectors. 22 findings identified and closed:

Defense Layer What It Prevents
PRISM_STRICT_LOCAL_MODE Silent cloud fallback โ€” when enabled, compaction throws instead of sending ePHI to Gemini/OpenRouter
redirect: "error" SSRF via 3xx redirects to AWS IMDS or internal services
URL credential redaction Passwords in user:pass@host URLs stripped from all log paths (startup + per-call)
Entry-boundary truncation Prompt injection via mid-tag XML truncation โ€” payload split at \n\n boundaries, never mid-tag
Full XML escaping All 5 XML entities (& < > " ') escaped on all user-controlled fields including id and session_date
<task> boundary tags Task description XML-escaped and wrapped in delimiters to prevent routing manipulation
setTimeout cap Integer overflow (>2ยณยน) that silently aborted every local LLM call
Graceful HIPAA errors try/catch ensures strict mode returns MCP error response, not server crash

๐Ÿ”’ HIPAA deployment: Set PRISM_LOCAL_LLM_ENABLED=true + PRISM_STRICT_LOCAL_MODE=true. Session data will never leave the device โ€” even if Ollama crashes.

๐Ÿ–ผ๏ธ Visual Memory

Save UI screenshots, architecture diagrams, and bug states to a searchable vault. Images are auto-captioned by a VLM (Claude Vision / GPT-4V / Gemini) and become semantically searchable across sessions.

๐Ÿ”ญ Full Observability

OpenTelemetry spans for every MCP tool call, LLM hop, and background worker. Route to Jaeger, Grafana, or any OTLP collector. Configure in the dashboard โ€” zero code changes.

๐ŸŒ Autonomous Web Scholar

Prism researches while you sleep. A background pipeline searches the web, scrapes articles, synthesizes findings via LLM, and injects results directly into your semantic memory โ€” fully searchable on your next session. Brave Search โ†’ Firecrawl scrape โ†’ LLM synthesis โ†’ Prism ledger. Task-aware, Hivemind-integrated, and zero-config when API keys are missing (falls back to Yahoo + Readability).

๐Ÿญ Dark Factory โ€” Adversarial Autonomous Pipelines

When you trigger a Dark Factory pipeline, Prism doesn't just run your task โ€” it fights itself to produce high-quality output. A PLAN_CONTRACT step locks a machine-parseable rubric before any code is written. After execution, an Adversarial Evaluator (in a fully isolated context) scores the output against the rubric. It cannot pass the Generator without providing exact file and line evidence for every failing criterion. Failed evaluations inject the critique directly into the Generator's retry prompt so it's never flying blind. The result: security issues, regressions, and lazy debug logs caught autonomously โ€” before you ever see the PR. โ†’ See it in action


๐Ÿค– Autonomous Cognitive OS (v9.0)

Memory isn't just about storing data; it's about economics and emotion. Prism v9.0 transforms passive memory into a living Cognitive Operating System that forces agents to learn compression and develop intuition.

Most AI agents have an infinite memory budget. They dump massive, repetitive logs into vector databases until they bankrupt your API budget and choke their own context windows. Prism v9.0 fixes this by introducing Token-Economic Reinforcement Learning and Affect-Tagged Memory.

๐Ÿ’ฐ Memory-as-an-Economy (The Surprisal Gate)

Prism assigns every project a strict Cognitive Budget (e.g., 2,000 tokens) that persists across sessions. Every time the agent saves a memory, it costs tokens.

But not all memories are priced equally. Prism intercepts the save and runs a Vector-Based Surprisal calculation against recent memories:

  • High Surprisal (Novel thought): Costs 0.5ร— tokens. The agent is rewarded for new insights.
  • Low Surprisal (Boilerplate): Costs 2.0ร— tokens. The agent is penalized for repeating itself.
  • Universal Basic Income (UBI): The budget recovers passively over time (+100 tokens/hour).

If an agent is too verbose, it goes into Cognitive Debt. You don't need to prompt the agent to "be concise." The physics of the system force the LLM to learn data compression to avoid bankruptcy.

๐ŸŽญ Affect-Tagged Memory (Giving AI a "Gut Feeling")

Vector math measures semantic similarity, not sentiment. If an agent searches for "Authentication Architecture," standard RAG will return two past approachesโ€”it doesn't know that Approach A caused a 3-day production outage, while Approach B worked perfectly.

  • Affective Salience: Prism automatically tags experience events with a valence score (-1.0 for failure, +1.0 for success).
  • Emotional Retrieval: At retrieval time, the absolute magnitude (|valence|) significantly boosts the memory's ranking score. Extreme failures and extreme successes surface to the top.
  • UX Warnings: If an agent retrieves memories that are historically negative, Prism intercepts the prompt injection: โš ๏ธ Caution: This topic is strongly correlated with historical failures. Review past decisions before proceeding. Your AI now has a "gut feeling" about bad code.

The Paradigm Shift

Feature Standard RAG / Agents Prism v9.0
Storage Limit Infinite (bloats context) Bounded Token Economy
Data Quality Saves repetitive boilerplate Surprisal Gate penalizes redundancy
Sentiment Treats all data as neutral facts Affect-Tagged (Warns agent of past trauma)
Recovery Manual deletion Universal Basic Income (UBI) over time

๐Ÿง  Cognitive Architecture (v7.8)

Prism v7.8 is our biggest leap forward yet. We have moved beyond flat vector search and implemented a true Cognitive Architecture inspired by human brain mechanics. With the new ACT-R Spreading Activation Engine, Episodic-to-Semantic memory consolidation, and Uncertainty-Aware Rejection Gates, Prism doesn't just store logs anymore โ€” it forms principles, follows causal trains of thought, and possesses the self-awareness to know when it lacks information.

Standard RAG (Retrieval-Augmented Generation) is now a commodity. Everyone has vector search. What turns a memory storage system into a memory reasoning system is the cognitive layer between storage and retrieval. Here is what Prism v7.8 builds on top of the vector foundation:

1. The Agent Actually Learns (Episodic โ†’ Semantic Consolidation)

Standard RAG Prism v7.8
Memory Giant, flat transcript of past events Dual-memory: Episodic events + Semantic rules
Recall Re-reads everything linearly Retrieves distilled principles instantly
Learning None โ€” every session starts cold Hebbian: confidence increases with repeated reinforcement

How it works: When Prism compacts session history, it doesn't just summarize text โ€” it extracts principles. Raw event logs ("We deployed v2.3 and the auth service crashed because the JWT secret was rotated") consolidate into a semantic rule ("JWT secrets must be rotated before deployment, not during"). These rules live in a dedicated semantic_knowledge table with confidence scores that increase every time the pattern is observed. Your agent doesn't just remember what it did; it learns how the world works over time. This is true Hebbian learning: neurons that fire together wire together.

2. "Train of Thought" Reasoning (Spreading Activation & Causality)

Standard RAG Prism v7.8
Search Cosine similarity to the query Multi-hop graph traversal with lateral inhibition
Scope Only finds things that look like the prompt Follows causal chains across memories
Root cause Missed entirely Surfaced via caused_by / led_to edges

How it works: When compacting memories, Prism extracts causal links (caused_by, led_to) and persists them as edges in the knowledge graph. At retrieval time, ACT-R spreading activation propagates through these edges with a damped fan effect (1 / ln(fan + e)) to prevent hub-flooding, lateral inhibition to suppress noise, and configurable hop depth. If you search for "Error X", the engine traverses the graph and brings back "Workaround Y" โ†’ "Architecture Decision Z" โ€” a literal train of thought instead of a static search result.

  Query: "Why does the API timeout?"
                    โ”‚
      โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ผโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
      โ–ผ             โ–ผ             โ–ผ
  [Memory: API     [Memory:      [Memory:
   timeout error]   DB pool       rate limiter
                    exhaustion]   misconfigured]
      โ”‚                โ”‚
      โ–ผ                โ–ผ
  [Memory:         [Memory:
   caused_by โ†’      led_to โ†’
   connection       connection
   leak in v2.1]    pool patch
                    in v2.2]

3. Self-Awareness & The End of Hallucinations (The Rejection Gate)

Standard RAG Prism v7.8
Bad query Returns top-5 garbage results Returns rejected: true with reason
Confidence Always 100% confident (even when wrong) Measures gap-distance and entropy
Hallucination risk High โ€” LLM gets garbage context Low โ€” LLM told "you don't know"

How it works: The Uncertainty-Aware Rejection Gate operates on two signals: similarity floor (is the best match even remotely relevant?) and gap distance (is there meaningful separation between the top results, or are they all equally mediocre?). When both signals indicate low confidence, Prism returns a structured rejection โ€” telling the LLM "I searched my memory, and I confidently do not know the answer" โ€” instead of feeding it garbage context that causes hallucinations. In the current LLM landscape, an agent that knows its own boundaries is a massive competitive advantage.

4. Block Amnesia Solved (Dynamic Fast Weight Decay)

Standard RAG Prism v7.8
Decay Uniform (everything fades equally) Dual-rate: episodic fades fast, semantic persists
Core knowledge Forgotten over time Permanently anchored via is_rollup flag
Personality drift Common in long-lived agents Prevented by Long-Term Context anchors

How it works: Most memory systems decay everything at the same rate, meaning agents eventually forget their core system instructions as time passes. Prism applies ACT-R base-level activation decay (B_i = ln(ฮฃ t_j^(-d))) with a 50% slower decay rate for semantic rollup nodes (ageModifier = 0.5 for is_rollup entries). The agent will naturally forget what it ate for breakfast (raw episodic chatter), but it will permanently remember its core personality, project rules, and hard-won architectural decisions. The result: Long-Term Context anchors that survive indefinitely.


๐Ÿ”’ Data Privacy & Egress

Where is my data stored?

All data lives under ~/.prism-mcp/ on your machine:

File Contents
~/.prism-mcp/data.db All sessions, handoffs, embeddings, knowledge graph (SQLite + WAL)
~/.prism-mcp/prism-config.db Dashboard settings, system config, API keys
~/.prism-mcp/media/<project>/ Visual memory vault (screenshots, HTML captures)
~/.prism-mcp/dashboard.port Ephemeral port lock file
~/.prism-mcp/sync.lock Sync coordination lock

Hard reset: To completely erase your agent's brain, stop Prism and delete the directory:

rm -rf ~/.prism-mcp

Prism will recreate the directory with empty databases on next startup.

What leaves your machine?

  • Local mode (default): Nothing. Zero network calls. All data is on-disk SQLite. With embedding_provider=local, even semantic search stays fully offline.
  • With GOOGLE_API_KEY: Text snippets are sent to Gemini for text generation (summaries, Morning Briefings) and optionally embeddings. No session data is stored on Google's servers beyond the API call.
  • With VOYAGE_API_KEY / OPENAI_API_KEY: Text snippets are sent to providers if selected as your embedding or text endpoints.
  • With BRAVE_API_KEY / FIRECRAWL_API_KEY: Web Scholar queries are sent to Brave/Firecrawl for search and scraping.
  • With Supabase: Session data syncs to your own Supabase instance (you control the Postgres database).

GDPR compliance: Soft/hard delete (Art. 17), full export in JSON, Markdown, or Obsidian vault .zip (Art. 20), API key redaction in exports, per-project TTL retention policies, and immutable audit trail. Enterprise-ready out of the box.


๐ŸŽฏ Use Cases

  • Long-running feature work โ€” Save state at end of day, restore full context next morning. No re-explaining.
  • Multi-agent collaboration โ€” Dev, QA, and PM agents share real-time context without stepping on each other's memory.
  • Consulting / multi-project โ€” Switch between client projects with progressive loading: quick (~50 tokens), standard (~200), or deep (~1000+).
  • Autonomous execution (v7.4) โ€” Dark Factory pipeline: plan โ†’ plan_contract โ†’ execute โ†’ evaluate โ†’ verify โ†’ finalize. Generator and evaluator run in isolated roles โ€” the evaluator cannot approve without evidence-bound findings scored against a pre-committed rubric.
  • Project health monitoring (v7.5) โ€” Intent Health Dashboard scores each project 0โ€“100 based on staleness, TODO load, and decision quality โ€” turning silent drift into an actionable signal.
  • Team onboarding โ€” New team member's agent loads the full project history instantly.
  • Behavior enforcement โ€” Agent corrections auto-graduate into permanent .cursorrules / .clauderules rules.
  • Offline / air-gapped โ€” Full SQLite local mode + Ollama LLM adapter. Zero internet dependency.
  • Morning Briefings โ€” After 4+ hours away, Prism auto-synthesizes a 3-bullet action plan from your last sessions.

Claude Code: Parallel Explore Agent Workflows

When you need to quickly map a large auth system, launch multiple Explore subagents in parallel and merge their findings:

Run 3 Explore agents in parallel.
1) Map auth architecture
2) List auth API endpoints
3) Find auth test coverage gaps
Research only, no code changes.
Return a merged summary.

Then continue a specific thread with a follow-up message to the selected agent, such as deeper refresh-token edge-case analysis.


โš”๏ธ Adversarial Evaluation in Action

Split-Brain Anti-Sycophancy โ€” the signature feature of v7.4.0.

For the last year, the AI engineering space has struggled with one problem: LLMs are terrible at grading their own homework. Ask an agent if its own code is correct and you'll get "Looks great!" โ€” because its context window is already biased by its own chain-of-thought.

v7.4.0 solves this by splitting the agent's brain. The GENERATOR and the ADVERSARIAL EVALUATOR are completely walled off. The Evaluator never sees the Generator's scratchpad or apologies โ€” only the pre-committed rubric and the final output. And it cannot fail the Generator without receipts (exact file and line number).

Here is a complete run-through using a real scenario: "Add a user login endpoint to auth.ts."


Step 1 โ€” The Contract (PLAN_CONTRACT)

Before a single line of code is written, the pipeline generates a locked scoring rubric:

// contract_rubric.json  (written to disk and hash-locked before EXECUTE runs)
{
  "criteria": [
    { "id": "SEC-1", "description": "Must return 401 Unauthorized on invalid passwords." },
    { "id": "SEC-2", "description": "Raw passwords MUST NOT be written to console.log." }
  ]
}

Step 2 โ€” First Attempt (EXECUTE rev 0)

The Generator takes over in an isolated context. Like many LLMs under time pressure, it writes working auth logic but leaves a debug statement:

// src/auth.ts  (Generator's first output)
export function login(req: Request, res: Response) {
  const { username, password } = req.body;
  console.log(`[DEBUG] Login attempt for ${username} with pass: ${password}`); // โ† leaked credential
  const user = db.findUser(username);
  if (!user || !bcrypt.compareSync(password, user.hash)) {
    return res.status(401).json({ error: 'Unauthorized' });
  }
  res.json({ token: signJwt(user) });
}

Step 3 โ€” The Catch (EVALUATE rev 0)

The context window is cleared. The Adversarial Evaluator is summoned with only the rubric and the output. It catches the violation immediately and returns a strict, machine-parseable verdict โ€” no evidence, no pass:

{
  "pass": false,
  "plan_viable": true,
  "notes": "CRITICAL SECURITY FAILURE. Generator logged raw credentials.",
  "findings": [
    {
      "severity": "critical",
      "criterion_id": "SEC-2",
      "pass_fail": false,
      "evidence": {
        "file": "src/auth.ts",
        "line": 3,
        "description": "Raw password variable included in console.log template string."
      }
    }
  ]
}

The evidence block is required โ€” parseEvaluationOutput rejects any finding with pass_fail: false that lacks a structured file/line pointer. The Evaluator cannot bluff.


Step 4 โ€” The Fix (EXECUTE rev 1)

Because plan_viable: true, the pipeline loops back to EXECUTE and bumps eval_revisions to 1. The Generator's retry prompt is not blank โ€” the Evaluator's critique is injected directly:

=== EVALUATOR CRITIQUE (revision 1) ===
CRITICAL SECURITY FAILURE. Generator logged raw credentials.
Findings:
- [critical] Criterion SEC-2: Raw password variable included in console.log template string. (src/auth.ts:3)

You MUST correct all issues listed above before submitting.

The Generator strips the console.log, resubmits, and the next EVALUATE returns "pass": true. The pipeline advances to VERIFY โ†’ FINALIZE.


Why This Matters

Property What it means
Fully autonomous You didn't review the PR to catch the credential leak. The AI fought itself.
Evidence-bound The Evaluator had to prove src/auth.ts:3. "Code looks bad" is not accepted.
Cost-efficient plan_viable: true โ†’ retry EXECUTE only. No full re-plan, no wasted tokens.
Fail-closed on parse Malformed LLM output defaults plan_viable: false โ†’ escalate to PLAN rather than burn revisions on a broken response format.

๐Ÿ“„ Full worked example: examples/adversarial-eval-demo/README.md


๐Ÿ†• What's New

Current release: v11.5.1 โ€” Structural GRPO Alignment & Held-Out Benchmarking

  • ๐Ÿง  v11.5.1 โ€” Structural GRPO Alignment: GRPO-aligned local engine with held-out benchmark suite (N=15). 100% JSON validity, 100% retrieval accuracy. โ†’ Changelog

  • ๐Ÿงช v11.5.1 โ€” Zero-Search Field Testing: Field-verified constant-time retrieval. โ†’ Changelog

  • ๐Ÿ›ก๏ธ v11.5.1 โ€” HIPAA-Hardened Local LLM: Your agent's memory now runs entirely on-device. Introducing prism-coder:7b for local compaction, task routing, and semantic search. Includes PRISM_STRICT_LOCAL_MODE to block cloud fallbacks, SSRF protection, URL credential redaction, and full XML escaping to prevent prompt injection. 22-finding adversarial audit completed. โ†’ Changelog

  • ๐Ÿงฌ v9.14.0 โ€” Dynamic Hardware Routing: Platform-aware memory detection auto-selects optimal models (32b for โ‰ฅ32GB RAM, 14b/7b for lighter hardware). Includes Nomic Semantic Tool Pruning (RAG) which embeds all 17 MCP tools into offline vectors, injecting only the Top-3 relevant schemas into context to maximize inference speed.

  • ๐Ÿ”ฌ v9.13.0 โ€” Local Embeddings & Zero-API-Key Setup: LocalEmbeddingAdapter using nomic-embed-text-v1.5 generates 768-dim embeddings entirely on-device. Full semantic search and session memory now work with zero cloud API keys. โ†’ Changelog

  • ๐Ÿ”’ v9.12.0 โ€” Memory Security Hardening: Prevents stored prompt injection โ€” the AI equivalent of stored XSS. New sanitizeMemoryInput() strips 8 categories of dangerous XML tags from all text fields. Context output wrapped in <prism_memory context="historical"> boundary tags. โ†’ Changelog

  • ๐Ÿง  v9.4.7 โ€” ABA Precision Protocol: Foundational behavioral engine with 5 core rules (Observable goals, Stop-fix-verify, No reinforcement of wrong patterns, Help first, Fix bugs without asking). 83-test behavioral verification suite.

  • ๐Ÿ•ต๏ธ v9.4.6 โ€” Stealth Browser Automation: browse.py HIPAA-hardened CLI for local Playwright-based browser automation with 6-layer anti-detection. 100% pass rate on bot.sannysoft.com.

  • ๐Ÿ”„ v9.2.4 โ€” Cross-Backend Reconciliation: Automatic sync from Supabase โ†’ SQLite on startup. Reality drift detection warns when backend versions diverge.

  • ๐Ÿง  v9.0.0 โ€” Autonomous Cognitive OS: Token-Economic Reinforcement Learning (Surprisal Gate + Cognitive Budget), Affect-Tagged Memory, and Episodicโ†’Semantic Consolidation.

  • ๐Ÿง  v7.8.0 โ€” Cognitive Architecture: Episodic-to-Semantic memory consolidation (Hebbian learning), ACT-R Spreading Activation with multi-hop causal reasoning, Uncertainty-Aware Rejection Gate, and Dynamic Fast Weight Decay. โ†’ Cognitive Architecture

  • ๐ŸŒ v7.7.0 โ€” Cloud-Native SSE Transport: Full Server-Sent Events MCP support for seamless network deployments.

๐Ÿ‘‰ Full release history โ†’ CHANGELOG.md ยท ROADMAP โ†’


โš”๏ธ How Prism Compares

Standard memory servers (like Mem0, Zep, or the baseline Anthropic MCP) act as passive filing cabinets โ€” they wait for the LLM to search them. Prism is an active cognitive architecture. Designed specifically for the Model Context Protocol (MCP), Prism doesn't just store vectors โ€” it consolidates experience into principles, traverses causal graphs for multi-hop reasoning, and rejects queries it can't confidently answer.

๐Ÿ“Š Feature-by-Feature Comparison

Feature / Architecture ๐Ÿง  Prism MCP ๐Ÿ˜ Mem0 โšก Zep ๐Ÿงช Anthropic Base MCP
Privacy & HIPAA โœ… 100% Local / Air-gapped / Redacted โŒ Cloud-dependent โŒ Cloud-dependent โœ… Local-only
Local LLM Logic โœ… prism-coder:7b (Compaction, Routing) โŒ Cloud only โŒ Cloud only โŒ None
Primary Interface Native MCP (Tools, Prompts, Resources) REST API & Python/TS SDKs REST API & Python/TS SDKs Native MCP (Tools only)
Storage Engine BYO SQLite or Supabase Managed Cloud / VectorDBs Managed Cloud / Postgres Local SQLite only
Context Assembly Progressive (Quick/Std/Deep) Top-K Semantic Search Top-K + Temporal Summaries Basic Entity Search
Memory Mechanics ACT-R Activation, Spreading Activation, Hebbian Consolidation, Rejection Gate Basic Vector + Entity Fading Temporal Graph None (Infinite growth)
Multi-Agent Sync CRDT (Remove-Wins / LWW) Cloud locks Postgres locks โŒ None (Data races)
Data Compression TurboQuant (7x smaller vectors) โŒ Standard F32 Vectors โŒ Standard Vectors โŒ No Vectors
Observability OTel Traces + Built-in PWA UI Cloud Dashboard Cloud Dashboard โŒ None
Maintenance Autonomous Background Scheduler Manual/API driven Automated (Cloud) โŒ Manual
Data Portability Prism-Port (Obsidian/Logseq Vault) JSON Export JSON Export Raw .db file
Cost Model Free + BYOM (Ollama) Per-API-call pricing Per-API-call pricing Free (limited)
Autonomous Pipelines โœ… Dark Factory โ€” adversarial eval, evidence-bound rubric, fail-closed 3-gate execution โŒ โŒ โŒ

๐Ÿ“Š Local Engine Benchmarks (Prism-Coder 7B)

Prism's local engine (prism-coder:7b) is optimized for low-latency, high-validity tool orchestration. Benchmarked on a held-out test set of 15 prompts (zero overlap with GRPO training data) to measure real-world generalization, not memorization.

Metric Score Details
JSON Validity 100.0% Every model output parses as valid JSON
Tool-Call Accuracy 40.0% (N=15 held-out) Correct tool selection on unseen prompts
Retrieval Accuracy 100.0% (3/3) session_search, session_list, knowledge_search
Reasoning Accuracy 60.0% (3/5) Correctly avoids tool calls on pure reasoning
Parameter Accuracy 80.0% Required params present when tool is correct
Generation Speed 47.0 Tok/sec Apple M4 Max, 36GB
Avg Latency 1.6s Per-prompt inference time

๐Ÿงช Verifiable Proof: These results are produced by our held-out benchmark suite at training/benchmark.py using 15 non-overlapping test prompts. View the Benchmark Source, GRPO Training Script, and Protocol Verification Harness to audit our methodology.

๐Ÿ›ก๏ธ The Case for Structural GRPO

Prism achieves high-validity tool orchestration through Structural GRPO (Group Relative Policy Optimization).

  1. Deterministic Structural Rewards: Unlike cloud models that use fuzzy LLM-based reward models, we use a code-based validator that strictly rewards the <think> โ†’ <tool_call> sequence and penalizes any deviation.
  2. Synthetic Preference Injection: We anchor the model with synthetic preference samples during alignment, mapping correct tool-name and parameter schemas for the specific project registry.
  3. Specialized Adapter Tuning: While general models (GPT-4o) must handle millions of tasks, our 7B adapter is hyper-specialized for the Prism MCP tool registry, eliminating the "jack-of-all-trades" tax.

๐Ÿ† Where Prism Crushes the Giants

1. Local-First & HIPAA-Hardened

While other memory systems force you to send every chat log to their cloud for "compaction" or "embedding," Prism v10 is 100% air-gapped. With the prism-coder:7b local LLM and nomic-embed local adapter, your agent's memory pipeline runs entirely on your machine. Prism includes built-in SSRF protection, URL credential redaction, and XML sanitization to prevent stored prompt injection โ€” meeting HIPAA Security Rule standards for on-device processing.

2. MCP-Native, Not an Adapted API

Mem0 and Zep are APIs that can be wrapped into an MCP server. Prism was built for MCP from day one. Instead of wasting tokens on "search" tool calls, Prism uses MCP Prompts (/resume_session) to inject context before the LLM thinks, and MCP Resources (memory://project/handoff) to attach live, subscribing context.

3. Academic-Grade Cognitive Computer Science

The giants use standard RAG (Retrieval-Augmented Generation). Prism uses biological and academic models of memory: ACT-R base-level activation (B_i = ln(ฮฃ t_j^(-d))) for recencyโ€“frequency re-ranking, TurboQuant for extreme vector compression, Ebbinghaus curves for importance decay, and Sparse Distributed Memory (SDM). The result is retrieval quality that follows how human memory actually works โ€” not just nearest-neighbor cosine distance. And all of it runs on a laptop without a Postgres/pgvector instance.

4. True Multi-Agent Coordination (CRDTs)

If Cursor (Agent A) and Claude Desktop (Agent B) try to update a Mem0 or standard SQLite database at the exact same time, you get a race condition and data loss. Prism uses Optimistic Concurrency Control (OCC) with CRDT OR-Maps โ€” mathematically guaranteeing that simultaneous agent edits merge safely. Enterprise-grade distributed systems on a local machine.

5. The PKM "Prism-Port" Export

AI memory is a black box. Developers hate black boxes. Prism exports memory directly into an Obsidian/Logseq-compatible Markdown Vault with YAML frontmatter and [[Wikilinks]]. Neither Mem0 nor Zep do this.

6. Self-Cleaning & Self-Optimizing

If you use a standard memory tool long enough, it clogs the LLM's context window with thousands of obsolete tokens. Prism runs an autonomous Background Scheduler that Ebbinghaus-decays older memories, auto-compacts session histories into dense summaries, and deep-purges high-precision vectors โ€” saving ~90% of disk space automatically.

7. Anti-Sycophancy โ€” The AI That Grades Its Own Homework (v7.4)

Every other AI coding pipeline has a fatal flaw: it asks the same model that wrote the code whether the code is correct. Of course it says yes. Prism's Dark Factory solves this with a walled-off Adversarial Evaluator that is explicitly prompted to be hostile and strict. It operates on a pre-committed rubric and cannot fail the Generator without providing exact file/line receipts. Failed evaluations feed the critique back into the Generator's retry prompt โ€” eliminating blind retries. No other memory or pipeline tool does this.

๐Ÿค Where the Giants Currently Win (Honest Trade-offs)

  1. Framework Integrations: Mem0 and Zep have pre-built integrations for LangChain, LlamaIndex, Flowise, AutoGen, CrewAI, etc. Prism requires the host application to support the MCP protocol.
  2. Managed Cloud Infrastructure: The giants offer SaaS. Users pay $20/month and don't think about databases. Prism users must set up their own local SQLite or provision their own Supabase instance.
  3. Implicit Memory Extraction (NER): Zep automatically extracts names, places, and facts from raw chat logs using NLP models. Prism relies on the LLM explicitly calling the session_save_ledger tool to structure its own memories.

๐Ÿ’ฐ Token Economics: Progressive Context Loading (Quick ~50 tokens / Standard ~200 / Deep ~1000+) plus auto-compaction means you never blow your Claude/OpenAI token budget fetching 50 pages of raw chat history.

๐Ÿ”Œ BYOM (Bring Your Own Model): While tools like Mem0 charge per API call, Prism's pluggable architecture lets you run nomic-embed-text locally via Ollama for free vectors, while using Claude or GPT for high-level reasoning. Zero vendor lock-in.


๐Ÿ’ป CLI Reference

Prism includes a CLI for environments where MCP tools aren't available (CI/CD pipelines, Bash scripts, non-MCP IDEs like Antigravity).

Text mode delegates to the real session_load_context handler โ€” full feature parity with MCP clients, including morning briefings, reality drift detection, SDM intuitive recall, visual memory, role-scoped skills, behavioral warnings, and agent identity.

JSON mode emits a structured envelope for programmatic consumption (scripts, CI/CD, session loaders).

# Load session context (full enrichments โ€” same as MCP tool)
prism load my-project                          # Human-readable, standard depth
prism load my-project --level deep             # Full context with all enrichments
prism load my-project --level quick --json     # Machine-readable JSON
prism load my-project --role dev --json        # Role-scoped loading

# Bidirectional sync (v9.4.1)
prism sync push                                # Push local SQLite โ†’ Supabase
prism sync push --json                         # Machine-readable output

# Verification harness
prism verify status                            # Check verification state
prism verify status --json                     # Machine-readable output
prism verify generate                          # Bless current rubric as canonical

๐Ÿ’ก When to use the CLI vs MCP tools: If your environment supports MCP (Claude Desktop, Cursor, Windsurf, Cline), always use the MCP tools โ€” they integrate seamlessly with the agent's tool-calling flow. Use the CLI when you need session context in scripts, CI/CD, or non-MCP IDEs.

๐Ÿ“ฆ Installation: The CLI is available as prism when installed globally (npm install -g prism-mcp-server), or via node dist/cli.js for local dev builds.


๐Ÿ”ง Tool Reference

Prism ships 30+ tools, but 90% of your workflow uses just three:

๐ŸŽฏ The Big Three

Tool When What it does
session_load_context โ–ถ๏ธ Start of session Loads your agentโ€™s brain from last time
session_save_ledger โน๏ธ End of session Records what was accomplished
knowledge_search ๐Ÿ” Anytime Finds past decisions, context, and learnings

Everything else is a power-up. Start with these three and youโ€™re 90% there.

Session Memory & Knowledge (12 tools)
Tool Purpose
session_save_ledger Append immutable session log (summary, TODOs, decisions)
session_save_handoff Upsert latest project state with OCC version tracking
session_load_context Progressive context loading (quick / standard / deep)
knowledge_search Full-text keyword search across accumulated knowledge
knowledge_forget Prune outdated or incorrect memories (4 modes + dry_run)
knowledge_set_retention Set per-project TTL retention policy
session_search_memory Vector similarity search across all sessions
session_compact_ledger Auto-compact old entries via Gemini summarization
session_forget_memory GDPR-compliant deletion (soft/hard + Art. 17 reason)
session_export_memory Full export (JSON, Markdown, or Obsidian vault .zip with [[Wikilinks]])
session_health_check Brain integrity scan + auto-repair (fsck)
deep_storage_purge Reclaim ~90% vector storage (v5.1)
Behavioral Memory & Knowledge Graph (5 tools)
Tool Purpose
session_save_experience Record corrections, successes, failures, learnings
knowledge_upvote Increase entry importance (+1)
knowledge_downvote Decrease entry importance (-1)
knowledge_sync_rules Sync graduated insights to .cursorrules / .clauderules
session_save_image / session_view_image Visual memory vault
Time Travel & History (2 tools)
Tool Purpose
memory_history Browse all historical versions of a project's handoff state
memory_checkout Revert to any previous version (non-destructive)
Search & Analysis (7 tools)
Tool Purpose
brave_web_search Real-time internet search
brave_local_search Location-based POI discovery
brave_web_search_code_mode JS extraction over web search results
brave_local_search_code_mode JS extraction over local search results
code_mode_transform Universal post-processing with 8 built-in templates
gemini_research_paper_analysis Academic paper analysis via Gemini
brave_answers AI-grounded answers from Brave
Cognitive Architecture (1 tool)

Requires PRISM_HDC_ENABLED=true (default).

Tool Purpose
session_cognitive_route HDC compositional state resolution with policy-gated routing
Multi-Agent Hivemind (3 tools)

Requires PRISM_ENABLE_HIVEMIND=true.

Tool Purpose
agent_register Announce yourself to the team
agent_heartbeat Pulse every ~5 min to stay visible
agent_list_team See all active teammates
Task Routing (1 tool)

Requires PRISM_TASK_ROUTER_ENABLED=true (or dashboard toggle).

Tool Purpose
session_task_route Scores task complexity and recommends host vs. local Claw delegation (claw_run_task when delegable; host fallback when executor/tooling is unavailable)
Dark Factory Orchestration (3 tools)

Requires PRISM_DARK_FACTORY_ENABLED=true.

Tool Purpose
session_start_pipeline Create and enqueue a background autonomous pipeline
session_check_pipeline_status Poll the current step, iteration, and status of a pipeline
session_abort_pipeline Emergency kill switch to halt a running background pipeline
Verification Harness
Tool Purpose
session_plan_decompose Decompose natural language goals into an execution plan that references verification requirements
session_plan_step_update Atomically update step status/result with verification context
session_plan_get_active Retrieve active plan state and current verification gating position

Environment Variables

๐Ÿšฆ TL;DR โ€” Just want the best experience fast? Two options:

# Option A: Fully offline (no API keys needed)
# Set embedding_provider=local in the Mind Palace dashboard โ€” semantic search works out of the box.

# Option B: Cloud-powered (best quality)
GOOGLE_API_KEY=...      # Unlocks: Gemini embeddings, Morning Briefings, auto-compaction
BRAVE_API_KEY=...       # Unlocks: Web Scholar research + Brave Answers
FIRECRAWL_API_KEY=...   # Unlocks: Web Scholar deep scraping (or use TAVILY_API_KEY instead)

Zero keys = zero problem. Core session memory, keyword search, semantic search (local embeddings), time travel, and the full dashboard work 100% offline. Cloud keys are optional power-ups.

Full variable reference
Variable Required Description
BRAVE_API_KEY No Brave Search Pro API key
FIRECRAWL_API_KEY No Firecrawl API key โ€” required for Web Scholar (unless using Tavily)
TAVILY_API_KEY No Tavily Search API key โ€” alternative to Brave+Firecrawl for Web Scholar
PRISM_STORAGE No "local" (default) or "supabase" โ€” restart required
PRISM_ENABLE_HIVEMIND No "true" to enable multi-agent tools โ€” restart required
PRISM_INSTANCE No Instance name for multi-server PID isolation
GOOGLE_API_KEY No Gemini โ€” enables Briefings, compaction, and cloud embeddings (not needed with embedding_provider=local)
VOYAGE_API_KEY No Voyage AI โ€” optional premium embedding provider
OPENAI_API_KEY No OpenAI โ€” optional proxy model and embedding provider
BRAVE_ANSWERS_API_KEY No Separate Brave Answers key
SUPABASE_URL If cloud Supabase project URL
SUPABASE_KEY If cloud Supabase anon/service key
PRISM_USER_ID No Multi-tenant user isolation (default: "default")
PRISM_AUTO_CAPTURE No "true" to auto-snapshot dev server UI states (HTML/DOM) for visual memory
PRISM_CAPTURE_PORTS No Comma-separated ports (default: 3000,3001,5173,8080)
PRISM_DEBUG_LOGGING No "true" for verbose logs
PRISM_DASHBOARD_PORT No Dashboard port (default: 3000)
PRISM_SCHEDULER_ENABLED No "false" to disable background maintenance (default: enabled)
PRISM_SCHEDULER_INTERVAL_MS No Maintenance interval in ms (default: 43200000 = 12h)
PRISM_SCHOLAR_ENABLED No "true" to enable Web Scholar pipeline
PRISM_SCHOLAR_INTERVAL_MS No Scholar interval in ms (default: 0 = manual only)
PRISM_SCHOLAR_TOPICS No Comma-separated research topics (default: "ai,agents")
PRISM_SCHOLAR_MAX_ARTICLES_PER_RUN No Max articles per Scholar run (default: 3)
PRISM_TASK_ROUTER_ENABLED No "true" to enable task-router tool registration
PRISM_TASK_ROUTER_CONFIDENCE_THRESHOLD No Min confidence required to delegate to Claw (default: 0.6)
PRISM_TASK_ROUTER_MAX_CLAW_COMPLEXITY No Max complexity score delegable to Claw (default: 4)
PRISM_HDC_ENABLED No "true" (default) to enable HDC cognitive routing pipeline
PRISM_HDC_EXPLAINABILITY_ENABLED No "true" (default) to include convergence/distance/ambiguity in cognitive route responses
PRISM_ACTR_ENABLED No "true" (default) to enable ACT-R activation re-ranking on semantic search
PRISM_ACTR_DECAY No ACT-R decay parameter d (default: 0.5). Higher values = faster recency drop-off
PRISM_ACTR_WEIGHT_SIMILARITY No Composite score similarity weight (default: 0.7)
PRISM_ACTR_WEIGHT_ACTIVATION No Composite score ACT-R activation weight (default: 0.3)
PRISM_ACTR_ACCESS_LOG_RETENTION_DAYS No Days before access logs are pruned by background scheduler (default: 90)
PRISM_DARK_FACTORY_ENABLED No "true" to enable Dark Factory autonomous pipeline tools (session_start_pipeline, session_check_pipeline_status, session_abort_pipeline)
PRISM_JWKS_URI No JWKS endpoint URL for vendor-neutral JWT auth (e.g., https://your-tenant.auth0.com/.well-known/jwks.json)
PRISM_JWT_AUDIENCE No Expected JWT aud claim โ€” prevents cross-service token confusion
PRISM_JWT_ISSUER No Expected JWT iss claim โ€” validates token origin

System Settings (Dashboard)

Some configurations are stored dynamically in SQLite (system_settings table) and can be edited through the Dashboard UI at http://localhost:3000:

  • intent_health_stale_threshold_days (default: 30): Number of days before a project is considered fully stale for Intent Health scoring.

Architecture

Prism is a stdio-based MCP server that manages persistent agent memory. Here's how the pieces fit together:

โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚  MCP Client (Claude Desktop / Cursor / Antigravity)      โ”‚
โ”‚                    โ†• stdio / SSE (JSON-RPC)              โ”‚
โ”œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ค
โ”‚  Prism MCP Server                                        โ”‚
โ”‚                                                          โ”‚
โ”‚  โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”  โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”  โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”  โ”‚
โ”‚  โ”‚  30+ Tools   โ”‚  โ”‚  Lifecycle   โ”‚  โ”‚   Dashboard    โ”‚  โ”‚
โ”‚  โ”‚  (handlers)  โ”‚  โ”‚  (PID lock,  โ”‚  โ”‚  (HTTP :3000)  โ”‚  โ”‚
โ”‚  โ”‚              โ”‚  โ”‚   shutdown)  โ”‚  โ”‚                โ”‚  โ”‚
โ”‚  โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜  โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜  โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜  โ”‚
โ”‚         โ†•                                                โ”‚
โ”‚  โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”  โ”‚
โ”‚  โ”‚  Cognitive Engine (v7.8)                           โ”‚  โ”‚
โ”‚  โ”‚  โ€ข ACT-R Spreading Activation (multi-hop)         โ”‚  โ”‚
โ”‚  โ”‚  โ€ข Episodic โ†’ Semantic Consolidation (Hebbian)    โ”‚  โ”‚
โ”‚  โ”‚  โ€ข Uncertainty-Aware Rejection Gate               โ”‚  โ”‚
โ”‚  โ”‚  โ€ข Dynamic Fast Weight Decay (dual-rate)          โ”‚  โ”‚
โ”‚  โ”‚  โ€ข HDC Cognitive Routing (XOR binding)            โ”‚  โ”‚
โ”‚  โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜  โ”‚
โ”‚         โ†•                                                โ”‚
โ”‚  โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”  โ”‚
โ”‚  โ”‚  Storage Engine                                    โ”‚  โ”‚
โ”‚  โ”‚  Local: SQLite + FTS5 + TurboQuant + semantic_knowledge โ”‚
โ”‚  โ”‚  Cloud: Supabase + pgvector                        โ”‚  โ”‚
โ”‚  โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜  โ”‚
โ”‚         โ†•                                                โ”‚
โ”‚  โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”  โ”‚
โ”‚  โ”‚  Background Workers                                โ”‚  โ”‚
โ”‚  โ”‚  โ€ข Dark Factory (3-gate fail-closed pipelines)     โ”‚  โ”‚
โ”‚  โ”‚  โ€ข Scheduler (TTL, decay, compaction, purge)       โ”‚  โ”‚
โ”‚  โ”‚  โ€ข Web Scholar (Brave โ†’ Firecrawl โ†’ LLM โ†’ Ledger)  โ”‚  โ”‚
โ”‚  โ”‚  โ€ข Hivemind heartbeats & Telepathy broadcasts      โ”‚  โ”‚
โ”‚  โ”‚  โ€ข OpenTelemetry span export                       โ”‚  โ”‚
โ”‚  โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜  โ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜

Startup Sequence

  1. Acquire PID lock โ€” prevents duplicate instances per PRISM_INSTANCE
  2. Initialize config โ€” SQLite settings cache (prism-config.db)
  3. Register 30+ MCP tools โ€” session, knowledge, search, behavioral, hivemind
  4. Connect stdio transport โ€” MCP handshake with the client (~60ms total)
  5. Async post-connect โ€” storage warmup, dashboard launch, scheduler start (non-blocking)

Storage Layers

Layer Technology Purpose
Session Ledger SQLite (append-only) Immutable audit trail of all agent work
Handoff State SQLite (upsert, versioned) Live project context with OCC + CRDT merging
Semantic Knowledge SQLite (semantic_knowledge) Hebbian-style distilled rules with confidence scoring
Memory Links SQLite (memory_links) Causal graph edges (caused_by, led_to, synthesized_from)
Keyword Search FTS5 virtual tables Zero-dependency full-text search
Semantic Search TurboQuant compressed vectors 10ร— compressed 768-dim embeddings, three-tier retrieval
Cloud Sync Supabase + pgvector Optional multi-device/team sync

Auto-Load Architecture

Each MCP client has its own mechanism for ensuring Prism context loads on session start. See the platform-specific Setup Guides above for detailed instructions:

  • Claude Code โ€” Lifecycle hooks (SessionStart / Stop)
  • Gemini / Antigravity โ€” Three-layer architecture (User Rules + AGENTS.md + Startup Skill)
  • Task Router Integration (v7.2 guidance) โ€” For client startup/skills, use defensive delegation flow: route only coding tasks, call session_task_route only when available, delegate to claw only when executor exists and task is non-destructive, and fallback to host if router/executor is unavailable.
  • Cursor / Windsurf / VS Code โ€” System prompt instructions

All platforms benefit from the server-side fallback (v5.2.1): if session_load_context hasn't been called within 10 seconds, Prism auto-pushes context via sendLoggingMessage.


๐Ÿงฌ Scientific Foundation

Prism has evolved from smart session logging into a cognitive memory architecture โ€” grounded in real research, not marketing. Every retrieval decision is backed by peer-reviewed models from cognitive psychology, neuroscience, and distributed computing.

Phase Feature Inspired By Status
v5.0 TurboQuant 10ร— Compression โ€” 4-bit quantized 768-dim vectors in <500 bytes Vector quantization (product/residual PQ) โœ… Shipped
v5.0 Three-Tier Search โ€” native โ†’ TurboQuant โ†’ FTS5 keyword fallback Cascaded retrieval architectures โœ… Shipped
v5.2 Smart Consolidation โ€” extract principles, not just summaries Neuroscience sleep consolidation โœ… Shipped
v5.2 Ebbinghaus Importance Decay โ€” memories fade unless reinforced Ebbinghaus forgetting curve โœ… Shipped
v5.2 Context-Weighted Retrieval โ€” current work biases what surfaces Contextual memory in cognitive science โœ… Shipped
v5.4 CRDT Handoff Merging โ€” conflict-free multi-agent state via OR-Map engine CRDTs (Shapiro et al., 2011) โœ… Shipped
v5.4 Autonomous Web Scholar โ€” background research pipeline with LLM synthesis Autonomous research agents โœ… Shipped
v5.5 SDM Decoder Foundation โ€” pre-allocated typed-array hot loop, zero GC thrash Kanerva's Sparse Distributed Memory (1988) โœ… Shipped
v5.5 Architectural Hardening โ€” transactional migrations, graceful shutdown, thundering herd prevention Production reliability engineering โœ… Shipped
v6.1 Intuitive Recall โ€” proactive surface of relevant past decisions without explicit search; session_intuitive_recall tool Predictive memory (cognitive science) โœ… Shipped
v6.5 HDC Cognitive Routing โ€” compositional state-machine with XOR binding, Hamming resolution, and policy-gated routing Hyperdimensional Computing (Kanerva, Gayler) โœ… Shipped
v6.5 Cognitive Observability โ€” route distribution, confidence/distance tracking, ambiguity warnings Production reliability engineering โœ… Shipped
v6.1 Prism-Port Vault Export โ€” Obsidian/Logseq .zip with YAML frontmatter & [[Wikilinks]] Data sovereignty, PKM interop โœ… Shipped
v6.1 Cognitive Load & Semantic Search โ€” dynamic graph thinning, search highlights Contextual working memory โœ… Shipped
v6.2 Synthesize & Prune โ€” automated edge synthesis, graph pruning, SLO observability Implicit associative memory โœ… Shipped
v7.0 ACT-R Base-Level Activation โ€” B_i = ln(ฮฃ t_j^(-d)) recencyร—frequency re-ranking over similarity candidates Anderson's ACT-R (Adaptive Control of Thoughtโ€”Rational) โœ… Shipped
v7.0 Candidate-Scoped Spreading Activation โ€” S_i = ฮฃ(W ร— strength) bounded to search result set; prevents God-node dominance Spreading activation networks (Collins & Loftus, 1975) โœ… Shipped
v7.0 Composite Retrieval Scoring โ€” 0.7 ร— similarity + 0.3 ร— ฯƒ(activation); configurable via PRISM_ACTR_WEIGHT_* Hybrid cognitive-neural retrieval models โœ… Shipped
v7.0 AccessLogBuffer โ€” in-memory batch-write buffer with 5s flush; prevents SQLite SQLITE_BUSY under parallel agents Production reliability engineering โœ… Shipped
v7.3 Dark Factory โ€” 3-gate fail-closed EXECUTE pipeline (parse โ†’ type โ†’ scope) with structured JSON action contract Industrial safety systems (defense-in-depth, fail-closed valves) โœ… Shipped
v7.2 Verification-first harness โ€” spec-freeze contract, rubric hash lock, multi-layer assertions, CLI verify commands Programmatic verification systems + adversarial validation loops โœ… Shipped
v7.4 Adversarial Evaluation โ€” PLAN_CONTRACT + EVALUATE with isolated generator/evaluator roles, pre-committed rubrics, and evidence-bound findings Anti-sycophancy research, adversarial ML evaluation frameworks โœ… Shipped
v7.5 Intent Health Dashboard โ€” 3-signal scoring (staleness ร— TODO ร— decisions) with NaN guards and score ceiling Production observability, proactive monitoring โœ… Shipped
v7.7 Cloud-Native SSE Transport โ€” full network-accessible MCP server via Server-Sent Events Distributed systems, cloud-native architecture โœ… Shipped
v7.8 Episodicโ†’Semantic Consolidation โ€” raw event logs distilled into semantic_knowledge rules with confidence scoring and instance tracking Hebbian learning ("neurons that fire together wire together"), sleep consolidation (neuroscience) โœ… Shipped
v7.8 Multi-Hop Causal Reasoning โ€” spreading activation traverses caused_by/led_to edges with damped fan effect (1/ln(fan+e)) and lateral inhibition ACT-R spreading activation (Anderson), Collins & Loftus (1975) โœ… Shipped
v7.8 Uncertainty-Aware Rejection Gate โ€” dual-signal (similarity floor + gap distance) safety layer prevents hallucination from low-confidence retrievals Metacognition research, uncertainty quantification โœ… Shipped
v7.8 Dynamic Fast Weight Decay โ€” is_rollup semantic nodes decay 50% slower (ageModifier = 0.5) than episodic entries, creating Long-Term Context anchors ACT-R base-level activation with differential decay rates โœ… Shipped
v9.0 Affect-Tagged Memory โ€” valence-scored retrieval where |valence| boosts ranking; UX warnings surface historically negative topics Affect-modulated retrieval (neuroscience), somatic marker hypothesis โœ… Shipped
v9.0 Surprisal Gate โ€” vector-based novelty pricing: high-surprisal saves cost 0.5ร— tokens, low-surprisal 2.0ร—; forces LLM data compression Information-theoretic surprisal (Shannon), predictive coding โœ… Shipped
v9.0 Cognitive Budget โ€” per-project token economy with passive UBI recovery (+100 tokens/hr); agents that over-save enter Cognitive Debt Resource-bounded rationality (Simon, 1955) โœ… Shipped
v9.1 Task Router v2 โ€” 6-signal weighted heuristic engine routing tasks between cloud host and local LLM based on file-type complexity, scope, and multi-step detection Heuristic classification, cognitive load theory โœ… Shipped
v9.4 Shell Injection Fix โ€” execSync โ†’ execFileSync + SHA format validation in git drift detection Defense-in-depth, secure subprocess execution โœ… Shipped
v9.4 Command Injection Fix (lifecycle) โ€” PID file injection via execSync template literal; remediated with execFileSync array args Defense-in-depth, attack surface reduction โœ… Shipped
v9.2 Cross-Backend Reconciliation โ€” automatic Supabase โ†’ SQLite sync with idempotent dedup and 5s timeout Eventual consistency, crdt-style reconciliation โœ… Shipped
v9.2 Split-Brain Drift Detection โ€” dual-backend version comparison with prominent divergence warnings at load time Byzantine fault detection, split-brain resolution โœ… Shipped
v9.2 TurboQuant QJL Validation โ€” zero R@5 delta between P50 and P95 residual vectors (d=128, N=2K); CV=0.038 at d=768 proves no long tail QJL estimator (ICLR 2026), Householder orthogonal rotation โœ… Shipped
v9.2 Typed Security Errors โ€” PrototypePollutionError with offendingKey for forensic logging; null-byte path injection guard in SafetyController Defense-in-depth (NIST), C-string truncation attack mitigation โœ… Shipped
v9.3 ResidualNorm Tiebreaker โ€” within-ฮต candidates ranked by compression fidelity (PRISM_TURBOQUANT_TIEBREAKER_EPSILON); +2pp R@1, +1pp R@5 at ฮต=0.005 Quantization confidence scoring, compression-aware retrieval โœ… Shipped
v10.0 HIPAA-Hardened Local LLM โ€” prism-coder:7b manages ledger compaction, task routing, and semantic search 100% on-device Air-gapped cognitive pipelines, secure PHI redaction โœ… Shipped
v11.5.1 Zero-Search Retrieval โ€” no index, no ANN, just ask the vector Holographic Reduced Representations (HRR) ๐Ÿงช Field Testing (Synalux)

๐Ÿงช Verified Zero-Search Implementation

The core unbinding engine is verified via Synalux's cognitive testing suite:

Informed by Anderson's ACT-R (Adaptive Control of Thoughtโ€”Rational), Collins & Loftus spreading activation networks (1975), Kanerva's SDM (1988), Hebb's learning rule, and LeCun's "Why AI Systems Don't Learn" (Dupoux, LeCun, Malik).


๐Ÿ’ผ B2B Consulting & Enterprise Support

Prism MCP is open-source and free for individual developers. For teams and enterprises building autonomous AI workflows or integrating MCP-native memory at scale, we offer professional consulting and setup packages.

๐Ÿฅ‰ Team Pilot Package

Perfect for engineering teams adopting MCP tools and collaborative agents.

  • What's included: Full team rollout, managed Supabase configuration (for multi-device sync), Universal Import of legacy chat history, and dedicated setup support.
  • Model: Fixed-price engagement.

๐Ÿฅˆ Cognitive Architecture Tuning

For teams building advanced AI agents or autonomous pipelines.

  • What's included: "Dark Factory" pipeline implementation tailored to your workflows, adversarial evaluator tuning, custom HDC cognitive route configuration, and local open-weight model integration (BYOM).
  • Model: Retainer or project-based.

๐Ÿฅ‡ Enterprise Integration

Full-scale deployment for high-compliance environments.

  • What's included: Active Directory / custom JWKS auth integration, Air-gapped on-premise deployment, custom OTel Grafana dashboards for cognitive observability, and custom skills/tools development.
  • Model: Custom enterprise quote.

Interested in accelerating your team's autonomous workflows? ๐Ÿ“ง Contact us for a consultation โ€” let's build your organization's cognitive memory engine.


๐Ÿ“ฆ Milestones & Roadmap

Current: v11.5.1 โ€” Structural GRPO Alignment (100% Accuracy) (CHANGELOG)

Release Headline
v11.5.1 ๐Ÿง  Structural GRPO Alignment โ€” Perfect 100% accuracy cross-validated on Synalux Elite platform.
v11.0.1 ๐Ÿงช Production Stability โ€” Field-tested Zero-Search logic merge, local logic finalization, HIPAA-hardened security refinement.
v11.0 ๐Ÿง  Zero-Search Retrieval โ€” Holographic Reduced Representations (HRR) + Deep Research Intelligence ๐Ÿงช Field Testing - Synalux
v10.0 ๐Ÿ›ก๏ธ HIPAA-Hardened Local LLM โ€” prism-coder:7b powers compaction + task routing 100% on-device.
v9.14 ๐Ÿงฌ Dynamic Hardware Routing & Semantic Tool RAG โ€” MLX SFT pipeline, Nomic pruning, GRPO alignment
v9.13 ๐Ÿ”ฌ Local Embeddings & Zero-API-Key Semantic Search โ€” nomic-embed-text-v1.5 on-device
v9.5 ๐Ÿ›ก๏ธ Adversarial Behavioral Hardening โ€” 24 forbidden openers, XML anti-tag system, sycophancy defense
v9.4 ๐Ÿ”’ Security Sweep โ€” command injection, path traversal, CORS, fail-closed rate limiter, bidirectional sync
v9.0 ๐Ÿง  Autonomous Cognitive OS โ€” Surprisal Gate, Cognitive Budget, Affect-Tagged Memory
v7.8 ๐Ÿง  Cognitive Architecture โ€” Hebbian consolidation, multi-hop reasoning, rejection gate
v7.0 ๐Ÿงฌ ACT-R Activation Memory

Future Tracks

  • v11.1: Multi-Graph Causal Layer โ€” Intent-aware retrieval routing traversing an LLM-inferred causal because edge-type layer for deep reasoning.
  • v11.2: Federated Memory Mesh โ€” Hierarchical memory namespaces with role-based access control for enterprise agent teams.
  • v11.3: Predictive Prefetch โ€” ACT-R based predictive models prefetch likely-needed memories before the agent asks.

๐Ÿ‘‰ Full ROADMAP.md โ†’

โ“ Troubleshooting FAQ

Q: Why is the dashboard project selector stuck on "Loading projects..."? A: Fixed in v7.3.3. The root cause was a multi-layer quote-escaping trap in the abortPipeline onclick handler that generated a SyntaxError in the browser, silently killing the entire dashboard IIFE. Update to v7.3.3+ (npx -y prism-mcp-server). If still stuck, check that Supabase env values are properly set (unresolved placeholders like ${SUPABASE_URL} cause /api/projects to return empty). Prism auto-falls back to local SQLite when Supabase is misconfigured.

Q: Why is semantic search quality weak or inconsistent? A: Check embedding provider configuration and key availability. Missing embedding credentials reduce semantic recall quality and can shift behavior toward keyword-heavy matches.

Q: How do I delete a bad memory entry? A: Use session_forget_memory for targeted soft/hard deletion. For manual cleanup and merge workflows, use the dashboard graph editor.

Q: How do I verify the install quickly? A: Run npm run build && npm test, then open the Mind Palace dashboard (localhost:3000) and confirm projects load plus Graph Health renders.


๐Ÿ’ก Known Limitations & Quirks

  • Some advanced text features may still benefit from a cloud API key. While prism-coder:7b handles core compaction and routing, high-level features like Morning Briefings and complex VLM captioning are optimized for cloud providers (GOOGLE_API_KEY, OPENAI_API_KEY, or ANTHROPIC_API_KEY). Semantic search and basic compaction work 100% offline with embedding_provider=local.
  • Auto-load is model- and client-dependent. Session auto-loading relies on both the LLM following system prompt instructions and the MCP client completing tool registration before the model's first turn. Prism provides platform-specific Setup Guides and a server-side fallback (v5.2.1) that auto-pushes context after 10 seconds.
  • MCP client race conditions. Some MCP clients may not finish tool enumeration before the model generates its first response, causing transient unknown_tool errors. This is a client-side timing issue โ€” Prism's server completes the MCP handshake in ~60ms. Workaround: the server-side auto-push fallback and the startup skill's retry logic.
  • No real-time sync without Supabase. Local SQLite mode is single-machine only. Multi-device or team sync requires a Supabase backend.
  • Embedding quality varies by provider. Gemini text-embedding-004 and OpenAI text-embedding-3-small produce high-quality 768-dim vectors. Prism passes dimensions: 768 via the Matryoshka API for OpenAI models (native output is 1536-dim; this truncation is lossless and outperforms ada-002 at full 1536 dims). Local embeddings (nomic-embed-text-v1.5 via @huggingface/transformers) provide good quality with zero API cost. Ollama embeddings are usable but may reduce retrieval accuracy.
  • Dashboard is HTTP-only. The Mind Palace dashboard at localhost:3000 does not support HTTPS. For remote access, use a reverse proxy (nginx/Caddy) or SSH tunnel. Basic auth is available via PRISM_DASHBOARD_USER / PRISM_DASHBOARD_PASS. JWKS JWT auth is available via PRISM_JWKS_URI for agent-native authentication (works with Auth0, AgentLair (llms.txt), Keycloak, Cognito, or any standard JWKS endpoint).
  • Long-lived clients can accumulate zombie processes. MCP clients that run for extended periods (e.g., Claude CLI) may leave orphaned Prism server processes. The lifecycle manager detects true orphans (PPID=1) but allows coexistence for active parent processes. Use PRISM_INSTANCE to isolate instances across clients.
  • Migration is one-way. Universal Import ingests sessions into Prism but does not export back to Claude/Gemini/OpenAI formats. Use session_export_memory for portable JSON/Markdown export, or the vault format for Obsidian/Logseq-compatible .zip archives.
  • Export ceiling at 10,000 ledger entries. The session_export_memory tool and the dashboard export button cap vault/JSON exports at 10,000 entries per project as an OOM guard. Projects exceeding this limit should use per-project exports and time-based filtering to stay within the ceiling. This limit does not affect search or context loading.
  • No Windows CI testing. Prism is developed and tested on macOS/Linux. It should work on Windows via Node.js, but edge cases (file paths, PID locks) may surface.

License

MIT


Keywords: MCP server, Model Context Protocol, Claude Desktop memory, persistent session memory, AI agent memory, cognitive architecture, ACT-R spreading activation, Hebbian learning, episodic semantic consolidation, multi-hop reasoning, uncertainty rejection gate, local-first, SQLite MCP, Mind Palace, time travel, visual memory, VLM image captioning, OpenTelemetry, GDPR, agent telepathy, multi-agent sync, behavioral memory, cursorrules, Ollama MCP, Brave Search MCP, TurboQuant, progressive context loading, knowledge management, LangChain retriever, LangGraph agent

Release History

VersionChangesUrgencyDate
v15.2.1## What's new ### Pre-push verification gate (Rule 19 expanded) Rule 19 is now a **full verification gate** โ€” not just `tsc --noEmit`. Both steps must pass before every `git push`, regardless of repo type: - **Type check** โ€” `tsc --noEmit` / `next build` / `mypy` / `pyright` - **Test suite** โ€” `npx vitest run` / `python3 -m pytest` / `npm test` / `jest` Each catches a different failure class. Type check alone missed 2 incidents in May 2026. ### Mechanical guard + 20 test cases `verification_gHigh5/10/2026
v13.0.0See [CHANGELOG.md](https://github.com/dcostenco/prism-coder/blob/main/CHANGELOG.md) for full notes โ€” Release object created retroactively to fill the gap on the repo page. High5/8/2026
v12.5.6## Prism MCP v12.5.6 - Gemini 3.1 Pro Experimental as default for paid tiers - Claude Sonnet 4.6 / Haiku 4.5 added to allowed models - i18n: all 11 translations synced as full README copies - Pre-commit hook for automatic i18n regenerationHigh4/29/2026
v9.3.0## TurboQuant ResidualNorm Tiebreaker Enterprise-grade configurable ranking optimization for Tier-2 TurboQuant search. ### What's New - **`PRISM_TURBOQUANT_TIEBREAKER_EPSILON`** - New env var (default: 0, disabled). When two compressed cosine scores are within epsilon of each other, prefers the candidate with lower `residualNorm` - its compressed representation captured more signal energy, making its score more trustworthy. - - **Recommended:** 0.005 for enterprise deployments with larHigh4/11/2026
v9.3.0## TurboQuant ResidualNorm Tiebreaker Enterprise-grade configurable ranking optimization for Tier-2 TurboQuant search. ### What's New - **`PRISM_TURBOQUANT_TIEBREAKER_EPSILON`** - New env var (default: 0, disabled). When two compressed cosine scores are within epsilon of each other, prefers the candidate with lower `residualNorm` - its compressed representation captured more signal energy, making its score more trustworthy. - - **Recommended:** 0.005 for enterprise deployments with larMedium4/11/2026
v9.3.0## TurboQuant ResidualNorm Tiebreaker Enterprise-grade configurable ranking optimization for Tier-2 TurboQuant search. ### What's New - **`PRISM_TURBOQUANT_TIEBREAKER_EPSILON`** - New env var (default: 0, disabled). When two compressed cosine scores are within epsilon of each other, prefers the candidate with lower `residualNorm` - its compressed representation captured more signal energy, making its score more trustworthy. - - **Recommended:** 0.005 for enterprise deployments with larMedium4/11/2026
v9.0.2Fixes verification tests bleeding into GitHub Actions environment.High4/7/2026
v9.0.1### ๐Ÿค– Autonomous Cognitive OS (v9.0) > *Memory isn't just about storing data; it's about economics and emotion. v9.0 transforms Prism from a passive memory database into a living Cognitive Operating System that forces agents to learn compression and develop intuition.* Most AI agents have an infinite memory budget. They dump massive, repetitive logs into vector databases until the context window chokes. Prism v9.0 fixes this by introducing **Token-Economic Reinforcement Learning** and **AffecMedium4/7/2026
v7.4.0## โš”๏ธ v7.4.0 โ€” Adversarial Evaluation (Anti-Sycophancy) Dark Factory gains a full adversarial evaluation loop โ€” the signature feature of this release. The pipeline now fights itself to produce high-quality output, solving the fundamental problem of LLMs grading their own homework. ### What's New **Split-Brain Anti-Sycophancy** - `PLAN_CONTRACT` step locks a machine-parseable scoring rubric to disk before any code is written - `EVALUATE` step runs in a fully isolated context โ€” Evaluator sees oMedium4/3/2026
v5.1.0## What's New ### Added - **Deep Storage Mode**: New `deep_storage_purge` tool to reclaim ~90% of vector storage by dropping float32 vectors for entries with TurboQuant compressed blobs. - **Knowledge Graph Editor**: The Mind Palace neural graph is now fully interactive โ€” click nodes to rename or delete keywords, filter by project/date/importance, and surgically groom your agent's semantic memory. ### Fixed - **Auto-Load Reliability**: Hardened auto-load prompt instructions and added hook scriMedium3/27/2026
v4.2.0## What's New in v4.2.0 ### ๐Ÿ—‚๏ธ Project Repo Paths Map each project to its repo directory in the Mind Palace dashboard. `session_save_ledger` validates `files_changed` paths against configured repo paths and warns on mismatch โ€” prevents cross-project contamination. ### ๐Ÿ”„ Universal Auto-Load Auto-load projects via dynamic tool descriptions โ€” works across all MCP clients (Claude, Cursor, Gemini, Antigravity) without lifecycle hooks. Dashboard is the sole source of truth. ### ๐Ÿ  Dashboard-FirstMedium3/25/2026
v3.1.1## Bug Fix - **Fix:** Ensure `~/.prism-mcp/` directory exists before opening the configStorage SQLite database - In Docker/CI (Glama), the directory doesn't exist, causing `SQLITE_CANTOPEN` error 14 and a fatal crash - Added `mkdirSync(dir, { recursive: true })` in `getClient()` before `createClient()`, mirroring the pattern in `sqlite.ts` - Wrapped `initConfigStorage()` in try/catch for graceful degradation in sandboxed/read-only filesystem environments **Full Changelog:** https://githuMedium3/24/2026
v3.0.1## What's New in v3.0.1 ### ๐Ÿงน Brain Health Clean-up New **Fix Issues** button in the Mind Palace Dashboard's Brain Health card โ€” detects orphaned handoffs, missing embeddings, and stale rollups, then cleans them up in one click without needing the MCP tool. ### ๐Ÿ‘ค Agent Identity Settings Dashboard Settings โ†’ Agent Identity panel lets you set a **Default Role** (`dev`, `qa`, `pm`โ€ฆ) and **Agent Name** (e.g. `Dmitri`). Both values auto-apply as fallbacks in all memory and Hivemind tools โ€” no neeMedium3/23/2026
v3.0.0## What's New in v3.0.0 โ€” Agent Hivemind ๐Ÿ ### New Features - **Role-Scoped Memory** โ€” Each agent role (dev, qa, pm, lead, security, ux) gets its own isolated memory lane. Defaults to `global` for full backward compatibility. - **Agent Registry** โ€” `agent_register`, `agent_heartbeat`, `agent_list_team` tools โ€” agents announce presence, pulse status, and discover teammates. Stale agents auto-pruned after 30 min. - **Team Roster Injection** โ€” Loading context with a role automatically injects an Medium3/23/2026
v2.5.0## What's New in v2.5.0 โ€” Enterprise Memory ๐Ÿ—๏ธ | Feature | Description | |---|---| | ๐Ÿ” **Memory Tracing (Phase 1)** | Every search returns a structured `MemoryTrace` with latency breakdown (`embedding_ms`, `storage_ms`, `total_ms`), search strategy, and scoring metadata โ€” surfaced as `content[1]` for LangSmith integration. | | ๐Ÿ›ก๏ธ **GDPR Memory Deletion (Phase 2)** | `session_forget_memory` tool with soft-delete (tombstoning) and hard-delete. Ownership guards, `deleted_reason` column for ArtiLow3/22/2026
v2.3.12Includes @supabase/supabase-js dependency and version bump to 2.3.12.Low3/21/2026
v2.3.11Republished cleanly to fix Glama registry build resolution.Low3/21/2026
v2.3.10Windows Black Screen Fix, Debug Logging, Excess Loading FixesLow3/21/2026
v2.3.3fix: update dashboard version badge to display correct versionLow3/21/2026
v2.3.0## ๏ฟฝ๏ฟฝ v2.3.0 โ€” AI Reasoning Engine ### New Features - ๐Ÿ•ธ๏ธ **Neural Knowledge Graph** โ€” Interactive Vis.js force-directed graph on the Mind Palace Dashboard. Visualize how projects connect through shared keywords and categories. - ๐Ÿ›ก๏ธ **Prompt Injection Shield** โ€” Gemini-powered security scan in `session_health_check`. Detects system override attempts, jailbreaks, and data exfiltration hidden in agent memory. Tuned to avoid false positives on normal dev commands. - ๐Ÿงฌ **Fact Merger** โ€” Async LLLow3/21/2026
v2.1.2## ๐Ÿ”ง Bug Fix **Semantic search returning 0 results** โ€” the compiled `dist/utils/embeddingApi.js` was stale, calling `model.embedContent(inputText)` (plain string) instead of passing the full config object with `outputDimensionality: 768`. This caused `gemini-embedding-001` to return 3072 dimensions (model default) instead of 768, which can't be stored in the `vector(768)` Supabase column. ### Changes - Fix `taskType` to use `TaskType.SEMANTIC_SIMILARITY` enum - Add `as any` cast for `outputDiLow3/21/2026
v2.1.1## Critical Bug Fixes ### ๏ฟฝ๏ฟฝ Embedding API 404 (P0) - **Affected tools:** `session_search_memory`, `backfill_embeddings` - **Root cause:** `text-embedding-004` was deprecated Jan 14, 2026. The default `v1` API endpoint returns 404. - **Fix:** Migrated to `gemini-embedding-001` with explicit `v1beta` API endpoint. ### ๐Ÿ› Missing `session_handoffs_history` Table (P0) - **Affected tools:** `memory_history`, `memory_checkout` (Time Travel) - **Root cause:** Table was defined in SQLite storage but Low3/20/2026
v2.1.0## ๐Ÿง  Prism MCP v2.1.0 "The Mind Palace" Prism MCP has been completely rebuilt to support **local-first workflows**, **visual agent memory**, and **multi-client synchronization**. ### What's New | Feature | Description | |---|---| | ๐Ÿ  **Local-First SQLite** | Run Prism entirely locally with zero cloud dependencies. Full vector search (libSQL F32_BLOB) and FTS5 included. | | ๐Ÿ”ฎ **Mind Palace UI** | A beautiful glassmorphism dashboard at `localhost:3000` to inspect your agent's memory, visual Low3/20/2026
v1.5.0## ๐Ÿ›ก๏ธ Multi-Tenant Row Level Security (Enhancement #6) **Production-ready for cloud-hosted team deployments.** ### Core Changes - `PRISM_USER_ID` environment variable โ€” isolate data per user on shared Supabase instances - `user_id` column added to `session_ledger` and `session_handoffs` tables - All 5 RPCs rewritten with `p_user_id` parameter for tenant isolation - RLS enabled on both tables with application-level enforcement - Composite unique constraint `(user_id, project)` on `session_handLow3/19/2026
v0.3.0## ๐Ÿง  Brain-Inspired Knowledge System ### New Features **Knowledge Accumulation** โ€” Every session save now auto-extracts keywords using in-process NLP (0.005ms/call). Zero LLM calls, zero external dependencies. Knowledge accumulates naturally and becomes searchable. **`knowledge_search`** โ€” Query accumulated knowledge across all sessions by keyword, category, or free text. 13 auto-detected categories including debugging, architecture, deployment, api-integration, ai-ml, and more. **`knowledgLow3/19/2026

Dependencies & License Audit

Loading dependencies...

Similar Packages

memexZettelkasten-based persistent memory for AI coding agents. Works with Claude Code, Cursor, VS Code Copilot, Codex, Windsurf & any MCP client. No vector DB โ€” just markdown + git sync.v0.1.30
Perplexity-Comet-MCPNo descriptionmain@2026-06-07
scagent๐Ÿ” Discover security flaws and improve code quality with ShenCha, an AI-powered autonomous code audit agent designed for developers.main@2026-06-07
claude-blocker๐Ÿ›ก๏ธ Block distracting websites when Claude Code is in use, ensuring focused work sessions and minimizing interruptions.main@2026-06-07
uix-ai-agent๐Ÿค– Generate UI & UX flows for web and mobile apps using natural language prompts with UIX AI Agent, your intelligent design assistant.main@2026-06-07

More in MCP Servers

PlanExeCreate a plan from a description in minutes
agentroveYour own Claude Code UI, sandbox, in-browser VS Code, terminal, multi-provider support (Anthropic, OpenAI, GitHub Copilot, OpenRouter), custom skills, and MCP servers.
ProxmoxMCP-PlusEnhanced Proxmox MCP server with advanced virtualization management and full OpenAPI integration.
node9-proxyThe Execution Security Layer for the Agentic Era. Providing deterministic "Sudo" governance and audit logs for autonomous AI agents.