PLUR — Your agents share the same memory

Persistent memory for AI agents. Local-first, zero-cost, works across MCP tools.

The idea

You correct your agent's coding style on Monday. On Tuesday, it makes the same mistake. You explain your architecture in Cursor. That night, Claude Code has no idea.

PLUR fixes this. Install it once, and corrections, preferences, and conventions persist — across sessions, tools, and machines. Your memory is stored as plain YAML on your disk. No cloud, no API calls, no black box.

The interesting part: Haiku with PLUR memory outperforms Opus without it — 2.6x better on tool routing, at roughly 10x less cost. Turns out the bottleneck isn't model intelligence. It's context.

Install

Tell your agent

Go to plur.ai and tell your agent to install memory for your tool — Claude Code, Cursor, Windsurf, or OpenClaw. The site has the right config for your setup.

Manual setup (Claude Code)

One command sets up everything — storage, MCP config, and Claude Code hooks:

npx @plur-ai/mcp init

This creates ~/.plur/ for storage, adds PLUR to your .mcp.json, and installs Claude Code hooks for automatic engram injection. PLUR is installed globally — one MCP server, one store, available in every project. You only run init once.

For multi-project setups, use domain/scope to separate knowledge:

cd ~/projects/my-app
npx @plur-ai/cli init --domain myapp --scope project:my-app

This creates a .plur.yaml in the project with defaults that hooks apply automatically. Engrams learned in that project are tagged; recall filters by scope but always includes global knowledge.

Global install (faster startup)

npm install -g @plur-ai/mcp
plur-mcp init

OpenClaw

openclaw plugins install @plur-ai/claw
openclaw config set plur.enabled true

That's it. PLUR works in the background from here. No workflow changes needed — just use your tools as usual. Corrections accumulate automatically.

Hermes Agent

pip install plur-hermes

The plugin registers automatically via Hermes' plugin system. It injects relevant memories before each LLM call, extracts learnings from agent responses, and exposes all PLUR tools to the agent. Requires the PLUR CLI (npm install -g @plur-ai/cli).

Verify it works

Ask your agent: "What's my PLUR status?" — it should call plur_status and return your engram count and storage path.

How it works

PLUR has two storage primitives:

Engrams — learned knowledge that persists across sessions. Each engram is a typed assertion ("always use blue-green deploys", "never force-push to main") with:

Activation — retrieval strength that decays over time (ACT-R model) and strengthens on access. Stale facts naturally fade from injection without manual cleanup.
Feedback signals — positive/negative ratings that train injection quality over time
Scope — hierarchical namespace (global, project:myapp, cluster:prod, service:api) controlling where the engram applies
Polarity — automatic classification of "do" vs "don't" rules, so constraints are injected separately from directives
Associations — links to other engrams, including co-access edges that form automatically when engrams are recalled together

Episodes — timestamped event records for "what happened when." Each episode captures a summary, timestamp, agent attribution, and channel. Use episodes for incident timelines, session logs, and operational history. Query by time range, agent, or channel.

You correct your agent  →  engram created  →  YAML on your disk
Agent fixes an incident →  episode captured →  timeline searchable
Next session starts     →  relevant engrams injected  →  agent remembers
You rate the result     →  engram strengthens or decays  →  quality improves
Unused engrams          →  activation decays  →  naturally fade from injection

Search is fully local: BM25 (with IDF weighting, TF saturation, length normalization) + BGE embeddings + Reciprocal Rank Fusion. Zero API calls. 86.7% on LongMemEval — on par with cloud-based solutions that charge per query.

Plugins (OpenClaw, Hermes) automatically capture learnings from agent conversations — no manual saving needed. The agent's corrections become engrams without you doing anything.

See the full engram spec for schema details, activation model, and injection algorithm.

Usage

import { Plur } from '@plur-ai/core'

const plur = new Plur()

// Learn from a correction
plur.learn('toEqual() in Vitest is strict — use toMatchObject() for partial matching', {
  type: 'correction',
  scope: 'project:my-app',
  domain: 'dev/testing'
})

// Recall (hybrid: BM25 + embeddings, zero cost)
const results = await plur.recallHybrid('vitest assertion matching')

// Inject relevant engrams into agent context
const { engrams } = plur.inject('Write tests for the user service', {
  scope: 'project:my-app',
  limit: 15
})

// Feedback trains the system
plur.feedback(engram.id, 'positive')

// Capture an event (episode)
plur.capture('Fixed CrashLoopBackOff on bee-3-4 by increasing memory limits', {
  agent: 'claude-code',
  channel: 'terminal'
})

// Query timeline
const incidents = plur.timeline({ agent: 'claude-code' })

// Sync across machines
plur.sync('git@github.com:you/plur-memory.git')

MCP tools

Tool	What it does
`plur_learn`	Store a correction, preference, or convention
`plur_recall_hybrid`	Retrieve relevant memories (BM25 + embeddings)
`plur_inject_hybrid`	Select engrams for current task within token budget
`plur_feedback`	Rate relevance (trains quality over time)
`plur_forget`	Retire a memory (activaton decays, eventually pruned)
`plur_capture`	Record an event — incident, resolution, session milestone
`plur_timeline`	Query episode history by time, agent, or channel
`plur_ingest`	Extract engrams from text automatically
`plur_sync`	Sync across devices via git
`plur_status`	Check system health and engram counts

Benchmark

We ran 19 decisive contests across three Claude models (Haiku, Sonnet, Opus). Same task, same prompt — one agent with PLUR, one without. Ties removed.

Knowledge type	Record	What it tests
House rules	12–0	Tag conventions, file routing, project structure
Tool routing	10–2	Finding the right tool among 100+ options
Past experience	4–0	API quirks, debugging insights, infrastructure
Learned style	5–2	Communication tone, design preferences

31 wins, 4 losses (89% win rate). Without memory, agents got house rules right 10–38% of the time depending on model — with PLUR, 12–0 across every model. Memory isn't a reasoning crutch — it's information the model literally cannot infer.

The cost insight was unexpected: Haiku + PLUR scored 0.80 on discoverability. Opus alone scored 0.31. A $0.25/MTok model with memory beat a $15/MTok model without it.

Full methodology →

What PLUR is — and isn't

PLUR is agent memory — it stores corrections, preferences, conventions, and architectural decisions that an AI agent learns during work sessions, and injects them back when they're relevant.

PLUR is not a general-purpose search engine, a codebase indexer, or a replacement for code intelligence tools. It doesn't parse ASTs, navigate class hierarchies, or search your source files. If you need code-aware search (tree-sitter, language server features, symbol lookup), tools like claude-mem or your IDE's built-in search are the right choice.

The two are complementary:

	PLUR	Code intelligence tools
Stores	Learned knowledge (engrams) + event timeline (episodes)	Code structure, symbols, definitions
Search	Engram recall (BM25 + embeddings over memory)	AST traversal, symbol lookup, semantic code search
Learns	From agent corrections, feedback, usage patterns	From static analysis of source code
Captures	Auto-extracts learnings from conversations (via plugins)	N/A
Decays	Yes — unused memories fade (ACT-R model)	No — code index reflects current state
Timeline	Episodes track what happened when (incidents, fixes, decisions)	Git log only
Cross-tool	Any MCP client (Claude Code, Cursor, Windsurf, OpenClaw, Hermes)	Typically tied to one tool

While search is a core part of PLUR (finding the right engram to inject), the search targets are always engrams — not files, not code, not documents. PLUR's hybrid search (BM25 + embeddings + RRF) is optimized for short natural-language assertions, not source code.

Packages

Package	Description
`@plur-ai/core`	Engram engine — learn, recall, inject, search, decay
`@plur-ai/mcp`	MCP server for Claude Code, Cursor, Windsurf
`@plur-ai/claw`	OpenClaw ContextEngine plugin
`plur-hermes`	Hermes Agent plugin (Python, via CLI bridge)

Architecture

@plur-ai/core
├── engrams.ts           Engram CRUD + YAML persistence
├── episodes.ts          Episode capture + timeline queries
├── fts.ts               BM25 with IDF, TF saturation (k1/b), length normalization
├── embeddings.ts        BGE-small-en-v1.5, 384-dim, local ONNX
├── hybrid-search.ts     Reciprocal Rank Fusion
├── inject.ts            Context-aware selection + spreading activation
├── decay.ts             ACT-R activation decay
├── secrets.ts           Secret detection (API keys, passwords, tokens)
├── sync.ts              Git-based sync + file locking (O_EXCL)
├── storage.ts           Path detection + YAML I/O
└── storage-indexed.ts   Optional SQLite read index

@plur-ai/mcp          Wraps core as MCP tools
@plur-ai/claw          OpenClaw ContextEngine hooks (assemble/compact/afterTurn)
plur-hermes            Python plugin for Hermes Agent (CLI subprocess bridge)

Storage

Everything is plain YAML. Open it, read it, edit it.

~/.plur/
├── engrams.yaml     # learned knowledge (source of truth)
├── episodes.yaml    # session timeline
├── config.yaml      # settings
└── engrams.db       # optional SQLite read index (auto-generated)

PLUR_PATH overrides the default location.

For large stores (>1k engrams), enable the SQLite read index for faster filtered queries. Add index: true to config.yaml. The YAML file stays the source of truth — the .db is a cache that rebuilds automatically. Delete it anytime.

Requirements

Node.js 18+
2GB RAM minimum — the embedding model (ONNX runtime) needs ~1GB for installation. On servers with less RAM, embeddings are skipped and search falls back to BM25 keyword matching.

Development

git clone https://github.com/plur-ai/plur.git
cd plur
pnpm install && pnpm build && pnpm test

~340 tests across 27 files. pnpm test:watch for development.

Contributing

Bug reports — issue with reproduction steps
Feature requests — issue describing the use case
Code — fork, branch, PR. Tests required.
Integrations — build PLUR support for other tools

Before submitting: pnpm test passes, pnpm build succeeds, no new external deps in core without discussion.

Conventions: TypeScript, Zod validation, Vitest, no external APIs in core, YAML storage, zero-cost search by default.

License

Apache-2.0

Version	Changes	Urgency	Date
hermes-v0.9.11	Python plugin bump to align with `@plur-ai/cli@0.9.12`. ## Why bump The CLI's `learn` command picked up three semantic changes in 0.9.12 that hermes calls through `bridge.call('learn', ...)`: - #107 — Same-scope duplicate writes now increment `reference_count` + append to `sources[]` instead of creating a new engram. `forget` decrements; physical retirement only at 0. - #176 — Cross-scope re-learns trigger recurrence detection. 2nd cross-scope hit broadens scope to `global` and escala	High	5/27/2026
v0.9.11	Three independent bug fixes bundled. ### `plur_session_end` no longer crashes on string-array suggestions (#231) The JSON-Schema→Zod converter now recurses into array items and supports `anyOf`. The `engram_suggestions` handler accepts both `{statement, type}` objects and bare strings (coerced). `detectSecrets()` and `plur.learn()` throw clear `TypeError` on non-string input instead of letting `undefined.match()` propagate. ### `plur_stores_list` reports accurate remote engram counts (#184) Ne	High	5/26/2026
v0.9.9	Concurrent writes — hardened. - Multi-agent writes serialize cleanly - Failed saves logged, not silent - Pipelines auto-resume mid-run - Jittered retries bound wall time ### What changed (Hermes plugin) When two agents write engrams at the same time — a Twitter cron and a Telegram bot, say — they can race for the engram-store lock. Before 0.9.9, the second writer's call would fail silently and the engram would be lost. 0.9.9 retries lock-contended writes with jittered exponential backoff, su	High	5/14/2026
v0.9.5	## Remote stores Register PLUR Enterprise (or any compatible REST endpoint) as a store via `plur_stores_add`. ### Features - `RemoteStore` driver in `@plur-ai/core` — implements `EngramStore` over HTTP. 60s TTL cache, in-flight request dedup, paginated load, never-throws on network failure. - `plur_stores_add` accepts `url` + `token` — was `{path, scope}`-only; now `{path \| url+token, scope}`. Schema requires exactly one of path/url. Backwards compatible. - `Plur.addStore()` — `op	High	5/5/2026
v0.9.4	Hybrid recall, restored. - BGE embeddings actually work - Pinned engrams (always-inject) - plur_doctor diagnostic - PLUR_DISABLE_EMBEDDINGS opt-out ### Fixes - Hybrid search degraded-mode surfacing — `plur_recall_hybrid` now reports `mode: 'hybrid-degraded'` (with the underlying error) when the embedding model failed to load. Previously it lied with `mode: 'hybrid'` while silently falling back to BM25-only. - Embeddings build config — `@huggingface/transformers`, `onnxruntime-node`,	High	5/4/2026
v0.9.3	## Fixes - ESM import fix in core (critical): Replaced `require('os')` and `require('path')` with ESM imports. The CJS `require()` calls crashed consumers running PLUR in pure-ESM environments (Node 20+ with `\"type\": \"module\"`, modern bundlers). Affects `autoDiscoverStores` and related code paths in `@plur-ai/core`. ## Packages - `@plur-ai/core` 0.9.3 — ESM import fix - `@plur-ai/mcp` 0.9.3 — version parity - `@plur-ai/claw` 0.9.3 — version parity - `@plur-ai/cli` 0.9.3 — version pari	High	4/23/2026
v0.8.0	### Competitive Absorption: 50+ Features from 7 Memory Systems 50+ improvements absorbed in one session from Mem0, Claude-Mem, Mengram, Forge, Lossless Claw, OB1, and II-Agent. Implemented across 5 sub-projects, benchmarked, zero regressions. - 75% faster learn/recall/inject - 10% fewer injection tokens - LLM-driven dedup (opt-in) - Three-memory taxonomy ### Memory Intelligence (SP1) - `learnAsync()` method: pre-store dedup pipeline — content hash → semantic recall → LLM decision (ADD/UPDAT	Medium	4/8/2026
v0.7.3	Fix OpenClaw compat: remove pluginApi:"1" that blocked install on OpenClaw >=2026.3.31	Medium	4/2/2026
v0.7.2	Patch release: - Learning reflection hook: Stop hook nudges plur_learn every 3rd response — catches reasoning moments that tool-level hooks miss - Claw system prompt updated to v3: session workflow, pack commands, correction protocol, verification rules - Claw /packs slash command: list, install, uninstall from OpenClaw - 9 hooks installed by plur init (was 8)	Medium	4/2/2026
v0.7.0	## Knowledge Packs: Share What You Know Knowledge Packs are thematic engram collections you can share with your team, community, or across machines. Export what you've learned about a domain, share the pack, and anyone can install it. - Thematic export: `plur packs export react-patterns --domain code.react --tags hooks,state` - Privacy scan on export: blocks secrets and private engrams, warns on personal paths and emails - Conflict detection on install: flags duplicates and contradictions with	Medium	4/2/2026
v0.6.0	## Multi-Store: Share Knowledge Across Teams PLUR now reads engrams from multiple stores. Your team's learned knowledge lives in their git repo — PLUR reads it alongside your personal memory. No copying, no syncing. Just add a store path and your agent knows what the team knows. ```yaml # ~/.plur/config.yaml stores: - path: ~/projects/my-team/engrams.yaml scope: my-team readonly: true ``` - Store engrams get namespaced IDs (`ENG-DFD-2026-0401-001`) to prevent collisions - Scope vali	Medium	4/1/2026
v0.5.1	### Fixes - Removed redundant `session_firstmsg.py` injection hook — caused double injection and 2-3s latency. PLUR MCP `plur_session_start` handles injection now. - Documented recall split: `plur_recall_hybrid` for engram memory, `datacore.search` for journal/knowledge files - Documented multi-store limitation: `stores.add`/`stores.list` are config-only, cross-store search coming later ### Update ``` npm update -g @plur-ai/mcp @plur-ai/cli pip install --upgrade plur-hermes ```	Medium	4/1/2026
v0.5.0	## What's New ### Session Management - `plur_session_start` — inject relevant engrams at session start - `plur_session_end` — capture learnings + record episode at session end ### Extended Learning - `plur_learn` now accepts: tags, rationale, visibility, knowledge_anchors, dual_coding - Pack engram feedback — rate pack engrams, not just personal - `plur_promote` — activate candidate engrams (single + batch) ### Improved UX - Batch `plur_feedback` — rate multiple engrams in one call - Search-m	Medium	3/31/2026

plur

Description

README