contextdb

An embedded database for agentic memory systems. Relational storage, graph traversal, and vector similarity search under unified MVCC transactions — in a single file, in a single process. Every agent, device, or service runs its own contextdb. They sync bidirectionally through a central server over WebSocket — knowledge learned by one becomes available to all, with per-table conflict resolution. No port forwarding, no VPN — WebSocket traverses NAT out of the box.

If you're building agent memory today, you're probably stitching together SQLite for state, a vector database for embeddings, and application code for graph traversal. contextdb replaces all three — and adds enforceable policy constraints (state machines, DAG enforcement, cascading propagation) that the database guarantees, not application code.

contextdb> UPDATE decisions SET status = 'draft' WHERE id = '550e8400...';
Error: invalid state transition: active -> draft

No triggers. No application-side validation. The database enforces it.

Familiar conventions, nothing new to learn: PostgreSQL-compatible SQL, pgvector syntax for vector search (<=>), and SQL/PGQ-style GRAPH_TABLE ... MATCH for graph queries — the subset that matters for bounded traversal, not the full standard.

Language support: contextdb is a Rust library and CLI today. Python and TypeScript bindings are on the roadmap — contributions welcome.

Website: contextdb.tech · Docs: contextdb.tech/docs

See Why contextdb? for the full problem statement, or jump to Getting Started to try it in 2 minutes.

Why Not SQLite + Extensions?

Capability	SQLite + extensions	contextdb
Vector search	sqlite-vec (separate extension, no unified transactions with relational data)	Built-in, auto-HNSW at 1K vectors, pre-filtered search, same MVCC transaction as rows
Graph traversal	Recursive CTEs (unbounded, no cycle detection)	SQL/PGQ with bounded BFS, DAG enforcement, typed edges
State machines	CHECK constraints + triggers (bypassable)	`STATE MACHINE` in DDL, enforced by the database engine
Atomic cross-model updates	Application-level coordination	Single MVCC transaction across relational + graph + vector
Sync	Build your own	Bidirectional collaborative sync — each database syncs changesets with conflict resolution, not WAL pages
Immutable tables	Not enforceable (triggers are bypassable)	`IMMUTABLE` keyword, enforced by the database engine
Cascading invalidation	Application code	`PROPAGATE` in DDL — state changes cascade along edges and FKs

Use It As a Library

contextdb is an embedded database. The primary interface is the Rust API:

use contextdb_engine::Database;
use std::collections::HashMap;
use std::sync::Arc;

let db = Database::open(std::path::Path::new("./my.db"))?;
// or: Database::open_memory() for ephemeral

let params = HashMap::new();

db.execute(
    "CREATE TABLE observations (
       id UUID PRIMARY KEY,
       data JSON,
       embedding VECTOR(384)
     ) IMMUTABLE",
    &params,
)?;

// Insert with parameters
let mut params = HashMap::new();
params.insert("id".into(), Value::Uuid(uuid::Uuid::new_v4()));
params.insert("data".into(), Value::Json(serde_json::json!({"type": "sensor"})));
params.insert("embedding".into(), Value::Vector(vec![0.1; 384]));

db.execute(
    "INSERT INTO observations (id, data, embedding) VALUES ($id, $data, $embedding)",
    &params,
)?;

// Vector similarity search
let mut query_params = HashMap::new();
query_params.insert("query".into(), Value::Vector(vec![0.1; 384]));

let result = db.execute(
    "SELECT id, data FROM observations ORDER BY embedding <=> $query LIMIT 10",
    &query_params,
)?;

// Graph traversal
let result = db.execute(
    "SELECT target_id FROM GRAPH_TABLE(
       edges MATCH (a)-[:DEPENDS_ON]->{1,3}(b)
       WHERE a.id = $start
       COLUMNS (b.id AS target_id)
     )",
    &params,
)?;

// Subscribe to commits
let rx = db.subscribe();
// rx is a std::sync::mpsc::Receiver<CommitEvent>

One Query, Three Subsystems

Find semantically similar observations within a graph neighborhood, filtered by relational predicates — a query that would take ~40 lines of Python across SQLite, ChromaDB, and a hand-rolled BFS:

WITH neighborhood AS (
  SELECT b_id FROM GRAPH_TABLE(
    edges MATCH (start)-[:RELATES_TO]->{1,3}(related)
    WHERE start.id = $entity_id
    COLUMNS (related.id AS b_id)
  )
),
candidates AS (
  SELECT o.id, o.data, o.embedding
  FROM observations o
  INNER JOIN neighborhood n ON o.entity_id = n.b_id
  WHERE o.observation_type = 'config_change'
)
SELECT id, data FROM candidates
ORDER BY embedding <=> $query_embedding
LIMIT 5

One query. One transaction. One process.

Add to your Cargo.toml:

[dependencies]
contextdb-engine = "0.3.4"
contextdb-core = "0.3.4"

Install

# Install the CLI
cargo install contextdb-cli

# Or run the server (no clone needed)
curl -O https://raw.githubusercontent.com/context-graph-ai/contextdb/main/docker-compose.yml
curl -O https://raw.githubusercontent.com/context-graph-ai/contextdb/main/nats.conf
docker compose up

Or Explore With the CLI

cargo build --release -p contextdb-cli
./target/release/contextdb-cli :memory:

contextdb> CREATE TABLE decisions (id UUID PRIMARY KEY, status TEXT NOT NULL)
           STATE MACHINE (status: draft -> [active, rejected], active -> [superseded]);
ok (rows_affected=0)

contextdb> INSERT INTO decisions VALUES ('550e8400-e29b-41d4-a716-446655440000', 'draft');
ok (rows_affected=1)

contextdb> UPDATE decisions SET status = 'active' WHERE id = '550e8400-e29b-41d4-a716-446655440000';
ok (rows_affected=1)

contextdb> UPDATE decisions SET status = 'draft' WHERE id = '550e8400-e29b-41d4-a716-446655440000';
Error: invalid state transition: active -> draft

contextdb> .schema decisions
CREATE TABLE decisions (
  id UUID NOT NULL PRIMARY KEY,
  status TEXT NOT NULL
)
STATE MACHINE (status: active -> [superseded], draft -> [active, rejected])

What It Does

Relational (PostgreSQL-compatible SQL) — SELECT, INSERT, UPDATE, DELETE, JOINs (INNER/LEFT), CTEs, upsert (ON CONFLICT DO UPDATE), DISTINCT, LIMIT, IN with subqueries, LIKE, BETWEEN, parameter binding ($name).

Graph (SQL/PGQ-style) — GRAPH_TABLE(... MATCH ...) following SQL/PGQ conventions for bounded BFS, typed edges, variable-length paths ({1,3}), and direction control. DAG constraint enforcement prevents cycles. State propagation cascades changes along graph edges.

Vector (pgvector conventions) — Cosine similarity search via <=>. Auto-switches between brute-force (< 1000 vectors) and HNSW indexing. Pre-filtered search narrows candidates before scoring.

Unified transactions — One transaction atomically updates relational rows, graph adjacency structures, and vector indexes. One read snapshot sees consistent state across all three. MVCC with consistent snapshots — readers never block writers.

Enforceable policy constraints — IMMUTABLE tables, STATE MACHINE column transitions, DAG cycle prevention, RETAIN with TTL expiry, PROPAGATE for cascading state changes along edges and foreign keys. Enforced by the database — no application code can bypass them.

Collaborative sync — Every contextdb instance is a full read-write database. Each runs a SyncClient that syncs bidirectionally with a central SyncServer over NATS (WebSocket for clients behind NAT, native protocol for server-to-server). Offline-first: each database works independently, syncing changesets when connected. Per-table conflict resolution (LatestWins, ServerWins, EdgeWins) and per-table sync direction (Push, Pull, Both, None) give you fine-grained control over what flows where. The server runs the same contextdb engine — self-host it, or point your databases at a hosted server.

Persistence — Single-file storage via redb. Crash-safe. Compute/storage separated via the WriteSetApplicator trait (local redb for open source, object store for enterprise).

Plugin system — DatabasePlugin trait with lifecycle hooks (pre_commit, post_commit, on_open, on_close, on_ddl, on_query, post_query, health, describe, on_sync_push, on_sync_pull). Applications inject plugins via Database::open_with_plugin().

Subscriptions — db.subscribe() returns a broadcast channel of CommitEvents for reactive downstream processing.

Scale Envelope

contextdb is designed for agentic memory, not data warehousing:

10K-1M rows
Sparse graphs with bounded traversal (depth <= 10)
Append-heavy writes, small transactions
Configurable memory budget via SET MEMORY_LIMIT (no hard-coded ceiling)
Configurable file-growth budget via SET DISK_LIMIT / SHOW DISK_LIMIT or --disk-limit / CONTEXTDB_DISK_LIMIT for file-backed databases
Laptops, ARM64 devices (browser and mobile via Rust's WASM target are future directions)

Documentation

Full documentation is available at contextdb.tech/docs, or browse the source files:

Doc	What it covers
Getting Started	Build, first REPL session, library embedding — 2 minutes
Why contextdb?	Problem statement, design philosophy, comparison with alternatives
Usage Scenarios	16 problem-first walkthroughs: constraints, graph queries, vector search, sync, propagation
Query Language	SQL, graph MATCH, vector search, constraints, built-in functions
CLI Reference	REPL commands, sync commands, non-interactive scripting
Architecture	Crate map, storage engine, MVCC, sync protocol, plugin system

Architecture

10-crate Rust workspace:

Crate	Role
`contextdb-core`	Types, executor traits, errors, table metadata
`contextdb-tx`	MVCC transaction manager with deferred-apply write sets
`contextdb-relational`	Relational executor (scan, insert, upsert, delete)
`contextdb-graph`	Graph executor (bounded BFS, adjacency index, DAG enforcement)
`contextdb-vector`	Vector executor (cosine similarity, HNSW, pre-filtered search)
`contextdb-parser`	SQL parser (pest grammar with GRAPH_TABLE + vector extensions)
`contextdb-planner`	Rule-based query planner
`contextdb-engine`	Database engine — wires all subsystems, plugin API, subscriptions
`contextdb-server`	Sync server and client (NATS transport, conflict resolution)
`contextdb-cli`	Interactive CLI REPL

Building

cargo build --workspace
cargo test --workspace

License

Apache-2.0 — see LICENSE.

Version	Changes	Urgency	Date
v1.0.0	Full Changelog: https://github.com/context-graph-ai/contextdb/compare/v0.3.4...v1.0.0	High	4/29/2026
v0.3.4	Full Changelog: https://github.com/context-graph-ai/contextdb/compare/v0.3.3...v0.3.4	High	4/21/2026
v0.3.3	Full Changelog: https://github.com/context-graph-ai/contextdb/compare/v0.3.2...v0.3.3	Medium	4/12/2026
v0.3.2	Full Changelog: https://github.com/context-graph-ai/contextdb/compare/v0.3.1...v0.3.2	Medium	4/3/2026
v0.3.1	Full Changelog: https://github.com/context-graph-ai/contextdb/compare/v0.3.0...v0.3.1	Medium	4/3/2026
v0.3.0	## What's Changed * Remove test-side contention from DDL sync ordering checks by @lucidprogrammer in https://github.com/context-graph-ai/contextdb/pull/1 ## New Contributors * @lucidprogrammer made their first contribution in https://github.com/context-graph-ai/contextdb/pull/1 Full Changelog: https://github.com/context-graph-ai/contextdb/compare/v0.2.0...v0.3.0	Medium	4/3/2026
v0.2.0	ContextDB is now schema-agnostic. The database starts empty — consumers define their own tables with `CREATE TABLE`. Immutability and state machine constraints are declared per-table, not hardcoded. The CLI REPL supports psql-style `\dt`, `\d`, `\?`, `\q` commands alongside dot-style commands. ### What's new - Runtime table metadata registry — tables and constraints are created at runtime, not compiled in - Declarative constraints — `CREATE TABLE ... IMMUTABLE` and `CREATE TABLE ... ST	Low	3/8/2026
v0.1.0	Full Changelog: https://github.com/context-graph-ai/contextDB/commits/v0.1.0	Low	3/8/2026

contextdb

Description

README

contextdb

Why Not SQLite + Extensions?

Use It As a Library

One Query, Three Subsystems

Install

Or Explore With the CLI

What It Does

Scale Envelope

Documentation

Architecture

Building

License

Release History

Dependencies & License Audit

Similar Packages

More in Databases