๐ ็ฎไฝไธญๆ | ็น้ซไธญๆ | English | Espaรฑol | Deutsch | Franรงais | ๆฅๆฌ่ช
Give your AI coding assistant a memory โ Cross-session persistent memory MCP Server
Still using CLAUDE.md / MEMORY.md as memory? This Markdown-file memory approach has fatal flaws: the file keeps growing, injecting everything into every session and burning massive tokens; content only supports keyword matching โ search "database timeout" and you won't find "MySQL connection pool pitfall"; sharing one file across projects causes cross-contamination; there's no task tracking, so dev progress lives entirely in your head; not to mention the 200-line truncation, manual maintenance, and inability to deduplicate or merge.
AIVectorMemory is a fundamentally different approach. Local vector database storage with semantic search for precise recall (matches even when wording differs), on-demand retrieval that loads only relevant memories (token usage drops 50%+), automatic multi-project isolation with zero interference, and built-in issue tracking + task management that lets AI fully automate your dev workflow. All data is permanently stored on your machine โ zero cloud dependency, never lost when switching sessions or IDEs.
| Feature | Description |
|---|---|
| ๐ง Cross-Session Memory | Your AI finally remembers your project โ pitfalls, decisions, conventions all persist across sessions |
| ๐ Hybrid Smart Search | FTS5 full-text + vector semantic dual-path search, RRF fusion ranking + composite scoring (recency ร frequency ร importance), far more precise than pure vector search |
| ๐ Issue Tracking | Built-in Issue Tracker โ discover โ investigate โ fix โ archive, full lifecycle. AI manages bugs automatically |
| ๐ Task Management | Spec โ task breakdown โ nested subtasks โ status sync โ linked archival. AI drives the complete dev workflow |
| ๐ฆ Session State | Blocking management + breakpoint resume + progress tracking, seamless handoff across sessions and context compaction |
| ๐ช Hooks + Steering | Auto-inject workflow rules + behavior guard hooks, consistent AI behavior guaranteed โ no need to repeat instructions |
| ๐งฌ Memory Evolution | Contradiction detection auto-supersedes stale knowledge + short-term โ long-term auto-promotion + 90-day auto-archive, self-evolving memory |
| ๐ Desktop App + Web Dashboard | Native desktop app (macOS/Windows/Linux) + Web dashboard, 3D vector network reveals knowledge connections at a glance |
| ๐ฐ Save 50%+ Tokens | Stop copy-pasting project context every conversation. Semantic retrieval on demand, no more bulk injection |
| ๐ Fully Local | Zero cloud dependency. ONNX local inference, no API Key, data never leaves your machine |
| ๐ 11 IDEs Covered | Cursor / Kiro / Claude Code / Windsurf / VSCode / Copilot / OpenCode / Trae / Codex / Antigravity / OpenClaw โ one-click install & uninstall |
| ๐ Multi-Project Isolation | One DB for all projects, auto-isolated with zero interference, seamless project switching |
| ๐ Smart Dedup | Similarity > 0.95 auto-merges updates, keeping your memory store clean โ never gets messy over time |
| ๐ 7 Languages | ็ฎไฝไธญๆ / ็น้ซไธญๆ / English / Espaรฑol / Deutsch / Franรงais / ๆฅๆฌ่ช, full-stack i18n for dashboard + Steering rules |
QQ็พค๏ผ1085682431 ย |ย ๅพฎไฟก๏ผchanghuibiz
ๅ
ฑๅๅไธ้กน็ฎๅผๅๅ QQ็พคๆๅพฎไฟกไบคๆต
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ AI IDE โ
โ OpenCode / Codex / Claude Code / Cursor / ... โ
โโโโโโโโโโโโโโโโโโโโโโโโฌโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ MCP Protocol (stdio)
โโโโโโโโโโโโโโโโโโโโโโโโผโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ AIVectorMemory Server โ
โ โ
โ โโโโโโโโโโโโ โโโโโโโโโโโโ โโโโโโโโโโโโโโโโโโโโ โ
โ โ remember โ โ recall โ โ auto_save โ โ
โ โ forget โ โ task โ โ status/track โ โ
โ โโโโโโฌโโโโโโ โโโโโโฌโโโโโโ โโโโโโโโโฌโโโโโโโโโโโ โ
โ โ โ โ โ
โ โโโโโโผโโโโโโโโโโโโโผโโโโโโโโโโโโโโโโผโโโโโโโโโโโ โ
โ โ Embedding Engine (ONNX) โ โ
โ โ intfloat/multilingual-e5-small โ โ
โ โโโโโโโโโโโโโโโโโโโโโโฌโโโโโโโโโโโโโโโโโโโโโโโโ โ
โ โ โ
โ โโโโโโโโโโโโโโโโโโโโโโผโโโโโโโโโโโโโโโโโโโโโโโโ โ
โ โ SQLite + sqlite-vec (Vector Index) โ โ
โ โ ~/.aivectormemory/memory.db โ โ
โ โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
# Install
pip install aivectormemory
# Upgrade to latest version
pip install --upgrade aivectormemory
# Navigate to your project directory, one-click IDE setup
cd /path/to/your/project
run installrun install interactively guides you to select your IDE, auto-generating MCP config, Steering rules, and Hooks โ no manual setup needed.
macOS users note:
- If you get
externally-managed-environmenterror, add--break-system-packages- If you get
enable_load_extensionerror, your Python doesn't support SQLite extension loading (macOS built-in Python and python.org installers don't support it). Use Homebrew Python instead:brew install python /opt/homebrew/bin/python3 -m pip install aivectormemory
No pip install needed, run directly:
cd /path/to/your/project
uvx aivectormemory installRequires uv to be installed.
uvxauto-downloads and runs the package โ no manual installation needed.
{
"mcpServers": {
"aivectormemory": {
"command": "run",
"args": ["--project-dir", "/path/to/your/project"]
}
}
}๐ IDE Configuration File Locations
| IDE | Config Path |
|---|---|
| Kiro | .kiro/settings/mcp.json |
| Cursor | .cursor/mcp.json |
| Claude Code | .mcp.json |
| Windsurf | .windsurf/mcp.json |
| VSCode | .vscode/mcp.json |
| Trae | .trae/mcp.json |
| OpenCode | opencode.json |
| Codex | .codex/config.toml |
For Codex, use project-scoped TOML instead of JSON:
[mcp_servers.aivectormemory]
command = "run"
args = ["--project-dir", "/path/to/your/project"]Codex only loads project-scoped
.codex/config.tomlafter the repository is marked as a trusted project.
content (string, required) Memory content in Markdown format
tags (string[], required) Tags, e.g. ["pitfall", "python"]
scope (string) "project" (default) / "user" (cross-project)
Similarity > 0.95 auto-updates existing memory, no duplicates.
query (string) Semantic search keywords
tags (string[]) Exact tag filter
scope (string) "project" / "user" / "all"
top_k (integer) Number of results, default 5
Vector similarity matching โ finds related memories even with different wording.
memory_id (string) Single ID
memory_ids (string[]) Batch IDs
state (object, optional) Omit to read, pass to update
is_blocked, block_reason, current_task,
next_step, progress[], recent_changes[], pending[]
Maintains work progress across sessions, auto-restores context in new sessions.
action (string) "create" / "update" / "archive" / "list"
title (string) Issue title
issue_id (integer) Issue ID
status (string) "pending" / "in_progress" / "completed"
content (string) Investigation content
action (string, required) "batch_create" / "update" / "list" / "delete" / "archive"
feature_id (string) Linked feature identifier (required for list)
tasks (array) Task list (batch_create, supports subtasks)
task_id (integer) Task ID (update)
status (string) "pending" / "in_progress" / "completed" / "skipped"
Links to spec docs via feature_id. Update auto-syncs tasks.md checkboxes and linked issue status.
action (string) "generate" (default) / "diff" (compare differences)
lang (string) Language: en / zh-TW / ja / de / fr / es
sections (string[]) Specify sections: header / tools / deps
Auto-generates README content from TOOL_DEFINITIONS / pyproject.toml, multi-language support.
preferences (string[]) User-expressed technical preferences (fixed scope=user, cross-project)
extra_tags (string[]) Additional tags
Auto-extracts and stores user preferences at end of each conversation, smart dedup.
run web --port 9080
run web --port 9080 --quiet # Suppress request logs
run web --port 9080 --quiet --daemon # Run in background (macOS/Linux)Visit http://localhost:9080 in your browser. Default username admin, password admin123 (can be changed in settings after first login).
- Multi-project switching, memory browse/search/edit/delete/export/import
- Semantic search (vector similarity matching)
- One-click project data deletion
- Session status, issue tracking
- Tag management (rename, merge, batch delete)
- Token authentication protection
- 3D vector memory network visualization
- ๐ Multi-language support (็ฎไฝไธญๆ / ็น้ซไธญๆ / English / Espaรฑol / Deutsch / Franรงais / ๆฅๆฌ่ช)
ย ย ย ย
Scan to join WeChat group ย |ย Scan to join QQ group
AIVectorMemory is the storage layer. Use Steering rules to tell AI when and how to call these tools.
Running run install auto-generates Steering rules and Hooks config โ no manual setup needed.
| IDE | Steering Location | Hooks |
|---|---|---|
| Kiro | .kiro/steering/aivectormemory.md |
.kiro/hooks/*.hook |
| Cursor | .cursor/rules/aivectormemory.md |
.cursor/hooks.json |
| Claude Code | CLAUDE.md (appended) |
.claude/settings.json |
| Windsurf | .windsurf/rules/aivectormemory.md |
.windsurf/hooks.json |
| VSCode | .github/copilot-instructions.md (appended) |
.claude/settings.json |
| Trae | .trae/rules/aivectormemory.md |
โ |
| OpenCode | AGENTS.md (appended) |
.opencode/plugins/*.js |
| Codex | AGENTS.md (appended) |
โ |
๐ Steering Rules Example (auto-generated)
# AIVectorMemory - Workflow Rules
## 1. New Session Startup (execute in order)
1. `recall` (tags: ["project-knowledge"], scope: "project", top_k: 100) load project knowledge
2. `recall` (tags: ["preference"], scope: "user", top_k: 20) load user preferences
3. `status` (no state param) read session state
4. Blocked โ report and wait; Not blocked โ enter processing flow
## 2. Message Processing Flow
- Step A: `status` read state, wait if blocked
- Step B: Classify message type (chat/correction/preference/code issue)
- Step C: `track create` record issue
- Step D: Investigate (`recall` pitfalls + read code + find root cause)
- Step E: Present plan to user, set blocked awaiting confirmation
- Step F: Modify code (`recall` pitfalls before changes)
- Step G: Run tests to verify
- Step H: Set blocked awaiting user verification
- Step I: User confirms โ `track archive` + clear block
## 3. Blocking Rules
Must `status({ is_blocked: true })` when proposing plans or awaiting verification.
Only clear after explicit user confirmation. Never self-clear.
## 4-9. Issue Tracking / Code Checks / Spec Task Mgmt / Memory Quality / Tool Reference / Dev Standards
(Full rules auto-generated by `run install`)๐ Hooks Config Example (Kiro only, auto-generated)
Auto-save on session end removed. Dev workflow check (.kiro/hooks/dev-workflow-check.kiro.hook):
{
"enabled": true,
"name": "Dev Workflow Check",
"version": "1",
"when": { "type": "promptSubmit" },
"then": {
"type": "askAgent",
"prompt": "Core principles: verify before acting, no blind testing, only mark done after tests pass"
}
}The embedding model (~200MB) is auto-downloaded on first run. If slow:
export HF_ENDPOINT=https://hf-mirror.comOr add env to MCP config:
{
"env": { "HF_ENDPOINT": "https://hf-mirror.com" }
}| Component | Technology |
|---|---|
| Runtime | Python >= 3.10 |
| Vector DB | SQLite + sqlite-vec |
| Embedding | ONNX Runtime + intfloat/multilingual-e5-small |
| Tokenizer | HuggingFace Tokenizers |
| Protocol | Model Context Protocol (MCP) |
| Web | Native HTTPServer + Vanilla JS |
Enhancement: Rule System Overhaul + OpenClaw Support
- ๐ง Fixed 5 missing memory system calls in AI rules: recall pitfalls before investigation (Step D), before dangerous ops (ยง7), before Spec writing (ยง8), before subtask execution (ยง8), and remember pitfalls after fix (Step I)
- ๐ฆ Added OpenClaw IDE support โ now 11 IDEs total (MCP config merges into ~/.openclaw/openclaw.json, steering appends to AGENTS.md)
- ๐ญ Playwright self-test rules strengthened โ added ToolSearch deferred tools loading requirement, banned
opencommand workaround - ๐ง Merged v2.2.0โv2.2.6 features: hooks system (bash_guard + stop_guard + check_track), scoring engine improvements, recall optimizations, web dashboard bulk delete, desktop memory delete modal
โ ๏ธ DEV_WORKFLOW_PROMPT: added 2 new violation reminders (recall before code change, remember after fix)- ๐ All 7 language rule files updated in sync
Enhancement: AI Rule System Upgrade
- ๐ CLAUDE.md completion: added Identity & Tone (ยง1), 7 Core Principles (ยง3), message type judgment examples, expanded IDE safety and self-test sections
โ ๏ธ Hook added Common Violations Reminder: โ negative examples reinforcing the 4 most frequently missed rules (self-test, recall, track create, IDE safety)- ๐ All 7 language rule files updated in sync (zh-CN/zh-TW/en/ja/es/de/fr)
- ๐ข CLAUDE.md sections renumbered to ยง1โยง11, cross-references updated accordingly
New: Smart Memory Engine + Uninstall
- ๐ง FTS5 full-text search with Chinese tokenization (jieba) โ keyword search now actually works for CJK content
- ๐ Hybrid retrieval: vector + FTS5 dual-path with RRF (Reciprocal Rank Fusion) merging
- ๐ Composite scoring: results ranked by similarity ร 0.5 + recency ร 0.3 + frequency ร 0.2, weighted by importance
- โก Conflict detection: similar memories (0.85โ0.95) auto-superseded, old facts fade automatically
- ๐ฆ Memory tiers: frequently accessed memories auto-promote to long_term and get searched first
- ๐๏ธ Auto-archive: stale short_term memories (90 days inactive + low importance) cleaned up automatically
- ๐ Relation expansion: tag overlap โฅ 2 builds related links, 1-hop expansion surfaces connected memories
- ๐ Auto-summary: long memories (>500 chars) get summaries, brief mode returns summaries to save tokens
- ๐งน Code cleanup: removed 15 dead code items, refactored 7 duplicate patterns into shared utilities
- โ
run uninstallโ cleanly removes all IDE configurations (MCP, steering, hooks, permissions) while preserving memory data
Enhancement: Security & Rule Optimization
- ๐ Fixed SQL injection, command injection, and path traversal vulnerabilities
- ๐ก๏ธ Added transaction protection for data integrity (archive, insert, update operations)
- ๐ง Unified similarity formula across all search paths
- ๐ Compressed AI workflow rules by 38% (219โ136 lines) with zero process removal
- ๐งน v12 migration cleans up legacy garbage memories automatically
- ๐ All 7 languages synchronized
New: Playwright Browser Testing Built-in
- ๐ญ
run installnow automatically configures Playwright browser testing โ AI can open a real browser to verify frontend changes instead of guessing - ๐ญ Uses a dedicated test browser (Chrome for Testing) that won't interfere with your personal browser tabs
- ๐ Simplified permission setup โ no more manual permission popups for common tools
- ๐ Updated AI rules across all 7 languages to enforce proper browser testing behavior
Enhancement: More IDE Support
- ๐ฅ๏ธ Added support for Antigravity and GitHub Copilot IDEs
- ๐
run installnow auto-configures tool permissions, reducing manual setup - ๐ Streamlined AI self-testing rules
Enhancement: Faster Startup
- โก Optimized memory loading on session start โ loads faster with less context usage
- ๐ Auto-configures Claude Code permissions during installation
- ๐ All 7 languages synchronized
Enhancement: Simpler Rules
- ๐ AI workflow rules restructured for clarity and reduced token usage
- ๐พ AI now automatically saves your preferences at the end of each session
- ๐ All 7 languages synchronized
Fix: Tool Reliability
- ๐ง Comprehensive audit and fix of all MCP tool parameters โ improved reliability across all IDEs
Enhancement: Better Search & Safety
- ๐ Memory search now combines semantic and keyword matching for more accurate recall
- ๐ก๏ธ Added cross-project protection โ AI won't accidentally modify files in other projects
Enhancement: Rule Generalization & Desktop Version Fix
- ๐ Added "recall before asking user" rule โ AI must query memory system before asking user for project information (server address, passwords, deploy config, etc.)
- ๐ Generalized pre-operation check rule โ removed specific examples to apply to all operation scenarios
- ๐ฅ๏ธ Fixed desktop app settings page showing hardcoded version "1.0.0" instead of actual app version
- ๐ All 7 language i18n steering rules and workflow prompts synchronized
Fix: Hook Cross-Project Compatibility
- ๐ง
check_track.shnow derives project path from script location instead of$(pwd), fixing track detection failure when Claude Code runs hooks from non-root working directory - ๐ง
compact-recovery.shnow uses relative path derivation instead of hardcoded absolute paths, ensuring correct behavior when installed to any project - ๐ง Removed redundant CLAUDE.md re-injection from compact-recovery (already auto-loaded by Claude Code)
- ๐ง
install.pytemplate synchronized with all hook fixes - ๐ All 7 language i18n compact-recovery hints updated
Performance: ONNX INT8 Quantization
- โก Embedding model auto-quantized from FP32 to INT8 on first load, model file from 448MB down to 113MB
- โก MCP Server memory usage reduced from ~1.6GB to ~768MB (50%+ reduction)
- โก Quantization is transparent to users โ automatic on first use, cached for subsequent loads, falls back to FP32 on failure
New: Remember Password
- ๐ Login page on both desktop and web dashboard now has a "Remember password" checkbox
- ๐ When checked, credentials are saved to localStorage and auto-filled on next login; when unchecked, saved credentials are cleared
- ๐ Checkbox is hidden in registration mode
Enhancement: Steering Rules
- ๐ IDENTITY & TONE section strengthened with more specific constraints (no pleasantries, no translating user messages, etc.)
- ๐ Self-testing requirements now distinguish between backend-only, MCP Server, and frontend-visible changes (Playwright required for frontend)
- ๐ Development rules now mandate self-testing after completing development
- ๐ All 7 language versions synchronized
- ๐ Desktop app version comparison switched to semantic versioning, fixing false upgrade prompts when local version is higher
- ๐ Health check page field names aligned with backend, fixing consistency status always showing Mismatch
- ๐ง check_track.sh hook adds Python fallback, resolving silent hook failure when system sqlite3 is unavailable (#4)
- ๐ฅ๏ธ Desktop app one-click install + upgrade detection
- ๐ฅ๏ธ Auto-detect Python and aivectormemory installation status on startup
- ๐ฅ๏ธ Show one-click install button when not installed, check PyPI and desktop new versions when installed
- ๐ Installation detection switched to importlib.metadata.version() for accurate package version
- ๐ง Fix PyPI package size anomaly (sdist from 32MB down to 230KB), excluded accidentally packaged dev files
New: Native Desktop App
- ๐ฅ๏ธ Native desktop client supporting macOS (ARM64), Windows (x64), Linux (x64)
- ๐ฅ๏ธ Desktop app shares the same database as Web dashboard, fully feature-equivalent
- ๐ฅ๏ธ Dark/light theme switching, Glass frosted visual style
- ๐ฅ๏ธ Login auth, project selection, stats overview, memory management, issue tracking, task management, tag management, settings, data maintenance โ full feature coverage
- ๐ฆ Auto-published installers via GitHub Releases, download and use
New: CI/CD Auto Build
- ๐ GitHub Actions auto-builds desktop installers for all 3 platforms
- ๐ Push a tag to trigger the full compile, package, and release pipeline
Fixes
- ๐ Windows platform compatibility fixes
- ๐ sqlite-vec extension download URL fix
Optimization: Token Usage Reduction
- โก Steering rules changed from per-message dynamic injection to static loading, reducing repeated token consumption
- โก Greatest impact for Claude Code users โ ~2K fewer tokens per message
New: Full-Stack i18n (7 Languages)
- ๐ Web dashboard + desktop UI fully supports 7 languages: ็ฎไฝไธญๆ / ็น้ซไธญๆ / English / Espaรฑol / Deutsch / Franรงais / ๆฅๆฌ่ช
- ๐ One-click language switch in settings page, takes effect immediately
- ๐ MCP tool responses follow language setting, AI replies automatically use the corresponding language
- ๐ Switching language auto-regenerates steering rules for all installed projects
New: Web Dashboard Settings Page
- โ๏ธ Language switch, theme settings, system info display
- โ๏ธ Database health check, repair, backup and other maintenance tools
Optimization: Memory Search
- ๐
recallsearch supports OR/AND tag matching modes, fixing missed results with multi-tag searches - ๐ Semantic search + tag filter defaults to OR matching (broader), tags-only browsing keeps AND matching (more precise)
๐ v0.2.x and earlier changelog
Apache-2.0





