server-nexe

Local AI server with persistent memory, RAG, and multi-backend inference (MLX / llama.cpp / Ollama). Runs entirely on your machine — zero data sent to external services.

ai apple-silicon embeddings fastapi llama-cpp llm local-ai mlx python vector-database

Why this rank:Recent releaseHealthy release cadenceStrong adoption

Description

Local AI server with persistent memory, RAG, and multi-backend inference (MLX / llama.cpp / Ollama). Runs entirely on your machine — zero data sent to external services.

README

Local AI server with persistent memory. Zero cloud. Full control.

I've reached the minimum viable product for the real world — but feedback is still missing. 🚀

Documentation · Install · Architecture · Releases

Català · Español

The Story
Screenshots
Why Server Nexe?
Quick Start
Backends
Available Models by RAM Tier
Architecture
- Request processing pipeline
Plugin System
AI-Ready Documentation
Security
Platform Support
Requirements
Testing
Roadmap
Limitations
Contributing
Acknowledgments
Disclaimer

The Story

Server Nexe started as a learning-by-doing experiment: "What would it take to have your own local AI with persistent memory?" Since I wasn't going to build an LLM, I started picking up pieces to assemble a useful lego for myself and my day-to-day work. One thing led to another — inference backends, RAG pipelines, vector search, plugin systems, security layers, a web UI, an installer with hardware detection.

This entire project — code, tests, audits, documentation — has been built by one person orchestrating different AI models, both local (MLX, Ollama) and cloud (Claude, GPT, Gemini, DeepSeek, Qwen, Grok...), as collaborators. The human decides what to build, designs the architecture, reviews lines and runs tests. The AIs write, audit, and stress-test under human direction.

What began as a prototype has turned into a genuinely useful product: 4842 tests, security audits, encryption at rest, a macOS installer with hardware detection, and a plugin system. It's not done — there's a roadmap full of ideas — but it already does what it set out to do: run an AI server on your machine, with memory that persists, and zero data leaving your device.

This is not trying to compete with ChatGPT or Claude. But it can be complementary for less demanding tasks. It's an open-source tool for people who want to own their AI infrastructure. Built by one person in Barcelona, with AI as co-pilot, music, and stubbornness.

More technically: what was a giant spaghetti monster ended up distilling, refactor after refactor, into a minimal, agnostic, modular core — where security and memory are solved at the base so building on top is fast and comfortable, in human–AI collaboration. Whether that worked is for the community to say (the AI says yes, but what did you expect 🤪).

Screenshots

Web UI — light mode	Web UI — dark mode
System tray menu (NexeTray.app)	SwiftUI installer wizard (DMG)

Why Server Nexe?

Your conversations, documents, embeddings, and model weights stay on your machine. Always. Server Nexe combines LLM inference with a persistent RAG memory system — your AI remembers context across sessions, indexes your documents, and never phones home.

Local & Private Every conversation, document, and embedding stays on your device. No telemetry, no external calls, no cloud dependency. Not even a server to spy on you.	Persistent RAG Memory Remembers context across sessions using Qdrant vector search with 768-dimensional embeddings across 3 specialized collections. Ingest documents, recall knowledge.
Automatic Memory (MEM_SAVE) The model extracts facts from conversations automatically — names, jobs, preferences, projects — and stores them to memory inside the same LLM call, with zero extra latency. Trilingual intent detection (ca/es/en), semantic deduplication, and deletion by voice ("forget that...").	Multi-Backend Inference Switch between MLX (Apple Silicon native), llama.cpp (GGUF, universal), or Ollama — one config change, same OpenAI-compatible API.
Modular Plugin System Auto-discovered plugins with independent manifests. Security, web UI, RAG, backends — everything is a plugin. Add capabilities without touching the core. NexeModule protocol with duck typing, no inheritance.	macOS Installer DMG with guided wizard that detects your hardware, picks the right backend, recommends models for your RAM, and gets you running in minutes.
Document Upload with Session Isolation Upload `.txt`, `.md` or `.pdf` and they're automatically indexed for RAG. Each document is only visible within the session it was uploaded in — no cross-contamination between sessions.	Built to Grow 4842 tests (~85% coverage), security audits, i18n in 3 languages, comprehensive API. What started as an experiment is being built with production practices.

Quick Start

Option A: DMG Installer (macOS)

Download the latest Install Nexe.dmg from Releases. The wizard handles everything: hardware detection, backend selection, model download, and configuration.

Option B: Command Line

git clone https://github.com/jgoy-labs/server-nexe.git
cd server-nexe
./setup.sh      # guided installation (detects hardware, picks backend & model)
nexe go         # start server on port 9119

Once running:

nexe chat               # interactive chat
nexe chat --rag         # chat with RAG memory
nexe memory store "Barcelona is the capital of Catalonia"
nexe memory recall "capital Catalonia"
nexe status             # system status

Option C: Headless (servers, scripts, CI)

python -m installer.install_headless --backend ollama --model qwen3.5:latest
nexe go

Endpoints at http://localhost:9119:

Endpoint	Description
`/v1/chat/completions`	OpenAI-compatible chat API
`/ui`	Web UI (chat, file upload, sessions)
`/health`	Health check
`/docs`	Interactive API documentation (Swagger)

Authentication via X-API-Key header. Key is generated during installation and stored in .env.

Backends

Backend	Platform	Best for
MLX	macOS (Apple Silicon)	Recommended for Mac — native Metal GPU acceleration, fastest on M-series
llama.cpp	macOS / Linux	Universal — GGUF format, Metal on Mac, CPU/CUDA on Linux
Ollama	macOS / Linux	Bridge to existing Ollama installations, easiest model management

The installer auto-detects your hardware and recommends the best backend. You can switch anytime in personality/server.toml.

Available Models by RAM Tier

The installer organizes the 16 catalog models by the RAM available on your machine (4 tiers):

Tier	Models	Origin
8 GB	Gemma 3 4B, Qwen3.5 4B, Qwen3 4B	Google, Alibaba
16 GB	Gemma 4 E4B, Salamandra 7B, Qwen3.5 9B, Gemma 3 12B	Google, BSC/AINA, Alibaba
24 GB	Gemma 4 31B, Qwen3 14B, GPT-OSS 20B	Google, Alibaba, OpenAI
32 GB	Qwen3.5 27B, Gemma 3 27B, DeepSeek R1 32B, Qwen3.5 35B-A3B, ALIA-40B	Alibaba, Google, DeepSeek, Spanish Government

In addition, you can use any Ollama model by name or any GGUF model from Hugging Face.

Architecture

server-nexe/
├── core/                 # FastAPI server, endpoints, CLI, config, metrics, resilience
│   ├── endpoints/        # REST API (v1 chat, health, status, system)
│   ├── cli/              # CLI commands & i18n (ca/es/en)
│   └── resilience/       # Circuit breaker, rate limiting
├── personality/          # Module manager, plugin discovery, server.toml
│   ├── loading/          # Plugin loading pipeline (find, validate, import, lifecycle)
│   └── module_manager/   # Discovery, registry, config, sync
├── memory/               # Embeddings, RAG engine, vector memory, document ingestion
│   ├── embeddings/       # Chunking, embedding generation
│   ├── rag/              # Retrieval-augmented generation pipeline
│   └── memory/           # Persistent vector store (Qdrant)
├── plugins/              # Auto-discovered plugin modules
│   ├── mlx_module/       # MLX backend (Apple Silicon)
│   ├── llama_cpp_module/ # llama.cpp backend (GGUF)
│   ├── ollama_module/    # Ollama bridge
│   ├── security/         # Auth, injection detection, CSRF, rate limiting, input sanitization
│   └── web_ui_module/    # Browser-based chat UI with file upload
├── installer/            # Guided installer, headless mode, hardware detection, model catalog
├── knowledge/            # Indexed documentation for RAG (ca/es/en)
└── tests/                # Integration & e2e test suites

Request processing pipeline

flowchart LR
    A[Request] --> B[Auth<br/>X-API-Key]
    B --> C[Rate Limit<br/>slowapi]
    C --> D[validate_string_input<br/>context parameter]
    D --> E[RAG Recall<br/>3 collections]
    E --> F[_sanitize_rag_context<br/>injection filter]
    F --> G[LLM Inference<br/>MLX/Ollama/llama.cpp]
    G --> H[Stream Response<br/>SSE markers]
    H --> I[MEM_SAVE Parsing<br/>fact extraction]
    I --> J[Response<br/>to client]

Plugin System

Server Nexe uses a duck typing protocol (NexeModule Protocol) — no class inheritance, no BasePlugin. Each plugin is a directory under plugins/ with a manifest.toml and a module.py.

5 active plugins:

Plugin	Type	Key features
mlx_module	LLM Backend	Apple Silicon native, prefix caching (trie), Metal GPU
llama_cpp_module	LLM Backend	Universal GGUF, LRU ModelPool, CPU/GPU
ollama_module	LLM Backend	HTTP bridge to Ollama, auto-start, VRAM cleanup
security	Core	Dual-key auth, 6 injection detectors + NFKC, 47 jailbreak patterns, rate limiting, RFC5424 audit logging
web_ui_module	Interface	Web chat, sessions, document upload, MEM_SAVE, RAG sanitization, i18n

AI-Ready Documentation

The knowledge/ folder contains 13 thematic documents × 3 languages = 39 files, structured with YAML frontmatter for RAG ingestion:

API, Architecture, Use Cases, Errors, Identity, Installation, Limitations, Plugins, RAG, README, Security, Testing, Usage.

Point any AI assistant at this repo and it can understand the complete architecture.

Language	Link
English	knowledge/en/README.md
Catalan	knowledge/ca/README.md
Spanish	knowledge/es/README.md

Security

Server Nexe includes a security module enabled by default:

API key authentication on all endpoints
CSP headers (script-src 'self', no unsafe-inline)
CSRF protection with token validation
Rate limiting per IP
Input sanitization — 6 injection detectors + Unicode normalization
Jailbreak detection — 47 pattern speed-bump detector
Upload denylist — blocks accidental upload of API keys, PEM keys
Memory injection protection — tag stripping on all input paths
RAG injection sanitization — [MEM_SAVE:], [MEM_DELETE:], [OLVIDA|OBLIT|FORGET:], [MEMORIA:] neutralized at ingest and retrieval (v0.9.9)
Pipeline enforcement — all chat through canonical endpoints only
Encryption at rest — AES-256-GCM, SQLCipher, auto default, fail-closed (v0.9.2+)
Trusted host middleware

Note: This project has not been tested in production with real users. Security testing has been performed by AI, not by professional auditors. See SECURITY.md for full disclosure and vulnerability reporting.

Platform Support

Platform	Status	Backends
macOS Apple Silicon (M1+)	Supported — all 3 backends	MLX, llama.cpp, Ollama
macOS Intel	Not supported since v0.9.9	—
macOS 13 Ventura or earlier	Not supported since v0.9.9 (requires macOS 14 Sonoma+)	—
Linux x86_64	Partial — unit tests pass, CI green, NOT tested in production	llama.cpp, Ollama
Linux ARM64	Not directly tested	llama.cpp, Ollama (theoretical)
Windows	Not supported	—

Since v0.9.9, server-nexe requires macOS 14 Sonoma+ with Apple Silicon (M1 or later). The pre-built wheels in the DMG are arm64 exclusive. Linux with the llama.cpp and Ollama backends should work, but the full compatibility audit is on the roadmap.

Requirements

	Minimum	Recommended
OS	macOS 14 Sonoma (Apple Silicon only)	macOS 14+ (Apple Silicon)
CPU	Apple Silicon M1	Apple Silicon M2 / M3 / M4
Python	3.11+	3.12+
RAM	8 GB	16 GB+ (for larger models)
Disk	10 GB free	20 GB+ free

Intel Macs and macOS 13 Ventura are no longer supported. Apple Silicon only (arm64). Linux: Works with llama.cpp and Ollama backends. Full Linux compatibility audit is on the roadmap.

Testing

4842 tests collected (of 4990 total, 148 deselected by default markers) with ~85% code coverage. CI runs the full suite on every push.

# Unit tests
pytest core memory personality plugins -m "not integration and not e2e and not slow" \
  --cov=core --cov=memory --cov=personality --cov=plugins \
  --cov-report=term --tb=short -q

# Integration tests (requires Ollama running)
NEXE_AUTOSTART_OLLAMA=true pytest -m "integration" -q

Roadmap

Server Nexe is actively developed. Here's what's coming:

Persistent memory with RAG (v0.9.0)
Encryption at rest — AES-256-GCM (v0.9.0)
macOS code signing & notarization (v0.9.0)
Security hardening — jailbreak detection, upload denylist, pipeline enforcement (v0.9.1)
Encryption default auto, fail-closed (v0.9.2)
Embeddings on ONNX (fastembed), PyTorch removed (v0.9.3)
Multimodal VLM — 4 backends (Ollama, MLX, llama.cpp, Web UI) (v0.9.7)
Precomputed KB embeddings (~10.7x faster startup) (v0.9.8)
RAG injection sanitization (MEM tags neutralized at ingest and retrieval) (v0.9.9)
Offline install bundle — all wheels + embedding model in DMG (~1.2 GB, post-v0.9.9)
Thinking toggle endpoint — PATCH /session/{id}/thinking (post-v0.9.9)
Native macOS app (SwiftUI, replaces Python tray)
Configurable inference parameters via UI
Community forum

See CHANGELOG.md for version history.

Limitations

Honest disclosure of what server Nexe does not do or does not do well:

Local models < cloud — Local models are less capable than GPT-4 or Claude. That's the trade-off for privacy.
RAG is not perfect — Homonymy, negations, cold start (empty memory), and contradictory information across time periods.
Partially OpenAI-compatible API — /v1/chat/completions works. Missing: /v1/embeddings, /v1/models, function calling, and multimodal.
Single user — Mono-user by design. No multi-device sync, no accounts.
No fine-tuning — You cannot train or fine-tune models.
New encryption — Added in v0.9.0 (default auto since v0.9.2, fail-closed). Not battle-tested. If you lose the master key, data cannot be recovered (see MEK fallback: file → keyring → env → generate).
Single developer, single real user — Personal open-source project, not an enterprise product.

See knowledge/en/LIMITATIONS.md for full detail.

Contributing

See CONTRIBUTING.md for setup instructions and guidelines.

Acknowledgments

server-nexe is built on the shoulders of these amazing open-source projects:

AI & Inference

MLX — Apple Silicon native ML framework
llama.cpp — Efficient GGUF model inference
Ollama — Local model management and serving
fastembed — ONNX-based text embeddings (replaced sentence-transformers since v0.9.3, saves ~600 MB)
sentence-transformers — Historical: original embedding backend, replaced by fastembed in v0.9.3
Hugging Face — Model hub and transformers library

Infrastructure

Qdrant — Vector search engine powering RAG memory
FastAPI — High-performance async web framework
Uvicorn — Lightning-fast ASGI server
Pydantic — Data validation

Tools & Libraries

Rich — Beautiful terminal formatting
marked.js — Markdown rendering in web UI
PyPDF — PDF text extraction for RAG
rumps — macOS menu bar integration

Security & Monitoring

Prometheus — Metrics and monitoring
SlowAPI — Rate limiting

Also built with: Python, NumPy, httpx, tenacity, Click, Typer, Colorama, python-dotenv, PyYAML, toml, structlog, starlette-csrf, python-multipart, psutil, PyObjC, and Linux.

20% of Enterprise sponsorships go directly to supporting these projects.

Built with AI collaboration · Barcelona

Disclaimer

This software is provided "as is", without warranty of any kind. Use it at your own risk. The author is not responsible for any damage, data loss, security incidents, or misuse arising from the use of this software.

See LICENSE for details.

Version 1.0.2-beta · Apache 2.0 · Made by Jordi Goy in Barcelona

Release History

Version	Changes	Urgency	Date
v1.0.5-beta	Desktop app installers for server-nexe v1.0.5-beta — macOS DMG + Linux AppImage. The desktop app bundles the server-nexe engine and runs it as a local sidecar (chat, persistent memory, RAG). Everything runs on your machine; no data is sent to external services. App source: https://github.com/jgoy-labs/nexe-app ## Downloads \| Platform \| File \| Requirements \| \|---\|---\|---\| \| macOS (Apple Silicon) \| `nexe-app_1.0.5_aarch64.dmg` \| macOS 14 (Sonoma) or later · signed & notarized \| \| **Linu	High	6/1/2026
v1.0.4	## v1.0.4 — Desktop App Release Server Nexe now ships as a Tauri v2 desktop application with onboarding wizard, system tray, and automatic sidecar management. ### Downloads \| Platform \| Package \| Size \| \|----------\|---------\|------\| \| macOS (Apple Silicon) \| `nexe-app_1.0.4_aarch64.dmg` \| ~1.3 GB \| \| Linux (ARM64) \| `nexe-app_1.0.4_aarch64.AppImage` \| ~1.2 GB \| ### Highlights - Desktop app — Tauri v2 shell wrapping server-nexe as a sidecar process - Onboarding wizard — hardware	High	5/26/2026
v1.0.2-beta	## What's new Small but meaningful fixes after v1.0.1-beta. No breaking changes. ### Fixed - Security filter: SQL detector false positives on natural text. The pattern `r'--\s'` triggered on legitimate user input such as email visual separators (`----------`), RFC 3676 signature delimiters (`-- \n`), em-dashes in prose and dash-separated enumerations. Chat messages containing any of these returned HTTP 400 "SQL detected" at `/ui/chat`. Replaced with `r'[\'"]\s*--`, which only matches the	High	4/21/2026
v1.0.1-beta	## v1.0.1-beta — 2026-04-20 ### Added - Memory delete confirmation flow: MEM_DELETE now requires user confirmation before deleting (PENDING_DELETE token + `POST /memory/confirm-delete`) - Atomic fact splitter: compound MEM_SAVE facts automatically split into atomic facts (LLM + regex) - Fallback extractor for models that omit MEM_SAVE tags (e.g. Gemma-3 VLM) - Tray icon documentation added to knowledge base (ca/en/es) - Linux ARM64 tested via UTM (Ubuntu 24.04, Apple Silicon VM) ### Fixed - UI	High	4/19/2026
v1.0.0-beta	### Summary First public pre-1.0 release. Confidence bump from `0.9.9` after the final documentation coherence audit — no functional code changes beyond what `0.9.9` already shipped. The project is now considered a minimum viable product for the real world, open to community feedback. ### Changed - Version metadata bumped to `1.0.0-beta` across the codebase (pyproject, plugins, installer, knowledge base). - Knowledge base consolidated (13 thematic documents × 3 languages = 39 files), wit	High	4/17/2026
v1.0.0-beta	### Summary First public pre-1.0 release. Confidence bump from `0.9.9` after the final documentation coherence audit — no functional code changes beyond what `0.9.9` already shipped. The project is now considered a minimum viable product for the real world, open to community feedback. ### Changed - Version metadata bumped to `1.0.0-beta` across the codebase (pyproject, plugins, installer, knowledge base). - Knowledge base consolidated (13 thematic documents × 3 languages = 39 files), wit	High	4/17/2026
v1.0.0-beta	### Summary First public pre-1.0 release. Confidence bump from `0.9.9` after the final documentation coherence audit — no functional code changes beyond what `0.9.9` already shipped. The project is now considered a minimum viable product for the real world, open to community feedback. ### Changed - Version metadata bumped to `1.0.0-beta` across the codebase (pyproject, plugins, installer, knowledge base). - Knowledge base consolidated (13 thematic documents × 3 languages = 39 files), wit	High	4/17/2026
v1.0.0-beta	### Summary First public pre-1.0 release. Confidence bump from `0.9.9` after the final documentation coherence audit — no functional code changes beyond what `0.9.9` already shipped. The project is now considered a minimum viable product for the real world, open to community feedback. ### Changed - Version metadata bumped to `1.0.0-beta` across the codebase (pyproject, plugins, installer, knowledge base). - Knowledge base consolidated (13 thematic documents × 3 languages = 39 files), wit	Medium	4/17/2026
v1.0.0-beta	### Summary First public pre-1.0 release. Confidence bump from `0.9.9` after the final documentation coherence audit — no functional code changes beyond what `0.9.9` already shipped. The project is now considered a minimum viable product for the real world, open to community feedback. ### Changed - Version metadata bumped to `1.0.0-beta` across the codebase (pyproject, plugins, installer, knowledge base). - Knowledge base consolidated (13 thematic documents × 3 languages = 39 files), wit	Medium	4/17/2026
v1.0.0-beta	### Summary First public pre-1.0 release. Confidence bump from `0.9.9` after the final documentation coherence audit — no functional code changes beyond what `0.9.9` already shipped. The project is now considered a minimum viable product for the real world, open to community feedback. ### Changed - Version metadata bumped to `1.0.0-beta` across the codebase (pyproject, plugins, installer, knowledge base). - Knowledge base consolidated (13 thematic documents × 3 languages = 39 files), wit	Medium	4/17/2026
v1.0.0-beta	### Summary First public pre-1.0 release. Confidence bump from `0.9.9` after the final documentation coherence audit — no functional code changes beyond what `0.9.9` already shipped. The project is now considered a minimum viable product for the real world, open to community feedback. ### Changed - Version metadata bumped to `1.0.0-beta` across the codebase (pyproject, plugins, installer, knowledge base). - Knowledge base consolidated (13 thematic documents × 3 languages = 39 files), wit	Medium	4/17/2026
v1.0.0-beta	### Summary First public pre-1.0 release. Confidence bump from `0.9.9` after the final documentation coherence audit — no functional code changes beyond what `0.9.9` already shipped. The project is now considered a minimum viable product for the real world, open to community feedback. ### Changed - Version metadata bumped to `1.0.0-beta` across the codebase (pyproject, plugins, installer, knowledge base). - Knowledge base consolidated (13 thematic documents × 3 languages = 39 files), wit	Medium	4/17/2026
v1.0.0-beta	### Summary First public pre-1.0 release. Confidence bump from `0.9.9` after the final documentation coherence audit — no functional code changes beyond what `0.9.9` already shipped. The project is now considered a minimum viable product for the real world, open to community feedback. ### Changed - Version metadata bumped to `1.0.0-beta` across the codebase (pyproject, plugins, installer, knowledge base). - Knowledge base consolidated (13 thematic documents × 3 languages = 39 files), wit	Medium	4/17/2026
v1.0.0-beta	### Summary First public pre-1.0 release. Confidence bump from `0.9.9` after the final documentation coherence audit — no functional code changes beyond what `0.9.9` already shipped. The project is now considered a minimum viable product for the real world, open to community feedback. ### Changed - Version metadata bumped to `1.0.0-beta` across the codebase (pyproject, plugins, installer, knowledge base). - Knowledge base consolidated (13 thematic documents × 3 languages = 39 files), wit	Medium	4/17/2026
v1.0.0-beta	### Summary First public pre-1.0 release. Confidence bump from `0.9.9` after the final documentation coherence audit — no functional code changes beyond what `0.9.9` already shipped. The project is now considered a minimum viable product for the real world, open to community feedback. ### Changed - Version metadata bumped to `1.0.0-beta` across the codebase (pyproject, plugins, installer, knowledge base). - Knowledge base consolidated (13 thematic documents × 3 languages = 39 files), wit	Medium	4/17/2026
v1.0.0-beta	### Summary First public pre-1.0 release. Confidence bump from `0.9.9` after the final documentation coherence audit — no functional code changes beyond what `0.9.9` already shipped. The project is now considered a minimum viable product for the real world, open to community feedback. ### Changed - Version metadata bumped to `1.0.0-beta` across the codebase (pyproject, plugins, installer, knowledge base). - Knowledge base consolidated (13 thematic documents × 3 languages = 39 files), wit	Medium	4/17/2026
v1.0.0-beta	### Summary First public pre-1.0 release. Confidence bump from `0.9.9` after the final documentation coherence audit — no functional code changes beyond what `0.9.9` already shipped. The project is now considered a minimum viable product for the real world, open to community feedback. ### Changed - Version metadata bumped to `1.0.0-beta` across the codebase (pyproject, plugins, installer, knowledge base). - Knowledge base consolidated (13 thematic documents × 3 languages = 39 files), wit	Medium	4/17/2026
v1.0.0-beta	### Summary First public pre-1.0 release. Confidence bump from `0.9.9` after the final documentation coherence audit — no functional code changes beyond what `0.9.9` already shipped. The project is now considered a minimum viable product for the real world, open to community feedback. ### Changed - Version metadata bumped to `1.0.0-beta` across the codebase (pyproject, plugins, installer, knowledge base). - Knowledge base consolidated (13 thematic documents × 3 languages = 39 files), wit	Medium	4/17/2026
v1.0.0-beta	### Summary First public pre-1.0 release. Confidence bump from `0.9.9` after the final documentation coherence audit — no functional code changes beyond what `0.9.9` already shipped. The project is now considered a minimum viable product for the real world, open to community feedback. ### Changed - Version metadata bumped to `1.0.0-beta` across the codebase (pyproject, plugins, installer, knowledge base). - Knowledge base consolidated (13 thematic documents × 3 languages = 39 files), wit	Medium	4/17/2026
v1.0.0-beta	### Summary First public pre-1.0 release. Confidence bump from `0.9.9` after the final documentation coherence audit — no functional code changes beyond what `0.9.9` already shipped. The project is now considered a minimum viable product for the real world, open to community feedback. ### Changed - Version metadata bumped to `1.0.0-beta` across the codebase (pyproject, plugins, installer, knowledge base). - Knowledge base consolidated (13 thematic documents × 3 languages = 39 files), wit	Medium	4/17/2026
v1.0.0-beta	### Summary First public pre-1.0 release. Confidence bump from `0.9.9` after the final documentation coherence audit — no functional code changes beyond what `0.9.9` already shipped. The project is now considered a minimum viable product for the real world, open to community feedback. ### Changed - Version metadata bumped to `1.0.0-beta` across the codebase (pyproject, plugins, installer, knowledge base). - Knowledge base consolidated (13 thematic documents × 3 languages = 39 files), wit	Medium	4/17/2026
v1.0.0-beta	### Summary First public pre-1.0 release. Confidence bump from `0.9.9` after the final documentation coherence audit — no functional code changes beyond what `0.9.9` already shipped. The project is now considered a minimum viable product for the real world, open to community feedback. ### Changed - Version metadata bumped to `1.0.0-beta` across the codebase (pyproject, plugins, installer, knowledge base). - Knowledge base consolidated (13 thematic documents × 3 languages = 39 files), wit	Medium	4/17/2026
v1.0.0-beta	### Summary First public pre-1.0 release. Confidence bump from `0.9.9` after the final documentation coherence audit — no functional code changes beyond what `0.9.9` already shipped. The project is now considered a minimum viable product for the real world, open to community feedback. ### Changed - Version metadata bumped to `1.0.0-beta` across the codebase (pyproject, plugins, installer, knowledge base). - Knowledge base consolidated (13 thematic documents × 3 languages = 39 files), wit	Medium	4/17/2026
v1.0.0-beta	### Summary First public pre-1.0 release. Confidence bump from `0.9.9` after the final documentation coherence audit — no functional code changes beyond what `0.9.9` already shipped. The project is now considered a minimum viable product for the real world, open to community feedback. ### Changed - Version metadata bumped to `1.0.0-beta` across the codebase (pyproject, plugins, installer, knowledge base). - Knowledge base consolidated (13 thematic documents × 3 languages = 39 files), wit	Medium	4/17/2026
v1.0.0-beta	### Summary First public pre-1.0 release. Confidence bump from `0.9.9` after the final documentation coherence audit — no functional code changes beyond what `0.9.9` already shipped. The project is now considered a minimum viable product for the real world, open to community feedback. ### Changed - Version metadata bumped to `1.0.0-beta` across the codebase (pyproject, plugins, installer, knowledge base). - Knowledge base consolidated (13 thematic documents × 3 languages = 39 files), wit	Medium	4/17/2026
v1.0.0-beta	### Summary First public pre-1.0 release. Confidence bump from `0.9.9` after the final documentation coherence audit — no functional code changes beyond what `0.9.9` already shipped. The project is now considered a minimum viable product for the real world, open to community feedback. ### Changed - Version metadata bumped to `1.0.0-beta` across the codebase (pyproject, plugins, installer, knowledge base). - Knowledge base consolidated (13 thematic documents × 3 languages = 39 files), wit	Medium	4/17/2026
v1.0.0-beta	### Summary First public pre-1.0 release. Confidence bump from `0.9.9` after the final documentation coherence audit — no functional code changes beyond what `0.9.9` already shipped. The project is now considered a minimum viable product for the real world, open to community feedback. ### Changed - Version metadata bumped to `1.0.0-beta` across the codebase (pyproject, plugins, installer, knowledge base). - Knowledge base consolidated (13 thematic documents × 3 languages = 39 files), wit	Medium	4/17/2026
v1.0.0-beta	### Summary First public pre-1.0 release. Confidence bump from `0.9.9` after the final documentation coherence audit — no functional code changes beyond what `0.9.9` already shipped. The project is now considered a minimum viable product for the real world, open to community feedback. ### Changed - Version metadata bumped to `1.0.0-beta` across the codebase (pyproject, plugins, installer, knowledge base). - Knowledge base consolidated (13 thematic documents × 3 languages = 39 files), wit	Medium	4/17/2026
v1.0.0-beta	### Summary First public pre-1.0 release. Confidence bump from `0.9.9` after the final documentation coherence audit — no functional code changes beyond what `0.9.9` already shipped. The project is now considered a minimum viable product for the real world, open to community feedback. ### Changed - Version metadata bumped to `1.0.0-beta` across the codebase (pyproject, plugins, installer, knowledge base). - Knowledge base consolidated (13 thematic documents × 3 languages = 39 files), wit	Medium	4/17/2026
v1.0.0-beta	### Summary First public pre-1.0 release. Confidence bump from `0.9.9` after the final documentation coherence audit — no functional code changes beyond what `0.9.9` already shipped. The project is now considered a minimum viable product for the real world, open to community feedback. ### Changed - Version metadata bumped to `1.0.0-beta` across the codebase (pyproject, plugins, installer, knowledge base). - Knowledge base consolidated (13 thematic documents × 3 languages = 39 files), wit	Medium	4/17/2026
v1.0.0-beta	### Summary First public pre-1.0 release. Confidence bump from `0.9.9` after the final documentation coherence audit — no functional code changes beyond what `0.9.9` already shipped. The project is now considered a minimum viable product for the real world, open to community feedback. ### Changed - Version metadata bumped to `1.0.0-beta` across the codebase (pyproject, plugins, installer, knowledge base). - Knowledge base consolidated (13 thematic documents × 3 languages = 39 files), wit	Low	4/17/2026
v1.0.0-beta	### Summary First public pre-1.0 release. Confidence bump from `0.9.9` after the final documentation coherence audit — no functional code changes beyond what `0.9.9` already shipped. The project is now considered a minimum viable product for the real world, open to community feedback. ### Changed - Version metadata bumped to `1.0.0-beta` across the codebase (pyproject, plugins, installer, knowledge base). - Knowledge base consolidated (13 thematic documents × 3 languages = 39 files), wit	Low	4/17/2026
v1.0.0-beta	### Summary First public pre-1.0 release. Confidence bump from `0.9.9` after the final documentation coherence audit — no functional code changes beyond what `0.9.9` already shipped. The project is now considered a minimum viable product for the real world, open to community feedback. ### Changed - Version metadata bumped to `1.0.0-beta` across the codebase (pyproject, plugins, installer, knowledge base). - Knowledge base consolidated (13 thematic documents × 3 languages = 39 files), wit	Low	4/17/2026
v1.0.0-beta	### Summary First public pre-1.0 release. Confidence bump from `0.9.9` after the final documentation coherence audit — no functional code changes beyond what `0.9.9` already shipped. The project is now considered a minimum viable product for the real world, open to community feedback. ### Changed - Version metadata bumped to `1.0.0-beta` across the codebase (pyproject, plugins, installer, knowledge base). - Knowledge base consolidated (13 thematic documents × 3 languages = 39 files), wit	Low	4/17/2026
v1.0.0-beta	### Summary First public pre-1.0 release. Confidence bump from `0.9.9` after the final documentation coherence audit — no functional code changes beyond what `0.9.9` already shipped. The project is now considered a minimum viable product for the real world, open to community feedback. ### Changed - Version metadata bumped to `1.0.0-beta` across the codebase (pyproject, plugins, installer, knowledge base). - Knowledge base consolidated (13 thematic documents × 3 languages = 39 files), wit	Low	4/17/2026
v1.0.0-beta	### Summary First public pre-1.0 release. Confidence bump from `0.9.9` after the final documentation coherence audit — no functional code changes beyond what `0.9.9` already shipped. The project is now considered a minimum viable product for the real world, open to community feedback. ### Changed - Version metadata bumped to `1.0.0-beta` across the codebase (pyproject, plugins, installer, knowledge base). - Knowledge base consolidated (13 thematic documents × 3 languages = 39 files), wit	Low	4/17/2026
v1.0.0-beta	### Summary First public pre-1.0 release. Confidence bump from `0.9.9` after the final documentation coherence audit — no functional code changes beyond what `0.9.9` already shipped. The project is now considered a minimum viable product for the real world, open to community feedback. ### Changed - Version metadata bumped to `1.0.0-beta` across the codebase (pyproject, plugins, installer, knowledge base). - Knowledge base consolidated (13 thematic documents × 3 languages = 39 files), wit	Low	4/17/2026
v1.0.0-beta	### Summary First public pre-1.0 release. Confidence bump from `0.9.9` after the final documentation coherence audit — no functional code changes beyond what `0.9.9` already shipped. The project is now considered a minimum viable product for the real world, open to community feedback. ### Changed - Version metadata bumped to `1.0.0-beta` across the codebase (pyproject, plugins, installer, knowledge base). - Knowledge base consolidated (13 thematic documents × 3 languages = 39 files), wit	Low	4/17/2026
v1.0.0-beta	### Summary First public pre-1.0 release. Confidence bump from `0.9.9` after the final documentation coherence audit — no functional code changes beyond what `0.9.9` already shipped. The project is now considered a minimum viable product for the real world, open to community feedback. ### Changed - Version metadata bumped to `1.0.0-beta` across the codebase (pyproject, plugins, installer, knowledge base). - Knowledge base consolidated (13 thematic documents × 3 languages = 39 files), wit	Low	4/17/2026
v0.9.2	## Security fixes (P1 — post mega-consultoria 2026-04-11) - P1-A — Rate limit UI auth failures per IP: dict in-memory, 60s window, max 20 attempts → 429. Prevents brute-force on `/ui/chat` without rate limiting. - P1-B — Auth failure logging from web UI routes to security log. Failures on `/ui/` now appear in security log (previously only `/v1/` were logged). - P1-C — Symlink upload rejection: `os.path.realpath()` check post-save rejects uploads whose resolved path falls outside t	Medium	4/12/2026
v0.9.1	## What's new in v0.9.1 ### Security hardening (mega-consultoria) - Jailbreak speed-bump detector (47 patterns, multilingual) - Upload content denylist (API tokens, PEM keys) - Memory injection protection (tag stripping on all paths) - Pipeline enforcement (removed bypass endpoints) - SQLCIPHER fail-closed behavior - Ollama timeout split (connect=5s, read=600s) ### Quality - 4572 tests, 0 failures - Knowledge base coherence audit (36 files updated) - Docker files removed (untested, bare-metal	Medium	4/12/2026

Dependencies & License Audit

Loading dependencies...

Similar Packages

ai-real-estate-assistantAdvanced AI Real Estate Assistant using RAG, LLMs, and Python. Features market analysis, property valuation, and intelligent search.v5.0.7

txtai💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflowsv9.10.0

Awesome-RAG-Production🚀 Build and scale reliable Retrieval-Augmented Generation (RAG) systems with this curated collection of tools, frameworks, and best practices.main@2026-06-07

OmniLearnAI📚 Learn from diverse sources with OmniLearnAI, an intelligent platform that combines documents, videos, and more, all with reliable citations.main@2026-06-05

bigragSelf-hostable RAG platform - document ingestion, embedding, and vector search behind a simple REST APImain@2026-06-03

More in Databases

orbitOne API for 20+ LLM providers, your databases, and your files — self-hosted, open-source AI gateway with RAG, voice, and guardrails.

ai-real-estate-assistantAdvanced AI Real Estate Assistant using RAG, LLMs, and Python. Features market analysis, property valuation, and intelligent search.

alibabacloud-adb20211201Alibaba Cloud adb (20211201) SDK Library for Python

milvusMilvus is a high-performance, cloud-native vector database built for scalable vector ANN search