freshcrate
Skin:/
Home > Databases > orbit

orbit

One API for 20+ LLM providers, your databases, and your files โ€” self-hosted, open-source AI gateway with RAG, voice, and guardrails.

Why this rank:Strong adoptionRecent releaseHealthy release cadence

Description

One API for 20+ LLM providers, your databases, and your files โ€” self-hosted, open-source AI gateway with RAG, voice, and guardrails.

README

ORBIT Logo

ORBIT โ€” Open Retrieval-Based Inference Toolkit

One API for 29 LLM providers, 17 data sources, and your files.


License Python Latest release Last commit GitHub stars Tutorial ย |ย  API Reference ย |ย  Docker Guide ย |ย  Cookbook


Get running in 60 seconds

git clone https://github.com/schmitech/orbit.git && cd orbit/docker
docker compose up -d

Then test it:

curl -X POST http://localhost:3000/v1/chat \
  -H 'Content-Type: application/json' \
  -H 'X-API-Key: default-key' \
  -H 'X-Session-ID: local-test' \
  -d '{
    "messages": [{"role": "user", "content": "Summarize ORBIT in one sentence."}],
    "stream": false
  }'

That's it. ORBIT is listening on port 3000 with an admin panel at localhost:3000/admin (default login: admin / admin123).

For GPU acceleration: docker compose -f docker-compose.yml -f docker-compose.gpu.yml up -d

Adapter wiring and sample domains live in config/adapters/ and examples/intent-templates/.


What's in the box

Layer Coverage
LLM & inference 29 providers โ€” OpenAI, Anthropic, Gemini, Cohere, Groq, DeepSeek, Mistral, xAI, AWS Bedrock, Azure, Vertex, Together, Fireworks, Perplexity, Replicate, OpenRouter, Watson, NVIDIA, Hugging Face, Ollama (local/cloud/remote), vLLM, TensorRT-LLM, llama.cpp, Shimmy, BitNet (1.58-bit), Transformers, Z.ai
Data sources 17 โ€” Postgres, MySQL, MariaDB, SQL Server, Oracle, SQLite, MongoDB, Redis, Cassandra, DuckDB, Athena, Elasticsearch, Supabase + HTTP/REST, GraphQL, Firecrawl
Vector stores Chroma, Qdrant, Pinecone, Milvus, Weaviate, Elasticsearch
Embeddings 10 providers โ€” OpenAI, Cohere, Jina, Voyage, Mistral, Gemini, OpenRouter, Ollama, llama.cpp, Sentence-Transformers
Rerankers 6 providers โ€” Cohere, Jina, Voyage, OpenAI, Anthropic, Ollama
Moderation / guardrails OpenAI, Anthropic, Llama Guard (local), pluggable chain
Voice Full-duplex speech-to-speech via PersonaPlex; STT (Whisper, Google, Gemini), TTS (OpenAI, ElevenLabs, Coqui)
Protocols OpenAI-compatible chat API + MCP โ€” drop-in tool server for OpenClaw, Claude Desktop, Cursor, or any MCP client

Why ORBIT stands out

Most AI gateways stop at provider routing. ORBIT is built for the messy parts of production RAG.

  • Intent-based retrieval, not just vector search โ€” ship real queries, not vector guesses. Users ask in natural language; ORBIT picks the right template and runs the query against your data. Learn more.
  • Cross-adapter RAG across mixed databases + APIs โ€” one question, many sources. Fan a query out to SQL, MongoDB, HTTP, and more in parallel and let the LLM merge the answers. Learn more.
  • Template diagnostics โ€” iterate on intent templates without burning LLM tokens. Learn more.
  • Conversation threading with cached datasets โ€” branch off any turn; follow-ups reuse the retrieved data instead of re-querying the DB. Learn more.
  • Circuit breakers + parallel fan-out โ€” resilient adapter orchestration that survives provider hiccups. Learn more.
  • Autocomplete that knows your data โ€” fuzzy-matched suggestions sourced from your intent templates. Learn more.
  • Two-layer rate limiting โ€” IP limits plus per-API-key quotas with progressive throttling. Learn more.
  • Multilingual by default โ€” 100+ languages with conversation stickiness so the model doesn't flap between turns. Learn more.
  • OpenClaw / MCP integration โ€” drop ORBIT into any OpenClaw agent as a tool server with a single config entry. Learn more.

What can you build with ORBIT?

  • Ask your database questions in any language โ€” connect Postgres, MySQL, MongoDB, DuckDB, Elasticsearch, or any of the other 12 sources and query them with natural language.
  • Query across Postgres + MongoDB + a REST API in one prompt โ€” true multi-source RAG, no pipeline glue.
  • Switch LLM providers without changing code โ€” swap between 29 providers with a single config line.
  • Build full-duplex voice agents โ€” speech-to-speech with interruption handling via PersonaPlex.
  • Plug ORBIT into OpenClaw in minutes โ€” one config entry turns ORBIT into a tool server for any OpenClaw agent. Walkthrough.
  • Power agentic workflows โ€” MCP-compatible with Claude Desktop, Cursor, and custom agents.
  • Upload files and get answers โ€” RAG over PDFs, images, and documents out of the box.
  • Add guardrails and moderation โ€” chain OpenAI, Anthropic, or local Llama Guard moderators.
  • Keep everything private โ€” self-host on your own infrastructure with RBAC, audit logs, and two-layer rate limiting.

Real-world example: PoliceStats.ca

ORBIT is used in production at PoliceStats.ca, a public-facing AI search and analytics site for Canadian municipal police open data.

PoliceStats uses ORBIT to:

  • Route users across many dataset-specific adapters for cities like Toronto, Ottawa, Montreal, Edmonton, Hamilton, Winnipeg, Saskatoon, Vancouver, and Canada-wide statistics
  • Query structured public-safety datasets using natural language
  • Return grounded answers with source citations back to the relevant open data portal
  • Support both broad city assistants and narrow subdomain assistants
  • Power a production web chat experience with typed and voice interaction

PoliceStats is a useful reference if you want to see ORBIT applied to a real vertical product instead of only toy examples: one OpenAI-compatible API, many adapters, structured retrieval over public data, and answers designed for end users rather than internal analysts.


Build a business on ORBIT

ORBIT is Apache 2.0 โ€” you can build and sell commercial products on top of it without royalties or per-seat licensing back to the project. Below are product patterns that map cleanly to ORBIT's primitives (intent adapters, composite routing, voice, MCP, guardrails). Each row links to a cookbook recipe with a working configuration you can fork as a starting point.

Product pattern What you're actually selling Starting point
Vertical database copilot A natural-language query layer over one industry's schemas โ€” healthcare claims, legal discovery, logistics, retail ops, public safety โ€” sold per-seat to analysts who don't write SQL Database copilot ยท NL-BI
Private AI gateway for regulated data On-prem or VPC-only deployment for banks, hospitals, insurers, and government โ€” RBAC, audit logs, guardrails, and data residency baked in Private gateway
Full-duplex voice assistant Voice-first copilots for call centers, field service, drive-throughs, and lobby kiosks โ€” priced per concurrent channel or per minute Field service voice ยท PersonaPlex overview
Enterprise system chat connectors Chat-native access to ServiceNow, Salesforce, Jira, SAP, or internal APIs โ€” sold per connector or as an integration bundle ServiceNow connector ยท REST / GraphQL
Document Q&A for teams Knowledge-base SaaS over PDFs, contracts, policies, and manuals โ€” semantic search with citations, team subscriptions File-upload RAG ยท Vector stores
Managed ORBIT hosting Run ORBIT-as-a-Service for customers who don't want to operate it โ€” SLA-backed, failover, metered usage, multi-tenant API keys Production deployment ยท Resilient gateway ยท Rate limits & quotas
MCP tools for agent platforms Package domain-specific ORBIT adapters as MCP tools and sell them to Claude, Cursor, OpenClaw, and custom-agent users MCP / OpenClaw
Market-intel chat over the public web Subscription products that answer questions about competitor pricing, regulatory filings, and news by chat-indexing live web content Firecrawl web queries

Already shipped in production: see PoliceStats.ca above for a vertical public-data product built on ORBIT. If you build something on top, open an issue and we'll feature it here.


Why ORBIT?

Without ORBIT With ORBIT
One SDK per provider, rewrites when you switch One OpenAI-compatible API across 29 providers
Separate pipelines for retrieval and inference Unified model + retrieval + tooling gateway
Fragile glue scripts between data sources and LLMs 9 intent-adapter archetypes with template diagnostics
Separate tools for each database โ€” no way to combine them Composite adapters fan one prompt across SQL + NoSQL + HTTP, merged by the LLM
Cascading failures when a provider hiccups Circuit breakers, parallel fan-out, progressive throttling
No visibility into what models are doing Built-in RBAC, quota-aware rate limiting, and audit logging

Clients

Client Description
Web Chat React UI
CLI pip install schmitech-orbit-client
Mobile iOS & Android (Expo)
Node SDK Or use any OpenAI-compatible SDK

Deployment options

Docker Compose (fastest path)
git clone https://github.com/schmitech/orbit.git && cd orbit/docker
docker compose up -d

Starts ORBIT + Ollama with SmolLM2, auto-pulls models, and exposes the API on port 3000. The web admin UI is at /admin on the same host. Connect orbitchat from your host:

ORBIT_ADAPTER_KEYS='{"simple-chat":"default-key"}' npx orbitchat

See the full Docker Guide for GPU mode, volumes, and configuration.

Pre-built image (server only)
docker pull schmitech/orbit:basic
docker run -d --name orbit-basic -p 3000:3000 schmitech/orbit:basic

If Ollama runs on your host, add -e OLLAMA_HOST=host.docker.internal:11434 so the container can reach it. Includes simple-chat only.

From release tarball (production)
curl -L https://github.com/schmitech/orbit/releases/download/v2.6.6/orbit-2.6.6.tar.gz -o orbit-2.6.6.tar.gz
tar -xzf orbit-2.6.6.tar.gz && cd orbit-2.6.6

cp env.example .env && ./install/setup.sh
source venv/bin/activate
./bin/orbit.sh start && cat ./logs/orbit.log

Resources

  • Step-by-Step Tutorial โ€” Chat with your own data in minutes
  • Cookbook โ€” 20+ recipes: database copilots, voice assistants, fault tolerance, MCP agents, private gateways
  • Documentation โ€” Full architecture and setup guides
  • GitHub Issues โ€” Bug reports and feature requests

Contributing

Contributions are welcome! Check the issues for good first tasks, or open a new one to discuss your idea.

If you find ORBIT useful, a star helps others discover the project.


License

Apache 2.0 โ€” see LICENSE.

Release History

VersionChangesUrgencyDate
v2.7.0## [2.7.0] - 2026-05-31 ### Core System Updates - MCP Client: Added ORBIT-as-an-MCP-client support with the `mcp-agent` skill, external MCP server discovery, namespaced tool execution, and bounded multi-step tool-calling. - Skills: Added the web-search skill for provider-native web search through the conversational adapter path. - Providers: Fixed Mistral SDK import compatibility and updated the xAI Grok image model name. - Adapters: Updated intent-adapter examples for the current structuHigh5/31/2026
v2.6.8## [2.6.8] - 2026-05-24 ### Core System Updates - Document Generation: Introduced document generation skill and adapter supporting PDF, DOCX, and other formats. - Video Generation: Added xAI (Grok) video generation provider with support for duration, resolution, and aspect ratio settings. - Adapter Management: Fixed capability re-registration during adapter hot-reloads (#167); updated sample adapter configurations with new settings. ### Chat-app & UI Improvements - orbitchat v3.8.5: PuHigh5/24/2026
v2.6.7## [2.6.7] - 2026-05-08 ### Core System Updates - Image Generation: End-to-end support for image generation across multiple providers (OpenAI, Gemini/Imagen 4, Ollama, xAI); pipeline integration with image-specific generation steps and file-backed persistence in SQLite; integrated into Chat thread renderer - Skills System: Skill discovery and invocation framework (slash-command based picker), persistence in composer, and runtime enrichment of image prompts with conversation/thread context High5/9/2026
v2.6.6## [2.6.6] - 2026-04-19 ### Core System Updates - Composite adapter: Enabled composite adapter end-to-end, reorganized prompt examples, and added composite cross-adapter template hot reload so the admin reload flow rebuilds cross-adapter template embeddings and vector collections (with tests for the reload and disabled/no-op paths) - Cross-adapter templates: Added cross-domain intent templates layered on top of individual adapters - OpenAI Realtime voice: Added `openai_realtime` adapter (`High4/19/2026
v2.6.5## [2.6.5] - 2026-04-06 ### Core System Updates - PostgreSQL: Migrated from `psycopg2-binary` to `psycopg[binary,pool]` 3.3.3 across datasources, retrievers, vector stores, and examples (`ConnectionPool`, `dict_row`, `row_factory`); isolated the postgres sample adapter into its own module; fixed customer-order SQL templates and sql-intent parsing; added customer-orders sample data utilities - MCP: Replaced unmaintained `fastapi-mcp` with `fastmcp` (`FastMCP.from_fastapi()` + mount); `/mcp` High4/6/2026
v2.6.4## [2.6.4] - 2026-03-26 ### Core System Updates - Admin: Adapters tab with searchable list, per-adapter Ace YAML editor, enable/disable toggles, and read/write/list endpoints aligned with reload-adapters and reload-templates - Chat feedback: Thumbs up/down wired to feedback API and database with idempotent create/remove/switch on main replies and thread replies; responses always return `assistant_message_id` for feedback - Config: `adapters[].inputPlaceholder` from `orbitchat.yaml` throughMedium3/26/2026

Dependencies & License Audit

Loading dependencies...

Similar Packages

ai-real-estate-assistantAdvanced AI Real Estate Assistant using RAG, LLMs, and Python. Features market analysis, property valuation, and intelligent search.v5.0.7
rag-chatbotRAG (Retrieval-augmented generation) ChatBot that provides answers based on contextual information extracted from a collection of Markdown files.main@2026-06-04
redis-vl-pythonRedis Vector Library (RedisVL) -- the AI-native Python client for Redis.v0.20.0
Athena-PublicThe Linux OS for AI Agents โ€” Persistent memory, autonomy, and time-awareness for any LLM. Own the state. Rent the intelligence.v9.9.1
litellmPython SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropiv1.87.0

More in Databases

milvusMilvus is a high-performance, cloud-native vector database built for scalable vector ANN search
WeKnoraLLM-powered framework for deep document understanding, semantic retrieval, and context-aware answers using RAG paradigm.
ai-real-estate-assistantAdvanced AI Real Estate Assistant using RAG, LLMs, and Python. Features market analysis, property valuation, and intelligent search.
alibabacloud-adb20211201Alibaba Cloud adb (20211201) SDK Library for Python