Search results for "inference"
Plano is an AI-native proxy and data plane for agentic apps β with built-in orchestration, safety, observability, and smart LLM routing so you stay focused on your agents core logic.
EdgeCrab π¦ A Super Powerful Personal Assistant inspired by NousHermes and OpenClaw β Rust-native, blazing-fast terminal UI, ReAct tool loop, multi-provider LLM support, ACP protocol, gateway adapters
The worldβs fastest AI model gateway (450x less overhead than LiteLLM). Unified access to LLMs across endpoints (openAI, self-hosted, etc.) behind a single authentication layer - with API key generati
A functional programming language optimized for LLM code generation. Compiles to Rust and WebAssembly.
Engine-agnostic LLM gateway in Rust. Full OpenAI & Anthropic API compatibility across SGLang, vLLM, TRT-LLM, OpenAI, Gemini & more. Industry-first gRPC pipeline, KV cache-aware routing, chat history,
π¦ Open-source alternative to Claude Code, built from scratch in Rust. Agentic coding CLI β thinks, plans, and executes with any LLM. Compatible with Claude Code workflows.
OramaCore is the complete runtime you need for your projects, answer engines, copilots, and search. It includes a fully-fledged full-text search engine, vector database, LLM interface, and many more u
NextPlaid, ColGREP: Multi-vector search, from database to coding agents.
A portable accelerated SQL query, search, and LLM-inference engine, written in Rust, for data-grounded AI apps and agents.
TensorZero is an open-source LLMOps platform that unifies an LLM gateway, observability, evaluation, optimization, and experimentation.
Local AI anywhere, for everyone β LLM inference, chat UI, voice, agents, workflows, RAG, and image generation. No cloud, no subscriptions.
Self-hosted orchestrator for AI autonomous agents. Run Claude Code & Open Code in isolated linux workspaces. Manage your skills, configs and encrypted secrets with a git repo.
Self-hosted AI coding assistant
Lightweight, modular AI agent runtime β thinks (Hrafn) and remembers (MuninnDB) π¦ββ¬
Open-source autonomous AI assistant with 5-tier security, 62 tools, 14 LLM providers. Written in Rust. Single binary.
LSP server leveraging LLMs for code completion (and more?)
