freshcrate

Search results for "inference"

Clear filters
16 results found (Rust)
planoπŸ“0.4.20🌿 Growing⭐6,241

Plano is an AI-native proxy and data plane for agentic apps β€” with built-in orchestration, safety, observability, and smart LLM routing so you stay focused on your agents core logic.

edgecrabπŸ“v0.7.0🌱 Seedling⭐21

EdgeCrab πŸ¦€ A Super Powerful Personal Assistant inspired by NousHermes and OpenClaw β€” Rust-native, blazing-fast terminal UI, ReAct tool loop, multi-provider LLM support, ACP protocol, gateway adapters

control-layerπŸ“v8.41.0🌿 Growing⭐62

The world’s fastest AI model gateway (450x less overhead than LiteLLM). Unified access to LLMs across endpoints (openAI, self-hosted, etc.) behind a single authentication layer - with API key generati

almideπŸ“v0.15.0🌱 Seedling⭐15

A functional programming language optimized for LLM code generation. Compiles to Rust and WebAssembly.

smgπŸ“v1.4.1🌿 Growing⭐156

Engine-agnostic LLM gateway in Rust. Full OpenAI & Anthropic API compatibility across SGLang, vLLM, TRT-LLM, OpenAI, Gemini & more. Industry-first gRPC pipeline, KV cache-aware routing, chat history,

crab-codeπŸ“main@2026-04-21🌱 Seedling⭐25

πŸ¦€ Open-source alternative to Claude Code, built from scratch in Rust. Agentic coding CLI β€” thinks, plans, and executes with any LLM. Compatible with Claude Code workflows.

oramacoreπŸ“v1.2.38🌱 Seedling⭐249

OramaCore is the complete runtime you need for your projects, answer engines, copilots, and search. It includes a fully-fledged full-text search engine, vector database, LLM interface, and many more u

next-plaidπŸ“v1.2.0🌿 Growing⭐331

NextPlaid, ColGREP: Multi-vector search, from database to coding agents.

spiceaiπŸ“v1.11.5🌱 Seedling⭐2,868

A portable accelerated SQL query, search, and LLM-inference engine, written in Rust, for data-grounded AI apps and agents.

tensorzeroπŸ“2026.4.0🌱 Seedling⭐11,204

TensorZero is an open-source LLMOps platform that unifies an LLM gateway, observability, evaluation, optimization, and experimentation.

DreamServerπŸ“v2.0.0🌱 Seedling⭐478

Local AI anywhere, for everyone β€” LLM inference, chat UI, voice, agents, workflows, RAG, and image generation. No cloud, no subscriptions.

sandboxed.shπŸ“v0.10.0🌱 Seedling⭐371

Self-hosted orchestrator for AI autonomous agents. Run Claude Code & Open Code in isolated linux workspaces. Manage your skills, configs and encrypted secrets with a git repo.

hrafnπŸ“master@2026-04-18🌱 Seedling⭐2

Lightweight, modular AI agent runtime β€” thinks (Hrafn) and remembers (MuninnDB) πŸ¦β€β¬›

ryvosπŸ“v0.9.0🌱 Seedling⭐2

Open-source autonomous AI assistant with 5-tier security, 62 tools, 14 LLM providers. Written in Rust. Single binary.

llm-lsπŸ“0.5.3⚰️ Archived⭐865

LSP server leveraging LLMs for code completion (and more?)