freshcrate
Home > #llm-inference

Tag: #llm-inference

7 packages • ⭐ 12,106 total stars

plano0.4.20🌿 Growing6,241

Plano is an AI-native proxy and data plane for agentic apps — with built-in orchestration, safety, observability, and smart LLM routing so you stay focused on your agents core logic.

spiceaiv1.11.5🌱 Seedling2,868

A portable accelerated SQL query, search, and LLM-inference engine, written in Rust, for data-grounded AI apps and agents.

neuron-ai3.3.9🌿 Growing1,834

The PHP Agentic Framework to build production-ready AI driven applications. Connect components (LLMs, vector DBs, memory) to agents that can interact with your data. With its modular architecture it's

MiniSearchmain@2026-04-20🌿 Growing553

Minimalist web-searching platform with an AI assistant that runs directly from your browser. Uses WebLLM, Wllama and SearXNG. Demo: https://felladrin-minisearch.hf.space

vllm-cliv0.2.5💤 Dormant487

A command-line interface tool for serving LLM using vLLM.

monoclev0.7.8🌿 Growing72

Monocle is a framework for tracing GenAI app code. This repo contains implementation of Monocle for GenAI apps written in Python.

sample-genai-on-eks-starter-kitv1.1.0🌿 Growing51

A comprehensive toolkit for deploying production-ready Generative AI infrastructure on Amazon EKS. Includes pre-configured components for: 🚀 AI Gateway (LiteLLM) 🤖 LLM Serving (vLLM, SGLang, Ollama