freshcrate

Search results for "llm-inference"

7 results found
neuron-ai📁3.3.9🌿 Growing1,834

The PHP Agentic Framework to build production-ready AI driven applications. Connect components (LLMs, vector DBs, memory) to agents that can interact with your data. With its modular architecture it's

plano📁0.4.20🌿 Growing6,241

Plano is an AI-native proxy and data plane for agentic apps — with built-in orchestration, safety, observability, and smart LLM routing so you stay focused on your agents core logic.

MiniSearch📁main@2026-04-20🌿 Growing553

Minimalist web-searching platform with an AI assistant that runs directly from your browser. Uses WebLLM, Wllama and SearXNG. Demo: https://felladrin-minisearch.hf.space

monocle📁v0.7.8🌿 Growing72

Monocle is a framework for tracing GenAI app code. This repo contains implementation of Monocle for GenAI apps written in Python.

A comprehensive toolkit for deploying production-ready Generative AI infrastructure on Amazon EKS. Includes pre-configured components for: 🚀 AI Gateway (LiteLLM) 🤖 LLM Serving (vLLM, SGLang, Ollama

spiceai📁v1.11.5🌱 Seedling2,868

A portable accelerated SQL query, search, and LLM-inference engine, written in Rust, for data-grounded AI apps and agents.

vllm-cli📁v0.2.5💤 Dormant487

A command-line interface tool for serving LLM using vLLM.