freshcrate

Search results for "vllm"

Clear filters
5 results found (JavaScript)
houtini-lm📁v2.8.0🌿 Growing71

MCP server that saves Claude Code tokens by delegating bounded tasks to local or cloud LLMs. Works with LM Studio, Ollama, vLLM, DeepSeek, Groq, Cerebras.

A comprehensive toolkit for deploying production-ready Generative AI infrastructure on Amazon EKS. Includes pre-configured components for: 🚀 AI Gateway (LiteLLM) 🤖 LLM Serving (vLLM, SGLang, Ollama

pickle-rick-claude📁v1.44.3🌱 Seedling21

🥒 Pickle Rick for Claude Code — autonomous PRD-driven coding loops + relentless code review. Ralph Loop toolkit.

xiaozhi-esp32-server📁v0.9.2🏛️ Flagship9,342

本项目为xiaozhi-esp32提供后端服务,帮助您快速搭建ESP32设备控制服务器。Backend service for xiaozhi-esp32, helps you quickly build an ESP32 device control server.

omnishapeagent1.0.12🌱 Seedling

Local AI agent with chat, tool use, persistent memory, and vision