freshcrate — Search

Search results for "ai-cache"

1 result found (Python)

SmarterRouter: An intelligent LLM gateway and VRAM-aware router for Ollama, llama.cpp, and OpenAI. Features semantic caching, model profiling, and automatic failover for local AI labs.

ai-cache ai-gateway docker fastapi gpu-monitoring llm llm-proxy llm-router pythonby peva3Python