Search results for "benchmarking"
3 results found (Rust)
LeanKG: Stop Burning Tokens. Start Coding Lean.
The worldβs fastest AI model gateway (450x less overhead than LiteLLM). Unified access to LLMs across endpoints (openAI, self-hosted, etc.) behind a single authentication layer - with API key generati
β‘πΎ Vectro β Compress LLM embeddings π§ π Save memory, speed up retrieval, and keep semantic accuracy π―β¨ Lightning-fast quantization for Python + Mojo, vector DB friendly ποΈ, and perfect for RAG pip
