freshcrate

Search results for "quantization"

Clear filters
3 results found (C++)
ctranslate2📁4.7.1🌳 Mature4,444

Fast inference engine for Transformer models

llama.cpp📁b8871🏛️ Flagship105,537

LLM inference in C/C++

zvec📁v0.3.1🌳 Mature9,474

A lightweight, lightning-fast, in-process vector database