Search results for "quantization"
3 results found (C++)
Fast inference engine for Transformer models
A lightweight, lightning-fast, in-process vector database
Fast inference engine for Transformer models
A lightweight, lightning-fast, in-process vector database