freshcrate

Search results for "triton"

Clear filters
6 results found (Python)
litellm📁v1.83.7-stable🏛️ Flagship44,168

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropi

vllm📁v0.19.1🏛️ Flagship77,587

A high-throughput and memory-efficient inference and serving engine for LLMs

monocle📁v0.7.8🌿 Growing79

Monocle is a framework for tracing GenAI app code. This repo contains implementation of Monocle for GenAI apps written in Python.

fast-plaid📁1.4.5🌿 Growing245

High-Performance Engine for Multi-Vector Search

awesome-opensource-ai📁main@2026-04-20🌿 Growing2,849

Curated list of the best truly open-source AI projects, models, tools, and infrastructure.

tritonclient2.67.0🌱 Seedling

Python client library and utilities for communicating with Triton Inference Server