freshcrate

Search results for "triton"

8 results found
triton📁3.6.0🌳 Mature19,010

A language and compiler for custom Deep Learning operations

litellm📁v1.83.7-stable🏛️ Flagship44,168

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropi

vllm📁v0.19.1🏛️ Flagship77,587

A high-throughput and memory-efficient inference and serving engine for LLMs

monocle📁v0.7.8🌿 Growing79

Monocle is a framework for tracing GenAI app code. This repo contains implementation of Monocle for GenAI apps written in Python.

fast-plaid📁1.4.5🌿 Growing245

High-Performance Engine for Multi-Vector Search

awesome-opensource-ai📁main@2026-04-20🌿 Growing2,849

Curated list of the best truly open-source AI projects, models, tools, and infrastructure.

dbg📁0.0.0🌱 Seedling12

One CLI. Every debugger. Give your AI agent eyes into runtime state instead of guessing from source code.

tritonclient2.67.0🌱 Seedling

Python client library and utilities for communicating with Triton Inference Server