freshcrate

Search results for "decoding"

35 results found
flashinfer-python📁0.6.8.post1🏛️ Flagship5,467

FlashInfer: Kernel Library for LLM Serving

e2b📁2.20.0🏛️ Flagship11,835

E2B SDK that give agents cloud environments

scapy📁2.7.0🏛️ Flagship12,202

Scapy: interactive packet manipulation tool

xgrammar📁0.1.33🌳 Mature1,637

Efficient, Flexible and Portable Structured Generation

ctranslate2📁4.7.1🌳 Mature4,444

Fast inference engine for Transformer models

librosa📁0.11.0🏛️ Flagship8,341

Python module for audio and music processing

sglang📁0.5.10.post1🏛️ Flagship26,220

SGLang is a fast serving framework for large language models and vision language models.

asyncpg📁0.31.0🏛️ Flagship7,999

An asyncio PostgreSQL driver

tokenizers📁0.22.2🏛️ Flagship10,652

No description

httpcore📁1.0.9🌳 Mature538

A minimal low-level HTTP client.

httpx📁0.28.1🏛️ Flagship15,213

The next generation HTTP client.

llama.cpp📁b8871🏛️ Flagship105,537

LLM inference in C/C++

Agenvoy📁v0.19.4🌿 Growing61

Agentic framework | Self-improving memory | Pluggable tool extensions | Sandbox execution

mesh-llm📁v0.64.0🌳 Mature834

Distributed AI/LLM for the people. Share compute privately or publicly to power your agents and chat.

GhostDesk📁v7.1.0🌱 Seedling44

Give any AI agent a full desktop — it sees the screen, clicks, types, and runs apps like a human. Automate anything with a UI: browsers, legacy software, internal tools. No API needed. One Docker comm

vllm📁v0.19.1🏛️ Flagship77,587

A high-throughput and memory-efficient inference and serving engine for LLMs

vmlx📁v1.3.34🌿 Growing348

vMLX - Home of JANG_Q - Cont Batch, Prefix, Paged, KV Cache Quant, VL - Powers MLX Studio. Image gen/edit, OpenAI/Anth

QuickDesk📁v2.8.0.0🌿 Growing150

QuickDesk is the first AI-native remote desktop — an open-source, free application with a built-in MCP (Model Context Protocol) Server that lets any AI agent see and control remote computers.

teleton-agent📁v0.8.6🌿 Growing70

Teleton: Autonomous AI Agent for Telegram & TON Blockchain

cyllama📁0.2.11🌱 Seedling25

A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp

coding-proxy📁v0.3.0🌱 Seedling13

A High-Availability, Transparent, and Smart Multi-Vendor Proxy for Claude Code. Support Claude Plans, GitHub Copilot, Google Antigravity, ZAI/GLM, MiniMax, Qwen, Xiaomi, Kimi, Doubao...

pycocotools📁2.0.11🌱 Seedling169

Official APIs for the MS-COCO dataset

awesome-code-agents📁main@2026-04-20🌿 Growing98

A curated list of products, benchmarks, and research papers on autonomous code agents. Beyond coding — they're redefining how software changes the world.

awesome-opensource-ai📁main@2026-04-20🌿 Growing2,849

Curated list of the best truly open-source AI projects, models, tools, and infrastructure.

Awesome-Agent-Memory📁main@2026-04-16🌿 Growing363

Curated systems, benchmarks, and papers etc. on memory for LLMs/MLLMs --- long-term context, retrieval, and reasoning.

Awesome-Repo-Level-Code-Generation📁main@2026-04-10🌿 Growing280

Must-read papers on Repository-level Code Generation & Issue Resolution 🔥

it-tools-mcp📁v5.10.7🌱 Seedling21

A comprehensive Model Context Protocol (MCP) server that provides access to over 100 IT tools and utilities commonly used by developers, system administrators, and IT professionals. Inspired by https:

moralstack📁v0.3.1🌱 Seedling8

MoralStack is a governance and safety layer for LLM applications. It analyzes user requests before generation, evaluates risk and intent, and decides whether the AI should answer normally, answer safe

jsonconversion📁1.2.1🌱 Seedling9

This python module helps converting arbitrary Python objects into JSON strings and back.

longbow📁0.1.8🌱 Seedling8

Apache Arrow Flight clustered vector cache for high throughput Agent memory sharing

ds_ex📁main@2026-04-09🌱 Seedling17

DSPEx - Declarative Self-improving Elixir | A BEAM-Native AI Program Optimization Framework

geon-decoder📁main@2026-04-11🌱 Seedling3

GEON: Structure-first decoding via equivalence classes and field closure

Context-Engineering📁0.0.0💤 Dormant81

🚀 A framework for Context Engineering using Google Gemini. Move beyond simple prompting and learn to systematically provide context to your AI coding assistant for more reliable, consistent, and comp

pyannote-audio4.0.4🌱 Seedling

State-of-the-art speaker diarization toolkit

CognitiveLattice📁0.0.0💤 Dormant11

A stateful AI agent framework powered by the Cognitive Lattice to solve complex tasks with persistent memory and reliable tool orchestration.