Search results for "decoding"
FlashInfer: Kernel Library for LLM Serving
Scapy: interactive packet manipulation tool
Efficient, Flexible and Portable Structured Generation
Fast inference engine for Transformer models
Python module for audio and music processing
SGLang is a fast serving framework for large language models and vision language models.
No description
Agentic framework | Self-improving memory | Pluggable tool extensions | Sandbox execution
Distributed AI/LLM for the people. Share compute privately or publicly to power your agents and chat.
Give any AI agent a full desktop — it sees the screen, clicks, types, and runs apps like a human. Automate anything with a UI: browsers, legacy software, internal tools. No API needed. One Docker comm
A high-throughput and memory-efficient inference and serving engine for LLMs
vMLX - Home of JANG_Q - Cont Batch, Prefix, Paged, KV Cache Quant, VL - Powers MLX Studio. Image gen/edit, OpenAI/Anth
QuickDesk is the first AI-native remote desktop — an open-source, free application with a built-in MCP (Model Context Protocol) Server that lets any AI agent see and control remote computers.
Teleton: Autonomous AI Agent for Telegram & TON Blockchain
A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp
A High-Availability, Transparent, and Smart Multi-Vendor Proxy for Claude Code. Support Claude Plans, GitHub Copilot, Google Antigravity, ZAI/GLM, MiniMax, Qwen, Xiaomi, Kimi, Doubao...
A curated list of products, benchmarks, and research papers on autonomous code agents. Beyond coding — they're redefining how software changes the world.
Curated list of the best truly open-source AI projects, models, tools, and infrastructure.
Curated systems, benchmarks, and papers etc. on memory for LLMs/MLLMs --- long-term context, retrieval, and reasoning.
Must-read papers on Repository-level Code Generation & Issue Resolution 🔥
A comprehensive Model Context Protocol (MCP) server that provides access to over 100 IT tools and utilities commonly used by developers, system administrators, and IT professionals. Inspired by https:
MoralStack is a governance and safety layer for LLM applications. It analyzes user requests before generation, evaluates risk and intent, and decides whether the AI should answer normally, answer safe
This python module helps converting arbitrary Python objects into JSON strings and back.
Apache Arrow Flight clustered vector cache for high throughput Agent memory sharing
DSPEx - Declarative Self-improving Elixir | A BEAM-Native AI Program Optimization Framework
GEON: Structure-first decoding via equivalence classes and field closure
🚀 A framework for Context Engineering using Google Gemini. Move beyond simple prompting and learn to systematically provide context to your AI coding assistant for more reliable, consistent, and comp
A stateful AI agent framework powered by the Cognitive Lattice to solve complex tasks with persistent memory and reliable tool orchestration.
