freshcrate

Search results for "serving"

Clear filters
15 results found (Python)
npcpy📁v1.4.21🌳 Mature1,287

The python library for research and development in NLP, multimodal LLMs, Agents, ML, Knowledge Graphs, and more.

llm-rl-environments-lil-course📁main@2026-04-17🌿 Growing57

🌱 A little course on Reinforcement Learning Environments for evaluating and training Language Models

vllm📁v0.19.1🌿 Growing76,155

A high-throughput and memory-efficient inference and serving engine for LLMs

coding-proxy📁v0.3.0🌱 Seedling6

A High-Availability, Transparent, and Smart Multi-Vendor Proxy for Claude Code. Support Claude Plans, GitHub Copilot, Google Antigravity, ZAI/GLM, MiniMax, Qwen, Xiaomi, Kimi, Doubao...

deer-flow📁main@2026-04-21🌿 Growing60,446

An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of ta

ag2📁v0.12.0🌿 Growing4,383

AG2 (formerly AutoGen): The Open-Source AgentOS.Join us at: https://discord.gg/sNGSwQME3x

rag-chatbot📁main@2026-04-14🌿 Growing402

RAG (Retrieval-augmented generation) ChatBot that provides answers based on contextual information extracted from a collection of Markdown files.

vllm-mlx📁v0.2.8🌿 Growing798

OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX bac

datagouv-mcp📁v0.2.23🌿 Growing1,216

Official data.gouv.fr Model Context Protocol (MCP) server that allows AI chatbots to search, explore, and analyze datasets from the French national Open Data platform, directly through conversation.

memora📁v0.2.27🌱 Seedling386

Give your AI agents persistent memory.

MCP---Agent-Starter-Kit📁main@2026-04-21🌱 Seedling4

🚀 Build and explore multi-agent AI workflows with ready-to-use projects for document serving, Q/A bots, and orchestration.

ragflow📁v0.24.0🌱 Seedling77,784

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

Modular multi-agent orchestration framework powered by LangGraph and FastAPI.

Government-Citizen-Services-Voice-Agent📁main@2026-04-15🌱 Seedling1

Autonomous, multilingual AI voice agent using ElevenLabs, LangGraph, and RAG for government services

vllm-cli📁v0.2.5💤 Dormant487

A command-line interface tool for serving LLM using vLLM.