freshcrate

Search results for "serving"

35 results found
llama.cpp📁b8864🌳 Mature103,119

LLM inference in C/C++

npcpy📁v1.4.21🌳 Mature1,287

The python library for research and development in NLP, multimodal LLMs, Agents, ML, Knowledge Graphs, and more.

LeanKG📁v0.16.5🌱 Seedling32

LeanKG: Stop Burning Tokens. Start Coding Lean.

mentisdb📁0.9.3.39🌿 Growing56

Memory that lasts and compounds. MentisDB gives agents durable memory so they do not just remember, they improve over time. It stores append-only thought chains plus a Git-like skills registry, lett

llm-rl-environments-lil-course📁main@2026-04-17🌿 Growing57

🌱 A little course on Reinforcement Learning Environments for evaluating and training Language Models

vllm📁v0.19.1🌿 Growing76,155

A high-throughput and memory-efficient inference and serving engine for LLMs

A comprehensive toolkit for deploying production-ready Generative AI infrastructure on Amazon EKS. Includes pre-configured components for: 🚀 AI Gateway (LiteLLM) 🤖 LLM Serving (vLLM, SGLang, Ollama

promptfoo📁code-scan-action-0.1.5🌿 Growing19,943

Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and

coding-proxy📁v0.3.0🌱 Seedling6

A High-Availability, Transparent, and Smart Multi-Vendor Proxy for Claude Code. Support Claude Plans, GitHub Copilot, Google Antigravity, ZAI/GLM, MiniMax, Qwen, Xiaomi, Kimi, Doubao...

awesome-prompts📁main@2026-04-21🌿 Growing7,572

Curated list of chatgpt prompts from the top-rated GPTs in the GPTs Store. Prompt Engineering, prompt attack & prompt protect. Advanced Prompt Engineering papers.

deer-flow📁main@2026-04-21🌿 Growing60,446

An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of ta

claude-code-guide📁main@2026-04-21🌿 Growing3,908

Claude Code Guide - Setup, Commands, workflows, agents, skills & tips-n-tricks go from beginner to power user!

ag2📁v0.12.0🌿 Growing4,383

AG2 (formerly AutoGen): The Open-Source AgentOS.Join us at: https://discord.gg/sNGSwQME3x

endee📁v1.3.4🌿 Growing933

Endee.io – A high-performance vector database, designed to handle up to 1B vectors on a single node, delivering significant performance gains through optimized indexing and execution. Also available i

langgraphjs📁@langchain/langgraph-sdk@1.8.9🌿 Growing2,775

Framework to build resilient language agents as graphs.

Awesome-Agent-Memory📁main@2026-04-16🌿 Growing333

Curated systems, benchmarks, and papers etc. on memory for LLMs/MLLMs --- long-term context, retrieval, and reasoning.

mcp-go📁v0.48.0🌿 Growing8,573

A Go implementation of the Model Context Protocol (MCP), enabling seamless integration between LLM applications and external data sources and tools.

rag-chatbot📁main@2026-04-14🌿 Growing402

RAG (Retrieval-augmented generation) ChatBot that provides answers based on contextual information extracted from a collection of Markdown files.

vllm-mlx📁v0.2.8🌿 Growing798

OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX bac

AgenticGoKit📁v0.5.9🌿 Growing134

Open-source Agentic AI framework in Go for building, orchestrating, and deploying intelligent agents. LLM-agnostic, event-driven, with multi-agent workflows, MCP tool discovery, and production-grade o

next-plaid📁v1.2.0🌿 Growing331

NextPlaid, ColGREP: Multi-vector search, from database to coding agents.

datagouv-mcp📁v0.2.23🌿 Growing1,216

Official data.gouv.fr Model Context Protocol (MCP) server that allows AI chatbots to search, explore, and analyze datasets from the French national Open Data platform, directly through conversation.

matrixone📁v3.0.9🌱 Seedling1,834

AI-native HTAP database with Git-for-Data and built-in vector search, serving as the data and memory backbone for intelligent agents and applications.

memora📁v0.2.27🌱 Seedling386

Give your AI agents persistent memory.

spiceai📁v1.11.5🌱 Seedling2,868

A portable accelerated SQL query, search, and LLM-inference engine, written in Rust, for data-grounded AI apps and agents.

teleton-agent📁v0.8.6🌱 Seedling66

Teleton: Autonomous AI Agent for Telegram & TON Blockchain

MCP---Agent-Starter-Kit📁main@2026-04-21🌱 Seedling4

🚀 Build and explore multi-agent AI workflows with ready-to-use projects for document serving, Q/A bots, and orchestration.

ragflow📁v0.24.0🌱 Seedling77,784

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

agentic-news-generator📁main@2026-04-20🌱 Seedling1

Generate a custom newspaper with an AI agent based on your favorite YouTube channels.

Modular multi-agent orchestration framework powered by LangGraph and FastAPI.

coai📁v4.0.0💤 Dormant9,059

🚀 Next Generation Multi-tenant AI One-Stop Solution. Builtin Admin & Billing System. Enterprise-Grade Unified LLM Gateway Support for 200+ Models And 35+ Providers, Load Balacing w/ Priority-base Rou

Government-Citizen-Services-Voice-Agent📁main@2026-04-15🌱 Seedling1

Autonomous, multilingual AI voice agent using ElevenLabs, LangGraph, and RAG for government services

vllm-cli📁v0.2.5💤 Dormant487

A command-line interface tool for serving LLM using vLLM.

mcp-servers📁monorepo-latest-placeholder@0.0.0💤 Dormant63

MCP (Model Context Protocol) Servers authored and maintained by the PulseMCP team. We build reliable servers thoughtfully designed specifically for MCP Client-powered workflows.

dingo📁v0.9.0⚰️ Archived1,699

A multi-modal vector database that supports upserts and vector queries using unified SQL (MySQL-Compatible) on structured and unstructured data, while meeting the requirements of high concurrency and