freshcrate — Search

Search results for "vllm"

40 results found (Python)

torchao 📁0.17.0🌳 Mature⭐2,790

Package for applying ao techniques to GPU models

transformers 📁5.5.4🏛️ Flagship⭐159,705

Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

deep-learning llm machine-learning nlp pypi python pytorch transformer vlmby The Hugging Face team (past and future) with the help of all our contributors (https://github.com/huPython

restai 📁v6.1.45🌿 Growing⭐485

RESTai is an AIaaS (AI as a Service) open-source platform. Supports many public and local LLM suported by Ollama/vLLM/etc. Precise embeddings usage, tuning, analytics etc. Built-in image/audio generat

blocky embeddings fastapi langchain llama llamaindex llm ollama python ragby apocasPython

litellm 📁v1.83.7-stable🏛️ Flagship⭐44,168

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropi

ai-gateway anthropic azure-openai bedrock gateway langchain litellm llm pythonby BerriAIPython

vllm 📁v0.19.1🏛️ Flagship⭐77,587

A high-throughput and memory-efficient inference and serving engine for LLMs

amd blackwell cuda deepseek deepseek-v3 gpt gpt-oss inference pythonby vllm-projectPython

ContextPilot 📁v0.4.1🌿 Growing⭐79

Accelerating Long Context LLM Inference with Accuracy-Preserving Context Optimization in SGLang, vLLM, llama.cpp, OpenClaw, RAG, and Agentic AI.

ai-agents context-api context-engineering hermes-agent inference-optimization openclaw prompt-engineering pythonby EfficientContextPython

vllm-mlx 📁v0.2.8🌳 Mature⭐917

OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX bac

anthropic apple-silicon audio-processing claude-code computer-vision image-understanding inference llm pythonby waybarriosPython

open-responses-server 📁v0.4.3🌿 Growing⭐167

Wraps any OpenAI API interface as Responses with MCPs support so it supports Codex. Adding any missing stateful features. Ollama and Vllm compliant.

ai codex generative-ai mcp mcp-client openai openai-api openai-codex pythonby teabranchPython

SurfSense 📁v0.0.19🏛️ Flagship⭐13,883

An open source, privacy focused alternative to NotebookLM for teams with no data limit's. Join our Discord: https://discord.gg/ejRNvftDp9

agent agents ai chrome-extension extension fastapi langchain langgraph python ragby MODSetterPython

onyx 📁v3.2.6🏛️ Flagship⭐27,905

Open Source AI Platform - AI Chat with advanced features that works with every LLM

ai ai-chat chatgpt chatui enterprise-search gen-ai information-retrieval llm python ragby onyx-dot-appPython

cognithor 📁v0.92.3🌿 Growing⭐115

Cognithor - Agent OS: Local-first autonomous agent operating system. 16 LLM providers, 17 channels, 112+ MCP tools, 5-tier memory, A2A protocol, knowledge vault, voice, browser automation, Computer-us

agent-os ai-agent anthropic autonomous-agent discord-bot document-analysis gdpr-compliant gemini pythonby Alex8791-cyberPython

PraisonAI 📁v4.6.27🏛️ Flagship⭐6,969

PraisonAI 🦞 — Hire a 24/7 AI Workforce. Stop writing boilerplate and start shipping autonomous agents that research, plan, code, and execute tasks. Deployed in 5 lines of code with built-in memory, R

agents ai ai-agent-framework ai-agent-sdk ai-agents ai-agents-framework ai-agents-sdk ai-framwork pythonby MervinPraisonPython

llamafarm 📁v0.0.31🌳 Mature⭐819

Deploy any AI model, agent, database, RAG, and pipeline locally or remotely in minutes

ai aiproject chatgpt claude edge edge-computing finetuning-llms gemma prompt-engineering pythonby llama-farmPython

synaptic-memory 📁v0.16.0🌱 Seedling⭐27

Brain-inspired knowledge graph: spreading activation, Hebbian learning, memory consolidation.

ai-agent embedding graph-database hebbian-learning knowledge-graph llm mcp mcp-server pythonby PlateerLabPython

llm-rl-environments-lil-course 📁main@2026-04-17🌿 Growing⭐140

🌱 A little course on Reinforcement Learning Environments for evaluating and training Language Models

course grpo language-models llm llm-agent python reinforcement-learning reinforcement-learning-environments rlvrby anakin87Python

SimpleLLMFunc 📁v0.7.8🌿 Growing⭐77

A simple and well-tailored LLM application framework that enables you to seamlessly integrate LLM capabilities in the most "Code-Centric" manner. LLM As Function, Prompt As Code. 一个简单的恰到

agent agent-development-framework agent-framework agentframeworks agentic agentic-ai agents ai llm-agent pythonby NiJingzhePython

AReaL 📁v1.0.3🏛️ Flagship⭐5,075

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

agent llm llm-agent llm-reasoning machine-learning-systems mlsys python reinforcement-learning rlby inclusionAIPython

ArcReel 📁v0.9.0🌳 Mature⭐1,871

AI Agent 驱动的开源视频生成工作台 — 小说→角色/场景/道具设计→剧本→分镜图→视频，跨镜头角色与场景一致 | Open-source AI video workspace powered by AI Agents, Nano Banana 2 & Veo 3.1 / Grok / Seedance / OpenAI

ai-agent ai-video-generator claude-agent-sdk docker gemini grok image-to-video nano-banana-2 pythonby ArcReelPython

chak-ai 📁v0.3.1🌿 Growing⭐212

A simple, yet handy, LLM gateway.

pythonby zhixiangxuePython

LRAT 📁0.0.0🌱 Seedling⭐39

The implementation for SIGIR 2026: Learning to Retrieve from Agent Trajectories.

agent agentic llm python searchby Yuqi-ZhouPython

animaworks 📁v0.6.2🌿 Growing⭐230

Organization-as-Code for autonomous AI agents. Brain-inspired memory that grows, consolidates, and forgets. Multi-model (Claude/Codex/Gemini/Cursor/Ollama).

agent-framework ai-agents autonomous-agents brain-inspired claude forgetting llm memory pythonby xuiltulPython

RAGLight 📁3.4.7🌳 Mature⭐658

RAGLight is a modular framework for Retrieval-Augmented Generation (RAG). It makes it easy to plug in different LLMs, embeddings, and vector stores, and now includes seamless MCP integration to connec

agentic-ai agentic-rag agentic-workflow artificial-intelligence data-science framework huggingface lmstudio pythonby Bessouat40Python

outlines 📁1.2.12🏛️ Flagship⭐13,705

Structured Outputs

cfg generative-ai json llms prompt-engineering python regex structured-generation symbolic-aiby dottxt-aiPython

orbit 📁v2.6.6🌿 Growing⭐250

One API for 20+ LLM providers, your databases, and your files — self-hosted, open-source AI gateway with RAG, voice, and guardrails.

ai-assistant ai-gateway ai-safety anthropic chatbot developer-tools elasticsearch llm pythonby schmitechPython

Gito 📁v4.0.3🌿 Growing⭐211

An AI-powered GitHub code review tool that uses LLMs to detect high-confidence, high-impact issues—such as security vulnerabilities, bugs, and maintainability concerns.

ai ai-code-analysis ai-code-review ai-code-reviewer ai-coding ai-coding-assistant code-analysis code-audit pythonby NayjestPython

PAI-RAG 📁v0.4.3🌿 Growing⭐455

An easy-to-use framework for modular RAG

pythonby aigc-appsPython

deer-flow 📁main@2026-04-21🌿 Growing⭐63,234

An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of ta

agent agentic agentic-framework agentic-workflow ai ai-agents deep-research harness pythonby bytedancePython

awesome-opensource-ai 📁main@2026-04-20🌿 Growing⭐2,849

Curated list of the best truly open-source AI projects, models, tools, and infrastructure.

agents ai artificial-intelligence awesome awesome-list generative-ai llm machine-learning python ragby alvinrealPython

ai-agents-from-zero 📁main@2026-04-20🌿 Growing⭐416

🚀 2026 最系统的 AI Agent 速成指南｜智能体实战教程 · 完整学习路径 + 实战项目 + 面试题库 · 对标大模型应用开发工程师岗位 · 覆盖LangChain / LangGraph / Coze / Dify / MCP / skills / LLM / RAG / 提示词 · 企业级部署与微调 · 从0到企业级落地 + 从学习到上线项目 + 面试准备一体化

agent agent-framework ai-agent aigc coze dify gpt langchain pythonby didililiPython

ToolAgents 📁0.3.0🌱 Seedling⭐35

ToolAgents is a lightweight and flexible framework for creating function-calling agents with various language models and APIs.

agent agents function-calling llm llm-agent llm-agents llms local-llm pythonby Maximilian-WinterPython

deep-research-mcp 📁main@2026-04-13🌿 Growing⭐77

MCP server for OpenAI's Deep Research APIs, Gemini Deep Research Agent, and Hugging Face's Open Deep Research

pythonby pminerviniPython

Micro-Agent 📁v2.0.0🌿 Growing⭐106

A lightweight AI agent framework for vertical domain applications | 面向垂域应用的轻量级 AI Agent 框架

ai-agent fastapi litellm llm mcp python rag react-agent vertical-domainby fdueblabPython

llm_context_benchmarks 📁0.0.0🌱 Seedling⭐59

📊 LLM Context Benchmarks - A comprehensive benchmarking tool for testing LLMs with varying context sizes using Ollama. Features dual benchmark modes (API/CLI), automatic hardware detection (optimiz

ai benchmarking llms pythonby ivanfioravantiPython

dory 📁v0.1.0🌱 Seedling⭐14

One memory layer for every AI agent. Local-first, markdown source of truth, and CLI/HTTP/MCP native. Your agent forgot who you are. Again. Dory fixes that.

agents ai-agents claude-code codex docker fastapi knowledge-graph llm model-context-protocol pythonby deeflectPython

vllm-cli 📁v0.2.5💤 Dormant⭐491

A command-line interface tool for serving LLM using vLLM.

llm llm-inference llm-tools python vllmby Chen-zexiPython

daiv 📁v2.0.0🌱 Seedling⭐18

Your AI-powered SWE teammate, built into your git workflow

ai-agent anthropic deepagents genai git github gitlab google-gemini-ai pythonby srtabPython

autopoe 📁v0.2.12🌱 Seedling⭐2

A structured multi-agent framework for coordinated AI collaboration

ai ai-agents artificial-intelligence assistant code-generation fastapi full-stack llm pythonby ImFeH2Python

llm-in-sandbox 📁v0.2.0🌱 Seedling⭐221

Computer Environments Elicit General Agentic Intelligence in LLMs

coding-agent computer-use-agent general-agent pythonby llm-in-sandboxPython

Qwen-Agent 📁v0.0.26💤 Dormant⭐16,132

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

pythonby QwenLMPython

rjobs 📁0.0.0🌱 Seedling⭐1

CLI tool to search and rank remote job opportunities

job-search job-search-automation jobsearch jobsearch-automation llm-tool pythonby pboechatPython