freshcrate

Search results for "vllm"

Clear filters
40 results found (Python)
torchao๐Ÿ“0.17.0๐ŸŒณ Matureโญ2,790

Package for applying ao techniques to GPU models

transformers๐Ÿ“5.5.4๐Ÿ›๏ธ Flagshipโญ159,705

Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

restai๐Ÿ“v6.1.45๐ŸŒฟ Growingโญ485

RESTai is an AIaaS (AI as a Service) open-source platform. Supports many public and local LLM suported by Ollama/vLLM/etc. Precise embeddings usage, tuning, analytics etc. Built-in image/audio generat

litellm๐Ÿ“v1.83.7-stable๐Ÿ›๏ธ Flagshipโญ44,168

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropi

vllm๐Ÿ“v0.19.1๐Ÿ›๏ธ Flagshipโญ77,587

A high-throughput and memory-efficient inference and serving engine for LLMs

ContextPilot๐Ÿ“v0.4.1๐ŸŒฟ Growingโญ79

Accelerating Long Context LLM Inference with Accuracy-Preserving Context Optimization in SGLang, vLLM, llama.cpp, OpenClaw, RAG, and Agentic AI.

vllm-mlx๐Ÿ“v0.2.8๐ŸŒณ Matureโญ917

OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX bac

open-responses-server๐Ÿ“v0.4.3๐ŸŒฟ Growingโญ167

Wraps any OpenAI API interface as Responses with MCPs support so it supports Codex. Adding any missing stateful features. Ollama and Vllm compliant.

SurfSense๐Ÿ“v0.0.19๐Ÿ›๏ธ Flagshipโญ13,883

An open source, privacy focused alternative to NotebookLM for teams with no data limit's. Join our Discord: https://discord.gg/ejRNvftDp9

onyx๐Ÿ“v3.2.6๐Ÿ›๏ธ Flagshipโญ27,905

Open Source AI Platform - AI Chat with advanced features that works with every LLM

cognithor๐Ÿ“v0.92.3๐ŸŒฟ Growingโญ115

Cognithor - Agent OS: Local-first autonomous agent operating system. 16 LLM providers, 17 channels, 112+ MCP tools, 5-tier memory, A2A protocol, knowledge vault, voice, browser automation, Computer-us

PraisonAI๐Ÿ“v4.6.27๐Ÿ›๏ธ Flagshipโญ6,969

PraisonAI ๐Ÿฆž โ€” Hire a 24/7 AI Workforce. Stop writing boilerplate and start shipping autonomous agents that research, plan, code, and execute tasks. Deployed in 5 lines of code with built-in memory, R

llamafarm๐Ÿ“v0.0.31๐ŸŒณ Matureโญ819

Deploy any AI model, agent, database, RAG, and pipeline locally or remotely in minutes

synaptic-memory๐Ÿ“v0.16.0๐ŸŒฑ Seedlingโญ27

Brain-inspired knowledge graph: spreading activation, Hebbian learning, memory consolidation.

llm-rl-environments-lil-course๐Ÿ“main@2026-04-17๐ŸŒฟ Growingโญ140

๐ŸŒฑ A little course on Reinforcement Learning Environments for evaluating and training Language Models

SimpleLLMFunc๐Ÿ“v0.7.8๐ŸŒฟ Growingโญ77

A simple and well-tailored LLM application framework that enables you to seamlessly integrate LLM capabilities in the most "Code-Centric" manner. LLM As Function, Prompt As Code. ไธ€ไธช็ฎ€ๅ•็š„ๆฐๅˆฐ

AReaL๐Ÿ“v1.0.3๐Ÿ›๏ธ Flagshipโญ5,075

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

ArcReel๐Ÿ“v0.9.0๐ŸŒณ Matureโญ1,871

AI Agent ้ฉฑๅŠจ็š„ๅผ€ๆบ่ง†้ข‘็”Ÿๆˆๅทฅไฝœๅฐ โ€” ๅฐ่ฏดโ†’่ง’่‰ฒ/ๅœบๆ™ฏ/้“ๅ…ท่ฎพ่ฎกโ†’ๅ‰งๆœฌโ†’ๅˆ†้•œๅ›พโ†’่ง†้ข‘๏ผŒ่ทจ้•œๅคด่ง’่‰ฒไธŽๅœบๆ™ฏไธ€่‡ด | Open-source AI video workspace powered by AI Agents, Nano Banana 2 & Veo 3.1 / Grok / Seedance / OpenAI

chak-ai๐Ÿ“v0.3.1๐ŸŒฟ Growingโญ212

A simple, yet handy, LLM gateway.

LRAT๐Ÿ“0.0.0๐ŸŒฑ Seedlingโญ39

The implementation for SIGIR 2026: Learning to Retrieve from Agent Trajectories.

animaworks๐Ÿ“v0.6.2๐ŸŒฟ Growingโญ230

Organization-as-Code for autonomous AI agents. Brain-inspired memory that grows, consolidates, and forgets. Multi-model (Claude/Codex/Gemini/Cursor/Ollama).

RAGLight๐Ÿ“3.4.7๐ŸŒณ Matureโญ658

RAGLight is a modular framework for Retrieval-Augmented Generation (RAG). It makes it easy to plug in different LLMs, embeddings, and vector stores, and now includes seamless MCP integration to connec

orbit๐Ÿ“v2.6.6๐ŸŒฟ Growingโญ250

One API for 20+ LLM providers, your databases, and your files โ€” self-hosted, open-source AI gateway with RAG, voice, and guardrails.

Gito๐Ÿ“v4.0.3๐ŸŒฟ Growingโญ211

An AI-powered GitHub code review tool that uses LLMs to detect high-confidence, high-impact issuesโ€”such as security vulnerabilities, bugs, and maintainability concerns.

PAI-RAG๐Ÿ“v0.4.3๐ŸŒฟ Growingโญ455

An easy-to-use framework for modular RAG

deer-flow๐Ÿ“main@2026-04-21๐ŸŒฟ Growingโญ63,234

An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of ta

awesome-opensource-ai๐Ÿ“main@2026-04-20๐ŸŒฟ Growingโญ2,849

Curated list of the best truly open-source AI projects, models, tools, and infrastructure.

ai-agents-from-zero๐Ÿ“main@2026-04-20๐ŸŒฟ Growingโญ416

๐Ÿš€ 2026 ๆœ€็ณป็ปŸ็š„ AI Agent ้€ŸๆˆๆŒ‡ๅ—๏ฝœๆ™บ่ƒฝไฝ“ๅฎžๆˆ˜ๆ•™็จ‹ ยท ๅฎŒๆ•ดๅญฆไน ่ทฏๅพ„ + ๅฎžๆˆ˜้กน็›ฎ + ้ข่ฏ•้ข˜ๅบ“ ยท ๅฏนๆ ‡ๅคงๆจกๅž‹ๅบ”็”จๅผ€ๅ‘ๅทฅ็จ‹ๅธˆๅฒ—ไฝ ยท ่ฆ†็›–LangChain / LangGraph / Coze / Dify / MCP / skills / LLM / RAG / ๆ็คบ่ฏ ยท ไผไธš็บง้ƒจ็ฝฒไธŽๅพฎ่ฐƒ ยท ไปŽ0ๅˆฐไผไธš็บง่ฝๅœฐ + ไปŽๅญฆไน ๅˆฐไธŠ็บฟ้กน็›ฎ + ้ข่ฏ•ๅ‡†ๅค‡ไธ€ไฝ“ๅŒ–

ToolAgents๐Ÿ“0.3.0๐ŸŒฑ Seedlingโญ35

ToolAgents is a lightweight and flexible framework for creating function-calling agents with various language models and APIs.

deep-research-mcp๐Ÿ“main@2026-04-13๐ŸŒฟ Growingโญ77

MCP server for OpenAI's Deep Research APIs, Gemini Deep Research Agent, and Hugging Face's Open Deep Research

Micro-Agent๐Ÿ“v2.0.0๐ŸŒฟ Growingโญ106

A lightweight AI agent framework for vertical domain applications | ้ขๅ‘ๅž‚ๅŸŸๅบ”็”จ็š„่ฝป้‡็บง AI Agent ๆก†ๆžถ

llm_context_benchmarks๐Ÿ“0.0.0๐ŸŒฑ Seedlingโญ59

๐Ÿ“Š LLM Context Benchmarks - A comprehensive benchmarking tool for testing LLMs with varying context sizes using Ollama. Features dual benchmark modes (API/CLI), automatic hardware detection (optimiz

dory๐Ÿ“v0.1.0๐ŸŒฑ Seedlingโญ14

One memory layer for every AI agent. Local-first, markdown source of truth, and CLI/HTTP/MCP native. Your agent forgot who you are. Again. Dory fixes that.

vllm-cli๐Ÿ“v0.2.5๐Ÿ’ค Dormantโญ491

A command-line interface tool for serving LLM using vLLM.

daiv๐Ÿ“v2.0.0๐ŸŒฑ Seedlingโญ18

Your AI-powered SWE teammate, built into your git workflow

autopoe๐Ÿ“v0.2.12๐ŸŒฑ Seedlingโญ2

A structured multi-agent framework for coordinated AI collaboration

llm-in-sandbox๐Ÿ“v0.2.0๐ŸŒฑ Seedlingโญ221

Computer Environments Elicit General Agentic Intelligence in LLMs

Qwen-Agent๐Ÿ“v0.0.26๐Ÿ’ค Dormantโญ16,132

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

rjobs๐Ÿ“0.0.0๐ŸŒฑ Seedlingโญ1

CLI tool to search and rank remote job opportunities