freshcrate

Search results for "vllm"

52 results found
pi-mono๐Ÿ“v0.68.0๐ŸŒณ Matureโญ34,430

AI agent toolkit: coding agent CLI, unified LLM API, TUI & web UI libraries, Slack bot, vLLM pods

restai๐Ÿ“v6.1.45๐ŸŒฟ Growingโญ483

RESTai is an AIaaS (AI as a Service) open-source platform. Supports many public and local LLM suported by Ollama/vLLM/etc. Precise embeddings usage, tuning, analytics etc. Built-in image/audio generat

litellm๐Ÿ“v1.83.7-stable๐ŸŒณ Matureโญ42,951

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropi

PraisonAI๐Ÿ“v4.6.25๐ŸŒณ Matureโญ6,900

PraisonAI ๐Ÿฆž โ€” Hire a 24/7 AI Workforce. Stop writing boilerplate and start shipping autonomous agents that research, plan, code, and execute tasks. Deployed in 5 lines of code with built-in memory, R

skales๐Ÿ“v10.0.4๐ŸŒณ Matureโญ769

Your local AI Desktop Agent for Windows, macOS & Linux. Agent Skills (SKILL.md), autonomous coding (Codework), multi-agent teams, desktop automation, 15+ AI providers, Desktop Buddy. No Docker, no ter

pickle-rick-claude๐Ÿ“v1.44.3๐ŸŒฑ Seedlingโญ19

๐Ÿฅ’ Pickle Rick for Claude Code โ€” autonomous PRD-driven coding loops + relentless code review. Ralph Loop toolkit.

Agent๐Ÿ“1.0.75.164๐ŸŒฑ Seedlingโญ30

Agent! connects any AI to your Mac. 13 LLM providers โ€” cloud, local, or on-device. It writes code, builds Xcode projects, manages git, organizes files, automates Safari, controls any app, and handl

llm-rl-environments-lil-course๐Ÿ“main@2026-04-17๐ŸŒฟ Growingโญ57

๐ŸŒฑ A little course on Reinforcement Learning Environments for evaluating and training Language Models

obsidian-local-llm-hub๐Ÿ“0.12.2๐ŸŒฑ Seedlingโญ27

All-in-one local AI hub for Obsidian โ€” LLM chat with vault tools, MCP servers, RAG, workflow automation, encryption, and edit history. Fully private, no cloud required.

WeKnora๐Ÿ“v0.4.0๐ŸŒณ Matureโญ13,819

LLM-powered framework for deep document understanding, semantic retrieval, and context-aware answers using RAG paradigm.

LRAT๐Ÿ“0.0.0๐ŸŒฑ Seedlingโญ34

The implementation for SIGIR 2026: Learning to Retrieve from Agent Trajectories.

cactus๐Ÿ“0.0.0๐ŸŒฟ Growingโญ50

LLM Agent that leverages cheminformatics tools to provide informed responses.

aitools_client๐Ÿ“0.0.0๐ŸŒฟ Growingโญ182

Seth's AI Tools: A Unity based front end that uses ComfyUI and LLMs to create stories, images, movies, quizzes and posters

openlit๐Ÿ“openlit-1.18.1๐ŸŒฟ Growingโญ2,358

Open source platform for AI Engineering: OpenTelemetry-native LLM Observability, GPU Monitoring, Guardrails, Evaluations, Prompt Management, Vault, Playground. ๐Ÿš€๐Ÿ’ป Integrates with 50+ LLM Providers,

cognithor๐Ÿ“v0.92.2๐ŸŒฟ Growingโญ94

Cognithor - Agent OS: Local-first autonomous agent operating system. 16 LLM providers, 17 channels, 112+ MCP tools, 5-tier memory, A2A protocol, knowledge vault, voice, browser automation, Computer-us

ArcReel๐Ÿ“v0.9.0๐ŸŒฟ Growingโญ1,650

AI Agent ้ฉฑๅŠจ็š„ๅผ€ๆบ่ง†้ข‘็”Ÿๆˆๅทฅไฝœๅฐ โ€” ๅฐ่ฏดโ†’่ง’่‰ฒ/ๅœบๆ™ฏ/้“ๅ…ท่ฎพ่ฎกโ†’ๅ‰งๆœฌโ†’ๅˆ†้•œๅ›พโ†’่ง†้ข‘๏ผŒ่ทจ้•œๅคด่ง’่‰ฒไธŽๅœบๆ™ฏไธ€่‡ด | Open-source AI video workspace powered by AI Agents, Nano Banana 2 & Veo 3.1 / Grok / Seedance / OpenAI

vllm๐Ÿ“v0.19.1๐ŸŒฟ Growingโญ76,155

A high-throughput and memory-efficient inference and serving engine for LLMs

sample-genai-on-eks-starter-kit๐Ÿ“v1.1.0๐ŸŒฟ Growingโญ51

A comprehensive toolkit for deploying production-ready Generative AI infrastructure on Amazon EKS. Includes pre-configured components for: ๐Ÿš€ AI Gateway (LiteLLM) ๐Ÿค– LLM Serving (vLLM, SGLang, Ollama

hermes-workspace๐Ÿ“v2.0.0๐ŸŒฟ Growingโญ1,124

Native web workspace for Hermes Agent โ€” chat, terminal, memory, skills, inspector.

vllm-mlx๐Ÿ“v0.2.8๐ŸŒฟ Growingโญ798

OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX bac

synaptic-memory๐Ÿ“v0.16.0๐ŸŒฑ Seedlingโญ25

Brain-inspired knowledge graph: spreading activation, Hebbian learning, memory consolidation.

smg๐Ÿ“v1.4.1๐ŸŒฟ Growingโญ156

Engine-agnostic LLM gateway in Rust. Full OpenAI & Anthropic API compatibility across SGLang, vLLM, TRT-LLM, OpenAI, Gemini & more. Industry-first gRPC pipeline, KV cache-aware routing, chat history,

oh-my-pi๐Ÿ“v14.1.2๐ŸŒฟ Growingโญ2,872

โŒฅ AI Coding agent for the terminal โ€” hash-anchored edits, optimized tool harness, LSP, Python, browser, subagents, and more

crab-code๐Ÿ“main@2026-04-21๐ŸŒฑ Seedlingโญ25

๐Ÿฆ€ Open-source alternative to Claude Code, built from scratch in Rust. Agentic coding CLI โ€” thinks, plans, and executes with any LLM. Compatible with Claude Code workflows.

awesome-prompts๐Ÿ“main@2026-04-21๐ŸŒฟ Growingโญ7,572

Curated list of chatgpt prompts from the top-rated GPTs in the GPTs Store. Prompt Engineering, prompt attack & prompt protect. Advanced Prompt Engineering papers.

deer-flow๐Ÿ“main@2026-04-21๐ŸŒฟ Growingโญ60,446

An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of ta

ai-agents-from-zero๐Ÿ“main@2026-04-20๐ŸŒฟ Growingโญ264

๐Ÿš€ 2026 ๆœ€็ณป็ปŸ็š„ AI Agent ้€ŸๆˆๆŒ‡ๅ—๏ฝœๆ™บ่ƒฝไฝ“ๅฎžๆˆ˜ๆ•™็จ‹ ยท ๅฎŒๆ•ดๅญฆไน ่ทฏๅพ„ + ๅฎžๆˆ˜้กน็›ฎ + ้ข่ฏ•้ข˜ๅบ“ ยท ๅฏนๆ ‡ๅคงๆจกๅž‹ๅบ”็”จๅผ€ๅ‘ๅทฅ็จ‹ๅธˆๅฒ—ไฝ ยท ่ฆ†็›–LangChain / LangGraph / Coze / Dify / MCP / skills / LLM / RAG / ๆ็คบ่ฏ ยท ไผไธš็บง้ƒจ็ฝฒไธŽๅพฎ่ฐƒ ยท ไปŽ0ๅˆฐไผไธš็บง่ฝๅœฐ + ไปŽๅญฆไน ๅˆฐไธŠ็บฟ้กน็›ฎ + ้ข่ฏ•ๅ‡†ๅค‡ไธ€ไฝ“ๅŒ–

AIGC-Interview-Book๐Ÿ“main@2026-04-19๐ŸŒฟ Growingโญ3,447

ใ€ไธ‰ๅนด้ข่ฏ•ไบ”ๅนดๆจกๆ‹Ÿใ€‘AIGC็ฎ—ๆณ•ๅทฅ็จ‹ๅธˆ้ข่ฏ•็ง˜็ฑใ€‚ๆถต็›–AIGCใ€LLMๅคงๆจกๅž‹ใ€AI Agentใ€ไผ ็ปŸๆทฑๅบฆๅญฆไน ใ€่‡ชๅŠจ้ฉพ้ฉถใ€ๆœบๅ™จๅญฆไน ใ€่ฎก็ฎ—ๆœบ่ง†่ง‰ใ€่‡ช็„ถ่ฏญ่จ€ๅค„็†ใ€ๅผบๅŒ–ๅญฆไน ใ€ๅคงๆ•ฐๆฎๆŒ–ๆŽ˜ใ€ๅ…ท่บซๆ™บ่ƒฝใ€ๅ…ƒๅฎ‡ๅฎ™ใ€AGI็ญ‰AI่กŒไธš้ข่ฏ•็ฌ”่ฏ•ๅนฒ่ดง็ป้ชŒไธŽๆ ธๅฟƒ็Ÿฅ่ฏ†ใ€‚

orbit๐Ÿ“v2.6.6๐ŸŒฟ Growingโญ250

One API for 20+ LLM providers, your databases, and your files โ€” self-hosted, open-source AI gateway with RAG, voice, and guardrails.

AReaL๐Ÿ“v1.0.3๐ŸŒฟ Growingโญ5,017

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

baml๐Ÿ“0.221.0๐ŸŒฟ Growingโญ7,955

The AI framework that adds the engineering to prompt engineering (Python/TS/Ruby/Java/C#/Rust/Go compatible)

deep-research-mcp๐Ÿ“main@2026-04-13๐ŸŒฟ Growingโญ58

MCP server for OpenAI's Deep Research APIs, Gemini Deep Research Agent, and Hugging Face's Open Deep Research

AgenticGoKit๐Ÿ“v0.5.9๐ŸŒฟ Growingโญ134

Open-source Agentic AI framework in Go for building, orchestrating, and deploying intelligent agents. LLM-agnostic, event-driven, with multi-agent workflows, MCP tool discovery, and production-grade o

chak-ai๐Ÿ“v0.3.1๐ŸŒฟ Growingโญ211

A simple, yet handy, LLM gateway.

ds_ex๐Ÿ“main@2026-04-09๐ŸŒฑ Seedlingโญ17

DSPEx - Declarative Self-improving Elixir | A BEAM-Native AI Program Optimization Framework

open-responses-server๐Ÿ“v0.4.3๐ŸŒฑ Seedlingโญ161

Wraps any OpenAI API interface as Responses with MCPs support so it supports Codex. Adding any missing stateful features. Ollama and Vllm compliant.

LocalAI๐Ÿ“v4.1.3๐ŸŒฑ Seedlingโญ45,254

LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.

VecturaKit๐Ÿ“5.3.0๐ŸŒฑ Seedlingโญ280

Swift-based vector database for on-device RAG using MLTensor and MLX Embedders

tensorzero๐Ÿ“2026.4.0๐ŸŒฑ Seedlingโญ11,204

TensorZero is an open-source LLMOps platform that unifies an LLM gateway, observability, evaluation, optimization, and experimentation.

animaworks๐Ÿ“v0.6.2๐ŸŒฑ Seedlingโญ225

Organization-as-Code for autonomous AI agents. Brain-inspired memory that grows, consolidates, and forgets. Multi-model (Claude/Codex/Gemini/Cursor/Ollama).

TomoriBot๐Ÿ“v0.7.904๐ŸŒฑ Seedlingโญ33

A highly customizable personal AI assistant for Discord featuring smart agentic AI features such as memory, personas, tool usage, and more! ๏ฝœ ้•ทๆœŸ่จ˜ๆ†ถใ‚„ใƒšใƒซใ‚ฝใƒŠใ€ใƒ„ใƒผใƒซ้€ฃๆบใ‚’ๅฎŒๅ‚™ใ€‚ ๆฌกไธ–ไปฃใฎใ€Œ่‡ชๅพ‹ๅž‹AIใ‚จใƒผใ‚ธใ‚งใƒณใƒˆใ€Discordใƒœใƒƒใƒˆ๏ผ

teleton-agent๐Ÿ“v0.8.6๐ŸŒฑ Seedlingโญ66

Teleton: Autonomous AI Agent for Telegram & TON Blockchain

RAGLight๐Ÿ“3.4.7๐ŸŒฑ Seedlingโญ656

RAGLight is a modular framework for Retrieval-Augmented Generation (RAG). It makes it easy to plug in different LLMs, embeddings, and vector stores, and now includes seamless MCP integration to connec

mattermost-plugin-agents๐Ÿ“v1.14.0๐ŸŒฑ Seedlingโญ217

Mattermost Agents plugin supporting multiple LLMs

daiv๐Ÿ“v2.0.0๐ŸŒฑ Seedlingโญ18

Your AI-powered SWE teammate, built into your git workflow

Gito๐Ÿ“v4.0.3๐ŸŒฑ Seedlingโญ207

An AI-powered GitHub code review tool that uses LLMs to detect high-confidence, high-impact issuesโ€”such as security vulnerabilities, bugs, and maintainability concerns.

llm-in-sandbox๐Ÿ“v0.2.0๐ŸŒฑ Seedlingโญ216

Computer Environments Elicit General Agentic Intelligence in LLMs

PAI-RAG๐Ÿ“v0.4.3๐ŸŒฑ Seedlingโญ450

An easy-to-use framework for modular RAG

Minimum-viable-autonomous-entity๐Ÿ“0.0.0๐ŸŒฑ Seedlingโญ1

A self-operating entity with $50+ in real USDC that sells article summaries for $0.03, pays $0.018 in Ollama compute costs, and autonomously raises its price when running low all while tracking itsel

vllm-cli๐Ÿ“v0.2.5๐Ÿ’ค Dormantโญ487

A command-line interface tool for serving LLM using vLLM.

Qwen-Agent๐Ÿ“v0.0.26๐Ÿ’ค Dormantโญ15,963

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.