Search results for "vllm"
AI agent toolkit: coding agent CLI, unified LLM API, TUI & web UI libraries, Slack bot, vLLM pods
RESTai is an AIaaS (AI as a Service) open-source platform. Supports many public and local LLM suported by Ollama/vLLM/etc. Precise embeddings usage, tuning, analytics etc. Built-in image/audio generat
Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropi
PraisonAI ๐ฆ โ Hire a 24/7 AI Workforce. Stop writing boilerplate and start shipping autonomous agents that research, plan, code, and execute tasks. Deployed in 5 lines of code with built-in memory, R
Your local AI Desktop Agent for Windows, macOS & Linux. Agent Skills (SKILL.md), autonomous coding (Codework), multi-agent teams, desktop automation, 15+ AI providers, Desktop Buddy. No Docker, no ter
๐ฅ Pickle Rick for Claude Code โ autonomous PRD-driven coding loops + relentless code review. Ralph Loop toolkit.
Agent! connects any AI to your Mac. 13 LLM providers โ cloud, local, or on-device. It writes code, builds Xcode projects, manages git, organizes files, automates Safari, controls any app, and handl
๐ฑ A little course on Reinforcement Learning Environments for evaluating and training Language Models
All-in-one local AI hub for Obsidian โ LLM chat with vault tools, MCP servers, RAG, workflow automation, encryption, and edit history. Fully private, no cloud required.
LLM-powered framework for deep document understanding, semantic retrieval, and context-aware answers using RAG paradigm.
The implementation for SIGIR 2026: Learning to Retrieve from Agent Trajectories.
LLM Agent that leverages cheminformatics tools to provide informed responses.
Seth's AI Tools: A Unity based front end that uses ComfyUI and LLMs to create stories, images, movies, quizzes and posters
Open source platform for AI Engineering: OpenTelemetry-native LLM Observability, GPU Monitoring, Guardrails, Evaluations, Prompt Management, Vault, Playground. ๐๐ป Integrates with 50+ LLM Providers,
Cognithor - Agent OS: Local-first autonomous agent operating system. 16 LLM providers, 17 channels, 112+ MCP tools, 5-tier memory, A2A protocol, knowledge vault, voice, browser automation, Computer-us
AI Agent ้ฉฑๅจ็ๅผๆบ่ง้ข็ๆๅทฅไฝๅฐ โ ๅฐ่ฏดโ่ง่ฒ/ๅบๆฏ/้ๅ ท่ฎพ่ฎกโๅงๆฌโๅ้ๅพโ่ง้ข๏ผ่ทจ้ๅคด่ง่ฒไธๅบๆฏไธ่ด | Open-source AI video workspace powered by AI Agents, Nano Banana 2 & Veo 3.1 / Grok / Seedance / OpenAI
A high-throughput and memory-efficient inference and serving engine for LLMs
A comprehensive toolkit for deploying production-ready Generative AI infrastructure on Amazon EKS. Includes pre-configured components for: ๐ AI Gateway (LiteLLM) ๐ค LLM Serving (vLLM, SGLang, Ollama
Native web workspace for Hermes Agent โ chat, terminal, memory, skills, inspector.
OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX bac
Brain-inspired knowledge graph: spreading activation, Hebbian learning, memory consolidation.
Engine-agnostic LLM gateway in Rust. Full OpenAI & Anthropic API compatibility across SGLang, vLLM, TRT-LLM, OpenAI, Gemini & more. Industry-first gRPC pipeline, KV cache-aware routing, chat history,
โฅ AI Coding agent for the terminal โ hash-anchored edits, optimized tool harness, LSP, Python, browser, subagents, and more
๐ฆ Open-source alternative to Claude Code, built from scratch in Rust. Agentic coding CLI โ thinks, plans, and executes with any LLM. Compatible with Claude Code workflows.
Curated list of chatgpt prompts from the top-rated GPTs in the GPTs Store. Prompt Engineering, prompt attack & prompt protect. Advanced Prompt Engineering papers.
An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of ta
๐ 2026 ๆ็ณป็ป็ AI Agent ้ๆๆๅ๏ฝๆบ่ฝไฝๅฎๆๆ็จ ยท ๅฎๆดๅญฆไน ่ทฏๅพ + ๅฎๆ้กน็ฎ + ้ข่ฏ้ขๅบ ยท ๅฏนๆ ๅคงๆจกๅๅบ็จๅผๅๅทฅ็จๅธๅฒไฝ ยท ่ฆ็LangChain / LangGraph / Coze / Dify / MCP / skills / LLM / RAG / ๆ็คบ่ฏ ยท ไผไธ็บง้จ็ฝฒไธๅพฎ่ฐ ยท ไป0ๅฐไผไธ็บง่ฝๅฐ + ไปๅญฆไน ๅฐไธ็บฟ้กน็ฎ + ้ข่ฏๅๅคไธไฝๅ
ใไธๅนด้ข่ฏไบๅนดๆจกๆใAIGC็ฎๆณๅทฅ็จๅธ้ข่ฏ็ง็ฑใๆถต็AIGCใLLMๅคงๆจกๅใAI Agentใไผ ็ปๆทฑๅบฆๅญฆไน ใ่ชๅจ้ฉพ้ฉถใๆบๅจๅญฆไน ใ่ฎก็ฎๆบ่ง่งใ่ช็ถ่ฏญ่จๅค็ใๅผบๅๅญฆไน ใๅคงๆฐๆฎๆๆใๅ ท่บซๆบ่ฝใๅ ๅฎๅฎใAGI็ญAI่กไธ้ข่ฏ็ฌ่ฏๅนฒ่ดง็ป้ชไธๆ ธๅฟ็ฅ่ฏใ
One API for 20+ LLM providers, your databases, and your files โ self-hosted, open-source AI gateway with RAG, voice, and guardrails.
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.
The AI framework that adds the engineering to prompt engineering (Python/TS/Ruby/Java/C#/Rust/Go compatible)
MCP server for OpenAI's Deep Research APIs, Gemini Deep Research Agent, and Hugging Face's Open Deep Research
Open-source Agentic AI framework in Go for building, orchestrating, and deploying intelligent agents. LLM-agnostic, event-driven, with multi-agent workflows, MCP tool discovery, and production-grade o
DSPEx - Declarative Self-improving Elixir | A BEAM-Native AI Program Optimization Framework
Wraps any OpenAI API interface as Responses with MCPs support so it supports Codex. Adding any missing stateful features. Ollama and Vllm compliant.
LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.
Swift-based vector database for on-device RAG using MLTensor and MLX Embedders
TensorZero is an open-source LLMOps platform that unifies an LLM gateway, observability, evaluation, optimization, and experimentation.
Organization-as-Code for autonomous AI agents. Brain-inspired memory that grows, consolidates, and forgets. Multi-model (Claude/Codex/Gemini/Cursor/Ollama).
A highly customizable personal AI assistant for Discord featuring smart agentic AI features such as memory, personas, tool usage, and more! ๏ฝ ้ทๆ่จๆถใใใซใฝใใใใผใซ้ฃๆบใๅฎๅใ ๆฌกไธไปฃใฎใ่ชๅพๅAIใจใผใธใงใณใใDiscordใใใ๏ผ
Teleton: Autonomous AI Agent for Telegram & TON Blockchain
RAGLight is a modular framework for Retrieval-Augmented Generation (RAG). It makes it easy to plug in different LLMs, embeddings, and vector stores, and now includes seamless MCP integration to connec
Mattermost Agents plugin supporting multiple LLMs
Your AI-powered SWE teammate, built into your git workflow
Structured Outputs
An AI-powered GitHub code review tool that uses LLMs to detect high-confidence, high-impact issuesโsuch as security vulnerabilities, bugs, and maintainability concerns.
Computer Environments Elicit General Agentic Intelligence in LLMs
An easy-to-use framework for modular RAG
A self-operating entity with $50+ in real USDC that sells article summaries for $0.03, pays $0.018 in Ollama compute costs, and autonomously raises its price when running low all while tracking itsel
A command-line interface tool for serving LLM using vLLM.
Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.
