Search results for "vllm"
Package for applying ao techniques to GPU models
Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
RESTai is an AIaaS (AI as a Service) open-source platform. Supports many public and local LLM suported by Ollama/vLLM/etc. Precise embeddings usage, tuning, analytics etc. Built-in image/audio generat
Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropi
A high-throughput and memory-efficient inference and serving engine for LLMs
Accelerating Long Context LLM Inference with Accuracy-Preserving Context Optimization in SGLang, vLLM, llama.cpp, OpenClaw, RAG, and Agentic AI.
OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX bac
Wraps any OpenAI API interface as Responses with MCPs support so it supports Codex. Adding any missing stateful features. Ollama and Vllm compliant.
An open source, privacy focused alternative to NotebookLM for teams with no data limit's. Join our Discord: https://discord.gg/ejRNvftDp9
Open Source AI Platform - AI Chat with advanced features that works with every LLM
Cognithor - Agent OS: Local-first autonomous agent operating system. 16 LLM providers, 17 channels, 112+ MCP tools, 5-tier memory, A2A protocol, knowledge vault, voice, browser automation, Computer-us
PraisonAI ๐ฆ โ Hire a 24/7 AI Workforce. Stop writing boilerplate and start shipping autonomous agents that research, plan, code, and execute tasks. Deployed in 5 lines of code with built-in memory, R
Deploy any AI model, agent, database, RAG, and pipeline locally or remotely in minutes
Brain-inspired knowledge graph: spreading activation, Hebbian learning, memory consolidation.
๐ฑ A little course on Reinforcement Learning Environments for evaluating and training Language Models
A simple and well-tailored LLM application framework that enables you to seamlessly integrate LLM capabilities in the most "Code-Centric" manner. LLM As Function, Prompt As Code. ไธไธช็ฎๅ็ๆฐๅฐ
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.
AI Agent ้ฉฑๅจ็ๅผๆบ่ง้ข็ๆๅทฅไฝๅฐ โ ๅฐ่ฏดโ่ง่ฒ/ๅบๆฏ/้ๅ ท่ฎพ่ฎกโๅงๆฌโๅ้ๅพโ่ง้ข๏ผ่ทจ้ๅคด่ง่ฒไธๅบๆฏไธ่ด | Open-source AI video workspace powered by AI Agents, Nano Banana 2 & Veo 3.1 / Grok / Seedance / OpenAI
The implementation for SIGIR 2026: Learning to Retrieve from Agent Trajectories.
Organization-as-Code for autonomous AI agents. Brain-inspired memory that grows, consolidates, and forgets. Multi-model (Claude/Codex/Gemini/Cursor/Ollama).
RAGLight is a modular framework for Retrieval-Augmented Generation (RAG). It makes it easy to plug in different LLMs, embeddings, and vector stores, and now includes seamless MCP integration to connec
Structured Outputs
One API for 20+ LLM providers, your databases, and your files โ self-hosted, open-source AI gateway with RAG, voice, and guardrails.
An AI-powered GitHub code review tool that uses LLMs to detect high-confidence, high-impact issuesโsuch as security vulnerabilities, bugs, and maintainability concerns.
An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of ta
Curated list of the best truly open-source AI projects, models, tools, and infrastructure.
๐ 2026 ๆ็ณป็ป็ AI Agent ้ๆๆๅ๏ฝๆบ่ฝไฝๅฎๆๆ็จ ยท ๅฎๆดๅญฆไน ่ทฏๅพ + ๅฎๆ้กน็ฎ + ้ข่ฏ้ขๅบ ยท ๅฏนๆ ๅคงๆจกๅๅบ็จๅผๅๅทฅ็จๅธๅฒไฝ ยท ่ฆ็LangChain / LangGraph / Coze / Dify / MCP / skills / LLM / RAG / ๆ็คบ่ฏ ยท ไผไธ็บง้จ็ฝฒไธๅพฎ่ฐ ยท ไป0ๅฐไผไธ็บง่ฝๅฐ + ไปๅญฆไน ๅฐไธ็บฟ้กน็ฎ + ้ข่ฏๅๅคไธไฝๅ
ToolAgents is a lightweight and flexible framework for creating function-calling agents with various language models and APIs.
MCP server for OpenAI's Deep Research APIs, Gemini Deep Research Agent, and Hugging Face's Open Deep Research
A lightweight AI agent framework for vertical domain applications | ้ขๅๅๅๅบ็จ็่ฝป้็บง AI Agent ๆกๆถ
๐ LLM Context Benchmarks - A comprehensive benchmarking tool for testing LLMs with varying context sizes using Ollama. Features dual benchmark modes (API/CLI), automatic hardware detection (optimiz
One memory layer for every AI agent. Local-first, markdown source of truth, and CLI/HTTP/MCP native. Your agent forgot who you are. Again. Dory fixes that.
A command-line interface tool for serving LLM using vLLM.
Your AI-powered SWE teammate, built into your git workflow
A structured multi-agent framework for coordinated AI collaboration
Computer Environments Elicit General Agentic Intelligence in LLMs
Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.
CLI tool to search and rank remote job opportunities
