Search results for "caching"
OmniRoute is an AI gateway for multi-provider LLMs: an OpenAI-compatible endpoint with smart routing, load balancing, retries, and fallbacks. Add policies, rate limits, caching, and observability for
PraisonAI π¦ β Hire a 24/7 AI Workforce. Stop writing boilerplate and start shipping autonomous agents that research, plan, code, and execute tasks. Deployed in 5 lines of code with built-in memory, R
MCP server for Kaseya Autotask PSA β 39 tools for companies, tickets, projects, time entries, and more
Open-source persistent memory for AI agent pipelines (LangGraph, CrewAI, AutoGen) and Claude. REST API + knowledge graph + autonomous consolidation.
Universal AI Development Platform with MCP server integration, multi-provider support, and professional CLI. Build, test, and deploy AI applications with multiple ai providers.
The only fully local production-grade Super SDK that provides a simple, unified, and powerful interface for calling more than 200+ LLMs.
Agent-CI is local GitHub Actions for your agents.
π₯ Pickle Rick for Claude Code β autonomous PRD-driven coding loops + relentless code review. Ralph Loop toolkit.
A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp
Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropi
Agent! connects any AI to your Mac. 13 LLM providers β cloud, local, or on-device. It writes code, builds Xcode projects, manages git, organizes files, automates Safari, controls any app, and handl
#1 Terminal Benchmark 2.0 β AI that ships your tickets.
Curated directory of terminal-native AI coding agents and the harnesses that orchestrate them. Covers open-source tools (Pi, OpenCode, Aider, Goose), platform agents (Claude Code, Codex, Gemini CLI),
MCP server for Copilot Money App - AI powered personal finance queries using local data
RAPTOR (Robust AI-Powered Toolkit for Operational Robots) is an AI-native Content Insight Engine that transforms passive media storage into an intelligent knowledge platform through automated analysis
MCP Server for Turkish & American Stock Exchange and Fund Data
Agent Interface Discovery
This Guidance demonstrates how to streamline access to numerous large language models (LLMs) through a unified, industry-standard API gateway based on OpenAI API standards
Custom plugins for hermes-agent β goal management, inter-agent bridge, model selection, cost control
The PHP Agentic Framework to build production-ready AI driven applications. Connect components (LLMs, vector DBs, memory) to agents that can interact with your data. With its modular architecture it's
SDL-MCP (Symbol Delta Ledger MCP Server) is a cards-first context system for coding agents that saves tokens and improves context.
Knowledge Engine for AI Agent Memory in 6 lines of code
π₯ Comprehensive survey on Context Engineering: from prompt engineering to production-grade AI systems. hundreds of papers, frameworks, and implementation guides for LLMs and AI agents.
SmarterRouter: An intelligent LLM gateway and VRAM-aware router for Ollama, llama.cpp, and OpenAI. Features semantic caching, model profiling, and automatic failover for local AI labs.
π The leading agent orchestration platform for Claude. Deploy intelligent multi-agent swarms, coordinate autonomous workflows, and build conversational AI systems. Features enterprise-grade architect
Procedural memory for AI coding agents: transforms scattered session history into persistent, cross-agent memory so every agent learns from every other
AgenticX is a unified, production-ready multi-agent platform β Python SDK + CLI (agx) + Studio server + Machi desktop app. Features Meta-Agent orchestration, 15+ LLM providers, MCP Hub, hierarchical m
Engine-agnostic LLM gateway in Rust. Full OpenAI & Anthropic API compatibility across SGLang, vLLM, TRT-LLM, OpenAI, Gemini & more. Industry-first gRPC pipeline, KV cache-aware routing, chat history,
One API for 25+ LLMs, OpenAI, Anthropic, Bedrock, Azure. Caching, guardrails & cost controls. Go-native LiteLLM & Kong AI Gateway alternative.
Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and
β₯ AI Coding agent for the terminal β hash-anchored edits, optimized tool harness, LSP, Python, browser, subagents, and more
Official MCP Servers for AWS
Context window optimization for AI coding agents. Sandboxes tool output, 98% reduction. 12 platforms
A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related website
A tool-use-focused LLM plugin for neovim.
π€ Kubernetes for AI Agents. Self-hosted, production-grade runtime for orchestrating LLM swarms and autonomous agents. TypeScript-native.
The engine powering hundreds of thousands of MCP connections π€ π₯
A Model Context Protocol (MCP) server that gives Claude direct control over Strudel.cc for AI-assisted music generation and live coding.
Excalibase GraphQL instantly turns your database into a GraphQL API. Built with Spring Boot, it supports schema discovery, subscriptions, and type handling β no manual resolvers needed.
π₯ An autonomous AI agent that runs your deep learning experiments 24/7 while you sleep. Zero-cost monitoring, Leader-Worker architecture, constant-size memory.
AI-first security scanner with 76 analyzers, 9,600+ detection rules, and repo poisoning detection for AI/ML, LLM agents, and MCP servers. Scan any GitHub repo with: medusa scan --git user/repo
A high-throughput and memory-efficient inference and serving engine for LLMs
Self-hosted personal AI agent that lives in your DMs. Describe any workflow: triage Gmail, pull a Giphy feed, build a Slack bot, monitor markets. It writes the code, runs it, schedules it, and saves i
MaverickMCP - Personal Stock Analysis MCP Server
Curated systems, benchmarks, and papers etc. on memory for LLMs/MLLMs --- long-term context, retrieval, and reasoning.
Control Gmail, Google Calendar, Docs, Sheets, Slides, Chat, Forms, Tasks, Search & Drive with AI - Comprehensive Google Workspace / G Suite MCP Server & CLI Tool
Autonomous knowledge base plugin for Claude Code - captures reserch, ideas, and decisions into an interlinked wiki with reserch-on-miss, semantic search, and a Wikipedia-style web UI. Knowledge compou
Advanced AI Real Estate Assistant using RAG, LLMs, and Python. Features market analysis, property valuation, and intelligent search.
A curated list of awesome works related to high dimensional structure/vector search & database
OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX bac
Codingbuddy orchestrates 29 specialized AI agents to deliver code quality comparable to a team of human experts through a PLAN β ACT β EVAL workflow.
π The leading agent orchestration platform for Claude. Deploy intelligent multi-agent swarms, coordinate autonomous workflows, and build conversational AI systems. Features enterprise-grade archit
Open-source, self-improving autonomous agent swarmπ
DSPEx - Declarative Self-improving Elixir | A BEAM-Native AI Program Optimization Framework
Lightweight, embedded graph-based memory system for AI applications. Fast (<3ms recall), offline-first, with MCP server support for Claude and other AI tools.
Security-first AI agent orchestration system. Built-in agents with predefined capabilities, strict guardrails on what they can and cannot do, and a four-layer defense system that enforces security at
A collection of Agent Skills Standard and Best Practice for Programming Languages, Frameworks that help our AI Agent follow best practies on frameworks and programming laguages
trpc-agent-go is a powerful Go framework for building intelligent agent systems using large language models (LLMs) and tools.
The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.
A comprehensive Model Context Protocol (MCP) server that enables AI assistants to control Unreal Engine through the native C++ Automation Bridge plugin. Built with TypeScript and C++.
TensorZero is an open-source LLMOps platform that unifies an LLM gateway, observability, evaluation, optimization, and experimentation.
OpenAI-compatible HTTP LLM proxy / gateway for multi-provider inference (Google, Anthropic, OpenAI, PyTorch). Lightweight, extensible Python/FastAPIβuse as library or standalone service.
Serialize your functions with tools-rs!
A portable accelerated SQL query, search, and LLM-inference engine, written in Rust, for data-grounded AI apps and agents.
The SDK For Browser Agents
LLM-driven debugger server β give your AI agents step-through debugging superpowers
Exposes internet search tools for use by LLM-backed Assist in Home Assistant
Teleton: Autonomous AI Agent for Telegram & TON Blockchain
My personal Claude Code and OpenAI Codex setup with battle-tested skills, commands, hooks, agents and MCP servers that I use daily.
Design, conduct and analyze results of AI-powered surveys and experiments. Simulate social science and market research with large numbers of AI agents and LLMs.
Local AI anywhere, for everyone β LLM inference, chat UI, voice, agents, workflows, RAG, and image generation. No cloud, no subscriptions.
Optimize websites for AI search engines with a universal TypeScript engine supporting Next.js, NestJS, WordPress, and Shopify integration.
Route, manage, and analyze your LLM requests across multiple providers with a unified API interface
The official TanStack CLI - Project Scaffolding, MCP Server, Agent Skills Installation, etc
CloneMe is an advanced AI platform that builds your digital twinβan AI that chats like you, remembers details, and supports multiple platforms. Customizable, memory-driven, and hot-reloadable, it's th
MCP plugin that intercepts AI agent edits in RAM, validates them (TypeScript compiler + gopls + pyright), auto-heals missing imports, and commits atomically. If anything breaks, disk stays untouched
Computer Environments Elicit General Agentic Intelligence in LLMs
A Model Context Protocol (MCP) server for Autodesk ShotGrid/Flow Production Tracking (FPT) with comprehensive CRUD operations and data management capabilities.
Autonomous local AI assistant in Go β 40+ tools, 20+ LLM providers, multi-agent orchestration, self-improving
No description
Generate fully-typed, lint-clean language bindings for Rust libraries across 11 languages
Universal LLM Gateway: One API, every LLM. OpenAI/Anthropic-compatible endpoints with multi-provider translation and intelligent load-balancing.
Broken RAG For The Broken Souls
AI co-pilot for ComfyUI β 113 tools for workflow authoring, model provisioning, and iterative rendering. Multi-provider (Claude, GPT-4o, Gemini, Ollama). Ships as MCP server or standalone CLI.
PromptManager is a desktop application for cataloguing, searching, and executing AI prompts, and much more.
OpenGrok MCP Server is a native Model Context Protocol server that bridges your organization's OpenGrok search engine with AI assistants β enabling instant, natural language search and deep repository
A Slack bot and MCP client acts as a bridge between Slack and Model Context Protocol (MCP) servers. Using Slack as the interface, it enables large language models (LLMs) to connect and interact with v
Open source local sandboxing for running AI generated code.
Generate a custom newspaper with an AI agent based on your favorite YouTube channels.
π₯οΈ Enhance your terminal interactions by allowing AI to see and control your session for real-time debugging and automation.
Fluid, elastic data abstraction and acceleration for BigData/AI applications in cloud. (Project under CNCF)
MCP server for searching and retrieving Claude Agent Skills using vector search
π Next Generation Multi-tenant AI One-Stop Solution. Builtin Admin & Billing System. Enterprise-Grade Unified LLM Gateway Support for 200+ Models And 35+ Providers, Load Balacing w/ Priority-base Rou
β‘ Optimize vector searches with a hyper-efficient cache that uses machine learning for faster, smarter data access and reduced costs.
Simulate antenna designs instantly in your browser using NEC2-powered, open-source software with WebAssembly and Docker support.
Connect Kimi Codeβs large-context reading with Claude Codeβs reasoning for efficient, cost-effective codebase analysis and edits.
The ultimate native macOS AI Agent. Blends local MLX SLMs with 3D cognitive Metal rendering and autonomous system integrations.
An automated, agentic exploratory testing tool that performs comprehensive QA testing on web applications, simulating human user interactions through various input methods (mouse, keyboard, TAB naviga
π¦Ύ A productionβready research outreach AI agent that plans, discovers, reasons, uses tools, autoβbuilds cited briefings, and drafts tailored emails with toolβchaining, memory, tests, and turnkey Dock
MCP (Model Context Protocol) Servers authored and maintained by the PulseMCP team. We build reliable servers thoughtfully designed specifically for MCP Client-powered workflows.
π¦ Prevents outdated Rust code suggestions from AI assistants. This MCP server fetches current crate docs, uses embeddings/LLMs, and provides accurate context via a tool call.
An open-source SSPM tool written in Go
Roslyn Source Code Generators with Examples
