Search results for "cache"
FlashInfer: Kernel Library for LLM Serving
Automated generation of real Swagger/OpenAPI 2.0 schemas from Django Rest Framework code.
Celery result backends for Django.
Scapy: interactive packet manipulation tool
A package that allows you to utilize 12factor inspired environment variables to configure your Django application.
A modern Python package and dependency manager supporting the latest PEP standards
A tool for scanning Python environments for known vulnerabilities
Radically simplified static file serving for WSGI applications
Calculate prices for calling LLM inference APIs.
Powertools for AWS Lambda (Python) is a developer toolkit to implement Serverless best practices and increase developer velocity.
Simple LRU cache for asyncio
The comprehensive WSGI web application library.
Extensible memoizing collections and decorators
Redis Vector Library (RedisVL) -- the AI-native Python client for Redis.
SmarterRouter: An intelligent LLM gateway and VRAM-aware router for Ollama, llama.cpp, and OpenAI. Features semantic caching, model profiling, and automatic failover for local AI labs.
Lightning toolbox for across the our ecosystem.
vMLX - Home of JANG_Q - Cont Batch, Prefix, Paged, KV Cache Quant, VL - Powers MLX Studio. Image gen/edit, OpenAI/Anth
pytest plugin for test session metadata
Accelerated property cache
Expand standard functools to methods
LLM-powered knowledge base from your Claude Code, Codex CLI, Copilot, Cursor & Gemini sessions. Karpathy's LLM Wiki pattern โ implemented and shipped.
The Context Optimization Layer for LLM Applications
MCP-NixOS - Model Context Protocol Server for NixOS resources
Open Source AI Platform - AI Chat with advanced features that works with every LLM
Cognithor - Agent OS: Local-first autonomous agent operating system. 16 LLM providers, 17 channels, 112+ MCP tools, 5-tier memory, A2A protocol, knowledge vault, voice, browser automation, Computer-us
LLM้ฉฑๅจ็ A/H/็พ่กๆบ่ฝๅๆๅจ๏ผๅคๆฐๆฎๆบ่กๆ + ๅฎๆถๆฐ้ป + LLMๅณ็ญไปช่กจ็ + ๅคๆธ ้ๆจ้๏ผ้ถๆๆฌๅฎๆถ่ฟ่ก๏ผ็บฏ็ฝๅซ. LLM-powered stock analysis system for A/H/US markets.
Official MCP Servers for AWS
The leading, most token-efficient MCP server for GitHub source code exploration via tree-sitter AST parsing
The python library for research and development in NLP, multimodal LLMs, Agents, ML, Knowledge Graphs, and more.
423 plugins, 2,849 skills, 177 agents for Claude Code. Open-source marketplace at tonsofskills.com with the ccpi CLI package manager.
Knowledge Engine for AI Agent Memory in 6 lines of code
LlamaIndex is the leading document agent and OCR platform
RESTai is an AIaaS (AI as a Service) open-source platform. Supports many public and local LLM suported by Ollama/vLLM/etc. Precise embeddings usage, tuning, analytics etc. Built-in image/audio generat
Declarative Agent Orchestration. Ship while you sleep.
Security intelligence API and MCP server for AI agents. 25 tools, 35+ endpoints: CVE/EPSS/KEV, domain recon, SSL, IP reputation, threat intel, email security, code scanning. Free, no signup.
Autonomous knowledge base plugin for Claude Code - captures reserch, ideas, and decisions into an interlinked wiki with reserch-on-miss, semantic search, and a Wikipedia-style web UI. Knowledge compou
AI-first security scanner with 76 analyzers, 9,600+ detection rules, and repo poisoning detection for AI/ML, LLM agents, and MCP servers. Scan any GitHub repo with: medusa scan --git user/repo
Python Deep Agent framework built on top of Pydantic-AI, designed to help you quickly build production-grade autonomous AI agents with planning, filesystem operations, subagent delegation, skills, and
ๅฐ็ฃๅธๆณ้ขๅคๆฑบ + ๅ จๅๆณ่ฆ่ณๆๅบซ MCP server ยท Query Taiwan legal data from any MCP AI agent
Open-source MCP server for LinkedIn. Give Claude and any MCP-compatible AI assistant access to profiles, companies, jobs, and messages.
Accelerating Long Context LLM Inference with Accuracy-Preserving Context Optimization in SGLang, vLLM, llama.cpp, OpenClaw, RAG, and Agentic AI.
OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX bac
MCP Server for Turkish & American Stock Exchange and Fund Data
Secure AI conversations with documents, video, audio, and more. Personal workspaces for focused context, group spaces for shared insight. Classify docs, reuse prompts, and extend with modular features
AINL helps turn AI from "a smart conversation" into "a structured worker." It is designed for teams building AI workflows that need multiple steps, state and memory, tool use, repeatable execution, v
Give your AI agents persistent memory.
OSCAL tools for AI agents
OpenAI-compatible HTTP LLM proxy / gateway for multi-provider inference (Google, Anthropic, OpenAI, PyTorch). Lightweight, extensible Python/FastAPIโuse as library or standalone service.
NEXO Brain โ Shared brain for AI agents. Persistent memory, semantic RAG, natural forgetting, metacognitive guard, trust scoring, 150+ MCP tools. Works with Claude Code, Codex, Claude Desktop & any MC
Full featured redis cache backend for Django.
Wraps any OpenAI API interface as Responses with MCPs support so it supports Codex. Adding any missing stateful features. Ollama and Vllm compliant.
WHartTest ๆฏๅบไบ Django REST Framework ไธ็ฐไปฃๅคงๆจกๅๆๆฏๆ้ ็ AI ้ฉฑๅจๆต่ฏ่ชๅจๅๅนณๅฐใๅนณๅฐ่ๅ่ช็ถ่ฏญ่จ็่งฃใ็ฅ่ฏๅบๆฃ็ดขไธๅตๅ ฅๆ็ดข่ฝๅ๏ผ็ปๅ LangChain ไธ MCP๏ผModel Context Protocol๏ผ ๅทฅๅ ท่ฐ็จ๏ผๅฎ็ฐไป้ๆฑๅฐๅฏๆง่กๆต่ฏ็จไพ็่ชๅจๅ็ๆไธ็ฎก็๏ผๅธฎๅฉๆต่ฏๅข้ๆๅๆ็ไธ่ฆ็็ใ
MCP Server for Computer Use in Windows
Harness LLMs with Multi-Agent Programming
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
RAGLight is a modular framework for Retrieval-Augmented Generation (RAG). It makes it easy to plug in different LLMs, embeddings, and vector stores, and now includes seamless MCP integration to connec
Custom plugins for hermes-agent โ goal management, inter-agent bridge, model selection, cost control
"RAG-Anything: All-in-One RAG Framework"
Open-source multi-agent AI assistant powered by LangGraph, FastAPI & Next.js โ 16+ agents, Human-in-the-Loop, MCP integration, voice TTS, RAG, 500+ metrics, 6 languages.
MCP server that saves 97% of AI coding tokens โ your AI reads code structurally, not file-by-file. Faster, cheaper, smarter.
A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp
RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker
Universal LLM Gateway: One API, every LLM. OpenAI/Anthropic-compatible endpoints with multi-provider translation and intelligent load-balancing.
JRVS AI Agent with JARCORE autonomous coding engine - RAG knowledge base, web scraping, calendar, code generation. Powered by whatever local AI you choose.
Buddhist Digital Text Platform โ 9,200+ texts, 500+ sources, 8 UI languages, AI Q&A (RAG), knowledge graph, full-text search
CASSIA: A Multi-Agent LLM-Based Single-Cell Cell Type Annotation Framework
Published in CNCF Landscape: A MCP server for Kubernetes.
๐กโ๏ธAI-Powered Penetration Testing Framework with automated vulnerability scanning, multi-agent system, and compliance reporting๐กโ๏ธ
Describe it or draw it. Kiln makes it real. โ 461 MCP tools for AI-agent-controlled 3D printing. OctoPrint, Moonraker, Bambu Lab, Prusa Link, and Elegoo.
AI-powered bug bounty hunting from your terminal - recon, 20 vuln classes, autonomous hunting, and report generation. All inside Claude Code.
๐ฌ AI-powered YouTube Shorts automation tool using LLMs, real-time search, and text-to-speech. Create engaging short-form videos with automated research, voiceovers, and subtitles.
A Model Context Protocol (MCP) server for Autodesk ShotGrid/Flow Production Tracking (FPT) with comprehensive CRUD operations and data management capabilities.
Shell and coding agent on mcp clients
Persistent cache for Python cachetools.
Open-source, contract-driven data quality validation. Shift-left enforcement at the point of write โ before data enters your pipeline.
Production-ready RAG Framework (Python/FastAPI). 1-line config swaps: 6 Vector DBs (Weaviate, Pinecone, Qdrant, ChromaDB, pgvector, MongoDB), 5 LLMs (Gemini, OpenAI, Claude, Ollama, OpenRouter). OpenA
Lightweight, embedded graph-based memory system for AI applications. Fast (<3ms recall), offline-first, with MCP server support for Claude and other AI tools.
Prompt Driven Development Command Line Interface
Memory library for building stateful agents
Create a plan from a description in minutes
๐ฐ PromptLayer - Maintain a log of your prompts and OpenAI API requests. Track, debug, and replay old completions.
Curated list of the best truly open-source AI projects, models, tools, and infrastructure.
The highest-scoring AI memory system ever benchmarked that isn't reliant on LLM reranking. And it's free & burns less tokens.
METAโAGENTIC ฮฑโAGI ๐๏ธโจ โ Mission ๐ฏ Endโtoโend: Identify ๐ โ OutโLearn ๐ โ OutโThink ๐ง โ OutโDesign ๐จ โ OutโStrategise โ๏ธ โ OutโExecute โก
MaverickMCP - Personal Stock Analysis MCP Server
Open-Source Intelligent Command Layer
No description
Lightweight semantic code search engine โ 2-stage vector + FTS + RRF fusion + MCP server for Claude Code
Claude Code skills, architectural principles, and alternative approaches for AI-assisted development
A simple Python sandbox for helpful LLM data agents
Generic markdown collection MCP server with FTS5 + semantic search, frontmatter-aware indexing, and incremental reindexing
Synthadoc: An open-source LLM knowledge compilation engine that turns raw documents into structured, local-first wikis. A transparent, human-readable alternative to traditional RAG, which can be self-
MCP Server for Simplenote integration with Claude Desktop
๐ LLM Context Benchmarks - A comprehensive benchmarking tool for testing LLMs with varying context sizes using Ollama. Features dual benchmark modes (API/CLI), automatic hardware detection (optimiz
One memory layer for every AI agent. Local-first, markdown source of truth, and CLI/HTTP/MCP native. Your agent forgot who you are. Again. Dory fixes that.
The production runtime for AI agents. Schema in, API out. Built on PydanticAI + FastAPI.
Local First AI SEO Software on Nix, FastHTML & HTMX
Claude Code plugin for Ruby, Rails, Grape, PostgreSQL, Redis, and Sidekiq development
Design, conduct and analyze results of AI-powered surveys and experiments. Simulate social science and market research with large numbers of AI agents and LLMs.
Ship customer-facing AI with isolation, spend controls, and provenance.
Continuous prompt optimization for AI applications. Collect feedback, auto-optimize with DSPy, deliver as reviewable PRs.
Connect any LLM to OpenClaw โ production-tested middleware for Qwen3-235B and beyond
AI co-pilot for ComfyUI โ 113 tools for workflow authoring, model provisioning, and iterative rendering. Multi-provider (Claude, GPT-4o, Gemini, Ollama). Ships as MCP server or standalone CLI.
AI-powered web app builder โ describe it, build it, ship it. 2-agent LangGraph system (Sonnet 4.5 + o4-mini) generates React apps from natural language with live preview and one-click deploy.
๐งญ PromptDrifter โ oneโcommand CI guardrail that catches prompt drift and fails the build when your LLM answers change.
Second Brain is a desktop application that acts as a personal knowledge base, using retrieval-augmented generation (RAG), multimodal AI models, and a hybrid lexical/semantic search algorithm to intera
Broken RAG For The Broken Souls
Self-hosted autonomous AI agent โ 9-layer cascade, Docker sandbox, encrypted vault, review/build/control plane, 1407+ tests
Local-first AI assistant โ 9 specialized agents (code, web, debug, securityโฆ), 10M token vector memory, mobile relay via secure tunnel, real-time web search and document processing. Runs 100% on your
๐ช Intelligent orchestration system that coordinates multiple AI coding assistants (Claude, Codex, Gemini CLI, Copilot CLI) to collaborate on complex software development tasks via REPL or a Vue/Nuxt
An automated, agentic exploratory testing tool that performs comprehensive QA testing on web applications, simulating human user interactions through various input methods (mouse, keyboard, TAB naviga
CLI tool to search and rank remote job opportunities
AI-powered group finance assistant using MCP architecture, Gemini LLM and Streamlit.
A MCP server to use StatCAN data
โก Optimize vector searches with a hyper-efficient cache that uses machine learning for faster, smarter data access and reduced costs.
A collection of Summoner clients and agents featuring example implementations and reusable templates
Modular multi-agent orchestration framework powered by LangGraph and FastAPI.
ACR Control Plane: runtime control & governance for agentic AI (six-pillar enforcement).
๐ฆพ A productionโready research outreach AI agent that plans, discovers, reasons, uses tools, autoโbuilds cited briefings, and drafts tailored emails with toolโchaining, memory, tests, and turnkey Dock
llama-index indices llama-cloud integration
Intelligent Model Context Protocol (MCP) server for AI-assisted API development. Generate mock servers from OpenAPI specs with advanced logging, performance analytics, and server discovery. Optimized
