freshcrate

Search results for "caching"

Clear filters
50 results found (Python)
llm-wikiπŸ“v1.1.0-rc8🌿 Growing⭐139

LLM-powered knowledge base from your Claude Code, Codex CLI, Copilot, Cursor & Gemini sessions. Karpathy's LLM Wiki pattern β€” implemented and shipped.

PraisonAIπŸ“v4.6.25🌳 Mature⭐6,900

PraisonAI 🦞 β€” Hire a 24/7 AI Workforce. Stop writing boilerplate and start shipping autonomous agents that research, plan, code, and execute tasks. Deployed in 5 lines of code with built-in memory, R

claude-code-plugins-plus-skillsπŸ“v4.26.0🌳 Mature⭐1,995

423 plugins, 2,849 skills, 177 agents for Claude Code. Open-source marketplace at tonsofskills.com with the ccpi CLI package manager.

mcp-memory-serviceπŸ“v10.39.1🌳 Mature⭐1,643

Open-source persistent memory for AI agent pipelines (LangGraph, CrewAI, AutoGen) and Claude. REST API + knowledge graph + autonomous consolidation.

cyllamaπŸ“0.2.11🌱 Seedling⭐22

A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp

litellmπŸ“v1.83.7-stable🌳 Mature⭐42,951

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropi

RAPTORπŸ“0.0.0🌱 Seedling⭐13

RAPTOR (Robust AI-Powered Toolkit for Operational Robots) is an AI-native Content Insight Engine that transforms passive media storage into an intelligent knowledge platform through automated analysis

borsa-mcpπŸ“0.0.0🌳 Mature⭐548

MCP Server for Turkish & American Stock Exchange and Fund Data

lm-proxyπŸ“v3.2.2🌿 Growing⭐114

OpenAI-compatible HTTP LLM proxy / gateway for multi-provider inference (Google, Anthropic, OpenAI, PyTorch). Lightweight, extensible Python/FastAPIβ€”use as library or standalone service.

llm_intentsπŸ“1.7.1🌿 Growing⭐122

Exposes internet search tools for use by LLM-backed Assist in Home Assistant

hermes-pluginsπŸ“0.0.0🌱 Seedling⭐21

Custom plugins for hermes-agent β€” goal management, inter-agent bridge, model selection, cost control

claude-codex-settingsπŸ“v2.3.0🌳 Mature⭐623

My personal Claude Code and OpenAI Codex setup with battle-tested skills, commands, hooks, agents and MCP servers that I use daily.

LLM-API-Key-ProxyπŸ“dev/build-20260301-1-b62f6e4🌿 Growing⭐465

Universal LLM Gateway: One API, every LLM. OpenAI/Anthropic-compatible endpoints with multi-provider translation and intelligent load-balancing.

SmarterRouterπŸ“2.2.5🌿 Growing⭐105

SmarterRouter: An intelligent LLM gateway and VRAM-aware router for Ollama, llama.cpp, and OpenAI. Features semantic caching, model profiling, and automatic failover for local AI labs.

AgenticXπŸ“v0.3.7🌿 Growing⭐105

AgenticX is a unified, production-ready multi-agent platform β€” Python SDK + CLI (agx) + Studio server + Machi desktop app. Features Meta-Agent orchestration, 15+ LLM providers, MCP Hub, hierarchical m

mcpπŸ“2026.04.20260414152327🌿 Growing⭐8,740

Official MCP Servers for AWS

shotgrid-mcp-serverπŸ“v0.15.4🌿 Growing⭐56

A Model Context Protocol (MCP) server for Autodesk ShotGrid/Flow Production Tracking (FPT) with comprehensive CRUD operations and data management capabilities.

auto-deep-researcher-24x7πŸ“main@2026-04-19🌿 Growing⭐261

πŸ”₯ An autonomous AI agent that runs your deep learning experiments 24/7 while you sleep. Zero-cost monitoring, Leader-Worker architecture, constant-size memory.

medusaπŸ“v2026.5.5🌿 Growing⭐252

AI-first security scanner with 76 analyzers, 9,600+ detection rules, and repo poisoning detection for AI/ML, LLM agents, and MCP servers. Scan any GitHub repo with: medusa scan --git user/repo

vllmπŸ“v0.19.1🌿 Growing⭐76,155

A high-throughput and memory-efficient inference and serving engine for LLMs

opentulpaπŸ“main@2026-04-17🌱 Seedling⭐26

Self-hosted personal AI agent that lives in your DMs. Describe any workflow: triage Gmail, pull a Giphy feed, build a Slack bot, monitor markets. It writes the code, runs it, schedules it, and saves i

maverick-mcpπŸ“main@2026-04-17🌿 Growing⭐479

MaverickMCP - Personal Stock Analysis MCP Server

google_workspace_mcpπŸ“v1.19.0🌿 Growing⭐2,087

Control Gmail, Google Calendar, Docs, Sheets, Slides, Chat, Forms, Tasks, Search & Drive with AI - Comprehensive Google Workspace / G Suite MCP Server & CLI Tool

ai-real-estate-assistantπŸ“dev@2026-04-13🌿 Growing⭐159

Advanced AI Real Estate Assistant using RAG, LLMs, and Python. Features market analysis, property valuation, and intelligent search.

LLM-WikiπŸ“main@2026-04-18🌱 Seedling⭐7

Autonomous knowledge base plugin for Claude Code - captures reserch, ideas, and decisions into an interlinked wiki with reserch-on-miss, semantic search, and a Wikipedia-style web UI. Knowledge compou

vllm-mlxπŸ“v0.2.8🌿 Growing⭐798

OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX bac

kuzu-memoryπŸ“v1.12.9🌱 Seedling⭐22

Lightweight, embedded graph-based memory system for AI applications. Fast (<3ms recall), offline-first, with MCP server support for Claude and other AI tools.

server-nexeπŸ“v1.0.0-beta🌱 Seedling⭐9

Local AI server with persistent memory, RAG, and multi-backend inference (MLX / llama.cpp / Ollama). Runs entirely on your machine β€” zero data sent to external services.

claude-ruby-grape-railsπŸ“v1.13.4🌱 Seedling⭐5

Claude Code plugin for Ruby, Rails, Grape, PostgreSQL, Redis, and Sidekiq development

claude-skills-mcpπŸ“v1.0.6🌱 Seedling⭐378

MCP server for searching and retrieving Claude Agent Skills using vector search

edslπŸ“wasm-wheel🌿 Growing⭐454

Design, conduct and analyze results of AI-powered surveys and experiments. Simulate social science and market research with large numbers of AI agents and LLMs.

deltallmπŸ“v0.1.20-rc2🌱 Seedling⭐3

Route, manage, and analyze your LLM requests across multiple providers with a unified API interface

clonemeπŸ“0.0.0πŸ’€ Dormant⭐38

CloneMe is an advanced AI platform that builds your digital twinβ€”an AI that chats like you, remembers details, and supports multiple platforms. Customizable, memory-driven, and hot-reloadable, it's th

llm-in-sandboxπŸ“v0.2.0🌱 Seedling⭐221

Computer Environments Elicit General Agentic Intelligence in LLMs

DOXπŸ“main@2026-04-15🌱 Seedling⭐1

Broken RAG For The Broken Souls

Comfy-CozyπŸ“v4.0.0🌱 Seedling⭐3

AI co-pilot for ComfyUI β€” 113 tools for workflow authoring, model provisioning, and iterative rendering. Multi-provider (Claude, GPT-4o, Gemini, Ollama). Ships as MCP server or standalone CLI.

PromptManagerπŸ“master@2026-04-12🌱 Seedling⭐3

PromptManager is a desktop application for cataloguing, searching, and executing AI prompts, and much more.

vector-cache-optimizerπŸ“base-setup@2026-04-21🌱 Seedling⭐1

⚑ Optimize vector searches with a hyper-efficient cache that uses machine learning for faster, smarter data access and reduced costs.

qa-agentπŸ“v0.2.1🌱 Seedling⭐1

An automated, agentic exploratory testing tool that performs comprehensive QA testing on web applications, simulating human user interactions through various input methods (mouse, keyboard, TAB naviga

Agentic-AI-PipelineπŸ“v1.0.0πŸ’€ Dormant⭐63

🦾 A production‑ready research outreach AI agent that plans, discovers, reasons, uses tools, auto‑builds cited briefings, and drafts tailored emails with tool‑chaining, memory, tests, and turnkey Dock

newrelicπŸ“12.1.0🌱 Seedling

New Relic Python Agent

banksπŸ“2.4.1🌱 Seedling

A prompt programming language

hishelπŸ“1.1.10🌱 Seedling

Elegant HTTP Caching for Python

ctranslate2πŸ“4.7.1🌱 Seedling

Fast inference engine for Transformer models

prefectπŸ“3.6.27🌱 Seedling

Workflow orchestration and management.

ppftπŸ“1.7.8🌱 Seedling

distributed and parallel Python

pathosπŸ“0.3.5🌱 Seedling

parallel graph management and execution in heterogeneous computing

ipythonπŸ“9.12.0🌱 Seedling

IPython: Productive Interactive Computing

httpcoreπŸ“1.0.9🌱 Seedling

A minimal low-level HTTP client.