freshcrate

Search results for "efficient"

Clear filters
108 results found (Python)
flashinfer-python📁0.6.8.post1🏛️ Flagship5,467

FlashInfer: Kernel Library for LLM Serving

hishel📁1.1.10🌿 Growing379

Elegant HTTP Caching for Python

trafilatura📁2.0.0🏛️ Flagship5,758

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML.

gensim📁4.4.0🏛️ Flagship16,395

Python framework for fast Vector Space Modelling

crewai📁1.14.2🏛️ Flagship49,445

Cutting-edge framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.

aiolimiter📁1.2.1🌳 Mature755

asyncio rate limiter, a leaky bucket implementation

msgraph-core📁1.3.8🌿 Growing284

Core component of the Microsoft Graph Python SDK

diff-cover📁10.2.0🌳 Mature828

Run coverage and linting reports on diffs

optuna📁4.8.0🏛️ Flagship14,019

A hyperparameter optimization framework

progressbar2📁4.5.0🌳 Mature879

A Python Progressbar library to provide visual (yet text based) progress to long running operations.

holidays📁0.95🌳 Mature1,878

Open World Holidays Framework

sglang📁0.5.10.post1🏛️ Flagship26,220

SGLang is a fast serving framework for large language models and vision language models.

fastapi-mcp📁0.4.0🏛️ Flagship11,816

Automatic MCP server generator for FastAPI applications - converts FastAPI endpoints to MCP tools for LLM integration

asyncpg📁0.31.0🏛️ Flagship7,999

An asyncio PostgreSQL driver

prompt-toolkit📁3.0.52🏛️ Flagship10,412

Library for building powerful interactive command lines in Python

jcodemunch-mcp📁v1.71.0🌳 Mature1,636

The leading, most token-efficient MCP server for GitHub source code exploration via tree-sitter AST parsing

jdocmunch-mcp📁v1.9.0🌿 Growing147

The leading, most token-efficient MCP server for documentation exploration and retrieval via structured section indexing

vllm📁v0.19.1🏛️ Flagship77,587

A high-throughput and memory-efficient inference and serving engine for LLMs

jdatamunch-mcp📁v0.8.4🌱 Seedling36

Token-efficient MCP server for tabular data retrieval. Index CSV/Excel files, query rows, aggregate — 99%+ token savings vs raw file reads.

AGiXT📁v1.9.4🌳 Mature3,179

AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, a

ua-parser📁1.0.2🌿 Growing642

Python port of Browserscope's user agent parser

crewAI📁1.14.3a2🏛️ Flagship49,446

Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.

agentscope-runtime📁v1.1.5🌳 Mature744

A production-ready runtime framework for agent apps with secure tool sandboxing, Agent-as-a-Service APIs, scalable deployment, full-stack observability, and broad framework compatibility.

mcp📁2026.04.20260421081720🏛️ Flagship8,833

Official MCP Servers for AWS

mcp-client-for-ollama📁v0.28.0🌳 Mature655

A text-based user interface (TUI) client for interacting with MCP servers using Ollama. Features include agent mode, multi-server, model switching, streaming responses, tool management, human-in-the-l

agentscope📁v1.0.19🏛️ Flagship24,189

Build and run agents you can see, understand and trust.

Vibe-Skills📁v3.0.4🌳 Mature1,645

Vibe-Skills is an all-in-one AI skills package. It seamlessly integrates expert-level capabilities and context management into a general-purpose skills package, enabling any AI agent to instantly upgr

llm-rl-environments-lil-course📁main@2026-04-17🌿 Growing140

🌱 A little course on Reinforcement Learning Environments for evaluating and training Language Models

SimpleLLMFunc📁v0.7.8🌿 Growing77

A simple and well-tailored LLM application framework that enables you to seamlessly integrate LLM capabilities in the most "Code-Centric" manner. LLM As Function, Prompt As Code. 一个简单的恰到

AReaL📁v1.0.3🏛️ Flagship5,075

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

fabric-rti-mcp📁0.5.3🌿 Growing110

MCP server for Fabric Real-Time Intelligence (https://aka.ms/fabricrti) supporting tools for Eventhouse (https://aka.ms/eventhouse), Azure Data Explorer (https://aka.ms/adx, and other RTI services (co

caveman📁v1.6.0🏛️ Flagship42,198

🪨 why use many token when few token do trick — Claude Code skill that cuts 65% of tokens by talking like caveman

camel📁v0.2.91a1🏛️ Flagship16,753

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

The Multi-Agent Custom Automation Engine Solution Accelerator is an AI-driven system that manages a group of AI agents to accomplish tasks based on user input. Powered by Microsoft Agent Framework, Az

ContextPilot📁v0.4.1🌿 Growing79

Accelerating Long Context LLM Inference with Accuracy-Preserving Context Optimization in SGLang, vLLM, llama.cpp, OpenClaw, RAG, and Agentic AI.

vllm-mlx📁v0.2.8🌳 Mature917

OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX bac

openakita📁v1.27.9🌳 Mature1,655

An open-source AI assistant framework with skills and agent architecture

simplechat📁v0.241.006🌿 Growing129

Secure AI conversations with documents, video, audio, and more. Personal workspaces for focused context, group spaces for shared insight. Classify docs, reuse prompts, and extend with modular features

ainativelang📁v1.4.6🌿 Growing72

AINL helps turn AI from "a smart conversation" into "a structured worker." It is designed for teams building AI workflows that need multiple steps, state and memory, tool use, repeatable execution, v

mcp-gateway-registry📁v1.0.18🌳 Mature599

Enterprise-ready MCP Gateway & Registry that centralizes AI development tools with secure OAuth authentication, dynamic tool discovery, and unified access for both autonomous AI agents and AI coding a

lm-proxy📁v3.2.2🌿 Growing114

OpenAI-compatible HTTP LLM proxy / gateway for multi-provider inference (Google, Anthropic, OpenAI, PyTorch). Lightweight, extensible Python/FastAPI—use as library or standalone service.

Windows-MCP📁v0.7.1🏛️ Flagship5,258

MCP Server for Computer Use in Windows

RAGLight📁3.4.7🌳 Mature658

RAGLight is a modular framework for Retrieval-Augmented Generation (RAG). It makes it easy to plug in different LLMs, embeddings, and vector stores, and now includes seamless MCP integration to connec

fast-plaid📁1.4.5🌿 Growing245

High-Performance Engine for Multi-Vector Search

adeu📁v1.1.0🌿 Growing63

Agentic DOCX Redlining Engine. Enables LLMs to read Word documents and inject native Track Changes (w:ins, w:del) and Comments without breaking formatting. Includes Model Context Protocol (MCP) Server

lad_mcp_server📁main@2026-04-20🌱 Seedling22

Lad MCP Server: Autonomous code & system design review for AI coding agents (Claude Code, Cursor, Codex, etc.). Features multi-model consensus via OpenRouter and context-aware reviews via Serena.

cyllama📁0.2.11🌱 Seedling25

A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp

LycheeMem📁master@2026-04-19🌿 Growing232

Compact, efficient, and extensible long-term memory for LLM agents.

LightAgent📁v0.5.0🌳 Mature876

LightAgent: Lightweight AI agent framework with memory, tools & tree-of-thought. Supports multi-agent collaboration, self-learning, and major LLMs (OpenAI/DeepSeek/Qwen). Open-source with MCP/SSE prot

OpenRA-RL📁v0.4.1🌿 Growing120

Open Framework for AI Agents to play Red Alert through Reinforcement Learning

Agentic-RAG-R1📁0.0.0🌿 Growing413

Agentic RAG R1 Framework via Reinforcement Learning

DeepCode📁v1.2.0🏛️ Flagship15,244

"DeepCode: Open Agentic Coding (Paper2Code & Text2Web & Text2Backend)"

llmware📁v0.4.6🌿 Growing14,862

Unified framework for building enterprise RAG pipelines with small, specialized models

zai-shell📁v9.0.3🌱 Seedling40

Command Line telepathy. An Autonomous Al Agent for your Terminal that turns intent into Execution (Windows/Linux/Mac)

arag📁v0.1.0🌿 Growing252

A-RAG: Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces. State-of-the-art RAG framework with keyword, semantic, and chunk read tools for multi-hop QA.

ragas📁v0.4.3🌳 Mature13,570

Supercharge Your LLM Application Evaluations 🚀

pdd📁main@2026-04-21🌿 Growing656

Prompt Driven Development Command Line Interface

GenericAgent📁main@2026-04-21🌿 Growing5,482

Self-evolving agent: grows skill tree from 3.3K-line seed, achieving full system control with 6x less token consumption

awesome-code-agents📁main@2026-04-20🌿 Growing98

A curated list of products, benchmarks, and research papers on autonomous code agents. Beyond coding — they're redefining how software changes the world.

awesome-opensource-ai📁main@2026-04-20🌿 Growing2,849

Curated list of the best truly open-source AI projects, models, tools, and infrastructure.

security-investigator📁main@2026-04-18🌿 Growing175

Automated security investigation tool using Microsoft MCP Servers, GitHub Copilot, Python Modules and custom copilot-instructions.

prompt-os📁v1.0.0🌱 Seedling6

A desktop AI agent that controls your local machine — runs commands, manages files, executes code, browses the web autonomously etc. Supports Claude, GPT, Gemini, Llama, DeepSeek, and more. .exe avail

ollamafreeapi📁main@2026-04-15🌿 Growing172

OllamaFreeAPI: Free Distributed API for Ollama LLMs Public gateway to our managed Ollama servers with: - Zero-configuration access to 50+ models - Auto load-balanced across global nodes - Free tier w

dependency-groups📁1.3.1🌱 Seedling14

A tool for resolving PEP 735 Dependency Group data

cdpilot📁v0.3.0🌱 Seedling25

Zero-dependency browser automation CLI. 70+ commands, 10 test assertions, smart commands (click/fill by text — no LLM needed). MCP server for AI agents with 500x fewer tokens. Extract, observe, script

hermes-gate📁0.0.0🌱 Seedling18

🏛️ Hermes Gate — Terminal TUI for managing remote Hermes Agent sessions with auto-reconnect, detach support, and zero config

synthadoc📁v0.1.0🌱 Seedling66

Synthadoc: An open-source LLM knowledge compilation engine that turns raw documents into structured, local-first wikis. A transparent, human-readable alternative to traditional RAG, which can be self-

llm_context_benchmarks📁0.0.0🌱 Seedling59

📊 LLM Context Benchmarks - A comprehensive benchmarking tool for testing LLMs with varying context sizes using Ollama. Features dual benchmark modes (API/CLI), automatic hardware detection (optimiz

ffcx📁v0.10.1.post0🌱 Seedling189

Next generation FEniCS Form Compiler for finite element forms

VibeCode-Protocol-Suite📁main@2026-04-10🌱 Seedling17

A comprehensive suite of protocols, meta-prompts, and orchestration tools designed to streamline software development workflows, project management, and team collaboration. Includes the VibeCode Proto

server-nexe📁v1.0.2-beta🌱 Seedling9

Local AI server with persistent memory, RAG, and multi-backend inference (MLX / llama.cpp / Ollama). Runs entirely on your machine — zero data sent to external services.

SploitGPT📁main@2026-04-21🌱 Seedling9

🛠️ Automate penetration testing with SploitGPT, an AI agent using Kali Linux tools for efficient security assessments and minimal user input.

kdcube-ai-app📁2026.4.21.1656🌱 Seedling8

Ship customer-facing AI with isolation, spend controls, and provenance.

vllm-cli📁v0.2.5💤 Dormant491

A command-line interface tool for serving LLM using vLLM.

LLM-Agent-Paper-daily📁main@2026-04-21🌱 Seedling20

Automatically Update LLM-Agent Papers Daily using Github Actions (Update Every 12th hours)

cloneme📁0.0.0💤 Dormant38

CloneMe is an advanced AI platform that builds your digital twin—an AI that chats like you, remembers details, and supports multiple platforms. Customizable, memory-driven, and hot-reloadable, it's th

claude-api-cost-optimization📁main@2026-04-21🌱 Seedling3

💰 Optimize your Claude API usage to save 50-90% on costs with batching techniques and efficient request management.

rag-agent📁master@2026-04-21🌱 Seedling7

Python LLM-RAG deep agent using LangChain, LangGraph and LangSmith built on Quart web microframework and served using Hypercorn ASGI and WSGI web server.

KAG📁v0.8.0💤 Dormant8,688

KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning and factual Q&A solutions for professional domain knowledge base

Deepagent-research-context-engineering📁main@2026-04-21🌱 Seedling2

🔍 Accelerate research using a Multi Agent System for efficient context engineering with DeepAgent and LangChain's library.

Qwen-Agent📁v0.0.26💤 Dormant16,132

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

argus-mcp📁main@2026-04-21🌱 Seedling1

🔍 Enhance code quality with Argus MCP, an AI-driven code review server using a Zero-Trust model for safe and efficient development.

ReNovel-AI📁0.0.0🌱 Seedling9

✍️ Revise and enhance novels with ReNovel-AI, your smart tool for story reimagining and memory-driven writing assistance.

local-rag-server📁main@2026-04-21🌱 Seedling2

Deploy a local, multi-user RAG system to query PDF and DOCX documents using a local LLM without cloud or API dependencies.

search📁main@2026-04-21🌱 Seedling1

🔍 Implement hybrid search using Vespa and FastAPI, blending BM25 and dense semantic retrieval for efficient, accurate information retrieval.

loopy📁v2025.2💤 Dormant630

A code generator for array-based code on CPUs and GPUs

vector-cache-optimizer📁base-setup@2026-04-21🌱 Seedling1

⚡ Optimize vector searches with a hyper-efficient cache that uses machine learning for faster, smarter data access and reduced costs.

mcp-agent-framework📁master@2026-04-21🌱 Seedling1

🤖 Orchestrate AI agents at scale using the MCP framework, enabling seamless context sharing, communication, and integration for enhanced collaboration.

ImC📁master@2026-04-21🌱 Seedling1

🖼️ Convert images quickly between formats with ImC, a fast and simple CLI tool built on Pillow for efficient batch processing and clean command usage.

ai-lead-qualifier📁main@2026-04-21🌱 Seedling3

🧠 Qualify leads with an AI-driven system that understands intent, asks key questions, and structures quality leads without hardcoding processes.

a-mem-mcp-server📁main@2026-04-21🌱 Seedling1

🧠 Enhance LLM agents with an agentic memory system, featuring automatic note construction, dynamic memory updates, and intelligent semantic retrieval.

Legion📁v0.1.3💤 Dormant115

A Python-based framework for building multi-agent systems with LLMs. Currently in pre-launch alpha.

Flipkart-Product-Recommender-RAG📁main@2026-04-21🌱 Seedling2

🛒 Build a leading-edge e-commerce recommendation system using RAG architecture, Groq Llama 3, LangChain, and AstraDB, deployed on Kubernetes for scalability.

Awesome-RAG-Production📁main@2026-04-21🌱 Seedling2

🚀 Build and scale reliable Retrieval-Augmented Generation (RAG) systems with this curated collection of tools, frameworks, and best practices.

andy-universal-agent-rules📁main@2026-04-21🌱 Seedling2

🧠 Enhance your AI coding assistant with a universal knowledge base and rules system, compatible with any project and editor.

Web-Use📁v0.2💤 Dormant246

Web-Use is a CDP powered Browser Agent

RAG📁main@2026-04-21🌱 Seedling1

🧠 Build an offline RAG chatbot to answer questions from PDFs, adapting responses based on user experience levels with a smooth chat interface.

fastRAG📁v3.1.2💤 Dormant1,776

Efficient Retrieval Augmentation and Generation Framework

django-treebeard5.0.5🌱 Seedling

Efficient tree implementations for Django

ag-ui-protocol0.1.17🌱 Seedling

No description

fireworks-ai0.19.20🌱 Seedling

Python client library for the Fireworks AI Platform

mockloop-mcp📁v2.2.9💤 Dormant15

Intelligent Model Context Protocol (MCP) server for AI-assisted API development. Generate mock servers from OpenAPI specs with advanced logging, performance analytics, and server discovery. Optimized

ai-news-scraper📁2.9.7💤 Dormant8

AI News Scraper & Semantic Search: A Python application that scrapes news articles, uses GenAI to generate summaries and identify topics, and provides semantic search capabilities through vector embed

neuraldocs📁3.2.2💤 Dormant1

Demo RAG API (FastAPI, OpenAI, ChromaDB, Docker) automatically generated using the OpenAI Codex CLI tool. Highlights Codex's capability for rapid, complex application development.