freshcrate — Search

Search results for "benchmark"

155 results found

pilot 📁v2.99.1🌿 Growing⭐395

#1 Terminal Benchmark 2.0 — AI that ships your tickets.

agentic agentic-workflow ai-agent ai-bots ai-tools autonomous-coding claude claude-code goby qf-studioGo

ai-agents-reality-check 📁0.0.0🌿 Growing⭐57

Benchmarking the gap between AI agent hype and architecture. Three agent archetypes, 73-point performance spread, stress testing, network resilience, and ensemble coordination analysis with statistica

agent-architecture agent-benchmark agent-evaluation agent-performance agentic-ai agentic-workflow ai-benchmarking architectural-evaluation llm-agent pythonby Cre4T3Tiv3Python

openclaw-engram 📁v9.3.142🌿 Growing⭐54

Local-first memory plugin for OpenClaw AI agents. LLM-powered extraction, plain markdown storage, hybrid search via QMD. Gives agents persistent long-term memory across conversations.

ai-agent ai-memory conversational-ai engram knowledge-graph llm local-first long-term-memory typescriptby joshuaswarrenTypeScript

PraisonAI 📁v4.6.25🌳 Mature⭐6,900

PraisonAI 🦞 — Hire a 24/7 AI Workforce. Stop writing boilerplate and start shipping autonomous agents that research, plan, code, and execute tasks. Deployed in 5 lines of code with built-in memory, R

agents ai ai-agent-framework ai-agent-sdk ai-agents ai-agents-framework ai-agents-sdk ai-framwork pythonby MervinPraisonPython

llama.cpp 📁b8864🌳 Mature⭐103,119

LLM inference in C/C++

c++ggmlby ggerganovC++

agentmemory 📁v0.9.1🌳 Mature⭐738

Persistent memory for AI coding agents

typescriptby rohitg00TypeScript

opik 📁2.0.6🌳 Mature⭐18,767

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

evaluation hacktoberfest hacktoberfest2025 langchain llama-index llm llm-evaluation llm-observability pythonby comet-mlPython

mcp-memory-service 📁v10.39.1🌳 Mature⭐1,643

Open-source persistent memory for AI agent pipelines (LangGraph, CrewAI, AutoGen) and Claude. REST API + knowledge graph + autonomous consolidation.

agent-memory agentic-ai ai-agents autogen claude crewai knowledge-graph langgraph pythonby doobidooPython

LeanKG 📁v0.16.5🌱 Seedling⭐32

LeanKG: Stop Burning Tokens. Start Coding Lean.

ai-agent claude claude-code claude-code-plugin concise-context cursor gemini graph-database rustby FreePeakRust

byterover-cli 📁v3.7.1🌳 Mature⭐4,422

ByteRover CLI (brv) - The portable memory layer for autonomous coding agents (formerly Cipher)

agent ai autonomous-agents cli coding-assistant context-memory developer-tools knowledge-management typescriptby campfireinTypeScript

Auto-claude-code-research-in-sleep 📁v0.4.4🌳 Mature⭐6,182

ARIS ⚔️ (Auto-Research-In-Sleep) — Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in — works wi

ai-research ai-tools aris autonomous-agent claude claude-code claude-code-skills codex pythonby wanshuiyinPython

OmniRoute 📁v3.6.9🌳 Mature⭐2,435

OmniRoute is an AI gateway for multi-provider LLMs: an OpenAI-compatible endpoint with smart routing, load balancing, retries, and fallbacks. Add policies, rate limits, caching, and observability for

typescriptby diegosouzapwTypeScript

jcodemunch-mcp 📁v1.70.0🌳 Mature⭐1,523

The leading, most token-efficient MCP server for GitHub source code exploration via tree-sitter AST parsing

claude claude-code python serena token token-savings tokensby jgravellePython

mentisdb 📁0.9.3.39🌿 Growing⭐56

Memory that lasts and compounds. MentisDB gives agents durable memory so they do not just remember, they improve over time. It stores append-only thought chains plus a Git-like skills registry, lett

ai ai-agents claude codex copilot infinite-memory openai openrouter rustby CloudLLM-aiRust

osaurus 📁0.16.16🌳 Mature⭐4,912

Own your AI. The native macOS harness for AI agents -- any model, persistent memory, autonomous execution, cryptographic identity. Built in Swift. Fully offline. Open source.

anthropic apple-foundation-models apple-intelligence apple-neural-engine llm mcp mcp-server mlx swiftby osaurus-aiSwift

agentic-memory 📁0.0.0🌿 Growing⭐162

No description

by lhl

LRAT 📁0.0.0🌱 Seedling⭐34

The implementation for SIGIR 2026: Learning to Retrieve from Agent Trajectories.

agent agentic llm python searchby Yuqi-ZhouPython

cactus 📁0.0.0🌿 Growing⭐50

LLM Agent that leverages cheminformatics tools to provide informed responses.

cheminformatics chemistry foundation-models jupyter notebook llm llm-agent nlp scienceby pnnlJupyter Notebook

GEA 📁0.0.0🌱 Seedling⭐23

Group Evolving Agents: Open-Ended Self-Improvement via Experience Sharing

code-generation group-evolving-agents open-ended-evolution open-endedness python research-agents self-evolving-agentsby eric-ai-labPython

chinese-llm-benchmark 📁v5.9🌿 Growing⭐5,841

ReLE评测：中文AI大模型能力评测（持续更新）：目前已囊括359个大模型，覆盖chatgpt、gpt-5.2、o4-mini、谷歌gemini-3-pro、Claude-4.6、文心ERNIE-X1.1、ERNIE-5.0、qwen3-max、qwen3.5-plus、百川、讯飞星火、商汤senseChat等商用模型，以及step3.5-flash、kimi-k2.5、ernie4.5、Min

agentic-ai artificial-intelligence llm-agent llm-evaluationby jeinlee1991

arthur-engine 📁2.1.529🌿 Growing⭐75

Make AI work for Everyone - Monitoring and governing for your AI/ML

agentic benchmarking evaluation genai guardrails llm ml monitoring pythonby arthur-aiPython

ISC-Bench 📁v0.0.5🌿 Growing⭐786

Internal Safety Collapse: Turning the LLM or an AI Agent into a sensitive data generator.

adversarial-attacks agent-safety ai-safety benchmark frontier-models jailbreak large-language-models llm-safety pythonby wuyoscarPython

SocratiCode 📁v1.6.1🌿 Growing⭐810

Enterprise-grade (40m+ lines) codebase intelligence in a zero-setup, private and local Claude Plugin or MCP: managed indexing, hybrid semantic search, polyglot code dependency graphs, and DB/API/infra

ai ai-assistant ast claude code-graph codebase-analysis codebase-intelligence docker typescript vector-databaseby giancarloerraTypeScript

ClawRouter 📁v0.12.158🌿 Growing⭐6,186

The agent-native LLM router for OpenClaw. 41+ models, <1ms routing, USDC payments on Base & Solana via x402.

ai ai-agents anthropic cost-optimization deepseek gemini llm llm-router typescriptby BlockRunAITypeScript

mem0 📁openclaw-v1.0.7🌿 Growing⭐52,660

Universal memory layer for AI Agents

agents ai ai-agents application chatbots chatgpt genai llm pythonby mem0aiPython

sdl-mcp 📁v0.10.7🌿 Growing⭐121

SDL-MCP (Symbol Delta Ledger MCP Server) is a cards-first context system for coding agents that saves tokens and improves context.

agent-context agent-tools agentic-coding agentic-workflow agents ai-agents code-analysis code-context typescriptby GlitterKillTypeScript

SmolVM 📁v0.0.10🌿 Growing⭐233

Open-source sandboxes for code execution, browser use, and AI agents.

agent-runtime browser-agent browser-use computer-use openclaw python sandboxby CelestoAIPython

cognithor 📁v0.92.2🌿 Growing⭐94

Cognithor - Agent OS: Local-first autonomous agent operating system. 16 LLM providers, 17 channels, 112+ MCP tools, 5-tier memory, A2A protocol, knowledge vault, voice, browser automation, Computer-us

agent-os ai-agent anthropic autonomous-agent discord-bot document-analysis gdpr-compliant gemini pythonby Alex8791-cyberPython

claude-mem-lite 📁v2.34.4🌱 Seedling⭐32

Lightweight persistent memory system for Claude Code — FTS5 search, episode batching, error-triggered recall

claude-code fts5 hooks javascript mcp memory persistence sqliteby sdsrssJavaScript

Autonomous-Agents 📁main@2026-04-16🌿 Growing⭐1,211

Autonomous Agents (LLMs) research papers. Updated Daily.

agent agentic agentic-ai agents ai ai-agents aiagent aiagentsby tmgthb

almide 📁v0.15.0🌱 Seedling⭐15

A functional programming language optimized for LLM code generation. Compiles to Rust and WebAssembly.

code-generation compiler functional-programming llm programming-language rust webassemblyby almideRust

CodeGen 📁0.0.0🌳 Mature⭐773

Reference implementation of code generation projects from Facebook AI Research. General toolkit to apply machine learning to code, from dataset creation to model training and evaluation. Comes with pr

pythonby facebookresearchPython

Awesome-Context-Engineering 📁0.0.0🌳 Mature⭐3,045

🔥 Comprehensive survey on Context Engineering: from prompt engineering to production-grade AI systems. hundreds of papers, frameworks, and implementation guides for LLMs and AI agents.

agent agentic-ai agi awesome-list cognitive-science context-engineering llm ragby Meirtz

models 📁main@2026-04-21🌿 Growing⭐72

This repository contains comprehensive pricing and configuration data for LLMs. It powers cost attribution for 200+ enterprises running 400B+ tokens through Portkey AI Gateway every day.

ai javascript llms llms-benchmarking modelsby Portkey-AIJavaScript

awesome-code-agents 📁main@2026-04-20🌿 Growing⭐94

A curated list of products, benchmarks, and research papers on autonomous code agents. Beyond coding — they're redefining how software changes the world.

pythonby EuniAIPython

DeepClaude 📁v1.0.1🌳 Mature⭐2,788

Unleash Next-Level AI! 🚀 💻 Code Generation: DeepSeek r1 + Claude 3.7 Sonnet - Unparalleled Performance! 📝 Content Creation: DeepSeek r1 + Gemini 2.5 Pro - Superior Quality! 🔌 OpenAI-Compatible. �

ai claude-3-7-sonnet deepseek gemini pythonby ErlichLiuPython

vector-db-benchmark 📁master@2026-04-17🌿 Growing⭐356

Framework for benchmarking vector search engines

benchmark python vector-database vector-search vector-search-engineby qdrantPython

claude-flows 📁0.0.0🌿 Growing⭐93

🌊 The leading agent orchestration platform for Claude. Deploy intelligent multi-agent swarms, coordinate autonomous workflows, and build conversational AI systems. Features enterprise-grade architect

shellby xyzthiagoShell

Awesome-Agent-Memory 📁main@2026-04-16🌿 Growing⭐333

Curated systems, benchmarks, and papers etc. on memory for LLMs/MLLMs --- long-term context, retrieval, and reasoning.

agent-memory ai-agent ai-agent-memory awesome-agent-memory llm-memory memory memory-management multimodal-llm-memoryby TeleAI-UAGI

OpenClawProBench 📁main@2026-04-15🌿 Growing⭐340

OpenClawProBench is a live-first benchmark harness for evaluating LLM agents in the OpenClaw runtime with deterministic grading and repeated-trial reliability.

agent benchmark evaluation harness leaderboard llm openclaw pythonby suyoumoPython

FastExpressionCompiler 📁v5.4.1🌿 Growing⭐1,355

Fast Compiler for C# Expression Trees and the lightweight LightExpression alternative. Diagnostic and code generation tools for the expressions.

benchmark c#closure code-generation compiler delegate delegates dryioc expression-treeby dadhiC#

Memori 📁v3.3.0🌿 Growing⭐13,290

Memori is agent-native memory infrastructure. A SQL-native, LLM-agnostic layer that turns agent execution and conversation into structured, persistent state for production systems.

agent agent-memory ai ai-memory aiagent awesome chatgpt llm pythonby MemoriLabsPython

mcp-devtools 📁v0.59.53🌿 Growing⭐133

A modular MCP server that provides commonly used developer tools for AI coding agents

agentic ai cline coding devtools go llm mcp sammcjby sammcjGo

AgenticX 📁v0.3.7🌿 Growing⭐105

AgenticX is a unified, production-ready multi-agent platform — Python SDK + CLI (agx) + Studio server + Machi desktop app. Features Meta-Agent orchestration, 15+ LLM providers, MCP Hub, hierarchical m

agent-framework agentic-workflows ai-agent ai-orchestration chatbot desktop-app electron fastapi pythonby DemonDamonPython

synaptic-memory 📁v0.16.0🌱 Seedling⭐25

Brain-inspired knowledge graph: spreading activation, Hebbian learning, memory consolidation.

ai-agent embedding graph-database hebbian-learning knowledge-graph llm mcp mcp-server pythonby PlateerLabPython

context-mode 📁v1.0.89🌿 Growing⭐7,020

Context window optimization for AI coding agents. Sandboxes tool output, 98% reduction. 12 platforms

antigravity claude claude-code claude-code-hooks claude-code-plugins claude-code-skill codex codex-cli typescriptby mksgluTypeScript

arag 📁v0.1.0🌿 Growing⭐247

A-RAG: Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces. State-of-the-art RAG framework with keyword, semantic, and chunk read tools for multi-hop QA.

agent agentic-ai agenticrag deepresearch evaluation graphrag llm llmagents pythonby Ayanami0730Python

memind 📁main@2026-04-21🌿 Growing⭐360

Self-evolving cognitive memory and context engine for AI agents in Java. Empowering 24/7 proactive agents like OpenClaw with understanding and SOTA performance.

ai ai-agent ai-agents ai-memory context-engineering java memory openclawby openmemindJava

Awesome-World-Models 📁main@2026-04-21🌿 Growing⭐1,473

A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related website

artificial-intelligence autonomous-driving awesome deep-learning embodied-ai future-prediction video-prediction world-modelby leofan90

awesome-prompts 📁main@2026-04-21🌿 Growing⭐7,572

Curated list of chatgpt prompts from the top-rated GPTs in the GPTs Store. Prompt Engineering, prompt attack & prompt protect. Advanced Prompt Engineering papers.

awesome awesome-list chatgpt gpt4 gpts gptstore papers prompt prompt-engineeringby ai-boost

aura 📁main@2026-04-21🌱 Seedling⭐47

A sovereign cognitive architecture with IIT 4.0 integrated information, residual-stream affective steering (CAA), Global Workspace Theory, active inference, and 72 consciousness modules — running loca

active-inference affective-computing apple-silicon artificial-consciousness autonomous-agent cognitive-architecture cognitive-science consciousness pythonby youngbryan97Python

LLM-Agent-Paper-daily 📁main@2026-04-21🌱 Seedling⭐20

Automatically Update LLM-Agent Papers Daily using Github Actions (Update Every 12th hours)

llm llm-agent pythonby Lyz103Python

NeuronFS 📁main@2026-04-21🌿 Growing⭐136

mkdir beats vector DB. B-tree NeuronFS: 0-byte folders govern AI — ₩0 infrastructure, ~200x token efficiency. OS-native constraint engine for LLM agents.

ai-agent ai-safety cursor-rules file-system go guardrails llm model-agnostic multi-agentby rhino-acousticGo

multi-agent-ralph-loop 📁main@2026-04-20🌿 Growing⭐113

Autonomous orchestration framework for Claude Code with MemPalace-inspired memory (4-layer stack, 818-token wake-up), parallel-first Agent Teams (6 teammates), Aristotle First Principles methodology,

ai-orchestration automation bats-testing claude-code code-quality codex codex-cli dynamic-contexts shellby alfredolopez80Shell

llm_context_benchmarks 📁0.0.0🌱 Seedling⭐59

📊 LLM Context Benchmarks - A comprehensive benchmarking tool for testing LLMs with varying context sizes using Ollama. Features dual benchmark modes (API/CLI), automatic hardware detection (optimiz

ai benchmarking llms pythonby ivanfioravantiPython

skills-vote 📁main@2026-04-19🌱 Seedling⭐31

The Next-Gen Agent-Native Skill Recommendation Engine

agent-skill agent-skills llm llm-agent pythonby MemTensorPython

zeroclaw 📁v0.7.3🌿 Growing⭐29,983

Fast, small, and fully autonomous AI personal assistant infrastructure, ANY OS, ANY PLATFORM — deploy anywhere, swap anything 🦀

agent agentic ai infra ml openclaw os rust zeroclawby zeroclaw-labsRust

SmarterRouter 📁2.2.5🌿 Growing⭐105

SmarterRouter: An intelligent LLM gateway and VRAM-aware router for Ollama, llama.cpp, and OpenAI. Features semantic caching, model profiling, and automatic failover for local AI labs.

ai-cache ai-gateway docker fastapi gpu-monitoring llm llm-proxy llm-router pythonby peva3Python

medusa 📁v2026.5.5🌿 Growing⭐252

AI-first security scanner with 76 analyzers, 9,600+ detection rules, and repo poisoning detection for AI/ML, LLM agents, and MCP servers. Scan any GitHub repo with: medusa scan --git user/repo

agent-security ai-security code-analysis cve-detection devsecops llm-security mcp nextjs pythonby Pantheon-SecurityPython

zvec 📁v0.3.1🌿 Growing⭐9,287

A lightweight, lightning-fast, in-process vector database

agent-memory ann-search c++embedded-database local nodejs python rag vector-searchby alibabaC++

SciAgent-Skills 📁main@2026-04-17🌿 Growing⭐93

Life sciences computational skills for scientific AI agents

ai-agent bioinformatics claude-code claude-plugin claude-skills drug-discovery genomics life-sciences pythonby jaechang-hitsPython

milvus 📁v2.6.15🌿 Growing⭐43,734

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

anns cloud-native diskann distributed embedding-database embedding-similarity embedding-store faiss goby milvus-ioGo

maverick-mcp 📁main@2026-04-17🌿 Growing⭐479

MaverickMCP - Personal Stock Analysis MCP Server

anthropic artificial-intelligence claude equities fastmcp finance financial-analysis fintech pythonby wshobsonPython

biomcp 📁v0.8.21🌿 Growing⭐488

BioMCP: Biomedical Model Context Protocol

ai bioinformatics clinical-trials genomics llm mcp mcp-server medical rustby genomoncologyRust

AReaL 📁v1.0.3🌿 Growing⭐5,017

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

agent llm llm-agent llm-reasoning machine-learning-systems mlsys python reinforcement-learning rlby inclusionAIPython

olb 📁v1.0.0🌱 Seedling⭐18

High-performance zero-dependency L4/L7 load balancer written in Go. Single binary with Web UI, clustering, MCP/AI integration. 8.5K RPS, 39 E2E tests.

acme clustering go golang l4 l7 load-balancer mcpby OpenLoadBalancerGo

AutoGPT 📁autogpt-platform-beta-v0.6.56🌿 Growing⭐183,319

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

agentic-ai agents ai artificial-intelligence autonomous-agents claude gpt llama-api pythonby Significant-GravitasPython

paiml-mcp-agent-toolkit 📁v3.14.0🌿 Growing⭐148

Pragmatic AI Labs MCP Agent Toolkit - An MCP Server designed to make code with agents more deterministic

agentic c deno kotlin mcp mcp-server paiml paiml-active-tool rustby paimlRust

claw-eval 📁main@2026-04-15🌿 Growing⭐394

Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.

agent harness llm openclaw pythonby claw-evalPython

llmware 📁v0.4.6🌿 Growing⭐14,857

Unified framework for building enterprise RAG pipelines with small, specialized models

agents generative-ai-tools llamacpp llm onnx openvino parsing python retrieval-augmented-generationby llmware-aiPython

cognitive-dissonance-dspy 📁main@2026-04-14🌿 Growing⭐276

A multi-agent LLM system for detecting and resolving cognitive dissonance.

pythonby evalopsPython

rag-chatbot 📁main@2026-04-14🌿 Growing⭐402

RAG (Retrieval-augmented generation) ChatBot that provides answers based on contextual information extracted from a collection of Markdown files.

chatbot chromadb gpu lamacpp llama3 llm python qwen3-5 ragby umbertogriffoPython

skill 📁v1.2.1🌱 Seedling⭐978

PinchBench is a benchmarking system for evaluating LLM models as OpenClaw coding agents. Made with 🦀 by the humans at https://kilo.ai

pythonby pinchbenchPython

octocode 📁0.14.0🌿 Growing⭐319

Semantic code searcher and codebase utility

ai ai-tools cli cli-app code-search developer-tool developer-tools doc-search mcp-server rustby MuvonRust

awesome-vector-database 📁main@2026-04-13🌿 Growing⭐341

A curated list of awesome works related to high dimensional structure/vector search & database

approximate-nearest-neighbor-search embedding-similarity embeddings-similarity nearest-neighbor-search search-engine similarity-search vector-database vector-searchby dangkhoasdc

vllm-mlx 📁v0.2.8🌿 Growing⭐798

OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX bac

anthropic apple-silicon audio-processing claude-code computer-vision image-understanding inference llm pythonby waybarriosPython

prism-mcp 📁v9.3.0🌿 Growing⭐116

The Mind Palace for AI Agents — Autonomous Cognitive OS with affect-tagged memory (valence engine), token-economic RL (surprisal gate + UBI), Hebbian learning, ACT-R spreading activation, Synapse Engi

agent-memory ai-agent anti-sycophancy claude-desktop cognitive-architecture google-gemini hebbian-learning llm-tools typescriptby dcostencoTypeScript

EvoScientist 📁v0.0.7🌿 Growing⭐2,731

🔬 Harness Vibe Research with Self-evolving AI Scientists

ai-agent ai4science multi-agent-system python vibe-researchby EvoScientistPython

AutoRAG 📁v0.3.22🌱 Seedling⭐4,693

AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation

analysis automl benchmarking document-parser embeddings evaluation llm llm-evaluation pythonby Marker-Inc-KoreaPython

Awesome-Repo-Level-Code-Generation 📁main@2026-04-10🌿 Growing⭐274

Must-read papers on Repository-level Code Generation & Issue Resolution 🔥

ai4se automated-software-engineering code-generation large-language-models llm software-engineeringby YerbaPage

claude-code-skills 📁v2026.04.10🌿 Growing⭐374

Plugin suite + bundled MCP servers for Claude Code. Full delivery lifecycle: Agile pipeline with multi-model AI review, project bootstrap, documentation generation, codebase audits, performance optimi

agile-workflows ai-agents anthropic claude-ai claude-code claude-code-skills code-analysis code-review javascriptby levnikolaevichJavaScript

echos 📁v0.19.1🌱 Seedling⭐49

Your personal AI knowledge system — self-hosted, agent-driven, and always private.

agent agentic ai personal-assistant second-brain typescriptby albinotonninaTypeScript

charlotte 📁v0.6.1🌿 Growing⭐120

Token-efficient browser MCP server — structured web pages for AI agents, not raw accessibility dumps

ai-agents mcp mcp-server model-context-protocol typescript web-browsing web-scrapingby TickTockBentTypeScript

UltraRAG 📁v0.3.0.2🌿 Growing⭐5,480

A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines

deepseek demo easy embedding flask gpt huggingface-transformers llm pythonby OpenBMBPython

kuzu-memory 📁v1.12.9🌱 Seedling⭐22

Lightweight, embedded graph-based memory system for AI applications. Fast (<3ms recall), offline-first, with MCP server support for Claude and other AI tools.

pythonby bobmatnycPython

DSPex 📁main@2026-04-09🌱 Seedling⭐17

Declarative Self Improving Elixir - DSPy Orchestration in Elixir

ai ai-framework autonomous-systems beam declarative-programming dspy elixir erlang-vmby nshkrdotcomElixir

Suganthans-BigQuery-MCP-Server 📁0.0.0🌱 Seedling⭐25

BigQuery MCP server for Claude — query any BigQuery dataset in natural language, with built-in SEO analysis tools for GSC bulk export data

typescriptby Suganthan-MohanadasanTypeScript

signetai 📁v0.103.3🌱 Seedling⭐113

Local-first identity, memory, and secrets for AI agents. Portable state across models and harnesses.

agent-identity agent-infrastructure agent-memory agent-orchestration agent-state ai-agents ai-memory bun typescriptby Signet-AITypeScript

mcp-google-map 📁v0.0.52🌱 Seedling⭐270

A powerful Model Context Protocol (MCP) server providing comprehensive Google Maps API integration with LLM processing capabilities.

agent-skill ai ai-agent dive geocoding geospatial google-map google-maps typescriptby cablateTypeScript

bv-mcp 📁v2.9.2🌱 Seedling⭐5

Open-source DNS & email security scanner. One MCP endpoint, 57 checks, zero install. Cloudflare Workers.

agentic ai ai-tools cloudflare-workers cybersecurity dkim dmarc dns-security typescriptby MadaBurnsTypeScript

mesh-llm 📁v0.64.0🌱 Seedling⭐834

Distributed AI/LLM for the people. Share compute privately or publicly to power your agents and chat.

agents ai decentralized distributed llm rustby Mesh-LLMRust

zettelforge 📁v2.4.0🌱 Seedling⭐25

Agentic memory for CTI in Python — STIX knowledge graphs, threat-actor alias resolution, offline-first RAG, MCP server for Claude Code and LangChain agents

agentic-memory ai-agent claude-code cti cybersecurity knowledge-graph langchain llm pythonby rolandpgPython

Ultimate-Agent-Directory 📁0.0.0🌱 Seedling⭐51

🤖 The most comprehensive directory of AI agent frameworks, platforms, tools, and resources - hundreds of curated entries covering open-source, no-code, enterprise, and autonomous solutions. NEW Boil

agent agentic agentic-ai agents boilerplate boilerplate-application boilerplate-template pythonby moshehbenavrahamPython

synthadoc 📁v0.1.0🌱 Seedling⭐66

Synthadoc: An open-source LLM knowledge compilation engine that turns raw documents into structured, local-first wikis. A transparent, human-readable alternative to traditional RAG, which can be self-

agent-skills agentic-ai cli-tool domain-adaptation enterprise enterprise-solutions knowledge-graph local-llm pythonby axoviq-aiPython

deepeval 📁v3.9.5🌳 Mature⭐14,701

The LLM Evaluation Framework

evaluation-framework evaluation-metrics llm-evaluation llm-evaluation-framework llm-evaluation-metrics pythonby confident-aiPython

AutoViralAI 📁0.0.0🌱 Seedling⭐11

Autonomous AI agent that researches viral content, generates posts, publishes them, measures engagement — and rewrites its own strategy based on what worked. Self-learning loop powered by LangGraph +

ai-agent autonomous-agent claude content-generation growth-hacking langgraph python self-learning social-mediaby kgarbacinskiPython

agent-skills-standard 📁php-v1.3.2🌱 Seedling⭐391

A collection of Agent Skills Standard and Best Practice for Programming Languages, Frameworks that help our AI Agent follow best practies on frameworks and programming laguages

agent-agentic-ai android angular best-practices coding-standards cursor-rules flutter typescriptby HoangNguyen0403TypeScript

cortex-hub 📁v0.7.0🌱 Seedling⭐48

Self-hosted AI Agent Memory + Code Intelligence Platform — one MCP endpoint for persistent memory, AST-aware code search, shared knowledge, and quality enforcement across all your AI coding agents.

ai-agents claude-code code-intelligence cursor developer-tools docker knowledge-base mcp typescriptby lktiepTypeScript

Open-Sable 📁v1.7.0🌱 Seedling⭐18

Open-Sable is a local-first autonomous agent framework with AGI-inspired cognitive subsystems (goals, memory, metacognition, tool use). It can run continuously on your machine, integrate with chat int

agentic agentic-ai ai ai-assistant open-source pythonby IdeoaLabsPython

typeahead-kmp 📁2.0.4🌱 Seedling⭐9

A lock-free, in-memory fuzzy search engine for Kotlin Multiplatform. L2-normalized sparse vector embeddings with O(1) cosine similarity — handles typos, transpositions, and blind continuation. Zero-al

autocomplete concurrent coroutines cosine-similarity embeddings fuzzy-matching fuzzy-search in-memory kotlin vector-databaseby karlotiKotlin

openakita 📁v1.25.18🌱 Seedling⭐1,613

An open-source AI assistant framework with skills and agent architecture

agent ai assistant automation claw clawd clawdbot openclaw pythonby openakitaPython

plur 📁v0.8.0🌱 Seedling⭐46

Shared memory for AI agents

typescriptby plur-aiTypeScript

DBreeze 📁v1.136🌱 Seedling⭐569

C# .NET NOSQL ( key value, object store embedded TextSearch SemanticSearch Vector layer ) ACID multi-paradigm database management system.

acid android c#c-sharp clustering database dotnet embedded keyby hhblazeC#

tensorzero 📁2026.4.0🌱 Seedling⭐11,204

TensorZero is an open-source LLMOps platform that unifies an LLM gateway, observability, evaluation, optimization, and experimentation.

ai ai-engineering anthropic artificial-intelligence deep-learning genai generative-ai gpt rustby tensorzeroRust

vikramaditya 📁main@2026-04-20🌱 Seedling⭐5

Autonomous VAPT platform. Give it a target (FQDN, IP, CIDR) — it hunts, it reports. Inspired by the Obsidian Order.

ai-security autonomous-agent bash bug-bounty penetration-testing python recon securityby venkatasPython

infinity 📁v0.7.0-dev5🌱 Seedling⭐4,476

The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text.

ai-native approximate-nearest-neighbor-search bm25 c++cpp20 cpp20-modules embedding full-text-search hnswby infiniflowC++

Wax 📁waxmcp-v0.1.19🌱 Seedling⭐700

Single-file memory layer for AI agents, sub mili-second RAG on Apple Silicon. Metal Optimized On-Device. No Server. No API. One File. Pure Swift

ai-agents cli coreml coreml-framework data-science machine-learning mcp mcp-server swiftby christopherkaraniSwift

claude-codex-settings 📁v2.3.0🌱 Seedling⭐587

My personal Claude Code and OpenAI Codex setup with battle-tested skills, commands, hooks, agents and MCP servers that I use daily.

ai-agents ai-tools claude-ai claude-code claude-code-plugin claude-skills claudecode claudecode-config pythonby fcakyonPython

remembra 📁v0.13.1🌱 Seedling⭐12

Universal memory layer for AI applications. Self-host in minutes. Open source.

ai ai-agents ai-memory claude developer-tools embeddings html knowledge-graph llm ragby remembra-aiHTML

VectorDBBench 📁v1.0.20🌱 Seedling⭐1,068

Benchmark for vector databases.

benchmark cost-effectiveness performance python vector-database vector-search vectordbby zilliztechPython

mattermost-plugin-agents 📁v1.14.0🌱 Seedling⭐217

Mattermost Agents plugin supporting multiple LLMs

ai go llm mattermost mattermost-pluginby mattermostGo

skillfoundry 📁v2.0.61🌱 Seedling⭐6

AI engineering framework with quality gates, persistent memory, and multi-platform support. Works inside Claude Code, Cursor, Copilot, Codex, and Gemini.

ai-agents ai-coding ai-framework claude-code code-quality copilot cursor developer-tools typescriptby samibsTypeScript

rex-cli 📁v0.17.0🌱 Seedling⭐27

Local-first AI agent bootstrap: Playwright Browser MCP + ContextDB for Codex CLI, Claude Code, Gemini CLI, and OpenCode.

ai-agent automation browser-automation claude-code cli codex-cli contextdb gemini-cli javascriptby rexleimoJavaScript

devkit 📁v2.1.29🌱 Seedling⭐2

A deterministic development harness for Claude Code — MCP workflow engine, enforcement hooks, YAML workflows, and multi-agent consensus (Claude + Codex + Gemini)

ai-agents claude-code code-quality developer-tools devops go mcp mcp-server multi-agentby 5uck1essGo

Somi 📁Mineralization🌱 Seedling⭐21

Local-first AI agent framework with GUI, memory, web search, personality constructs, speech i/o, tools, skills, CLI & Telegram features — fully self-hosted via Ollama.

ai-agents ai-framework arti automation cli gui homeb local pythonby Somi-ProjectPython

DeepCamera 📁v2026.3🌱 Seedling⭐2,689

Open-Source AI Camera Skills Platform, AI NVR & CCTV Surveillance. Local VLM video analysis with Qwen, DeepSeek, SmolVLM, LLaVA, YOLO26. LLM-powered agentic security camera agent — watches, understand

ai ai-camera ai-nvr camera cctv computer-vision deep-learning face-recognition javascriptby SharpAIJavaScript

DreamServer 📁v2.0.0🌱 Seedling⭐478

Local AI anywhere, for everyone — LLM inference, chat UI, voice, agents, workflows, RAG, and image generation. No cloud, no subscriptions.

ai-agents amd comfyui docker llama-cpp llm local-ai n8n rustby Light-Heart-LabsRust

OpenRA-RL 📁v0.4.1🌱 Seedling⭐118

Open Framework for AI Agents to play Red Alert through Reinforcement Learning

pythonby yxc20089Python

Zen-Ai-Pentest 📁v3.0.0🌱 Seedling⭐279

🛡⚔️AI-Powered Penetration Testing Framework with automated vulnerability scanning, multi-agent system, and compliance reporting🛡⚔️

ai automation compliance cybersecurity ethical-hacking framework penetration-testing pentesting pythonby SHAdd0WTAkaPython

Geneclaw 📁v0.1.0🌱 Seedling⭐34

Self-evolving AI agent framework with 5-layer safety gatekeeper. Agents observe failures, propose fixes, and safely apply them. Built on HKUDS/nanobot.

ai-agent autonomous-agent evolution llm nanobot python safety self-evolving-aiby Clawland-AIPython

awesome-agent-benchmarks 📁master@2026-04-21🌱 Seedling⭐3

🧠 Discover and evaluate advanced benchmark datasets for Large Language Model agents to enhance performance assessment in real-world tasks.

agent-based-modeling agent-benchmark agentic agentic-ai ai ai-agent ai-models awesomeby axxafo

Nreki 📁v10.5.1🌱 Seedling⭐2

MCP plugin that intercepts AI agent edits in RAM, validates them (TypeScript compiler + gopls + pyright), auto-heals missing imports, and commits atomically. If anything breaks, disk stays untouched

acid-transactions ai-code-review ai-safety auto-healing claude-code code-safety gopls lsp mcp typescriptby Ruso-0TypeScript

OriginDL 📁v1.0.0🌱 Seedling⭐245

Implement a Pytorch-like DL library in C++ from scratch, step by step

ai-framework ai-infra c++cuda deeplearning pytorch yoloby jinbooooomC++

llm-in-sandbox 📁v0.2.0🌱 Seedling⭐216

Computer Environments Elicit General Agentic Intelligence in LLMs

coding-agent computer-use-agent general-agent pythonby llm-in-sandboxPython

m3-memory 📁v2026.4.20🌱 Seedling⭐4

Local-first Agentic Memory Layer for MCP Agents • 25 tools • Hybrid search (FTS5 + vector + MMR) • GDPR • 100% local

agentic-memory ai-agents aider claude-code gdpr gemini-cli hybrid-search local-llm mcp pythonby skynetcmdPython

sofia 📁main@2026-04-11🌱 Seedling⭐2

Autonomous local AI assistant in Go — 40+ tools, 20+ LLM providers, multi-agent orchestration, self-improving

ai ai-agent anthropic artificial-intelligence assistant automation autonomous-agent cli goby grasbergGo

coordinode 📁v0.4.1🌱 Seedling⭐1

The graph-native hybrid retrieval engine for AI and GraphRAG. Graph + Vector + Full-Text in a single transactional engine.

ai database embedded-database full-text-search graph-database graphrag hnsw knowledge-graph rust vector-databaseby structured-worldRust

ragas 📁v0.4.3🌱 Seedling⭐13,329

Supercharge Your LLM Application Evaluations 🚀

evaluation llm llmops pythonby explodinggradientsPython

goskills 📁v0.6.0🌱 Seedling⭐176

A tool supports OPENAI and other LLMs with Claude Skills, you can also use it as a subagent

goby smallnestGo

PromptManager 📁master@2026-04-12🌱 Seedling⭐3

PromptManager is a desktop application for cataloguing, searching, and executing AI prompts, and much more.

prompt-engineering pythonby voytas75Python

octobench 📁main@2026-04-21🌱 Seedling⭐1

Benchmark and compare LLM tool, configuration, and prompt setups using a shared case framework with automated scoring and telemetry.

agentic agents ai ai-workflow anthropic automation benchmark codexby xInfer123

ASAN-Architecture 📁0.0.0🌱 Seedling⭐6

ASAN: A conceptual architecture for a self-creating (autopoietic), energy-efficient, and governable multi-agent AI system.

agent-caching ai-architecture ai-efficiency ai-framework ai-governance ai-safety asan autopoietic-aiby Variable-Fox

unchained-infra 📁main@2026-04-14🌱 Seedling⭐1

Open infrastructure/control plane for Unchained

browser-agent browser-automation cdp chrome-devtools-protocol claude mcp model-context-protocol open-core pythonby protostatisPython

seraph 📁develop@2026-04-13🌱 Seedling⭐1

An AI guardian that remembers, watches, and acts.

agentic-ai ai-agent fastapi llm macos mcp memory model-context-protocol pythonby seraph-questPython

HealthFlow 📁datasets💤 Dormant⭐40

HealthFlow: A Self-Evolving AI Agent with Meta Planning for Autonomous Healthcare Research

ai-for-healthcare ai-for-science ehr llm llm-agent multi-agent pythonby yhzhu99Python

cogames0.25.7🌱 Seedling

Multi-agent cooperative games

pypiby pypiPython

gepa 📁0.1.1🌱 Seedling

A framework for optimizing textual system components (AI prompts, code snippets, etc.) using LLM-based reflection and Pareto-efficient evolutionary search.

pypiby pypiPython

pyannote-audio4.0.4🌱 Seedling

State-of-the-art speaker diarization toolkit

pypiby pypiPython

trafilatura 📁2.0.0🌱 Seedling

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML.

corpus html2text natural-language-processing news-crawler pypi scraper tei-xml text-extraction webscrapingby pypiPython

nanobind 📁2.12.0🌱 Seedling

nanobind: tiny and efficient C++/Python bindings

pypiby pypiPython

faster-whisper 📁1.2.1🌱 Seedling

Faster Whisper transcription with CTranslate2

ctranslate2 inference openai pypi quantization speech transformer whisperby Guillaume KleinPython

browser-use 📁0.12.6🌱 Seedling

Make websites accessible for AI agents

pypiby Gregor ZunicPython

timm 📁1.0.26🌱 Seedling

PyTorch Image Models

image-classification pypi pytorchby pypiPython

keras 📁3.14.0🌱 Seedling

Multi-backend Keras

pypiby pypiPython

sentence-transformers 📁5.4.1🌱 Seedling

Embeddings, Retrieval, and Reranking

bert embedding networks nlp pypi pytorch sentence transformer xlnetby pypiPython

graphene 📁3.4.3🌱 Seedling

GraphQL Framework for Python

api graphene graphql protocol pypi relay restby Syrus AkbaryPython

langsmith 📁0.7.33🌱 Seedling

Client library to connect to the LangSmith Observability and Evaluation Platform.

evaluation langchain langsmith language llm nlp platform pypi tracingby pypiPython

uvloop 📁0.22.1🌱 Seedling

Fast implementation of asyncio event loop on top of libuv

asyncio networking pypiby pypiPython

@gaia-agent/sdk 📁0.1.26🌱 Seedling

Production-ready AI agent library using AI SDK v6 ToolLoopAgent for GAIA benchmarks with swappable providers

agent ai ai-sdk autonomous benchmark gaia llm npm toolsby JavaScript

crypto-skill-bench 📁0.1.7🌱 Seedling

Benchmark framework for evaluating crypto skills in AI agent ecosystems

npmby GitHub ActionsJavaScript

KAG 📁v0.8.0💤 Dormant⭐8,668

KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning and factual Q&A solutions for professional domain knowledge base

knowledge-graph large-language-model logical-reasoning multi-hop-question-answering python trustfulnessby OpenSPGPython

SimpleInfer 📁0.0.0⚰️ Archived⭐25

A simple neural network inference framework

ai-framework c++cpp deep-learning inference-engine neural-network xmakeby zpyeC++

FlexRAG 📁0.3.0💤 Dormant⭐235

FlexRAG: A RAG Framework for Information Retrieval and Generation.

llms nlp python ragby ictnlpPython

Qwen-Agent 📁v0.0.26💤 Dormant⭐15,963

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

pythonby QwenLMPython

fastRAG 📁v3.1.2💤 Dormant⭐1,772

Efficient Retrieval Augmentation and Generation Framework

benchmark colbert diffusion generative-ai information-retrieval knowledge-graph llm multi-modal pythonby IntelLabsPython