freshcrate

Search results for "benchmark"

155 results found
pilot๐Ÿ“v2.99.1๐ŸŒฟ Growingโญ395

#1 Terminal Benchmark 2.0 โ€” AI that ships your tickets.

ai-agents-reality-check๐Ÿ“0.0.0๐ŸŒฟ Growingโญ57

Benchmarking the gap between AI agent hype and architecture. Three agent archetypes, 73-point performance spread, stress testing, network resilience, and ensemble coordination analysis with statistica

openclaw-engram๐Ÿ“v9.3.142๐ŸŒฟ Growingโญ54

Local-first memory plugin for OpenClaw AI agents. LLM-powered extraction, plain markdown storage, hybrid search via QMD. Gives agents persistent long-term memory across conversations.

PraisonAI๐Ÿ“v4.6.25๐ŸŒณ Matureโญ6,900

PraisonAI ๐Ÿฆž โ€” Hire a 24/7 AI Workforce. Stop writing boilerplate and start shipping autonomous agents that research, plan, code, and execute tasks. Deployed in 5 lines of code with built-in memory, R

llama.cpp๐Ÿ“b8864๐ŸŒณ Matureโญ103,119

LLM inference in C/C++

agentmemory๐Ÿ“v0.9.1๐ŸŒณ Matureโญ738

Persistent memory for AI coding agents

opik๐Ÿ“2.0.6๐ŸŒณ Matureโญ18,767

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

mcp-memory-service๐Ÿ“v10.39.1๐ŸŒณ Matureโญ1,643

Open-source persistent memory for AI agent pipelines (LangGraph, CrewAI, AutoGen) and Claude. REST API + knowledge graph + autonomous consolidation.

LeanKG๐Ÿ“v0.16.5๐ŸŒฑ Seedlingโญ32

LeanKG: Stop Burning Tokens. Start Coding Lean.

byterover-cli๐Ÿ“v3.7.1๐ŸŒณ Matureโญ4,422

ByteRover CLI (brv) - The portable memory layer for autonomous coding agents (formerly Cipher)

Auto-claude-code-research-in-sleep๐Ÿ“v0.4.4๐ŸŒณ Matureโญ6,182

ARIS โš”๏ธ (Auto-Research-In-Sleep) โ€” Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in โ€” works wi

OmniRoute๐Ÿ“v3.6.9๐ŸŒณ Matureโญ2,435

OmniRoute is an AI gateway for multi-provider LLMs: an OpenAI-compatible endpoint with smart routing, load balancing, retries, and fallbacks. Add policies, rate limits, caching, and observability for

jcodemunch-mcp๐Ÿ“v1.70.0๐ŸŒณ Matureโญ1,523

The leading, most token-efficient MCP server for GitHub source code exploration via tree-sitter AST parsing

mentisdb๐Ÿ“0.9.3.39๐ŸŒฟ Growingโญ56

Memory that lasts and compounds. MentisDB gives agents durable memory so they do not just remember, they improve over time. It stores append-only thought chains plus a Git-like skills registry, lett

osaurus๐Ÿ“0.16.16๐ŸŒณ Matureโญ4,912

Own your AI. The native macOS harness for AI agents -- any model, persistent memory, autonomous execution, cryptographic identity. Built in Swift. Fully offline. Open source.

agentic-memory๐Ÿ“0.0.0๐ŸŒฟ Growingโญ162

No description

by lhl
LRAT๐Ÿ“0.0.0๐ŸŒฑ Seedlingโญ34

The implementation for SIGIR 2026: Learning to Retrieve from Agent Trajectories.

cactus๐Ÿ“0.0.0๐ŸŒฟ Growingโญ50

LLM Agent that leverages cheminformatics tools to provide informed responses.

GEA๐Ÿ“0.0.0๐ŸŒฑ Seedlingโญ23

Group Evolving Agents: Open-Ended Self-Improvement via Experience Sharing

chinese-llm-benchmark๐Ÿ“v5.9๐ŸŒฟ Growingโญ5,841

ReLE่ฏ„ๆต‹๏ผšไธญๆ–‡AIๅคงๆจกๅž‹่ƒฝๅŠ›่ฏ„ๆต‹๏ผˆๆŒ็ปญๆ›ดๆ–ฐ๏ผ‰๏ผš็›ฎๅ‰ๅทฒๅ›Šๆ‹ฌ359ไธชๅคงๆจกๅž‹๏ผŒ่ฆ†็›–chatgptใ€gpt-5.2ใ€o4-miniใ€่ฐทๆญŒgemini-3-proใ€Claude-4.6ใ€ๆ–‡ๅฟƒERNIE-X1.1ใ€ERNIE-5.0ใ€qwen3-maxใ€qwen3.5-plusใ€็™พๅทใ€่ฎฏ้ฃžๆ˜Ÿ็ซใ€ๅ•†ๆฑคsenseChat็ญ‰ๅ•†็”จๆจกๅž‹๏ผŒ ไปฅๅŠstep3.5-flashใ€kimi-k2.5ใ€ernie4.5ใ€Min

arthur-engine๐Ÿ“2.1.529๐ŸŒฟ Growingโญ75

Make AI work for Everyone - Monitoring and governing for your AI/ML

ISC-Bench๐Ÿ“v0.0.5๐ŸŒฟ Growingโญ786

Internal Safety Collapse: Turning the LLM or an AI Agent into a sensitive data generator.

SocratiCode๐Ÿ“v1.6.1๐ŸŒฟ Growingโญ810

Enterprise-grade (40m+ lines) codebase intelligence in a zero-setup, private and local Claude Plugin or MCP: managed indexing, hybrid semantic search, polyglot code dependency graphs, and DB/API/infra

ClawRouter๐Ÿ“v0.12.158๐ŸŒฟ Growingโญ6,186

The agent-native LLM router for OpenClaw. 41+ models, <1ms routing, USDC payments on Base & Solana via x402.

mem0๐Ÿ“openclaw-v1.0.7๐ŸŒฟ Growingโญ52,660

Universal memory layer for AI Agents

sdl-mcp๐Ÿ“v0.10.7๐ŸŒฟ Growingโญ121

SDL-MCP (Symbol Delta Ledger MCP Server) is a cards-first context system for coding agents that saves tokens and improves context.

SmolVM๐Ÿ“v0.0.10๐ŸŒฟ Growingโญ233

Open-source sandboxes for code execution, browser use, and AI agents.

cognithor๐Ÿ“v0.92.2๐ŸŒฟ Growingโญ94

Cognithor - Agent OS: Local-first autonomous agent operating system. 16 LLM providers, 17 channels, 112+ MCP tools, 5-tier memory, A2A protocol, knowledge vault, voice, browser automation, Computer-us

claude-mem-lite๐Ÿ“v2.34.4๐ŸŒฑ Seedlingโญ32

Lightweight persistent memory system for Claude Code โ€” FTS5 search, episode batching, error-triggered recall

Autonomous-Agents๐Ÿ“main@2026-04-16๐ŸŒฟ Growingโญ1,211

Autonomous Agents (LLMs) research papers. Updated Daily.

almide๐Ÿ“v0.15.0๐ŸŒฑ Seedlingโญ15

A functional programming language optimized for LLM code generation. Compiles to Rust and WebAssembly.

CodeGen๐Ÿ“0.0.0๐ŸŒณ Matureโญ773

Reference implementation of code generation projects from Facebook AI Research. General toolkit to apply machine learning to code, from dataset creation to model training and evaluation. Comes with pr

Awesome-Context-Engineering๐Ÿ“0.0.0๐ŸŒณ Matureโญ3,045

๐Ÿ”ฅ Comprehensive survey on Context Engineering: from prompt engineering to production-grade AI systems. hundreds of papers, frameworks, and implementation guides for LLMs and AI agents.

models๐Ÿ“main@2026-04-21๐ŸŒฟ Growingโญ72

This repository contains comprehensive pricing and configuration data for LLMs. It powers cost attribution for 200+ enterprises running 400B+ tokens through Portkey AI Gateway every day.

awesome-code-agents๐Ÿ“main@2026-04-20๐ŸŒฟ Growingโญ94

A curated list of products, benchmarks, and research papers on autonomous code agents. Beyond coding โ€” they're redefining how software changes the world.

DeepClaude๐Ÿ“v1.0.1๐ŸŒณ Matureโญ2,788

Unleash Next-Level AI! ๐Ÿš€ ๐Ÿ’ป Code Generation: DeepSeek r1 + Claude 3.7 Sonnet - Unparalleled Performance! ๐Ÿ“ Content Creation: DeepSeek r1 + Gemini 2.5 Pro - Superior Quality! ๐Ÿ”Œ OpenAI-Compatible. ๏ฟฝ

vector-db-benchmark๐Ÿ“master@2026-04-17๐ŸŒฟ Growingโญ356

Framework for benchmarking vector search engines

claude-flows๐Ÿ“0.0.0๐ŸŒฟ Growingโญ93

๐ŸŒŠ The leading agent orchestration platform for Claude. Deploy intelligent multi-agent swarms, coordinate autonomous workflows, and build conversational AI systems. Features enterprise-grade architect

Awesome-Agent-Memory๐Ÿ“main@2026-04-16๐ŸŒฟ Growingโญ333

Curated systems, benchmarks, and papers etc. on memory for LLMs/MLLMs --- long-term context, retrieval, and reasoning.

OpenClawProBench๐Ÿ“main@2026-04-15๐ŸŒฟ Growingโญ340

OpenClawProBench is a live-first benchmark harness for evaluating LLM agents in the OpenClaw runtime with deterministic grading and repeated-trial reliability.

FastExpressionCompiler๐Ÿ“v5.4.1๐ŸŒฟ Growingโญ1,355

Fast Compiler for C# Expression Trees and the lightweight LightExpression alternative. Diagnostic and code generation tools for the expressions.

Memori๐Ÿ“v3.3.0๐ŸŒฟ Growingโญ13,290

Memori is agent-native memory infrastructure. A SQL-native, LLM-agnostic layer that turns agent execution and conversation into structured, persistent state for production systems.

mcp-devtools๐Ÿ“v0.59.53๐ŸŒฟ Growingโญ133

A modular MCP server that provides commonly used developer tools for AI coding agents

AgenticX๐Ÿ“v0.3.7๐ŸŒฟ Growingโญ105

AgenticX is a unified, production-ready multi-agent platform โ€” Python SDK + CLI (agx) + Studio server + Machi desktop app. Features Meta-Agent orchestration, 15+ LLM providers, MCP Hub, hierarchical m

synaptic-memory๐Ÿ“v0.16.0๐ŸŒฑ Seedlingโญ25

Brain-inspired knowledge graph: spreading activation, Hebbian learning, memory consolidation.

context-mode๐Ÿ“v1.0.89๐ŸŒฟ Growingโญ7,020

Context window optimization for AI coding agents. Sandboxes tool output, 98% reduction. 12 platforms

arag๐Ÿ“v0.1.0๐ŸŒฟ Growingโญ247

A-RAG: Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces. State-of-the-art RAG framework with keyword, semantic, and chunk read tools for multi-hop QA.

memind๐Ÿ“main@2026-04-21๐ŸŒฟ Growingโญ360

Self-evolving cognitive memory and context engine for AI agents in Java. Empowering 24/7 proactive agents like OpenClaw with understanding and SOTA performance.

Awesome-World-Models๐Ÿ“main@2026-04-21๐ŸŒฟ Growingโญ1,473

A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related website

awesome-prompts๐Ÿ“main@2026-04-21๐ŸŒฟ Growingโญ7,572

Curated list of chatgpt prompts from the top-rated GPTs in the GPTs Store. Prompt Engineering, prompt attack & prompt protect. Advanced Prompt Engineering papers.

aura๐Ÿ“main@2026-04-21๐ŸŒฑ Seedlingโญ47

A sovereign cognitive architecture with IIT 4.0 integrated information, residual-stream affective steering (CAA), Global Workspace Theory, active inference, and 72 consciousness modules โ€” running loca

LLM-Agent-Paper-daily๐Ÿ“main@2026-04-21๐ŸŒฑ Seedlingโญ20

Automatically Update LLM-Agent Papers Daily using Github Actions (Update Every 12th hours)

NeuronFS๐Ÿ“main@2026-04-21๐ŸŒฟ Growingโญ136

mkdir beats vector DB. B-tree NeuronFS: 0-byte folders govern AI โ€” โ‚ฉ0 infrastructure, ~200x token efficiency. OS-native constraint engine for LLM agents.

multi-agent-ralph-loop๐Ÿ“main@2026-04-20๐ŸŒฟ Growingโญ113

Autonomous orchestration framework for Claude Code with MemPalace-inspired memory (4-layer stack, 818-token wake-up), parallel-first Agent Teams (6 teammates), Aristotle First Principles methodology,

llm_context_benchmarks๐Ÿ“0.0.0๐ŸŒฑ Seedlingโญ59

๐Ÿ“Š LLM Context Benchmarks - A comprehensive benchmarking tool for testing LLMs with varying context sizes using Ollama. Features dual benchmark modes (API/CLI), automatic hardware detection (optimiz

skills-vote๐Ÿ“main@2026-04-19๐ŸŒฑ Seedlingโญ31

The Next-Gen Agent-Native Skill Recommendation Engine

zeroclaw๐Ÿ“v0.7.3๐ŸŒฟ Growingโญ29,983

Fast, small, and fully autonomous AI personal assistant infrastructure, ANY OS, ANY PLATFORM โ€” deploy anywhere, swap anything ๐Ÿฆ€

SmarterRouter๐Ÿ“2.2.5๐ŸŒฟ Growingโญ105

SmarterRouter: An intelligent LLM gateway and VRAM-aware router for Ollama, llama.cpp, and OpenAI. Features semantic caching, model profiling, and automatic failover for local AI labs.

medusa๐Ÿ“v2026.5.5๐ŸŒฟ Growingโญ252

AI-first security scanner with 76 analyzers, 9,600+ detection rules, and repo poisoning detection for AI/ML, LLM agents, and MCP servers. Scan any GitHub repo with: medusa scan --git user/repo

zvec๐Ÿ“v0.3.1๐ŸŒฟ Growingโญ9,287

A lightweight, lightning-fast, in-process vector database

SciAgent-Skills๐Ÿ“main@2026-04-17๐ŸŒฟ Growingโญ93

Life sciences computational skills for scientific AI agents

milvus๐Ÿ“v2.6.15๐ŸŒฟ Growingโญ43,734

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

maverick-mcp๐Ÿ“main@2026-04-17๐ŸŒฟ Growingโญ479

MaverickMCP - Personal Stock Analysis MCP Server

biomcp๐Ÿ“v0.8.21๐ŸŒฟ Growingโญ488

BioMCP: Biomedical Model Context Protocol

AReaL๐Ÿ“v1.0.3๐ŸŒฟ Growingโญ5,017

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

olb๐Ÿ“v1.0.0๐ŸŒฑ Seedlingโญ18

High-performance zero-dependency L4/L7 load balancer written in Go. Single binary with Web UI, clustering, MCP/AI integration. 8.5K RPS, 39 E2E tests.

AutoGPT๐Ÿ“autogpt-platform-beta-v0.6.56๐ŸŒฟ Growingโญ183,319

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

paiml-mcp-agent-toolkit๐Ÿ“v3.14.0๐ŸŒฟ Growingโญ148

Pragmatic AI Labs MCP Agent Toolkit - An MCP Server designed to make code with agents more deterministic

claw-eval๐Ÿ“main@2026-04-15๐ŸŒฟ Growingโญ394

Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.

llmware๐Ÿ“v0.4.6๐ŸŒฟ Growingโญ14,857

Unified framework for building enterprise RAG pipelines with small, specialized models

cognitive-dissonance-dspy๐Ÿ“main@2026-04-14๐ŸŒฟ Growingโญ276

A multi-agent LLM system for detecting and resolving cognitive dissonance.

rag-chatbot๐Ÿ“main@2026-04-14๐ŸŒฟ Growingโญ402

RAG (Retrieval-augmented generation) ChatBot that provides answers based on contextual information extracted from a collection of Markdown files.

skill๐Ÿ“v1.2.1๐ŸŒฑ Seedlingโญ978

PinchBench is a benchmarking system for evaluating LLM models as OpenClaw coding agents. Made with ๐Ÿฆ€ by the humans at https://kilo.ai

octocode๐Ÿ“0.14.0๐ŸŒฟ Growingโญ319

Semantic code searcher and codebase utility

awesome-vector-database๐Ÿ“main@2026-04-13๐ŸŒฟ Growingโญ341

A curated list of awesome works related to high dimensional structure/vector search & database

vllm-mlx๐Ÿ“v0.2.8๐ŸŒฟ Growingโญ798

OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX bac

prism-mcp๐Ÿ“v9.3.0๐ŸŒฟ Growingโญ116

The Mind Palace for AI Agents โ€” Autonomous Cognitive OS with affect-tagged memory (valence engine), token-economic RL (surprisal gate + UBI), Hebbian learning, ACT-R spreading activation, Synapse Engi

EvoScientist๐Ÿ“v0.0.7๐ŸŒฟ Growingโญ2,731

๐Ÿ”ฌ Harness Vibe Research with Self-evolving AI Scientists

AutoRAG๐Ÿ“v0.3.22๐ŸŒฑ Seedlingโญ4,693

AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation

Awesome-Repo-Level-Code-Generation๐Ÿ“main@2026-04-10๐ŸŒฟ Growingโญ274

Must-read papers on Repository-level Code Generation & Issue Resolution ๐Ÿ”ฅ

claude-code-skills๐Ÿ“v2026.04.10๐ŸŒฟ Growingโญ374

Plugin suite + bundled MCP servers for Claude Code. Full delivery lifecycle: Agile pipeline with multi-model AI review, project bootstrap, documentation generation, codebase audits, performance optimi

echos๐Ÿ“v0.19.1๐ŸŒฑ Seedlingโญ49

Your personal AI knowledge system โ€” self-hosted, agent-driven, and always private.

charlotte๐Ÿ“v0.6.1๐ŸŒฟ Growingโญ120

Token-efficient browser MCP server โ€” structured web pages for AI agents, not raw accessibility dumps

UltraRAG๐Ÿ“v0.3.0.2๐ŸŒฟ Growingโญ5,480

A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines

kuzu-memory๐Ÿ“v1.12.9๐ŸŒฑ Seedlingโญ22

Lightweight, embedded graph-based memory system for AI applications. Fast (<3ms recall), offline-first, with MCP server support for Claude and other AI tools.

DSPex๐Ÿ“main@2026-04-09๐ŸŒฑ Seedlingโญ17

Declarative Self Improving Elixir - DSPy Orchestration in Elixir

Suganthans-BigQuery-MCP-Server๐Ÿ“0.0.0๐ŸŒฑ Seedlingโญ25

BigQuery MCP server for Claude โ€” query any BigQuery dataset in natural language, with built-in SEO analysis tools for GSC bulk export data

signetai๐Ÿ“v0.103.3๐ŸŒฑ Seedlingโญ113

Local-first identity, memory, and secrets for AI agents. Portable state across models and harnesses.

mcp-google-map๐Ÿ“v0.0.52๐ŸŒฑ Seedlingโญ270

A powerful Model Context Protocol (MCP) server providing comprehensive Google Maps API integration with LLM processing capabilities.

bv-mcp๐Ÿ“v2.9.2๐ŸŒฑ Seedlingโญ5

Open-source DNS & email security scanner. One MCP endpoint, 57 checks, zero install. Cloudflare Workers.

mesh-llm๐Ÿ“v0.64.0๐ŸŒฑ Seedlingโญ834

Distributed AI/LLM for the people. Share compute privately or publicly to power your agents and chat.

zettelforge๐Ÿ“v2.4.0๐ŸŒฑ Seedlingโญ25

Agentic memory for CTI in Python โ€” STIX knowledge graphs, threat-actor alias resolution, offline-first RAG, MCP server for Claude Code and LangChain agents

Ultimate-Agent-Directory๐Ÿ“0.0.0๐ŸŒฑ Seedlingโญ51

๐Ÿค– The most comprehensive directory of AI agent frameworks, platforms, tools, and resources - hundreds of curated entries covering open-source, no-code, enterprise, and autonomous solutions. NEW Boil

synthadoc๐Ÿ“v0.1.0๐ŸŒฑ Seedlingโญ66

Synthadoc: An open-source LLM knowledge compilation engine that turns raw documents into structured, local-first wikis. A transparent, human-readable alternative to traditional RAG, which can be self-

AutoViralAI๐Ÿ“0.0.0๐ŸŒฑ Seedlingโญ11

Autonomous AI agent that researches viral content, generates posts, publishes them, measures engagement โ€” and rewrites its own strategy based on what worked. Self-learning loop powered by LangGraph +

agent-skills-standard๐Ÿ“php-v1.3.2๐ŸŒฑ Seedlingโญ391

A collection of Agent Skills Standard and Best Practice for Programming Languages, Frameworks that help our AI Agent follow best practies on frameworks and programming laguages

cortex-hub๐Ÿ“v0.7.0๐ŸŒฑ Seedlingโญ48

Self-hosted AI Agent Memory + Code Intelligence Platform โ€” one MCP endpoint for persistent memory, AST-aware code search, shared knowledge, and quality enforcement across all your AI coding agents.

Open-Sable๐Ÿ“v1.7.0๐ŸŒฑ Seedlingโญ18

Open-Sable is a local-first autonomous agent framework with AGI-inspired cognitive subsystems (goals, memory, metacognition, tool use). It can run continuously on your machine, integrate with chat int

typeahead-kmp๐Ÿ“2.0.4๐ŸŒฑ Seedlingโญ9

A lock-free, in-memory fuzzy search engine for Kotlin Multiplatform. L2-normalized sparse vector embeddings with O(1) cosine similarity โ€” handles typos, transpositions, and blind continuation. Zero-al

openakita๐Ÿ“v1.25.18๐ŸŒฑ Seedlingโญ1,613

An open-source AI assistant framework with skills and agent architecture

plur๐Ÿ“v0.8.0๐ŸŒฑ Seedlingโญ46

Shared memory for AI agents

DBreeze๐Ÿ“v1.136๐ŸŒฑ Seedlingโญ569

C# .NET NOSQL ( key value, object store embedded TextSearch SemanticSearch Vector layer ) ACID multi-paradigm database management system.

tensorzero๐Ÿ“2026.4.0๐ŸŒฑ Seedlingโญ11,204

TensorZero is an open-source LLMOps platform that unifies an LLM gateway, observability, evaluation, optimization, and experimentation.

vikramaditya๐Ÿ“main@2026-04-20๐ŸŒฑ Seedlingโญ5

Autonomous VAPT platform. Give it a target (FQDN, IP, CIDR) โ€” it hunts, it reports. Inspired by the Obsidian Order.

infinity๐Ÿ“v0.7.0-dev5๐ŸŒฑ Seedlingโญ4,476

The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text.

Wax๐Ÿ“waxmcp-v0.1.19๐ŸŒฑ Seedlingโญ700

Single-file memory layer for AI agents, sub mili-second RAG on Apple Silicon. Metal Optimized On-Device. No Server. No API. One File. Pure Swift

claude-codex-settings๐Ÿ“v2.3.0๐ŸŒฑ Seedlingโญ587

My personal Claude Code and OpenAI Codex setup with battle-tested skills, commands, hooks, agents and MCP servers that I use daily.

remembra๐Ÿ“v0.13.1๐ŸŒฑ Seedlingโญ12

Universal memory layer for AI applications. Self-host in minutes. Open source.

mattermost-plugin-agents๐Ÿ“v1.14.0๐ŸŒฑ Seedlingโญ217

Mattermost Agents plugin supporting multiple LLMs

skillfoundry๐Ÿ“v2.0.61๐ŸŒฑ Seedlingโญ6

AI engineering framework with quality gates, persistent memory, and multi-platform support. Works inside Claude Code, Cursor, Copilot, Codex, and Gemini.

rex-cli๐Ÿ“v0.17.0๐ŸŒฑ Seedlingโญ27

Local-first AI agent bootstrap: Playwright Browser MCP + ContextDB for Codex CLI, Claude Code, Gemini CLI, and OpenCode.

devkit๐Ÿ“v2.1.29๐ŸŒฑ Seedlingโญ2

A deterministic development harness for Claude Code โ€” MCP workflow engine, enforcement hooks, YAML workflows, and multi-agent consensus (Claude + Codex + Gemini)

Somi๐Ÿ“Mineralization๐ŸŒฑ Seedlingโญ21

Local-first AI agent framework with GUI, memory, web search, personality constructs, speech i/o, tools, skills, CLI & Telegram features โ€” fully self-hosted via Ollama.

DeepCamera๐Ÿ“v2026.3๐ŸŒฑ Seedlingโญ2,689

Open-Source AI Camera Skills Platform, AI NVR & CCTV Surveillance. Local VLM video analysis with Qwen, DeepSeek, SmolVLM, LLaVA, YOLO26. LLM-powered agentic security camera agent โ€” watches, understand

DreamServer๐Ÿ“v2.0.0๐ŸŒฑ Seedlingโญ478

Local AI anywhere, for everyone โ€” LLM inference, chat UI, voice, agents, workflows, RAG, and image generation. No cloud, no subscriptions.

OpenRA-RL๐Ÿ“v0.4.1๐ŸŒฑ Seedlingโญ118

Open Framework for AI Agents to play Red Alert through Reinforcement Learning

Zen-Ai-Pentest๐Ÿ“v3.0.0๐ŸŒฑ Seedlingโญ279

๐Ÿ›กโš”๏ธAI-Powered Penetration Testing Framework with automated vulnerability scanning, multi-agent system, and compliance reporting๐Ÿ›กโš”๏ธ

Geneclaw๐Ÿ“v0.1.0๐ŸŒฑ Seedlingโญ34

Self-evolving AI agent framework with 5-layer safety gatekeeper. Agents observe failures, propose fixes, and safely apply them. Built on HKUDS/nanobot.

awesome-agent-benchmarks๐Ÿ“master@2026-04-21๐ŸŒฑ Seedlingโญ3

๐Ÿง  Discover and evaluate advanced benchmark datasets for Large Language Model agents to enhance performance assessment in real-world tasks.

Nreki๐Ÿ“v10.5.1๐ŸŒฑ Seedlingโญ2

MCP plugin that intercepts AI agent edits in RAM, validates them (TypeScript compiler + gopls + pyright), auto-heals missing imports, and commits atomically. If anything breaks, disk stays untouched

OriginDL๐Ÿ“v1.0.0๐ŸŒฑ Seedlingโญ245

Implement a Pytorch-like DL library in C++ from scratch, step by step

llm-in-sandbox๐Ÿ“v0.2.0๐ŸŒฑ Seedlingโญ216

Computer Environments Elicit General Agentic Intelligence in LLMs

m3-memory๐Ÿ“v2026.4.20๐ŸŒฑ Seedlingโญ4

Local-first Agentic Memory Layer for MCP Agents โ€ข 25 tools โ€ข Hybrid search (FTS5 + vector + MMR) โ€ข GDPR โ€ข 100% local

sofia๐Ÿ“main@2026-04-11๐ŸŒฑ Seedlingโญ2

Autonomous local AI assistant in Go โ€” 40+ tools, 20+ LLM providers, multi-agent orchestration, self-improving

coordinode๐Ÿ“v0.4.1๐ŸŒฑ Seedlingโญ1

The graph-native hybrid retrieval engine for AI and GraphRAG. Graph + Vector + Full-Text in a single transactional engine.

ragas๐Ÿ“v0.4.3๐ŸŒฑ Seedlingโญ13,329

Supercharge Your LLM Application Evaluations ๐Ÿš€

goskills๐Ÿ“v0.6.0๐ŸŒฑ Seedlingโญ176

A tool supports OPENAI and other LLMs with Claude Skills, you can also use it as a subagent

PromptManager๐Ÿ“master@2026-04-12๐ŸŒฑ Seedlingโญ3

PromptManager is a desktop application for cataloguing, searching, and executing AI prompts, and much more.

octobench๐Ÿ“main@2026-04-21๐ŸŒฑ Seedlingโญ1

Benchmark and compare LLM tool, configuration, and prompt setups using a shared case framework with automated scoring and telemetry.

ASAN-Architecture๐Ÿ“0.0.0๐ŸŒฑ Seedlingโญ6

ASAN: A conceptual architecture for a self-creating (autopoietic), energy-efficient, and governable multi-agent AI system.

seraph๐Ÿ“develop@2026-04-13๐ŸŒฑ Seedlingโญ1

An AI guardian that remembers, watches, and acts.

HealthFlow๐Ÿ“datasets๐Ÿ’ค Dormantโญ40

HealthFlow: A Self-Evolving AI Agent with Meta Planning for Autonomous Healthcare Research

cogames0.25.7๐ŸŒฑ Seedling

Multi-agent cooperative games

gepa๐Ÿ“0.1.1๐ŸŒฑ Seedling

A framework for optimizing textual system components (AI prompts, code snippets, etc.) using LLM-based reflection and Pareto-efficient evolutionary search.

pyannote-audio4.0.4๐ŸŒฑ Seedling

State-of-the-art speaker diarization toolkit

trafilatura๐Ÿ“2.0.0๐ŸŒฑ Seedling

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML.

nanobind๐Ÿ“2.12.0๐ŸŒฑ Seedling

nanobind: tiny and efficient C++/Python bindings

browser-use๐Ÿ“0.12.6๐ŸŒฑ Seedling

Make websites accessible for AI agents

timm๐Ÿ“1.0.26๐ŸŒฑ Seedling

PyTorch Image Models

keras๐Ÿ“3.14.0๐ŸŒฑ Seedling

Multi-backend Keras

graphene๐Ÿ“3.4.3๐ŸŒฑ Seedling

GraphQL Framework for Python

langsmith๐Ÿ“0.7.33๐ŸŒฑ Seedling

Client library to connect to the LangSmith Observability and Evaluation Platform.

uvloop๐Ÿ“0.22.1๐ŸŒฑ Seedling

Fast implementation of asyncio event loop on top of libuv

@gaia-agent/sdk๐Ÿ“0.1.26๐ŸŒฑ Seedling

Production-ready AI agent library using AI SDK v6 ToolLoopAgent for GAIA benchmarks with swappable providers

crypto-skill-bench๐Ÿ“0.1.7๐ŸŒฑ Seedling

Benchmark framework for evaluating crypto skills in AI agent ecosystems

KAG๐Ÿ“v0.8.0๐Ÿ’ค Dormantโญ8,668

KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning and factual Q&A solutions for professional domain knowledge base

SimpleInfer๐Ÿ“0.0.0โšฐ๏ธ Archivedโญ25

A simple neural network inference framework

FlexRAG๐Ÿ“0.3.0๐Ÿ’ค Dormantโญ235

FlexRAG: A RAG Framework for Information Retrieval and Generation.

Qwen-Agent๐Ÿ“v0.0.26๐Ÿ’ค Dormantโญ15,963

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

fastRAG๐Ÿ“v3.1.2๐Ÿ’ค Dormantโญ1,772

Efficient Retrieval Augmentation and Generation Framework