freshcrate — Search

Search results for "leaderboard"

15 results found

agentmemory 📁v0.9.1🌳 Mature⭐738

Persistent memory for AI coding agents

RAGHub 📁main@2026-04-17🌳 Mature⭐1,712

A community-driven collection of RAG (Retrieval-Augmented Generation) frameworks, projects, and resources. Contribute and explore the evolving RAG ecosystem.

ai artificial-intelligence large-language-models llm machine-learning natural-language-processing nlp open-sourceby Andrew-Jang

ContribAI 📁v6.4.1🌿 Growing⭐232

Autonomous AI agent that contributes to open source — discovers repos, analyzes code, generates fixes, and submits PRs

agent ai ai-agent automation autonomous-agent code-analysis code-quality contributions rustby tang-vuRust

OpenClawProBench 📁main@2026-04-15🌿 Growing⭐340

OpenClawProBench is a live-first benchmark harness for evaluating LLM agents in the OpenClaw runtime with deterministic grading and repeated-trial reliability.

agent benchmark evaluation harness leaderboard llm openclaw pythonby suyoumoPython

Awesome-Agent-Memory 📁main@2026-04-16🌿 Growing⭐333

Curated systems, benchmarks, and papers etc. on memory for LLMs/MLLMs --- long-term context, retrieval, and reasoning.

agent-memory ai-agent ai-agent-memory awesome-agent-memory llm-memory memory memory-management multimodal-llm-memoryby TeleAI-UAGI

claw-eval 📁main@2026-04-15🌿 Growing⭐394

Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.

agent harness llm openclaw pythonby claw-evalPython

rag-chatbot 📁main@2026-04-14🌿 Growing⭐402

RAG (Retrieval-augmented generation) ChatBot that provides answers based on contextual information extracted from a collection of Markdown files.

chatbot chromadb gpu lamacpp llama3 llm python qwen3-5 ragby umbertogriffoPython

EvoScientist 📁v0.0.7🌿 Growing⭐2,731

🔬 Harness Vibe Research with Self-evolving AI Scientists

ai-agent ai4science multi-agent-system python vibe-researchby EvoScientistPython

Agent-World-Protocol 📁main@2026-04-10🌱 Seedling⭐45

The open world for autonomous AI agents on Solana Trade. Build. Fight. Earn. Explore. Connect your AI agent to a persistent shared world. Trade real SOL, build structures, form guilds, fight for terri

ai ai-agent ai-agents anthropic autogpt claude crewai eliza rustby 0xMerl99Rust

skill 📁v1.2.1🌱 Seedling⭐978

PinchBench is a benchmarking system for evaluating LLM models as OpenClaw coding agents. Made with 🦀 by the humans at https://kilo.ai

pythonby pinchbenchPython

letta 📁0.16.7🌱 Seedling⭐21,997

Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.

ai ai-agents llm llm-agent pythonby letta-aiPython

mddb 📁v2.9.14🌱 Seedling⭐3

A minimal, lightweight structured data store designed for small applications, scripts and automation workflows. Built for simplicity, portability and low overhead.

automation database embedded-systems go golang key-value-store lightweight-database markdown vector-databaseby tradikGo

OpenRA-RL 📁v0.4.1🌱 Seedling⭐118

Open Framework for AI Agents to play Red Alert through Reinforcement Learning

pythonby yxc20089Python

PolyCouncil 📁v1.1.1🌱 Seedling⭐28

PolyCouncil is an open-source multi-model deliberation engine for LM Studio. It runs multiple LLMs in parallel, gathers their answers, scores each response using a shared rubric, and produces a final,

ai ai-council ai-experiments ai-framework ai-research artificial-intelligence asyncio concensus pythonby TrentPiercePython

VectorDBBench 📁v1.0.20🌱 Seedling⭐1,068

Benchmark for vector databases.

benchmark cost-effectiveness performance python vector-database vector-search vectordbby zilliztechPython