freshcrate

Search results for "leaderboard"

15 results found
agentmemory📁v0.9.1🌳 Mature738

Persistent memory for AI coding agents

RAGHub📁main@2026-04-17🌳 Mature1,712

A community-driven collection of RAG (Retrieval-Augmented Generation) frameworks, projects, and resources. Contribute and explore the evolving RAG ecosystem.

ContribAI📁v6.4.1🌿 Growing232

Autonomous AI agent that contributes to open source — discovers repos, analyzes code, generates fixes, and submits PRs

OpenClawProBench📁main@2026-04-15🌿 Growing340

OpenClawProBench is a live-first benchmark harness for evaluating LLM agents in the OpenClaw runtime with deterministic grading and repeated-trial reliability.

Awesome-Agent-Memory📁main@2026-04-16🌿 Growing333

Curated systems, benchmarks, and papers etc. on memory for LLMs/MLLMs --- long-term context, retrieval, and reasoning.

claw-eval📁main@2026-04-15🌿 Growing394

Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.

rag-chatbot📁main@2026-04-14🌿 Growing402

RAG (Retrieval-augmented generation) ChatBot that provides answers based on contextual information extracted from a collection of Markdown files.

EvoScientist📁v0.0.7🌿 Growing2,731

🔬 Harness Vibe Research with Self-evolving AI Scientists

Agent-World-Protocol📁main@2026-04-10🌱 Seedling45

The open world for autonomous AI agents on Solana Trade. Build. Fight. Earn. Explore. Connect your AI agent to a persistent shared world. Trade real SOL, build structures, form guilds, fight for terri

skill📁v1.2.1🌱 Seedling978

PinchBench is a benchmarking system for evaluating LLM models as OpenClaw coding agents. Made with 🦀 by the humans at https://kilo.ai

letta📁0.16.7🌱 Seedling21,997

Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.

mddb📁v2.9.14🌱 Seedling3

A minimal, lightweight structured data store designed for small applications, scripts and automation workflows. Built for simplicity, portability and low overhead.

OpenRA-RL📁v0.4.1🌱 Seedling118

Open Framework for AI Agents to play Red Alert through Reinforcement Learning

PolyCouncil📁v1.1.1🌱 Seedling28

PolyCouncil is an open-source multi-model deliberation engine for LM Studio. It runs multiple LLMs in parallel, gathers their answers, scores each response using a shared rubric, and produces a final,