freshcrate
Home > Testing > awesome-agent-benchmarks

awesome-agent-benchmarks

🧠 Discover and evaluate advanced benchmark datasets for Large Language Model agents to enhance performance assessment in real-world tasks.

Description

🧠 Discover and evaluate advanced benchmark datasets for Large Language Model agents to enhance performance assessment in real-world tasks.

Release History

VersionChangesUrgencyDate
master@2026-04-21Latest activity on master branchHigh4/21/2026
0.0.0No release found — using repo HEADHigh4/9/2026

Dependencies & License Audit

Loading dependencies...

Similar Packages

opentulpaSelf-hosted personal AI agent that lives in your DMs. Describe any workflow: triage Gmail, pull a Giphy feed, build a Slack bot, monitor markets. It writes the code, runs it, schedules it, and saves imain@2026-04-17
agentic-rag📄 Enable smart document and data search with AI-powered chat, vector search, and SQL querying across multiple file formats.main@2026-04-21
openclaw-gateway🔗 Connect your phone directly to AI agents with OpenClaw Gateway, an open-source WebSocket solution free from third-party oversight.main@2026-04-21
mcp-mifosxModel Context Protocol - MCP for Mifos Xmain@2026-04-21
ios-agentic-skills🔍 Discover and utilize agentic iOS/watchOS audit skills and playbooks for consistent quality assurance in your applications.master@2026-04-21