awesome-agent-benchmarks
🧠 Discover and evaluate advanced benchmark datasets for Large Language Model agents to enhance performance assessment in real-world tasks.
Description
🧠 Discover and evaluate advanced benchmark datasets for Large Language Model agents to enhance performance assessment in real-world tasks.
Release History
| Version | Changes | Urgency | Date |
|---|---|---|---|
| master@2026-04-21 | Latest activity on master branch | High | 4/21/2026 |
| 0.0.0 | No release found — using repo HEAD | High | 4/9/2026 |
Dependencies & License Audit
Loading dependencies...
Similar Packages
opentulpaSelf-hosted personal AI agent that lives in your DMs. Describe any workflow: triage Gmail, pull a Giphy feed, build a Slack bot, monitor markets. It writes the code, runs it, schedules it, and saves imain@2026-04-17
agentic-rag📄 Enable smart document and data search with AI-powered chat, vector search, and SQL querying across multiple file formats.main@2026-04-21
openclaw-gateway🔗 Connect your phone directly to AI agents with OpenClaw Gateway, an open-source WebSocket solution free from third-party oversight.main@2026-04-21
ios-agentic-skills🔍 Discover and utilize agentic iOS/watchOS audit skills and playbooks for consistent quality assurance in your applications.master@2026-04-21
