freshcrate
Home > #agent-benchmark

Tag: #agent-benchmark

2 packages • ⭐ 60 total stars

ai-agents-reality-check0.0.0🌿 Growing57

Benchmarking the gap between AI agent hype and architecture. Three agent archetypes, 73-point performance spread, stress testing, network resilience, and ensemble coordination analysis with statistica

awesome-agent-benchmarksmaster@2026-04-21🌱 Seedling3

🧠 Discover and evaluate advanced benchmark datasets for Large Language Model agents to enhance performance assessment in real-world tasks.