1 package • ⭐ 394 total stars
Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.