1 package âĸ â 465 total stars
Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.