freshcrate
Skin:/
Home > #llm-eval

Tag: #llm-eval

3 packages â€ĸ ⭐ 28,932 total stars

promptfoo0.121.14đŸ›ī¸ Flagship⭐20,382

Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and

giskard-ossgiskard-checks/v1.0.2b3đŸ›ī¸ Flagship⭐5,289

đŸĸ Open-Source Evaluation & Testing library for LLM Agents

trulenstrulens-2.8.1đŸŒŗ Mature⭐3,261

Evaluation and Tracking for LLM Experiments and AI Agents