freshcrate — #llm-eval

Home > #llm-eval

Tag: #llm-eval

3 packages • ⭐ 28,932 total stars

promptfoo0.121.19🏛️ Flagship⭐20,382

Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and

ci ci-cd cicd evaluation evaluation-framework llm llm-eval llm-evaluation typescriptby promptfoo

giskard-ossgiskard-scan/v1.0.0b3🏛️ Flagship⭐5,289

🐢 Open-Source Evaluation & Testing library for LLM Agents

agent-evaluation ai-red-team ai-security ai-testing fairness-ai llm llm-eval llm-evaluation pythonby Giskard-AI

trulenstrulens-2.9.0🌳 Mature⭐3,261

Evaluation and Tracking for LLM Experiments and AI Agents

agent-evaluation agentops ai-agents ai-monitoring ai-observability evals explainable-ml llm-eval pythonby truera

Tag: #llm-eval

Trending in #llm-eval