freshcrate

Search results for "llm-as-a-judge"

Clear filters
6 results found (Python)
opik📁2.0.6🌳 Mature18,767

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

openlit📁openlit-1.18.1🌿 Growing2,358

Open source platform for AI Engineering: OpenTelemetry-native LLM Observability, GPU Monitoring, Guardrails, Evaluations, Prompt Management, Vault, Playground. 🚀💻 Integrates with 50+ LLM Providers,

LLM-Agent-Paper-daily📁main@2026-04-21🌱 Seedling20

Automatically Update LLM-Agent Papers Daily using Github Actions (Update Every 12th hours)

agentscope📁v1.0.19🌿 Growing23,421

Build and run agents you can see, understand and trust.

evals📁v0.1.15🌿 Growing103

A comprehensive evaluation framework for AI agents and LLM applications.