freshcrate

Search results for "llm-as-a-judge"

9 results found
agenta📁v0.96.7🌳 Mature4,011

The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.

opik📁2.0.6🌳 Mature18,767

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

openlit📁openlit-1.18.1🌿 Growing2,358

Open source platform for AI Engineering: OpenTelemetry-native LLM Observability, GPU Monitoring, Guardrails, Evaluations, Prompt Management, Vault, Playground. 🚀💻 Integrates with 50+ LLM Providers,

Autonomous-Agents📁main@2026-04-16🌿 Growing1,211

Autonomous Agents (LLMs) research papers. Updated Daily.

memind📁main@2026-04-21🌿 Growing360

Self-evolving cognitive memory and context engine for AI agents in Java. Empowering 24/7 proactive agents like OpenClaw with understanding and SOTA performance.

LLM-Agent-Paper-daily📁main@2026-04-21🌱 Seedling20

Automatically Update LLM-Agent Papers Daily using Github Actions (Update Every 12th hours)

agentscope📁v1.0.19🌿 Growing23,421

Build and run agents you can see, understand and trust.

evals📁v0.1.15🌿 Growing103

A comprehensive evaluation framework for AI agents and LLM applications.