Search results for "hallucination"
Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.
ARIS โ๏ธ (Auto-Research-In-Sleep) โ Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in โ works wi
Custom plugins for hermes-agent โ goal management, inter-agent bridge, model selection, cost control
Open source platform for AI Engineering: OpenTelemetry-native LLM Observability, GPU Monitoring, Guardrails, Evaluations, Prompt Management, Vault, Playground. ๐๐ป Integrates with 50+ LLM Providers,
Make AI work for Everyone - Monitoring and governing for your AI/ML
autonomous AI agent that builds full-stack apps. local models. no cloud. no API keys. runs on your hardware.
OpenBrep: ็จ่ช็ถ่ฏญ่จ้ฉฑๅจ ArchiCAD GDL ๅบๅฏน่ฑก็ๅๅปบใไฟฎๆนไธ็ผ่ฏ
Automatically Update LLM-Agent Papers Daily using Github Actions (Update Every 12th hours)
A comprehensive evaluation framework for AI agents and LLM applications.
Open-Source Intelligent Command Layer
The conversational control layer for customer-facing AI agents - Parlant is a context-engineering framework optimized for controlling customer interactions.
The LLM Evaluation Framework
Droid LLM Hunter is a tool to scan for vulnerabilities in Android applications using Large Language Models (LLMs).
Broken RAG For The Broken Souls
Syllabus-aware RAG study assistant for university students. Answers strictly from your own notes & PDFs, unit-scoped retrieval, cross-encoder reranking, and a hallucination gate โ built to help studen
Universal LLM Gateway: One API, every LLM. OpenAI/Anthropic-compatible endpoints with multi-provider translation and intelligent load-balancing.
Lightweight hallucination detection framework for RAG applications
A structured reasoning and decision architecture for stable, interpretable, and hallucinationโresistant AI systems. An open standard for humanโAI collaboration and autonomous systems.
Python SDK for Agent AI Observability, Monitoring and Evaluation Framework. Includes features like agent, llm and tools tracing, debugging multi-agentic system, self-hosted dashboard and advanced anal
