freshcrate — Search

Search results for "hallucination"

19 results found (Python)

opik 📁2.0.6🌳 Mature⭐18,767

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

evaluation hacktoberfest hacktoberfest2025 langchain llama-index llm llm-evaluation llm-observability pythonby comet-mlPython

Auto-claude-code-research-in-sleep 📁v0.4.4🌳 Mature⭐6,182

ARIS ⚔️ (Auto-Research-In-Sleep) — Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in — works wi

ai-research ai-tools aris autonomous-agent claude claude-code claude-code-skills codex pythonby wanshuiyinPython

hermes-plugins 📁0.0.0🌱 Seedling⭐21

Custom plugins for hermes-agent — goal management, inter-agent bridge, model selection, cost control

ai-agent autonomous-agent hermes-agent open-source plugins pythonby 42-eveyPython

openlit 📁openlit-1.18.1🌿 Growing⭐2,358

Open source platform for AI Engineering: OpenTelemetry-native LLM Observability, GPU Monitoring, Guardrails, Evaluations, Prompt Management, Vault, Playground. 🚀💻 Integrates with 50+ LLM Providers,

ai-observability amd-gpu clickhouse distributed-tracing genai gpu-monitoring grafana langchain pythonby openlitPython

arthur-engine 📁2.1.529🌿 Growing⭐75

Make AI work for Everyone - Monitoring and governing for your AI/ML

agentic benchmarking evaluation genai guardrails llm ml monitoring pythonby arthur-aiPython

tsunami 📁main@2026-04-21🌱 Seedling⭐13

autonomous AI agent that builds full-stack apps. local models. no cloud. no API keys. runs on your hardware.

agentic-ai ai-agent ai-coding-assistant app-builder autonomous-agent code-generation coding-agent developer-tools pythonby gobbleyourdongPython

openbrep 📁main@2026-04-21🌱 Seedling⭐16

OpenBrep: 用自然语言驱动 ArchiCAD GDL 库对象的创建、修改与编译

ai-agent archicad bim code-generation llm openbrep parametric parametric-design pythonby byewind1Python

LLM-Agent-Paper-daily 📁main@2026-04-21🌱 Seedling⭐20

Automatically Update LLM-Agent Papers Daily using Github Actions (Update Every 12th hours)

llm llm-agent pythonby Lyz103Python

evals 📁v0.1.15🌿 Growing⭐103

A comprehensive evaluation framework for AI agents and LLM applications.

agentic agentic-ai ai evaluation machine-learning python strands-agentsby strands-agentsPython

codec 📁main@2026-04-16🌿 Growing⭐89

Open-Source Intelligent Command Layer

llm-agent llm-agent-framework local-ai local-ai-agents local-ai-development local-ai-llm mac-os mlx pythonby AVADSA25Python

parlant 📁v3.3.1🌿 Growing⭐17,899

The conversational control layer for customer-facing AI agents - Parlant is a context-engineering framework optimized for controlling customer interactions.

ai-agents ai-alignment customer-service customer-success gemini genai hacktoberfest llama3 pythonby emcie-coPython

deepeval 📁v3.9.5🌳 Mature⭐14,701

The LLM Evaluation Framework

evaluation-framework evaluation-metrics llm-evaluation llm-evaluation-framework llm-evaluation-metrics pythonby confident-aiPython

droid-llm-hunter 📁v1.0.0🌱 Seedling⭐95

Droid LLM Hunter is a tool to scan for vulnerabilities in Android applications using Large Language Models (LLMs).

android python scanning-tool vulnerability-scannersby roomkangaliPython

DOX 📁main@2026-04-15🌱 Seedling⭐1

Broken RAG For The Broken Souls

hallucination llm python rag retrieval-augmented-generation vibecodingby AmMoPyPython

uniAI 📁0.0.0🌱 Seedling⭐1

Syllabus-aware RAG study assistant for university students. Answers strictly from your own notes & PDFs, unit-scoped retrieval, cross-encoder reranking, and a hallucination gate — built to help studen

ai chromadb django genai information-retrieval llm local-llm ollama python vector-databaseby git-pratap-shreyPython

LLM-API-Key-Proxy 📁main/build-20260123-1-bf7ab7e🌱 Seedling⭐448

Universal LLM Gateway: One API, every LLM. OpenAI/Anthropic-compatible endpoints with multi-provider translation and intelligent load-balancing.

api-key gemini-api large-language-model large-language-models llm pythonby MirrowelPython

LettuceDetect 📁0.1.8💤 Dormant⭐545

Lightweight hallucination detection framework for RAG applications

bert hallucination-detection hallucination-evaluation information-extraction nlp python pytorch token-classificationby KRLabsOrgPython

algorithm-11 📁v1.0.0🌱 Seedling⭐2

A structured reasoning and decision architecture for stable, interpretable, and hallucination‑resistant AI systems. An open standard for human–AI collaboration and autonomous systems.

ai ai-collaboration ai-framework ai-safety alignment architecture artificial-intelligence autonomous-systems pythonby gormenz-svgPython

RagaAI-Catalyst 📁v2.2.4💤 Dormant⭐16,130

Python SDK for Agent AI Observability, Monitoring and Evaluation Framework. Includes features like agent, llm and tools tracing, debugging multi-agentic system, self-hosted dashboard and advanced anal

agentic-ai agentic-ai-development agentneo agents ai-agent-monitoring ai-application-debugging ai-evaluation-tools ai-performance-optimization pythonby raga-ai-hubPython