freshcrate — Search

Search results for "hallucination"

32 results found

opik 📁2.0.6🌳 Mature⭐18,767

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

evaluation hacktoberfest hacktoberfest2025 langchain llama-index llm llm-evaluation llm-observability pythonby comet-mlPython

Auto-claude-code-research-in-sleep 📁v0.4.4🌳 Mature⭐6,182

ARIS ⚔️ (Auto-Research-In-Sleep) — Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in — works wi

ai-research ai-tools aris autonomous-agent claude claude-code claude-code-skills codex pythonby wanshuiyinPython

Agent 📁1.0.75.164🌱 Seedling⭐30

Agent! connects any AI to your Mac. 13 LLM providers — cloud, local, or on-device. It writes code, builds Xcode projects, manages git, organizes files, automates Safari, controls any app, and handl

accessibility ae agentic agentic-ai agentic-framework agentic-workflow agenticai ai swiftby macOS26Swift

RAGHub 📁main@2026-04-17🌳 Mature⭐1,712

A community-driven collection of RAG (Retrieval-Augmented Generation) frameworks, projects, and resources. Contribute and explore the evolving RAG ecosystem.

ai artificial-intelligence large-language-models llm machine-learning natural-language-processing nlp open-sourceby Andrew-Jang

WeKnora 📁v0.4.0🌳 Mature⭐13,819

LLM-powered framework for deep document understanding, semantic retrieval, and context-aware answers using RAG paradigm.

agent agentic ai chatbot chatbots embeddings evaluation generative-ai goby TencentGo

hermes-plugins 📁0.0.0🌱 Seedling⭐21

Custom plugins for hermes-agent — goal management, inter-agent bridge, model selection, cost control

ai-agent autonomous-agent hermes-agent open-source plugins pythonby 42-eveyPython

openlit 📁openlit-1.18.1🌿 Growing⭐2,358

Open source platform for AI Engineering: OpenTelemetry-native LLM Observability, GPU Monitoring, Guardrails, Evaluations, Prompt Management, Vault, Playground. 🚀💻 Integrates with 50+ LLM Providers,

ai-observability amd-gpu clickhouse distributed-tracing genai gpu-monitoring grafana langchain pythonby openlitPython

Autonomous-Agents 📁main@2026-04-16🌿 Growing⭐1,211

Autonomous Agents (LLMs) research papers. Updated Daily.

agent agentic agentic-ai agents ai ai-agents aiagent aiagentsby tmgthb

Awesome-Context-Engineering 📁0.0.0🌳 Mature⭐3,045

🔥 Comprehensive survey on Context Engineering: from prompt engineering to production-grade AI systems. hundreds of papers, frameworks, and implementation guides for LLMs and AI agents.

agent agentic-ai agi awesome-list cognitive-science context-engineering llm ragby Meirtz

arthur-engine 📁2.1.529🌿 Growing⭐75

Make AI work for Everyone - Monitoring and governing for your AI/ML

agentic benchmarking evaluation genai guardrails llm ml monitoring pythonby arthur-aiPython

Awesome-World-Models 📁main@2026-04-21🌿 Growing⭐1,473

A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related website

artificial-intelligence autonomous-driving awesome deep-learning embodied-ai future-prediction video-prediction world-modelby leofan90

tsunami 📁main@2026-04-21🌱 Seedling⭐13

autonomous AI agent that builds full-stack apps. local models. no cloud. no API keys. runs on your hardware.

agentic-ai ai-agent ai-coding-assistant app-builder autonomous-agent code-generation coding-agent developer-tools pythonby gobbleyourdongPython

awesome-prompts 📁main@2026-04-21🌿 Growing⭐7,572

Curated list of chatgpt prompts from the top-rated GPTs in the GPTs Store. Prompt Engineering, prompt attack & prompt protect. Advanced Prompt Engineering papers.

awesome awesome-list chatgpt gpt4 gpts gptstore papers prompt prompt-engineeringby ai-boost

openbrep 📁main@2026-04-21🌱 Seedling⭐16

OpenBrep: 用自然语言驱动 ArchiCAD GDL 库对象的创建、修改与编译

ai-agent archicad bim code-generation llm openbrep parametric parametric-design pythonby byewind1Python

LLM-Agent-Paper-daily 📁main@2026-04-21🌱 Seedling⭐20

Automatically Update LLM-Agent Papers Daily using Github Actions (Update Every 12th hours)

llm llm-agent pythonby Lyz103Python

evals 📁v0.1.15🌿 Growing⭐103

A comprehensive evaluation framework for AI agents and LLM applications.

agentic agentic-ai ai evaluation machine-learning python strands-agentsby strands-agentsPython

codec 📁main@2026-04-16🌿 Growing⭐89

Open-Source Intelligent Command Layer

llm-agent llm-agent-framework local-ai local-ai-agents local-ai-development local-ai-llm mac-os mlx pythonby AVADSA25Python

parlant 📁v3.3.1🌿 Growing⭐17,899

The conversational control layer for customer-facing AI agents - Parlant is a context-engineering framework optimized for controlling customer interactions.

ai-agents ai-alignment customer-service customer-success gemini genai hacktoberfest llama3 pythonby emcie-coPython

prism-mcp 📁v9.3.0🌿 Growing⭐116

The Mind Palace for AI Agents — Autonomous Cognitive OS with affect-tagged memory (valence engine), token-economic RL (surprisal gate + UBI), Hebbian learning, ACT-R spreading activation, Synapse Engi

agent-memory ai-agent anti-sycophancy claude-desktop cognitive-architecture google-gemini hebbian-learning llm-tools typescriptby dcostencoTypeScript

ds_ex 📁main@2026-04-09🌱 Seedling⭐17

DSPEx - Declarative Self-improving Elixir | A BEAM-Native AI Program Optimization Framework

ai ai-framework automated-optimization beam declarative-programming dspy elixir erlang-vmby nshkrdotcomElixir

deepeval 📁v3.9.5🌳 Mature⭐14,701

The LLM Evaluation Framework

evaluation-framework evaluation-metrics llm-evaluation llm-evaluation-framework llm-evaluation-metrics pythonby confident-aiPython

droid-llm-hunter 📁v1.0.0🌱 Seedling⭐95

Droid LLM Hunter is a tool to scan for vulnerabilities in Android applications using Large Language Models (LLMs).

android python scanning-tool vulnerability-scannersby roomkangaliPython

seekdb 📁v1.2.0🌱 Seedling⭐2,505

The AI-Native Search Database. Unifies vector, text, structured and semi-structured data in a single engine, enabling hybrid search and in-database AI workflows.

ai-search ai-search-engine c++column-storage cpp database embedded-database fulltext fulltext-searchby oceanbaseC++

DOX 📁main@2026-04-15🌱 Seedling⭐1

Broken RAG For The Broken Souls

hallucination llm python rag retrieval-augmented-generation vibecodingby AmMoPyPython

uniAI 📁0.0.0🌱 Seedling⭐1

Syllabus-aware RAG study assistant for university students. Answers strictly from your own notes & PDFs, unit-scoped retrieval, cross-encoder reranking, and a hallucination gate — built to help studen

ai chromadb django genai information-retrieval llm local-llm ollama python vector-databaseby git-pratap-shreyPython

LLM-API-Key-Proxy 📁main/build-20260123-1-bf7ab7e🌱 Seedling⭐448

Universal LLM Gateway: One API, every LLM. OpenAI/Anthropic-compatible endpoints with multi-provider translation and intelligent load-balancing.

api-key gemini-api large-language-model large-language-models llm pythonby MirrowelPython

agentic-news-generator 📁main@2026-04-20🌱 Seedling⭐1

Generate a custom newspaper with an AI agent based on your favorite YouTube channels.

agentic generative-ai jupyter notebook news videoby florianbuetowJupyter Notebook

EliteAgent 📁main@2026-04-17🌱 Seedling⭐1

The ultimate native macOS AI Agent. Blends local MLX SLMs with 3D cognitive Metal rendering and autonomous system integrations.

apple-silicon autonomous-agents hybrid-intelligence llm-agent local-llm macos metal mlx swiftby trgysvcSwift

LettuceDetect 📁0.1.8💤 Dormant⭐545

Lightweight hallucination detection framework for RAG applications

bert hallucination-detection hallucination-evaluation information-extraction nlp python pytorch token-classificationby KRLabsOrgPython

algorithm-11 📁v1.0.0🌱 Seedling⭐2

A structured reasoning and decision architecture for stable, interpretable, and hallucination‑resistant AI systems. An open standard for human–AI collaboration and autonomous systems.

ai ai-collaboration ai-framework ai-safety alignment architecture artificial-intelligence autonomous-systems pythonby gormenz-svgPython

TSUKUYOMI 📁2.6.0💤 Dormant⭐86

TSUKUYOMI is an advanced modular intelligence framework designed for the democratization of Intelligence Analysis via systematic analysis, processing, and reporting across multiple domains. Built on a

ai ai-agent ai-framework js json osint osint-toolby savannah-i-g

RagaAI-Catalyst 📁v2.2.4💤 Dormant⭐16,130

Python SDK for Agent AI Observability, Monitoring and Evaluation Framework. Includes features like agent, llm and tools tracing, debugging multi-agentic system, self-hosted dashboard and advanced anal

agentic-ai agentic-ai-development agentneo agents ai-agent-monitoring ai-application-debugging ai-evaluation-tools ai-performance-optimization pythonby raga-ai-hubPython