freshcrate

Search results for "lua"

40 results found
opik๐Ÿ“2.0.6๐ŸŒณ Matureโญ18,767

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

agenta๐Ÿ“v0.96.7๐ŸŒณ Matureโญ4,011

The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.

WeKnora๐Ÿ“v0.4.0๐ŸŒณ Matureโญ13,819

LLM-powered framework for deep document understanding, semantic retrieval, and context-aware answers using RAG paradigm.

ai-agents-reality-check๐Ÿ“0.0.0๐ŸŒฟ Growingโญ57

Benchmarking the gap between AI agent hype and architecture. Three agent archetypes, 73-point performance spread, stress testing, network resilience, and ensemble coordination analysis with statistica

chinese-llm-benchmark๐Ÿ“v5.9๐ŸŒฟ Growingโญ5,841

ReLE่ฏ„ๆต‹๏ผšไธญๆ–‡AIๅคงๆจกๅž‹่ƒฝๅŠ›่ฏ„ๆต‹๏ผˆๆŒ็ปญๆ›ดๆ–ฐ๏ผ‰๏ผš็›ฎๅ‰ๅทฒๅ›Šๆ‹ฌ359ไธชๅคงๆจกๅž‹๏ผŒ่ฆ†็›–chatgptใ€gpt-5.2ใ€o4-miniใ€่ฐทๆญŒgemini-3-proใ€Claude-4.6ใ€ๆ–‡ๅฟƒERNIE-X1.1ใ€ERNIE-5.0ใ€qwen3-maxใ€qwen3.5-plusใ€็™พๅทใ€่ฎฏ้ฃžๆ˜Ÿ็ซใ€ๅ•†ๆฑคsenseChat็ญ‰ๅ•†็”จๆจกๅž‹๏ผŒ ไปฅๅŠstep3.5-flashใ€kimi-k2.5ใ€ernie4.5ใ€Min

arthur-engine๐Ÿ“2.1.529๐ŸŒฟ Growingโญ75

Make AI work for Everyone - Monitoring and governing for your AI/ML

langfuse๐Ÿ“v3.169.0๐ŸŒฟ Growingโญ24,578

๐Ÿชข Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. ๐ŸŠYC W23

promptfoo๐Ÿ“code-scan-action-0.1.5๐ŸŒฟ Growingโญ19,943

Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and

arag๐Ÿ“v0.1.0๐ŸŒฟ Growingโญ247

A-RAG: Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces. State-of-the-art RAG framework with keyword, semantic, and chunk read tools for multi-hop QA.

evals๐Ÿ“v0.1.15๐ŸŒฟ Growingโญ103

A comprehensive evaluation framework for AI agents and LLM applications.

langwatch๐Ÿ“skills@v0.3.0๐ŸŒฟ Growingโญ3,193

The platform for LLM evaluations and AI agent testing

AI-Infra-Guard๐Ÿ“v4.1.4๐ŸŒฟ Growingโญ3,428

A full-stack AI Red Teaming platform securing AI ecosystems via OpenClaw Security Scan, Agent Scan, Skills Scan, MCP scan, AI Infra scan and LLM jailbreak evaluation.

OpenClawProBench๐Ÿ“main@2026-04-15๐ŸŒฟ Growingโญ340

OpenClawProBench is a live-first benchmark harness for evaluating LLM agents in the OpenClaw runtime with deterministic grading and repeated-trial reliability.

oh-my-pi๐Ÿ“v14.1.2๐ŸŒฟ Growingโญ2,872

โŒฅ AI Coding agent for the terminal โ€” hash-anchored edits, optimized tool harness, LSP, Python, browser, subagents, and more

magenta.nvim๐Ÿ“main@2026-04-21๐ŸŒฟ Growingโญ435

A tool-use-focused LLM plugin for neovim.

medusa๐Ÿ“v2026.5.5๐ŸŒฟ Growingโญ252

AI-first security scanner with 76 analyzers, 9,600+ detection rules, and repo poisoning detection for AI/ML, LLM agents, and MCP servers. Scan any GitHub repo with: medusa scan --git user/repo

Matryoshka๐Ÿ“main@2026-04-18๐ŸŒฟ Growingโญ119

MCP server for token-efficient large document analysis via the use of REPL state

giskard-oss๐Ÿ“giskard-checks/v1.0.2b1๐ŸŒฑ Seedlingโญ5,225

๐Ÿข Open-Source Evaluation & Testing library for LLM Agents

trulens๐Ÿ“trulens-2.7.2๐ŸŒฑ Seedlingโญ3,237

Evaluation and Tracking for LLM Experiments and AI Agents

paiml-mcp-agent-toolkit๐Ÿ“v3.14.0๐ŸŒฟ Growingโญ148

Pragmatic AI Labs MCP Agent Toolkit - An MCP Server designed to make code with agents more deterministic

mlflow๐Ÿ“v3.11.1๐ŸŒฑ Seedlingโญ25,285

The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while controllin

octocode๐Ÿ“0.14.0๐ŸŒฟ Growingโญ319

Semantic code searcher and codebase utility

AutoRAG๐Ÿ“v0.3.22๐ŸŒฑ Seedlingโญ4,693

AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation

ai-gateway๐Ÿ“v1.0.4๐ŸŒฟ Growingโญ59

One API for 25+ LLMs, OpenAI, Anthropic, Bedrock, Azure. Caching, guardrails & cost controls. Go-native LiteLLM & Kong AI Gateway alternative.

awesome-ai-research-writing๐Ÿ“main@2026-04-21๐ŸŒฑ Seedlingโญ9

๐Ÿ“ Enhance your academic writing with tailored AI prompt templates and practical agent skills to boost efficiency and reduce repetitive tasks.

memora๐Ÿ“v0.2.27๐ŸŒฑ Seedlingโญ386

Give your AI agents persistent memory.

GuardianWAF๐Ÿ“v0.1.0๐ŸŒฑ Seedlingโญ18

Zero-dependency Web Application Firewall in Go. Single binary. Three deployment modes. Tokenizer-based detection.

codexlens-search๐Ÿ“v0.8.0๐ŸŒฑ Seedlingโญ44

Lightweight semantic code search engine โ€” 2-stage vector + FTS + RRF fusion + MCP server for Claude Code

any-agent๐Ÿ“1.18.0๐ŸŒฑ Seedlingโญ1,141

A single interface to use and evaluate different agent frameworks

ragas๐Ÿ“v0.4.3๐ŸŒฑ Seedlingโญ13,329

Supercharge Your LLM Application Evaluations ๐Ÿš€

smart-coding-mcp๐Ÿ“main@2026-04-21๐ŸŒฑ Seedlingโญ2

๐Ÿ” Enhance code search accuracy with Smart Coding MCP, an AI-driven server that uses intelligent embeddings for quick, relevant results.

ryvos๐Ÿ“v0.9.0๐ŸŒฑ Seedlingโญ2

Open-source autonomous AI assistant with 5-tier security, 62 tools, 14 LLM providers. Written in Rust. Single binary.

CodeRAG๐Ÿ“main@2026-04-21๐ŸŒฑ Seedlingโญ1

Build semantic vector databases from code and docs to enable AI agents to understand and navigate your entire codebase effectively.

harness๐Ÿ“master@2026-04-21๐ŸŒฑ Seedlingโญ1

Define and control AI agents in markdown with full prompt transparency, persistent memory, and integrated tools via the Claude Agent SDK.

LettuceDetect๐Ÿ“0.1.8๐Ÿ’ค Dormantโญ545

Lightweight hallucination detection framework for RAG applications

RagaAI-Catalyst๐Ÿ“v2.2.4๐Ÿ’ค Dormantโญ16,130

Python SDK for Agent AI Observability, Monitoring and Evaluation Framework. Includes features like agent, llm and tools tracing, debugging multi-agentic system, self-hosted dashboard and advanced anal

Dota2AIFramework๐Ÿ“0.0.0โšฐ๏ธ Archivedโญ75

General Framework for Dota 2 AI Competitions