Home > #benchmark
Tag: #benchmark
7 packages • ⭐ 5,678 total stars
Efficient Retrieval Augmentation and Generation Framework
Fast Compiler for C# Expression Trees and the lightweight LightExpression alternative. Diagnostic and code generation tools for the expressions.
Benchmark for vector databases.
Internal Safety Collapse: Turning the LLM or an AI Agent into a sensitive data generator.
Framework for benchmarking vector search engines
OpenClawProBench is a live-first benchmark harness for evaluating LLM agents in the OpenClaw runtime with deterministic grading and repeated-trial reliability.
Benchmark and compare LLM tool, configuration, and prompt setups using a shared case framework with automated scoring and telemetry.
