freshcrate
Skin:/
Home > #benchmark

Tag: #benchmark

10 packages â€ĸ ⭐ 5,875 total stars

fastRAGv3.1.2âš°ī¸ Archived⭐1,776

Efficient Retrieval Augmentation and Generation Framework

FastExpressionCompilerv5.4.1đŸŒŗ Mature⭐1,359

Fast Compiler for C# Expression Trees and the lightweight LightExpression alternative. Diagnostic and code generation tools for the expressions.

ISC-Benchv0.0.6đŸŒŗ Mature⭐799

Internal Safety Collapse: Turning the LLM or an AI Agent into a sensitive data generator.

OpenClawProBenchmain@2026-05-19đŸŒŋ Growing⭐453

OpenClawProBench is a live-first benchmark harness for evaluating LLM agents in the OpenClaw runtime with deterministic grading and repeated-trial reliability.

vector-db-benchmarkmaster@2026-06-05đŸŒŋ Growing⭐356

Framework for benchmarking vector search engines

@gaia-agent/sdk0.1.26🌱 Seedling⭐15

Production-ready AI agent library using AI SDK v6 ToolLoopAgent for GAIA benchmarks with swappable providers

flywheel-memoryflywheel-memory-v2.12.12🌱 Seedling⭐7

MCP server giving AI a knowledge graph over Obsidian vaults. 13-layer scoring that learns. Local-first, zero cloud.

octobenchmain@2026-06-02🌱 Seedling⭐1

Benchmark and compare LLM tool, configuration, and prompt setups using a shared case framework with automated scoring and telemetry.