freshcrate — #benchmark

Home > #benchmark

Tag: #benchmark

10 packages • ⭐ 5,875 total stars

fastRAGv3.1.2⚰️ Archived⭐1,776

Efficient Retrieval Augmentation and Generation Framework

benchmark colbert diffusion generative-ai information-retrieval knowledge-graph llm multi-modal pythonby IntelLabs

FastExpressionCompilerv5.4.1🌳 Mature⭐1,359

Fast Compiler for C# Expression Trees and the lightweight LightExpression alternative. Diagnostic and code generation tools for the expressions.

benchmark c#closure code-generation compiler delegate delegates dryioc expression-treeby dadhi

VectorDBBenchv1.0.22🌳 Mature⭐1,078

Benchmark for vector databases.

benchmark cost-effectiveness performance python vector-database vector-search vectordbby zilliztech

ISC-Benchmain@2026-07-13🌳 Mature⭐799

Internal Safety Collapse: Turning the LLM or an AI Agent into a sensitive data generator.

adversarial-attacks agent-safety ai-safety benchmark frontier-models jailbreak large-language-models llm-safety pythonby wuyoscar

OpenClawProBenchmain@2026-06-28🌿 Growing⭐453

OpenClawProBench is a live-first benchmark harness for evaluating LLM agents in the OpenClaw runtime with deterministic grading and repeated-trial reliability.

agent benchmark evaluation harness leaderboard llm openclaw pythonby suyoumo

vector-db-benchmarkmaster@2026-07-16🌿 Growing⭐356

Framework for benchmarking vector search engines

benchmark python vector-database vector-search vector-search-engineby qdrant

little-coderv1.11.0🌱 Seedling⭐31

A coding agent optimized to smaller LLMs

ai-coding-assistant aider-polygot benchmark code-generation coding-agent coding-agents local-llm ollama pythonby itayinbarr

@gaia-agent/sdk0.1.26🌱 Seedling⭐15

Production-ready AI agent library using AI SDK v6 ToolLoopAgent for GAIA benchmarks with swappable providers

agent ai ai-sdk autonomous benchmark gaia llm npm toolsby

flywheel-memoryflywheel-memory-v2.12.19🌱 Seedling⭐7

MCP server giving AI a knowledge graph over Obsidian vaults. 13-layer scoring that learns. Local-first, zero cloud.

ai-tools backlinks benchmark claude claude-desktop hotpotqa knowledge-graph local-first model-context-protocol typescriptby velvetmonkey

octobenchmain@2026-07-19🌱 Seedling⭐1

Benchmark and compare LLM tool, configuration, and prompt setups using a shared case framework with automated scoring and telemetry.

agentic agents ai ai-workflow anthropic automation benchmark codexby xInfer123

Tag: #benchmark

Trending in #benchmark