octobench
Benchmark and compare LLM tool, configuration, and prompt setups using a shared case framework with automated scoring and telemetry.
Description
Benchmark and compare LLM tool, configuration, and prompt setups using a shared case framework with automated scoring and telemetry.
README
Release History
| Version | Changes | Urgency | Date |
|---|---|---|---|
| main@2026-04-21 | Latest activity on main branch | High | 4/21/2026 |
| 0.0.0 | No release found — using repo HEAD | High | 4/9/2026 |
Dependencies & License Audit
Loading dependencies...
Similar Packages
claude-container🐳 Run Claude Code safely in isolated Docker containers with persistent projects and easy setup on macOS using Justfile automation.master@2026-04-21
hatch3rInstall an agentic coding setup that adds multiple AI agents, skills, and rules to enhance automation across GitHub, Azure DevOps, or GitLab repositories.main@2026-04-21
simBuild, deploy, and orchestrate AI agents. Sim is the central intelligence layer for your AI workforce.v0.6.51
