freshcrate
Skin:/
Home > Frameworks > inspect-ai

inspect-ai

Framework for large language model evaluations

Why this rank:Strong adoptionRecent releaseHealthy release cadence

Description

[<img width="295" src="https://inspect.aisi.org.uk/images/aisi-logo.svg" />](https://aisi.gov.uk/) Welcome to Inspect, a framework for large language model evaluations created by the [UK AI Security Institute](https://aisi.gov.uk/). Inspect provides many built-in components, including facilities for prompt engineering, tool usage, multi-turn dialog, and model graded evaluations. Extensions to Inspect (e.g. to support new elicitation and scoring techniques) can be provided by other Python packages. To get started with Inspect, please see the documentation at <https://inspect.aisi.org.uk/>. Inspect also includes a collection of over 200 pre-built evaluations ready to run on any model (learn more at <https://inspect.aisi.org.uk/evals/>). *** To work on development of Inspect, clone the repository and install with the `-e` flag and `[dev]` optional dependencies: ```bash git clone https://github.com/UKGovernmentBEIS/inspect_ai.git cd inspect_ai pip install -e ".[dev]" ``` Optionally install pre-commit hooks via ```bash make hooks ``` Run linting, formatting, and tests via ```bash make check make test ``` If you use VS Code, you should be sure to have installed the recommended extensions (Python, Ruff, and MyPy). Note that you'll be prompted to install these when you open the project in VS Code. ### Frontend development (TypeScript) The web UI lives in a git submodule at `src/inspect_ai/_view/ts-mono/`. **These steps are only needed if you plan to work on the TypeScript/React frontend** — Python-only contributors can skip this entirely. Initialize the submodule and install dependencies — see the [one-time setup guide](src/inspect_ai/_view/ts-mono/docs/submodule-guide.md#one-time-setup). ### Documentation To work on the Inspect documentation, install the optional `[doc]` dependencies with the `-e` flag and build the docs: ``` pip install -e ".[doc]" cd docs quarto render # or 'quarto preview' ``` If you intend to work on the docs iteratively, you'll want to install the Quarto extension in VS Code.

Release History

VersionChangesUrgencyDate
main@2026-06-05Latest activity on main branchHigh6/5/2026
0.3.209Imported from PyPI (0.3.209)Low4/21/2026
0.3.208Tag 0.3.208High4/19/2026
0.3.207Tag 0.3.207High4/16/2026
0.3.206Tag 0.3.206High4/15/2026
0.3.205Tag 0.3.205Medium4/4/2026
0.3.204Tag 0.3.204Medium4/2/2026
0.3.203Tag 0.3.203Medium4/1/2026
0.3.202Tag 0.3.202Medium3/31/2026
0.3.201Tag 0.3.201Medium3/25/2026
0.3.200Tag 0.3.200Low3/20/2026
0.3.199Tag 0.3.199Low3/17/2026
0.3.197Tag 0.3.197Low3/17/2026
0.3.196Tag 0.3.196Low3/16/2026
0.3.195Tag 0.3.195Low3/15/2026
0.3.193Tag 0.3.193Low3/13/2026
release/2025-11-28Tag release/2025-11-28Low11/28/2025
inspect-tool-support-1.2.0Tag inspect-tool-support-1.2.0Low8/11/2025
inspect-tool-support-1.1.2Tag inspect-tool-support-1.1.2Low7/9/2025
inspect-tool-support-1.1.0Tag inspect-tool-support-1.1.0Low5/2/2025
STASH_ANYIO_PASS1Tag STASH_ANYIO_PASS1Low4/10/2025

Dependencies & License Audit

Loading dependencies...

Similar Packages

tqdmFast, Extensible Progress Meterv4.68.1
hypothesisThe property-based testing library for Pythonv6.155.2
bleachAn easy safelist-based HTML-sanitizing tool.main@2026-06-05
jupyter-clientJupyter protocol implementation and client librariesv8.9.0
flask-appbuilderSimple and rapid application development framework, built on top of Flask. includes detailed security, auto CRUD generation for your models, google charts and much more.v5.2.1

More in Frameworks

langchainThe agent engineering platform
deer-flowAn open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of ta
tqdmFast, Extensible Progress Meter
simBuild, deploy, and orchestrate AI agents. Sim is the central intelligence layer for your AI workforce.