freshcrate
Home > Testing > Observal

Observal

Observal is an AI agent registry with first in class observabilty and eval framework

Description

Observal is an AI agent registry with first in class observabilty and eval framework

README

Observal

Discover, share, and monitor AI coding agents with full observability built in.

LicensePythonStatusStarsIf you find Observal useful, please consider giving it a star. It helps others discover the project and keeps development going.


Observal is a self-hosted AI agent registry with built-in observability. Think Docker Hub, but for AI coding agents.

Browse agents created by others, publish your own, and pull complete agent configurations — all defined in a portable YAML format that templates out to Claude Code, Kiro CLI, Cursor, Gemini CLI, and more. Every agent bundles its MCP servers, skills, hooks, prompts, and sandboxes into a single installable package. One command to install, zero manual config.

Every interaction generates traces, spans, and sessions that flow into a telemetry pipeline. The built-in eval engine scores agent sessions so you can measure performance and make your agents better over time.

Full docs live at observal.gitbook.io (sourced from /docs in this repo).

Start here Go to
5-minute install and first trace Quickstart
Understand the data model Core Concepts
Instrument your existing MCP servers Observe MCP traffic
Run Observal on your infrastructure Self-Hosting
Look up a CLI command CLI Reference

See CHANGELOG.md for recent updates.

Quick start

git clone https://github.com/BlazeUp-AI/Observal.git
cd Observal
cp .env.example .env

docker compose -f docker/docker-compose.yml up --build -d
uv tool install --editable .
observal auth login            # auto-creates admin on fresh server

Eight services start (API, web UI, Postgres, ClickHouse, Redis, worker, OTEL collector, Grafana). Full walkthrough in Quickstart; operator guide in Self-Hosting → Docker Compose setup.

Already have MCP servers in your IDE? Instrument them in one command:

observal scan                       # auto-detect, register, and instrument everything
observal pull <agent> --ide cursor  # install a complete agent

This detects MCP servers from your IDE config files and wraps them with observal-shim for telemetry without breaking your existing setup. A timestamped backup is created automatically. Everything happens locally — nothing is uploaded to the server.

Supported IDEs

IDE Support
Claude Code Full — skills, hooks, MCP, rules, OTLP telemetry
Kiro CLI Full — superpowers, hooks, MCP, steering files, OTLP telemetry
Gemini CLI Native OTEL + shim telemetry
Codex CLI Native OTEL + shim telemetry
GitHub Copilot Shim telemetry
OpenCode Shim telemetry
Cursor MCP + shim telemetry

Compatibility matrix and per-IDE setup: Integrations.

Tech stack

Component Technology
Frontend Next.js 16, React 19, Tailwind CSS 4, shadcn/ui, Recharts
Backend Python 3.11+, FastAPI, Strawberry GraphQL, Uvicorn
Databases PostgreSQL 16 (registry), ClickHouse (telemetry)
Queue Redis + arq
CLI Python, Typer, Rich
Eval engine AWS Bedrock / OpenAI-compatible LLMs
Telemetry OpenTelemetry Collector
Deployment Docker Compose (8 services)

Contributing

See CONTRIBUTING.md. The short version:

  1. Fork and clone
  2. make hooks to install pre-commit hooks
  3. Create a feature branch
  4. Run make lint and make test
  5. Open a PR

See AGENTS.md for internal codebase context.

Running tests

make test      # quick
make test-v    # verbose

All tests mock external services. No Docker needed.

Community

Have a question, idea, or want to share what you've built? Head to GitHub Discussions. Please use Discussions for questions; open Issues for confirmed bugs and concrete feature requests.

Join the Observal Discord to chat directly with the maintainers and other community members.

Security

To report a vulnerability, please use GitHub Private Vulnerability Reporting or email contact@blazeup.app. Do not open a public issue. See SECURITY.md.

License

Apache License 2.0. See LICENSE.

Star history

Star History Chart

Release History

VersionChangesUrgencyDate
v0.2.0## What's Changed * docs + refactor: READMEs, eval subpackage, repo cleanup by @Haz3-jolt in https://github.com/BlazeUp-AI/Observal/pull/322 * feat: live session updates via GraphQL subscriptions by @Haz3-jolt in https://github.com/BlazeUp-AI/Observal/pull/323 * fix(web): protect registry routes with auth guard by @Kaushik-Kumar-CEG in https://github.com/BlazeUp-AI/Observal/pull/324 * feat: unify telemetry — merge hooks, shims, and OTLP by @Haz3-jolt in https://github.com/BlazeUp-AI/ObservalHigh4/21/2026
v0.0.1Tag v0.0.1High4/15/2026

Dependencies & License Audit

Loading dependencies...

Similar Packages

llm-wikiLLM-powered knowledge base from your Claude Code, Codex CLI, Copilot, Cursor & Gemini sessions. Karpathy's LLM Wiki pattern — implemented and shipped.v1.1.0-rc8
agentic-configProject-agnostic, composable AI workflow automation via pi packages and Claude Code plugins.v0.3.0-alpha
cc-sddTurn approved specs into long-running autonomous implementation. A minimal, adaptable SDD harness with Agent Skills for Claude Code, Codex, Cursor, Copilot, Windsurf, OpenCode, Gemini CLI, and Antigrav3.0.2
mlflowThe open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while controllints/v0.2.0-rc.1
JobForgeAI-powered job search automation — evaluate offers, generate ATS-optimized CVs, scan portals, negotiate, and track applications. One config drives OpenCode, Cursor, Claude Code, and Codex. Free end-tov2.14.4