ouroboros

Home > MCP Servers > ouroboros

Stop prompting. Start specifying.

ai-agent claude-code codex-cli devtools evaluation llm mcp multi-agent prompt-engineering python

Why this rank:Strong adoptionRecent releaseHealthy release cadence

Description

Stop prompting. Start specifying.

README

English | 한국어

◯ ─────────── ◯

O U R O B O R O S

◯ ─────────── ◯

Stop prompting. Start specifying.
_{Specification-first workflow engine for AI coding agents}

Quick Start · Why · Results · How It Works · Commands · Philosophy

Turn a vague idea into a verified, working codebase -- with any AI coding agent.

Ouroboros sits between you and your AI runtime (Claude Code, Codex CLI, Hermes, or others). It replaces ad-hoc prompting with a structured specification-first workflow: interview, crystallize, execute, evaluate, evolve.

Why Ouroboros?

Most AI coding fails at the input, not the output. The bottleneck is not AI capability -- it is human clarity.

Problem	What Happens	Ouroboros Fix
Vague prompts	AI guesses, you rework	Socratic interview exposes hidden assumptions
No spec	Architecture drifts mid-build	Immutable seed spec locks intent before code
Manual QA	"Looks good" is not verification	3-stage automated evaluation gate

Quick Start

Install — one command, everything auto-detected:

curl -fsSL https://raw.githubusercontent.com/Q00/ouroboros/main/scripts/install.sh | bash

Build — open your AI coding agent and go:

> ooo interview "I want to build a task management CLI"

Works with Claude Code, Codex CLI, Hermes, and OpenCode. The installer detects Claude Code, Codex CLI, and Hermes CLI automatically and registers the MCP server. For OpenCode, run ouroboros setup --runtime opencode after installation.

Other install methods

Claude Code plugin only (no system package):

claude plugin marketplace add Q00/ouroboros && claude plugin install ouroboros@ouroboros

Then run ooo setup inside a Claude Code session.

pip / uv / pipx:

pip install ouroboros-ai                # base
pip install ouroboros-ai[claude]        # + Claude Code deps
pip install ouroboros-ai[litellm]       # + LiteLLM multi-provider
pip install ouroboros-ai[mcp]           # + MCP server/client support
pip install ouroboros-ai[tui]           # + Textual terminal UI
pip install ouroboros-ai[all]           # everything (claude + litellm + mcp + tui + dashboard)
ouroboros setup                         # configure runtime

Legacy compatibility: ouroboros-ai[dashboard] is still accepted as a compatibility alias while extras migrate.

See runtime guides: Claude Code · Codex CLI · Hermes · OpenCode

Uninstall

ouroboros uninstall

Removes all configuration, MCP registration, and data. See UNINSTALL.md for details.

Python >= 3.12 required. See pyproject.toml for the full dependency list.

What You Get

After one loop of the Ouroboros cycle, a vague idea becomes a verified codebase:

Step	Before	After
Interview	"Build me a task CLI"	12 hidden assumptions exposed, ambiguity scored to 0.19
Seed	No spec	Immutable specification with acceptance criteria, ontology, constraints
Evaluate	Manual review	3-stage gate: Mechanical (free) -> Semantic -> Multi-Model Consensus

What just happened?

interview  ->  Socratic questioning exposed 12 hidden assumptions
seed       ->  Crystallized answers into an immutable spec (Ambiguity: 0.15)
run        ->  Executed via Double Diamond decomposition
evaluate   ->  3-stage verification: Mechanical -> Semantic -> Consensus

Use ooo <cmd> inside your AI coding agent session, or ouroboros init start, ouroboros run seed.yaml, etc. from the terminal.

The serpent completed one loop. Each loop, it knows more than the last.

How It Compares

AI coding tools are powerful -- but they solve the wrong problem when the input is unclear.

	Vanilla AI Coding	Ouroboros
Vague prompt	AI guesses intent, builds on assumptions	Socratic interview forces clarity before code
Spec validation	No spec -- architecture drifts mid-build	Immutable seed spec locks intent; Ambiguity gate (<= 0.2) blocks premature code
Evaluation	"Looks good" / manual QA	3-stage automated gate: Mechanical -> Semantic -> Multi-Model Consensus
Rework rate	High -- wrong assumptions surface late	Low -- assumptions surface in the interview, not in the PR review

The Loop

The ouroboros -- a serpent devouring its own tail -- is not decoration. It IS the architecture:

    Interview -> Seed -> Execute -> Evaluate
        ^                           |
        +---- Evolutionary Loop ----+

Each cycle does not repeat -- it evolves. The output of evaluation feeds back as input for the next generation, until the system truly knows what it is building.

Phase	What Happens
Interview	Socratic questioning exposes hidden assumptions
Seed	Answers crystallize into an immutable specification
Execute	Double Diamond: Discover -> Define -> Design -> Deliver
Evaluate	3-stage gate: Mechanical ($0) -> Semantic -> Multi-Model Consensus
Evolve	Wonder ("What do we still not know?") -> Reflect -> next generation

"This is where the Ouroboros eats its tail: the output of evaluation becomes the input for the next generation's seed specification." -- reflect.py

Convergence is reached when ontology similarity >= 0.95 -- when the system has questioned itself into clarity.

Ralph: The Loop That Never Stops

ooo ralph runs the evolutionary loop persistently -- across session boundaries -- until convergence is reached. Each step is stateless: the EventStore reconstructs the full lineage, so even if your machine restarts, the serpent picks up where it left off.

Ralph Cycle 1: evolve_step(lineage, seed) -> Gen 1 -> action=CONTINUE
Ralph Cycle 2: evolve_step(lineage)       -> Gen 2 -> action=CONTINUE
Ralph Cycle 3: evolve_step(lineage)       -> Gen 3 -> action=CONVERGED
                                                +-- Ralph stops.
                                                    The ontology has stabilized.

Commands

Inside AI coding agent sessions, use ooo <cmd> skills. From the terminal, use the ouroboros CLI.

Skill (`ooo`)	CLI equivalent	What It Does
`ooo setup`	`ouroboros setup`	Register runtime and configure project (one-time)
`ooo interview`	`ouroboros init start`	Socratic questioning -- expose hidden assumptions
`ooo seed`	(generated by interview)	Crystallize into immutable spec
`ooo run`	`ouroboros run seed.yaml`	Execute via Double Diamond decomposition
`ooo evaluate`	(via MCP)	3-stage verification gate
`ooo evolve`	(via MCP)	Evolutionary loop until ontology converges
`ooo unstuck`	(via MCP)	5 lateral thinking personas when you are stuck
`ooo status`	`ouroboros status executions` / `ouroboros status execution <id>`	Session tracking + (MCP-only) drift detection
`ooo cancel`	`ouroboros cancel execution [<id>\|--all]`	Cancel stuck or orphaned executions
`ooo ralph`	(via MCP)	Persistent loop until verified
`ooo tutorial`	(interactive)	Interactive hands-on learning
`ooo help`	`ouroboros --help`	Full reference
`ooo pm`	(via MCP)	PM-focused interview + PRD generation
`ooo qa`	(via skill)	General-purpose QA verdict for any artifact
`ooo update`	`ouroboros update`	Check for updates + upgrade to latest
`ooo brownfield`	(via skill)	Scan and manage brownfield repo defaults
`ooo publish`	(skill/runtime surface; uses `gh` CLI)	Publish a Seed as GitHub Epic/Task issues for team workflows

Not all skills have direct CLI equivalents. Some (evaluate, evolve, unstuck, ralph, publish) are available through agent skills, runtime rules, or MCP tools rather than a direct ouroboros <subcommand> shell command.

See the CLI reference for full details.

The Nine Minds

Nine agents, each a different mode of thinking. Loaded on-demand, never preloaded:

Agent	Role	Core Question
Socratic Interviewer	Questions-only. Never builds.	"What are you assuming?"
Ontologist	Finds essence, not symptoms	"What IS this, really?"
Seed Architect	Crystallizes specs from dialogue	"Is this complete and unambiguous?"
Evaluator	3-stage verification	"Did we build the right thing?"
Contrarian	Challenges every assumption	"What if the opposite were true?"
Hacker	Finds unconventional paths	"What constraints are actually real?"
Simplifier	Removes complexity	"What's the simplest thing that could work?"
Researcher	Stops coding, starts investigating	"What evidence do we actually have?"
Architect	Identifies structural causes	"If we started over, would we build it this way?"

Under the Hood

Architecture overview -- Python >= 3.12

src/ouroboros/
+-- bigbang/        Interview, ambiguity scoring, brownfield explorer
+-- routing/        PAL Router -- 3-tier cost optimization (1x / 10x / 30x)
+-- execution/      Double Diamond, hierarchical AC decomposition
+-- evaluation/     Mechanical -> Semantic -> Multi-Model Consensus
+-- evolution/      Wonder / Reflect cycle, convergence detection
+-- resilience/     4-pattern stagnation detection, 5 lateral personas
+-- observability/  3-component drift measurement, auto-retrospective
+-- persistence/    Event sourcing (SQLAlchemy + aiosqlite), checkpoints
+-- orchestrator/   Runtime abstraction layer (Claude Code, Codex CLI)
+-- core/           Types, errors, seed, ontology, security
+-- providers/      LiteLLM adapter (100+ models)
+-- mcp/            MCP client/server integration
+-- plugin/         Plugin system (skill/agent auto-discovery)
+-- tui/            Terminal UI dashboard
+-- cli/            Typer-based CLI

Key internals:

PAL Router -- Frugal (1x) -> Standard (10x) -> Frontier (30x) with auto-escalation on failure, auto-downgrade on success
Drift -- Goal (50%) + Constraint (30%) + Ontology (20%) weighted measurement, threshold <= 0.3
Brownfield -- Auto-detects config files across multiple language ecosystems
Evolution -- Up to 30 generations, convergence at ontology similarity >= 0.95
Stagnation -- Detects spinning, oscillation, no-drift, and diminishing returns patterns
Runtime backends -- Pluggable abstraction layer (orchestrator.runtime_backend config) with first-class support for Claude Code, Codex CLI, and Hermes; same workflow spec, different execution engines

See Architecture for the full design document.

From Wonder to Ontology

The philosophical engine behind Ouroboros

Wonder -> "How should I live?" -> "What IS 'live'?" -> Ontology -- Socrates

Every great question leads to a deeper question -- and that deeper question is always ontological: not "how do I do this?" but "what IS this, really?"

   Wonder                          Ontology
"What do I want?"    ->    "What IS the thing I want?"
"Build a task CLI"   ->    "What IS a task? What IS priority?"
"Fix the auth bug"   ->    "Is this the root cause, or a symptom?"

This is not abstraction for its own sake. When you answer "What IS a task?" -- deletable or archivable? solo or team? -- you eliminate an entire class of rework. The ontological question is the most practical question.

Ouroboros embeds this into its architecture through the Double Diamond:

    * Wonder          * Design
   /  (diverge)      /  (diverge)
  /    explore      /    create
 /                 /
* ------------ * ------------ *
 \                 \
  \    define       \    deliver
   \  (converge)     \  (converge)
    * Ontology        * Evaluation

The first diamond is Socratic: diverge into questions, converge into ontological clarity. The second diamond is pragmatic: diverge into design options, converge into verified delivery. Each diamond requires the one before it -- you cannot design what you have not understood.

Ambiguity Score: The Gate Between Wonder and Code

The Interview does not end when you feel ready -- it ends when the math says you are ready. Ouroboros quantifies ambiguity as the inverse of weighted clarity:

Ambiguity = 1 - Sum(clarity_i * weight_i)

Each dimension is scored 0.0-1.0 by the LLM (temperature 0.1 for reproducibility), then weighted:

Dimension	Greenfield	Brownfield
Goal Clarity -- Is the goal specific?	40%	35%
Constraint Clarity -- Are limitations defined?	30%	25%
Success Criteria -- Are outcomes measurable?	30%	25%
Context Clarity -- Is the existing codebase understood?	--	15%

Threshold: Ambiguity <= 0.2 -- only then can a Seed be generated.

Example (Greenfield):

  Goal: 0.9 * 0.4  = 0.36
  Constraint: 0.8 * 0.3  = 0.24
  Success: 0.7 * 0.3  = 0.21
                        ------
  Clarity             = 0.81
  Ambiguity = 1 - 0.81 = 0.19  <= 0.2 -> Ready for Seed

Why 0.2? Because at 80% weighted clarity, the remaining unknowns are small enough that code-level decisions can resolve them. Above that threshold, you are still guessing at architecture.

Ontology Convergence: When the Serpent Stops

The evolutionary loop does not run forever. It stops when consecutive generations produce ontologically identical schemas. Similarity is measured as a weighted comparison of schema fields:

Similarity = 0.5 * name_overlap + 0.3 * type_match + 0.2 * exact_match

Component	Weight	What It Measures
Name overlap	50%	Do the same field names exist in both generations?
Type match	30%	Do shared fields have the same types?
Exact match	20%	Are name, type, AND description all identical?

Threshold: Similarity >= 0.95 -- the loop converges and stops evolving.

But raw similarity is not the only signal. The system also detects pathological patterns:

Signal	Condition	What It Means
Stagnation	Similarity >= 0.95 for 3 consecutive generations	Ontology has stabilized
Oscillation	Gen N ~ Gen N-2 (period-2 cycle)	Stuck bouncing between two designs
Repetitive feedback	>= 70% question overlap across 3 generations	Wonder is asking the same things
Hard cap	30 generations reached	Safety valve

Gen 1: {Task, Priority, Status}
Gen 2: {Task, Priority, Status, DueDate}     -> similarity 0.78 -> CONTINUE
Gen 3: {Task, Priority, Status, DueDate}     -> similarity 1.00 -> CONVERGED

Two mathematical gates, one philosophy: do not build until you are clear (Ambiguity <= 0.2), do not stop evolving until you are stable (Similarity >= 0.95).

Contributing

git clone https://github.com/Q00/ouroboros
cd ouroboros
uv sync --all-groups && uv run pytest

Issues · Discussions · Contributing Guide

Star History

"The beginning is the end, and the end is the beginning."

The serpent does not repeat -- it evolves.

MIT License

Release History

Version	Changes	Urgency	Date
v0.50.5	Patch release: the zcode runtime backend arrives, PM interviews get sharper — brownfield snapshot exploration and delivered-behavior success criteria — and orchestration goes fail-closed. ## Highlights ### New Runtime: zcode - zcode runtime backend (#1568) — Ouroboros now drives zcode as a first-class runtime backend. - zcode LLM adapter for interview/run (#1595) — interviews and seed execution can route through zcode end-to-end. ### PM: Brownfield Snapshots & Delivered-Behavior Crite	High	7/19/2026
v0.50.3	## Frugality-first execution: model-tier routing The headline: `ooo run` now routes work down the model ladder instead of running everything on one model. Decomposed child ACs execute on the frugal tier (Claude Haiku / Codex mini-class), top-level ACs stay on standard (Sonnet), and a failing AC escalates one tier per retry — so you get real work out of Ouroboros without a Max-tier plan. RLM-style: split the goal well, and each piece becomes cheap enough for a small model to stay gro	High	7/11/2026
v0.44.0	## v0.44.0 — Sturdier plugin dispatch & MCP tool discovery This release hardens two seams in the plugin path: dispatch exits are now preserved end-to-end (no more swallowed exit codes or lost Seed extras), and the tool-discovery convention is delivered through MCP server instructions so capabilities surface reliably to the host. ### Bug Fixes - Preserve dispatch exits and Seed extras (#1533) — plugin dispatch keeps exit signals and round-trips Seed extras intact. - **Deliver tool-discove	High	7/1/2026
v0.42.5	> v0.42.5 — a focused release adding intent provenance tracking to auto-mode interviews. ## Highlights ### 🎯 Intent provenance guard for interviews (#1493) Auto mode now tracks the origin of every interview answer and decision, distinguishing user-grounded intent from model-inferred content during convergence. This keeps the interview anchored to what the user actually said — reducing drift where the refiner could previously let generated phrasing leak into committed contracts. - New	High	6/20/2026
v0.42.0	![Ouroboros × gajae-code — Together, we build the future. Intention in. Software out.](https://github.com/Q00/ouroboros/releases/download/v0.42.0/gjc-release-banner.jpeg) # v0.42.0 — A new kernel, frugal-or-frontier, and Claude without the SDK > v0.40 closed the loop, v0.41 let it run anywhere and trust what it ships. v0.42 > adds another runtime kernel, lets you choose how hard it thinks — frugal or > frontier, per stage — and runs Claude with no SDK at all. Much of this release's > frugali	High	6/15/2026
v0.41.0	# v0.41.0 — Run it anywhere, and trust what it ships > A week ago, `ooo auto` learned to finish the job on its own. This release makes > that autonomy something you can actually rely on: it runs on one more runtime, > it refuses to start building until the goal is unambiguous, and the verdict that > decides "is this actually done?" can no longer be gamed. ## The headline Autonomy is only worth as much as the trust behind it. v0.40.0 closed the loop — goal in, product out. v0.41.0 spends	High	6/7/2026
v0.40.1	## What's Changed ### Bug Fixes - Include `click` as an installer runtime dependency (#1299) Full Changelog: https://github.com/Q00/ouroboros/compare/v0.40.0...v0.40.1 ## What's Changed * fix(installer): include click runtime dependency by @Q00 in https://github.com/Q00/ouroboros/pull/1299 Full Changelog: https://github.com/Q00/ouroboros/compare/v0.40.0...v0.40.1	High	5/30/2026
v0.40.0	# v0.40.0 — `ooo auto` crosses the line > This is the release where `ooo auto` stops being a demo and becomes a machine that finishes your work. You drop in a vague intention. The Socratic interview pins it down into a precise, machine-checkable goal — and then the engine refuses to stop until that goal is actually built, verified, and shipped. No babysitting. No "it drew up a plan and gave up halfway." The loop owns the outcome, end to end. This is not "generate a plan." This is **go	High	5/30/2026
v0.39.1	## What's Changed ### Features - Add `ouroboros status run --json` projection surface (#1133) - Record durable workflow lifecycle events in orchestrator (#1134) - Add `on_error`/`on_cancel` plugin observability hooks (PR E) (#1137) - Expose MCP interview reasoning metadata (#1140) - Prompt for required trust grants on plugin install (#1141) - Expose Ralph-start alias while preserving runtime ownership - Dispatch lifecycle hooks within plugin trust boundaries - Make plugin permission waits share	High	5/20/2026
v0.39.0	## Ouroboros v0.39.0 This release lands a high-severity security fix, flips `ooo run` to the fat-harness execution path by default, and completes the AgentOS roadmap wiring/baseline milestone tracked in [#961](https://github.com/Q00/ouroboros/issues/961). ## 🔒 Security ### RCE via untrusted project-directory `.env` (high severity) Ouroboros is run inside cloned repositories. `config/loader.py` loaded `./.env` from the working directory into `os.environ` at import time with the **same	High	5/18/2026
v0.38.2	## What's Changed ### Bug Fixes - Close residual `allowed_tools=[]` leak in sub-CLI envelope for interview ### Testing - Lock empty `allowedTools` passthrough - Cover strict empty allowed-tools envelope (#975) Full Changelog: https://github.com/Q00/ouroboros/compare/v0.38.1...v0.38.2 ## What's Changed * fix(interview): close residual allowed_tools=[] leak in sub-CLI envelope by @Q00 in https://github.com/Q00/ouroboros/pull/974 Full Changelog: https://github.com/Q00/ouroboros/compa	High	5/13/2026
v0.37.0	## What's Changed ### Features #### `ooo auto` Pipeline - `DomainProfile` and `VerifiablePredicate` contracts (#849, #809 P3 PR 1/6) - `UNSTUCK_LATERAL` persona advisor on EVALUATE fail (#829) - EVALUATE phase verifies run output against seed AC (#825) - Formalize run-handoff idempotency contract (#843) - Chain RUN→RALPH automatically with `--complete-product` (#791) - `user_preference` source + deterministic ambiguity floor (#811) - Top-level `pipeline_timeout_seconds` deadline (#790) - Steer	High	5/11/2026
v0.34.0	## What's Changed in v0.34.0 ### Features - Copilot CLI runtime (#1f07a9c8) — first-class Copilot agent runtime - Kiro runtime backend (#2fc7a3c8) — Kiro as a first-class runtime - Stage enum + runtime_profile.stages config (#538) — slice 1 of #519 - Agent OS runtime_profile (Codex backend) (#505) — runtime profile foundation - AgentProcess lifecycle projection (#628) — project lifecycle state from events - Directive emission at StepAction sites (#477464ef) — slice 1 of	High	5/6/2026
v0.31.1	## What's Changed * fix(parallel): honor configured worker caps by @andrew-adamson in https://github.com/Q00/ouroboros/pull/489 * feat(orchestrator): Gemini CLI runtime backend (follow-up to #312) by @shaun0927 in https://github.com/Q00/ouroboros/pull/504 * fix(router): preserve Windows literal path payloads through skill dispatch by @shaun0927 in https://github.com/Q00/ouroboros/pull/481 * fix(brownfield): scan repos with any origin remote by @andrew-adamson in https://github.com/Q00/ouroboros/	High	4/29/2026
v0.30.0	## What's Changed ### Features - CLI: add `ooo resume` to recover in-flight sessions after MCP disconnect (#433) - MCP: add streamable HTTP serve transport - MCP: add compact monitoring relay views - MCP: suggest lateral persona from stagnation - Resilience: add in-run lateral recovery - Events: add `control.directive.emitted` event factory - Core: introduce `Directive` StrEnum for control-plane decisions ### Bug Fixes MCP - clarify job wait execution progress	High	4/26/2026
v0.29.2	## What's Changed ### Bug Fixes - fix(mcp): protect startup stdin during shell env loading (#482) - fix(interview): guard seed-ready closure (#484) - fix(evaluate): replace language presets with AI-driven mechanical.toml (#454) Full Changelog: https://github.com/Q00/ouroboros/compare/v0.29.1...v0.29.2 ## What's Changed * fix(evaluate): replace language presets with AI-driven mechanical.toml by @Q00 in https://github.com/Q00/ouroboros/pull/454 * fix(interview): guard seed-ready closure by	High	4/22/2026
v0.29.1	## What's Changed ### Bug Fixes - mcp: prevent `start_execute_seed` UUID scope crash (#470) - mcp: avoid replaying failed tool calls - mcp: avoid retrying timed-out tool calls - mcp: reconnect stale client transports (#467) - runtime: drop `initial_context` on interview resume so InterviewHandler continues the session - router: `extract_first_argument` returns full argument payload instead of just the first token - tests: make OpenCode config tests platform-agnostic	High	4/21/2026
v0.29.0	## What's Changed This release introduces the Hermes Agent runtime, capability-aware orchestration (SandboxClass + policy audits), OpenCode bridge plugin runtime, and hardens the MCP transport URL validator. It also ships a large batch of setup/PM fail-closed fixes and tool-envelope enforcement across Gemini and OpenCode backends. ### Features - Hermes runtime: integrate Hermes Agent as a new execution runtime backend, conform to `AgentRuntime` resume contract, expose `llm_back	High	4/21/2026
v0.28.8	## What's Changed ### Features - feat(tui): improve event subscription, activity display, and tree sync (#440) - feat(cli): add `ouroboros mcp doctor` diagnostic command (#445) - feat(evaluate): add `ouroboros_checklist_verify` MCP tool (#366 part 2) ### Bug Fixes - fix(hud): merge subtask events into AC tree snapshot (#439) - fix(evaluate): honour 1-item `acceptance_criteria` in single-AC path - fix(mcp): block localhost hostname in SSRF validation - fix(mcp): add real SSRF hardening beyond s	High	4/18/2026
v0.28.7	## What's Changed ### Features - events: add `event_version` to `BaseEvent` payload (#435) - evaluate: wire checklist into `EvaluateHandler` for multi-AC support (#366) ### Bug Fixes - install: enforce Python >=3.12 for uv and pipx installers (#429) - providers: respect `OUROBOROS_SKIP_VERSION_CHECK` env var override - orchestrator: graceful fallback for legacy adapters without `llm_backend` - orchestrator: narrow exception scope in `_build_dependency_analyzer` - **orch	High	4/15/2026
v0.28.6	## What's Changed ### Features - add auto-confirm routing for high-confidence code answers (#357) by @shaun0927 in https://github.com/Q00/ouroboros/pull/382 - wire ambiguity milestones into prompt and MCP response (#363) by @shaun0927 in https://github.com/Q00/ouroboros/pull/380 - add milestone definitions for ambiguity score ranges (#363) by @shaun0927 in https://github.com/Q00/ouroboros/pull/379 - expose Socratic questions and evidence in Stage 2 output (#367) by @shaun0927 in	High	4/14/2026
v0.28.5	## What's Changed ### Bug Fixes - evaluator must not reuse max_turns=1 shared adapter (#368, fixes #305) - cap recursive decomposition depth (#369) - remove dead llm_adapter wire + file-based artifact evaluation (4f756c2) - satisfy lint and mypy for depth-cap PR (f17d511) ### Features - skip externally satisfied ACs (#370) - share Anthropic rate budget across workers (#371) ### Documentation - fix stale MAX_FILES reference in _scan_directory docstring (4a83c04) ##	High	4/13/2026
v0.28.4	## What's Changed ### Bug Fixes - fix: detect `opencode.jsonc` config alongside `opencode.json` (#356, #374) - fix(orchestrator): restore LLM-assisted top-level AC dependency planning (#373) Full Changelog: https://github.com/Q00/ouroboros/compare/v0.28.3...v0.28.4 ## What's Changed * fix(orchestrator): restore LLM-assisted top-level AC dependency planning by @Q00 in https://github.com/Q00/ouroboros/pull/373 * fix: detect opencode.jsonc config alongside opencode.json (#356) by @S	Medium	4/12/2026
v0.28.3	## What's Changed ### Features - feat(mcp): add http/sse/streamable-http transport support (#339) - feat(orchestrator): wire checkpoint store + public API context (#338) ### Performance - perf(#344): reduce per-call overhead in Claude adapter (#348) - perf(#345): optimize ambiguity scorer token usage and enforce format error limit (#349) ### Bug Fixes - fix(cli): clarify TUI install guidance (#355) - fix(mcp): register ACTreeHUDHandler in composition root (#340) - **fi	High	4/10/2026
v0.28.2	## What's Changed ### Bug Fixes - Fix MCP server crash when gateway spawns server with `cwd=/` (#335 follow-up) ### Refactoring - Align PMSeed and pm.md fields for consistency (#314) ### Documentation - Add `[mcp]` extra to `uvx` MCP server setup commands in docs Full Changelog: https://github.com/Q00/ouroboros/compare/v0.28.1...v0.28.2 ## What's Changed * refactor(pm): align PMSeed and pm.md fields by @cohemm in https://github.com/Q00/ouroboros/pull/314 Full Changelog: https://g	High	4/7/2026
v0.28.1	## What's Changed ### Bug Fixes - fix(mcp): fix uvx MCP setup missing mcp dependency (#335) - fix(openclaw): harden gateway integration with env loading, repo fallback, and DB consolidation (#336) ### Documentation - docs: complete OpenCode runtime documentation across all guides (#333) (#334) - docs: add openclaw update note for mcp set command (#332) - docs: replace PM Mode banner with OpenClaw integration (#330) Full Changelog: https://github.com/Q00/ouroboros/compare/v0.28.0...v0.28.1	Medium	4/7/2026
v0.28.0	## What's Changed ### Features - OpenCode runtime adapter for LLM completions (#324) - Reduce third-party runtime dependencies (#316) - OpenClaw channel workflow scaffold (#323) ### Bug Fixes - Prevent MCP startup timeout on large SQLite databases (#326) - Prevent SessionStart stdout from polluting Claude context (#322) - Guard against None optional params in measure_drift handler (#325) - Harden OpenCode stdin close and skill_path context manager (#329) ### Documentation - Clarify how users	Medium	4/6/2026
v0.27.2	## What's Changed ### Features - Recursive AC decomposition + `ac_tree_hud` MCP tool (#319) ### Bug Fixes - Improve evaluate error diagnostics for Claude SDK failures (#313) Full Changelog: https://github.com/Q00/ouroboros/compare/v0.27.1...v0.27.2 ## What's Changed * feat: recursive AC decomposition + ac_tree_hud MCP tool by @Q00 in https://github.com/Q00/ouroboros/pull/319 * Improve evaluate error diagnostics for Claude SDK failures by @cohemm in https://github.com/Q00/ouroboros/pull/	Medium	4/6/2026
v0.27.1	## What's Changed ### Features - Add PATH 4 research interlude for mid-interview exploration (#294) - Gemini CLI adapter for LLM completions (#315) — by @kostakost2 ### Bug Fixes - Wire trigger_consensus to pipeline + expand project root detection (#317) - Resolve double-backgrounding in job tracking and differentiate polling options (#318) - Preserve CompletionResponse immutability during JSON normalization (#309) - Include tracebacks in QA/evaluate errors (#308) - Remove dead OUROBOROS_WEB_S	Medium	4/4/2026
v0.27.0	## What's Changed ### Features - MCP Bridge — Server-to-server MCP communication enabling external tools in `ooo run` seeds (#278, #279, #280, #281) - Auto-discovers `~/.ouroboros/mcp_servers.yaml` for upstream MCP server configuration - Injects `mcp_manager` through `ExecuteSeedHandler` → `OrchestratorRunner` pipeline - Wires bridge into evolution executor for `ooo evolve` with external tools - Transport lifecycle hardening with proper rollback and symmetric cleanup - **`ooo publis	Medium	4/1/2026
v0.26.6	## What's Changed ### Bug Fixes - TUI session completion: Emit `execution.terminal` event so TUI correctly sees session completion (#272) - Subprocess safety: Prevent subprocess leak, fork bomb, and env poisoning (#271) Full Changelog: https://github.com/Q00/ouroboros/compare/v0.26.5...v0.26.6 ## What's Changed * fix(#269): prevent subprocess leak, fork bomb, and env poisoning by @Q00 in https://github.com/Q00/ouroboros/pull/271 * fix(tui): emit execution.terminal so TUI sees ses	Medium	3/30/2026
v0.26.5	## What's Changed ### Features - PM interview now returns DECIDE_LATER questions to user instead of auto-skipping (#238) ### Bug Fixes - Tolerate non-JSON QA verdict responses (#236) - Fix adapter timeout, tool policy, and factory max_turns in providers (#259) - Fix PM handoff into the runnable Seed workflow (#252) - Honor configured interview adapter backend in PM (#245) - Fix resume retries for Codex sessions (#249) - Align CLI completion scoring with MCP flow in PM (#256) - Use configured c	Medium	3/30/2026
v0.26.4	## What's Changed ### Bug Fixes - Show install guidance when `litellm` extra is missing (#246, fixes #239) Full Changelog: https://github.com/Q00/ouroboros/compare/v0.26.3...v0.26.4 ## What's Changed * fix(#239): show install guidance when litellm extra is missing by @srausser in https://github.com/Q00/ouroboros/pull/246 Full Changelog: https://github.com/Q00/ouroboros/compare/v0.26.3...v0.26.4	Medium	3/29/2026
v0.26.3	## What's Changed ### Features - Add `ouroboros uninstall` and `ouroboros config backend` CLI commands (#220) ### Bug Fixes - Handle null `failed_attempts` in lateral think (#234) - Degrade gracefully on `claude_code` JSON output (#232) Full Changelog: https://github.com/Q00/ouroboros/compare/v0.26.2...v0.26.3 ## What's Changed * feat(cli): add ouroboros uninstall and config backend commands by @Q00 in https://github.com/Q00/ouroboros/pull/220 * fix(#212): degrade gracefully on claude_c	Medium	3/27/2026
v0.26.2	## What's Changed * feat(evolution): graceful shutdown on SIGINT (#169) by @hackertaco in https://github.com/Q00/ouroboros/pull/172 * Improve GitHub issue intake and repository safety guidance by @hackertaco in https://github.com/Q00/ouroboros/pull/81 * fix(install): add [claude] extra to ouroboros-ai references (#217) by @hackertaco in https://github.com/Q00/ouroboros/pull/218 * fix(#214): prefer ./mvnw for Maven wrapper projects by @ouroboros-agent[bot] in https://github.com/Q00/ouroboros/pull	Medium	3/26/2026
v0.26.1	## What's Changed ### Bug Fixes - PM handler: Use LLM adapter factory instead of hardcoded ClaudeAgentAdapter - Install: Let user choose runtime instead of defaulting to Codex - Install: Add marketplace update before plugin install ### Maintenance - Align import paths with InterviewHandler convention - Fix import sorting in pm_handler - Point install.sh URL to main branch Full Changelog: https://github.com/Q00/ouroboros/compare/v0.26.0...v0.26.1 ## What's Changed * fix(pm):	Medium	3/25/2026
v0.26.0	## What's Changed in v0.26.0 ### 🚀 Features - PM Interview Engine & Brownfield Management — Socratic PM interview flow with brownfield codebase analysis (#191) by @cohemm - Codex CLI Runtime Support — Runtime-agnostic execution with Codex CLI adapter, parallel executor, and heartbeat monitoring - Interview Redesign — MCP question generator + main session router with breadth/closure personas (#178, #136) - Evaluation Enhancement — Reward hacking risk detection in Stage 2 semanti	Medium	3/25/2026
v0.26.0b7	Full Changelog: https://github.com/Q00/ouroboros/compare/v0.26.0b6...v0.26.0b7	Medium	3/24/2026
v0.26.0b6	## What's Changed * feat(version): automate version sync, install UX, and update flow by @Q00 in https://github.com/Q00/ouroboros/pull/194 Full Changelog: https://github.com/Q00/ouroboros/compare/v0.26.0b5...v0.26.0b6	Medium	3/24/2026
v0.26.0b5	> This is a beta release. Install with: `pip install ouroboros-ai==0.26.0b5` ## What's Changed ### Security - Remove litellm dependency — PyPI supply chain attack mitigation (merged from main v0.25.2) ### Features - PM interview engine + brownfield management (#191) - Beta version update automation ### Bug Fixes - Fix missing `Result` import in PM CLI commands - Fix ruff lint issues Full Changelog: https://github.com/Q00/ouroboros/compare/v0.26.0b4...v0.26.0b5 ## What's Change	Medium	3/24/2026
v0.25.2	## What's Changed ### Security - Remove litellm dependency — PyPI supply chain attack mitigation (#195) ### Build - hatch-vcs dev builds on every push (#183) - Strip local version identifiers for PyPI compatibility - Show dev version in GitHub Actions annotations Full Changelog: https://github.com/Q00/ouroboros/compare/v0.25.1...v0.25.2 ## What's Changed * feat(build): hatch-vcs dev builds on every push by @Q00 in https://github.com/Q00/ouroboros/pull/183 * fix(security): make litel	Medium	3/24/2026
v0.26.0b4	> This is a beta release. Install with: `pip install ouroboros-ai==0.26.0b4` ## What's Changed ### Features - Agent runtime abstraction with Codex CLI runtime support - Runtime-agnostic packaging with optional extras and standalone setup - Reward hacking risk detection in Stage 2 semantic evaluation (#174) - Runtime-agnostic orphan detection with WAL mode and retry - Interview breadth and closure personas (#136) - Packaged agents as source of truth (#136) - Codex artifact installation and	Medium	3/23/2026
v0.25.1	## What's Changed * fix(consensus): honest model names in Stage 3 fallback (#161) by @Q00 in https://github.com/Q00/ouroboros/pull/163 * fix(#168): ignore pending generations in convergence by @ouroboros-agent[bot] in https://github.com/Q00/ouroboros/pull/171 * fix(#176): inherit parent MCP tools in delegated runs by @Q00 in https://github.com/Q00/ouroboros/pull/180 ## New Contributors * @ouroboros-agent[bot] made their first contribution in https://github.com/Q00/ouroboros/pull/171 **Full Cha	Low	3/22/2026
v0.26.0b3	> This is a beta release. Install with: `pip install ouroboros-ai==0.26.0b3` ## What's Changed since v0.26.0b2 ### Features - Codex MCP auto-registration: `ouroboros setup --runtime codex` now registers the Ouroboros MCP server in `~/.codex/config.toml` with correct env vars (`OUROBOROS_AGENT_RUNTIME=codex`, `OUROBOROS_LLM_BACKEND=codex`) ### Bug Fixes - install.sh: Skip Python version check when `uv` or `pipx` is available (uv manages its own Python toolchain) - install.sh: P	Low	3/21/2026
v0.26.0b2	> This is a beta release. Install with: `pip install ouroboros-ai==0.26.0b2` ## What's Changed since v0.26.0b1 ### Features - Codex artifact auto-install: `ouroboros setup --runtime codex` now installs 15 skills and routing rules into `~/.codex/` automatically ### Bug Fixes - install.sh: Added `--upgrade` flag so existing installs get updated instead of silently skipping - install.sh: Fixed PEP 508 specifier ordering (`name[extras]==version`) and pinned to `0.26.0b1` - *docs	Low	3/21/2026
v0.25.0	## What's Changed ### Features - Standalone QA Skill — `ooo qa` now works as a fast single-pass quality verdict for any artifact, with or without MCP setup (#160) - Stall Detection & Supervisor Recovery — Orchestrator detects stuck executions and automatically retries with checkpoint recovery (#155) - Sequential Sub-AC Execution — Sub-acceptance criteria run sequentially with memory-pressure backoff for stability (#153) - Context Protection Guide — Interactive guide during `ooo	Low	3/20/2026
v0.26.0b1	> This is a beta release. Install with: `pip install ouroboros-ai==0.26.0b1` ## What's Changed ### Features - Runtime-agnostic orphan detection with WAL mode and retry - Add prompt parameter to `_build_command` and `_feeds_prompt_via_stdin` hook - Packaged agents as the source of truth (#136) - Multi-runtime support groundwork (claude, codex backends) - Streamlit analytics dashboard with plotly/pandas ### Bug Fixes - Comprehensive AC tree and runtime reliability fixes - AC tree not render	Low	3/20/2026
v0.24.0	## What's Changed ### Features - Overhaul TUI visual quality with Double Diamond visualization and performance improvements (#147) ### Bug Fixes - Add Sessions tab and improve session list readability - Load full session events including AC tree - Match Python TUI session display format - Re-sign macOS binary after copy + lazy session loading - Detect non-TTY and show helpful error message for uvx users Full Changelog: https://github.com/Q00/ouroboros/compare/v0.23.2...v0.24.0 ## What's	Low	3/15/2026
v0.23.2	## What's Changed ### Features - Add Windows x86_64 build target (`x86_64-pc-windows-msvc`) to CI release matrix - Handle `.exe` extension in CI artifact rename step ### Bug Fixes - Use `USERPROFILE` fallback for Windows home directory resolution in Rust TUI - Add Windows `subprocess.call` fallback in Python CLI (since `os.execv` behaves differently on Windows) ### Performance - Batch drain `execution_events` after event loop instead of per-event O(n) drain — eliminates ~30M element shifts on	Low	3/14/2026
v0.23.1	## What's Changed * fix(tui): SLT binary CI build + TUI bug fixes (v0.23.1) by @Q00 in https://github.com/Q00/ouroboros/pull/145 Full Changelog: https://github.com/Q00/ouroboros/compare/v0.23.0...v0.23.1	Low	3/14/2026
v0.23.0	## What's Changed ### Features - Native SLT backend for TUI monitor — A beautifully crafted alternative TUI backend built on the [SuperLightTUI](https://github.com/subinium/SuperLightTUI) library. The rendering quality and API design are genuinely impressive; it's exciting to see SLT powering Ouroboros's terminal UI (#141) - Sync CLAUDE.md version marker automatically after `ooo update` upgrade (#139) - Add Java Maven language preset for evaluation (#135) ### Bug Fixes - Add marketplace re	Low	3/14/2026
v0.22.0	## What's Changed ### Features - Add async MCP tools for background job execution (#119) ### Bug Fixes - Fix skills to load deferred MCP tools before checking availability (#126, #127) - Fix CI to prevent test hang from QA invoking real Claude CLI (#133) - Fix evolve to add validation gate to prevent blind convergence (#118) ### Maintenance - Upgrade all dependencies to latest versions (#75) Full Changelog: https://github.com/Q00/ouroboros/compare/v0.21.1...v0.22.0 ## What's Changed *	Low	3/13/2026
v0.21.1	## What's Changed ### Bug Fixes - interview: Resolve MCP interview tool unresponsiveness (#122) - core: Handle Windows OSError in `_check_stale_instance()` (#121) - UX: Clarify false convergence gate message (#114) (#115) Full Changelog: https://github.com/Q00/ouroboros/compare/v0.21.0...v0.21.1 ## What's Changed * fix: clarify false convergence gate message (#114) by @harrymunro in https://github.com/Q00/ouroboros/pull/115 * fix: handle Windows OSError in _check_stale_instan	Low	3/11/2026
v0.21.0	## What's Changed ### Features - Add `ouroboros_cancel_execution` MCP handler for graceful session cancellation (#111) - Add `ooo update` command with auto version check (#110) - Detect CLAUDE.md git workflow for ralph/run (#109) ### Bug Fixes - Add SSL fallback and accurate messaging for version check - Add PID file management for stale MCP instance recovery (#59) - Default ambiguity_score to 0.15 and document it (#104) - Break Wonder→Reflect stagnation loop in evolution (#103) - Distinguish	Low	3/9/2026
v0.20.0	## What's Changed ### Features - Language-agnostic mechanical verification — Stage 1 auto-detects project language (Python, Rust, Go, Zig, Node.js) and runs appropriate lint/build/test commands. Supports `.ouroboros/mechanical.toml` for custom overrides. (#84) - Codebase-aware interview — Interview reads the codebase before asking open questions for more targeted Socratic questioning. (#67) - Interview version check — Prompts user to update when a newer Ouroboros version is availabl	Low	3/8/2026
v0.19.1	## What's Changed ### Bug Fixes - Register QAHandler in MCP server with DI + add post-run QA (#78) - Add next-step suggestions to all ooo skill outputs - Address adversarial review findings in next-step suggestions ### Documentation - Add Star History chart to README Full Changelog: https://github.com/Q00/ouroboros/compare/v0.19.0...v0.19.1 ## What's Changed * fix(qa): register QAHandler in MCP server with DI + add post-run QA by @Q00 in https://github.com/Q00/ouroboros/pull/78 * Add ne	Low	3/7/2026
v0.19.0	## What's Changed ### New Features - feat(qa): Add `ouroboros_qa` tool — general-purpose QA verdict for any artifact type - Evaluates code, test output, documents, API responses against a quality bar - Returns structured JSON verdict with score, dimensions, differences, and suggestions - Supports iterative loop with session tracking and iteration history - Inspired by [oh-my-codex `$visual-verdict`](https://github.com/Yeachan-Heo/oh-my-codex/commit/6fd5471) by [@Yeachan-Heo](https:/	Low	3/5/2026
v0.18.1	## What's Changed ### Bug Fixes - fix(seed): Robust extraction prompt with preprocessing and retry logic (#54, closes #53) - Strengthened user prompt with inline format template and explicit "no prose" instructions - Added `_preprocess_response()` to strip markdown code blocks and conversational preamble - Added retry logic (1 retry) with clarified correction prompt on parse failure - Improved error messages with found fields and response preview ### Enhancements - **feat(interview	Low	3/4/2026
v0.18.0	## What's Changed ### Features - Rewind history tracking: Discarded generations preserved in `rewind_history` with full context, visible in TUI as collapsible subtrees - Failure error recording: `failure_error` field captures error messages from failed generations (e.g., MCP transport disconnect, API limit exceeded) - Enhanced lineage status: MCP `lineage_status` output now includes rewind history section with timestamps and discarded generation details - TUI improvements: Gener	Low	3/2/2026
v0.17.0	## What's Changed ### Features - Validation Phase: Add post-execute validation between Execute and Evaluate phases — automatically detects and fixes cascading import errors from parallel AC code generation (pytest --collect-only + Claude auto-fix, max 3 attempts) - Sequential Execution Mode: Add `--no-parallel` flag to ralph.sh/ralph.py and `parallel` parameter to evolve_step MCP tool for stable, sequential AC execution - Max Plan Model Routing: Route entire pipeline to Claude Sonne	Low	2/28/2026
v0.16.0	## What's Changed ### Features - Evolution loop improvements: Add oscillation detection, evaluation gate, auto-commit, and rollback support (#51) ### Bug Fixes - Brownfield explorer: Address code review findings from dialectical review ### Documentation - README redesign: New hero image, philosophical framing (Socratic reasoning → Ontology → Double Diamond), Ralph convergence example, and "The Nine Minds" agent showcase ### Testing & Maintenance - Update ontology question tests f	Low	2/27/2026
v0.15.0	## What's Changed ### Features - Add rewind feature for evolutionary lineages — roll back to any previous generation - Add TUI lineage viewer with generation detail panel, delta visualization, and rewind confirmation - Add ralph.sh / ralph.py scripts for standalone MCP-based evolution loops ### Bug Fixes - Fix FastMCP schema inference: build proper `inspect.Signature` on tool wrappers so clients can send flat argument dicts instead of `{"kwargs": {...}}` - Fix MCP stdio stdout pollution: use `	Low	2/27/2026
v0.14.1	## What's Changed ### Bug Fixes - interview: Fix empty response bypass in ClaudeCodeAdapter — empty content now always triggers error regardless of session_id - interview: Fix sub-agent turn exhaustion — increase `max_turns` from 1 to 3 so the agent can use tools and still generate the question ### Maintenance - Apply ruff format to 4 files - Resolve ruff and mypy CI failures Full Changelog: https://github.com/Q00/ouroboros/compare/v0.14.0...v0.14.1 Full Changelog: https://gi	Low	2/26/2026
v0.14.0	## What's Changed ### Features - Add `--no-execute` mode to evolve step, allowing dry-run evolution without executing seeds - Track and surface execution output metadata (duration, messages processed, success status) ### Bug Fixes - Use head+tail truncation for execution output — keeps first 500 + last 2000 chars instead of only the first 2000, preserving stack traces and test results that appear at the end - Guard against empty `generation_results` in evolutionary loop to prevent misleading s	Low	2/25/2026
v0.13.7	## What's Changed ### Bug Fixes - fix(parallel): Re-raise anyio `Cancelled` instead of suppressing it in parallel AC execution. Catching `BaseException` without re-raising `Cancelled` broke anyio's cancel-scope propagation inside task groups, causing the entire parallel execution to hang indefinitely when a completed AC's SDK generator cleanup triggered cancellation of sibling tasks. Full Changelog: https://github.com/Q00/ouroboros/compare/v0.13.6...v0.13.7 Full Changelog: https:	Low	2/25/2026
v0.13.6	## What's Changed ### Bug Fixes - Skip unknown SDK message types instead of crashing: The Claude Agent SDK throws `MessageParseError` for unrecognized streaming events like `rate_limit_event`, killing the entire async generator. Now wraps `query()` to catch and skip these parse errors gracefully, allowing evolve_step and other LLM-dependent operations to succeed. Full Changelog: https://github.com/Q00/ouroboros/compare/v0.13.5...v0.13.6 Full Changelog: https://github.com/Q00/ouro	Low	2/23/2026

Dependencies & License Audit

Loading dependencies...

Similar Packages

sawzhang_skillsClaude Code skills collection — CCA study guides, Twitter research, MCP review, auto-iteration tools0.0.0

claude-code-configClaude Code skills, architectural principles, and alternative approaches for AI-assisted developmentmain@2026-07-22

clumsiesPersistent, deterministic, and user-managed project memory for vibe coding.v0.19.2-alpha

Micro-AgentA lightweight AI agent framework for vertical domain applications | 面向垂域应用的轻量级 AI Agent 框架v2026.07.23.1

jdocmunch-mcpThe leading, most token-efficient MCP server for documentation exploration and retrieval via structured section indexingv1.105.0

More in MCP Servers

supersetCode Editor for the AI Agents Era - Run an army of Claude Code, Codex, etc. on your machine

kreuzbergA polyglot document intelligence framework with a Rust core. Extract text, metadata, images, and structured information from PDFs, Office documents, images, and 91+ formats. Available for Rust, Python

ai-engineering-from-scratchLearn it. Build it. Ship it for others.

CodeGraphContextAn MCP server plus a CLI tool that indexes local code into a graph database to provide context to AI assistants.