freshcrate
Skin:/
Home > MCP Servers > ouroboros

ouroboros

Stop prompting. Start specifying.

Why this rank:Strong adoptionRecent releaseHealthy release cadence

Description

Stop prompting. Start specifying.

README

English | ํ•œ๊ตญ์–ด


โ—ฏ โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€ โ—ฏ

Ouroboros

O U R O B O R O S

โ—ฏ โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€ โ—ฏ

Stop prompting. Start specifying.
Specification-first workflow engine for AI coding agents

PyPI Tests License

Quick Start ยท Why ยท Results ยท How It Works ยท Commands ยท Philosophy

Turn a vague idea into a verified, working codebase -- with any AI coding agent.

Ouroboros sits between you and your AI runtime (Claude Code, Codex CLI, Hermes, or others). It replaces ad-hoc prompting with a structured specification-first workflow: interview, crystallize, execute, evaluate, evolve.


Why Ouroboros?

Most AI coding fails at the input, not the output. The bottleneck is not AI capability -- it is human clarity.

Problem What Happens Ouroboros Fix
Vague prompts AI guesses, you rework Socratic interview exposes hidden assumptions
No spec Architecture drifts mid-build Immutable seed spec locks intent before code
Manual QA "Looks good" is not verification 3-stage automated evaluation gate

Quick Start

Install โ€” one command, everything auto-detected:

curl -fsSL https://raw.githubusercontent.com/Q00/ouroboros/main/scripts/install.sh | bash

Build โ€” open your AI coding agent and go:

> ooo interview "I want to build a task management CLI"

Works with Claude Code, Codex CLI, Hermes, and OpenCode. The installer detects Claude Code, Codex CLI, and Hermes CLI automatically and registers the MCP server. For OpenCode, run ouroboros setup --runtime opencode after installation.

Other install methods

Claude Code plugin only (no system package):

claude plugin marketplace add Q00/ouroboros && claude plugin install ouroboros@ouroboros

Then run ooo setup inside a Claude Code session.

pip / uv / pipx:

pip install ouroboros-ai                # base
pip install ouroboros-ai[claude]        # + Claude Code deps
pip install ouroboros-ai[litellm]       # + LiteLLM multi-provider
pip install ouroboros-ai[mcp]           # + MCP server/client support
pip install ouroboros-ai[tui]           # + Textual terminal UI
pip install ouroboros-ai[all]           # everything (claude + litellm + mcp + tui + dashboard)
ouroboros setup                         # configure runtime

Legacy compatibility: ouroboros-ai[dashboard] is still accepted as a compatibility alias while extras migrate.

See runtime guides: Claude Code ยท Codex CLI ยท Hermes ยท OpenCode

Uninstall
ouroboros uninstall

Removes all configuration, MCP registration, and data. See UNINSTALL.md for details.

Python >= 3.12 required. See pyproject.toml for the full dependency list.


What You Get

After one loop of the Ouroboros cycle, a vague idea becomes a verified codebase:

Step Before After
Interview "Build me a task CLI" 12 hidden assumptions exposed, ambiguity scored to 0.19
Seed No spec Immutable specification with acceptance criteria, ontology, constraints
Evaluate Manual review 3-stage gate: Mechanical (free) -> Semantic -> Multi-Model Consensus
What just happened?
interview  ->  Socratic questioning exposed 12 hidden assumptions
seed       ->  Crystallized answers into an immutable spec (Ambiguity: 0.15)
run        ->  Executed via Double Diamond decomposition
evaluate   ->  3-stage verification: Mechanical -> Semantic -> Consensus

Use ooo <cmd> inside your AI coding agent session, or ouroboros init start, ouroboros run seed.yaml, etc. from the terminal.

The serpent completed one loop. Each loop, it knows more than the last.


How It Compares

AI coding tools are powerful -- but they solve the wrong problem when the input is unclear.

Vanilla AI Coding Ouroboros
Vague prompt AI guesses intent, builds on assumptions Socratic interview forces clarity before code
Spec validation No spec -- architecture drifts mid-build Immutable seed spec locks intent; Ambiguity gate (<= 0.2) blocks premature code
Evaluation "Looks good" / manual QA 3-stage automated gate: Mechanical -> Semantic -> Multi-Model Consensus
Rework rate High -- wrong assumptions surface late Low -- assumptions surface in the interview, not in the PR review

The Loop

The ouroboros -- a serpent devouring its own tail -- is not decoration. It IS the architecture:

    Interview -> Seed -> Execute -> Evaluate
        ^                           |
        +---- Evolutionary Loop ----+

Each cycle does not repeat -- it evolves. The output of evaluation feeds back as input for the next generation, until the system truly knows what it is building.

Phase What Happens
Interview Socratic questioning exposes hidden assumptions
Seed Answers crystallize into an immutable specification
Execute Double Diamond: Discover -> Define -> Design -> Deliver
Evaluate 3-stage gate: Mechanical ($0) -> Semantic -> Multi-Model Consensus
Evolve Wonder ("What do we still not know?") -> Reflect -> next generation

"This is where the Ouroboros eats its tail: the output of evaluation becomes the input for the next generation's seed specification." -- reflect.py

Convergence is reached when ontology similarity >= 0.95 -- when the system has questioned itself into clarity.

Ralph: The Loop That Never Stops

ooo ralph runs the evolutionary loop persistently -- across session boundaries -- until convergence is reached. Each step is stateless: the EventStore reconstructs the full lineage, so even if your machine restarts, the serpent picks up where it left off.

Ralph Cycle 1: evolve_step(lineage, seed) -> Gen 1 -> action=CONTINUE
Ralph Cycle 2: evolve_step(lineage)       -> Gen 2 -> action=CONTINUE
Ralph Cycle 3: evolve_step(lineage)       -> Gen 3 -> action=CONVERGED
                                                +-- Ralph stops.
                                                    The ontology has stabilized.

Commands

Inside AI coding agent sessions, use ooo <cmd> skills. From the terminal, use the ouroboros CLI.

Skill (ooo) CLI equivalent What It Does
ooo setup ouroboros setup Register runtime and configure project (one-time)
ooo interview ouroboros init start Socratic questioning -- expose hidden assumptions
ooo seed (generated by interview) Crystallize into immutable spec
ooo run ouroboros run seed.yaml Execute via Double Diamond decomposition
ooo evaluate (via MCP) 3-stage verification gate
ooo evolve (via MCP) Evolutionary loop until ontology converges
ooo unstuck (via MCP) 5 lateral thinking personas when you are stuck
ooo status ouroboros status executions / ouroboros status execution <id> Session tracking + (MCP-only) drift detection
ooo cancel ouroboros cancel execution [<id>|--all] Cancel stuck or orphaned executions
ooo ralph (via MCP) Persistent loop until verified
ooo tutorial (interactive) Interactive hands-on learning
ooo help ouroboros --help Full reference
ooo pm (via MCP) PM-focused interview + PRD generation
ooo qa (via skill) General-purpose QA verdict for any artifact
ooo update ouroboros update Check for updates + upgrade to latest
ooo brownfield (via skill) Scan and manage brownfield repo defaults
ooo publish (skill/runtime surface; uses gh CLI) Publish a Seed as GitHub Epic/Task issues for team workflows

Not all skills have direct CLI equivalents. Some (evaluate, evolve, unstuck, ralph, publish) are available through agent skills, runtime rules, or MCP tools rather than a direct ouroboros <subcommand> shell command.

See the CLI reference for full details.


The Nine Minds

Nine agents, each a different mode of thinking. Loaded on-demand, never preloaded:

Agent Role Core Question
Socratic Interviewer Questions-only. Never builds. "What are you assuming?"
Ontologist Finds essence, not symptoms "What IS this, really?"
Seed Architect Crystallizes specs from dialogue "Is this complete and unambiguous?"
Evaluator 3-stage verification "Did we build the right thing?"
Contrarian Challenges every assumption "What if the opposite were true?"
Hacker Finds unconventional paths "What constraints are actually real?"
Simplifier Removes complexity "What's the simplest thing that could work?"
Researcher Stops coding, starts investigating "What evidence do we actually have?"
Architect Identifies structural causes "If we started over, would we build it this way?"

Under the Hood

Architecture overview -- Python >= 3.12
src/ouroboros/
+-- bigbang/        Interview, ambiguity scoring, brownfield explorer
+-- routing/        PAL Router -- 3-tier cost optimization (1x / 10x / 30x)
+-- execution/      Double Diamond, hierarchical AC decomposition
+-- evaluation/     Mechanical -> Semantic -> Multi-Model Consensus
+-- evolution/      Wonder / Reflect cycle, convergence detection
+-- resilience/     4-pattern stagnation detection, 5 lateral personas
+-- observability/  3-component drift measurement, auto-retrospective
+-- persistence/    Event sourcing (SQLAlchemy + aiosqlite), checkpoints
+-- orchestrator/   Runtime abstraction layer (Claude Code, Codex CLI)
+-- core/           Types, errors, seed, ontology, security
+-- providers/      LiteLLM adapter (100+ models)
+-- mcp/            MCP client/server integration
+-- plugin/         Plugin system (skill/agent auto-discovery)
+-- tui/            Terminal UI dashboard
+-- cli/            Typer-based CLI

Key internals:

  • PAL Router -- Frugal (1x) -> Standard (10x) -> Frontier (30x) with auto-escalation on failure, auto-downgrade on success
  • Drift -- Goal (50%) + Constraint (30%) + Ontology (20%) weighted measurement, threshold <= 0.3
  • Brownfield -- Auto-detects config files across multiple language ecosystems
  • Evolution -- Up to 30 generations, convergence at ontology similarity >= 0.95
  • Stagnation -- Detects spinning, oscillation, no-drift, and diminishing returns patterns
  • Runtime backends -- Pluggable abstraction layer (orchestrator.runtime_backend config) with first-class support for Claude Code, Codex CLI, and Hermes; same workflow spec, different execution engines

See Architecture for the full design document.


From Wonder to Ontology

The philosophical engine behind Ouroboros

Wonder -> "How should I live?" -> "What IS 'live'?" -> Ontology -- Socrates

Every great question leads to a deeper question -- and that deeper question is always ontological: not "how do I do this?" but "what IS this, really?"

   Wonder                          Ontology
"What do I want?"    ->    "What IS the thing I want?"
"Build a task CLI"   ->    "What IS a task? What IS priority?"
"Fix the auth bug"   ->    "Is this the root cause, or a symptom?"

This is not abstraction for its own sake. When you answer "What IS a task?" -- deletable or archivable? solo or team? -- you eliminate an entire class of rework. The ontological question is the most practical question.

Ouroboros embeds this into its architecture through the Double Diamond:

    * Wonder          * Design
   /  (diverge)      /  (diverge)
  /    explore      /    create
 /                 /
* ------------ * ------------ *
 \                 \
  \    define       \    deliver
   \  (converge)     \  (converge)
    * Ontology        * Evaluation

The first diamond is Socratic: diverge into questions, converge into ontological clarity. The second diamond is pragmatic: diverge into design options, converge into verified delivery. Each diamond requires the one before it -- you cannot design what you have not understood.

Ambiguity Score: The Gate Between Wonder and Code

The Interview does not end when you feel ready -- it ends when the math says you are ready. Ouroboros quantifies ambiguity as the inverse of weighted clarity:

Ambiguity = 1 - Sum(clarity_i * weight_i)

Each dimension is scored 0.0-1.0 by the LLM (temperature 0.1 for reproducibility), then weighted:

Dimension Greenfield Brownfield
Goal Clarity -- Is the goal specific? 40% 35%
Constraint Clarity -- Are limitations defined? 30% 25%
Success Criteria -- Are outcomes measurable? 30% 25%
Context Clarity -- Is the existing codebase understood? -- 15%

Threshold: Ambiguity <= 0.2 -- only then can a Seed be generated.

Example (Greenfield):

  Goal: 0.9 * 0.4  = 0.36
  Constraint: 0.8 * 0.3  = 0.24
  Success: 0.7 * 0.3  = 0.21
                        ------
  Clarity             = 0.81
  Ambiguity = 1 - 0.81 = 0.19  <= 0.2 -> Ready for Seed

Why 0.2? Because at 80% weighted clarity, the remaining unknowns are small enough that code-level decisions can resolve them. Above that threshold, you are still guessing at architecture.

Ontology Convergence: When the Serpent Stops

The evolutionary loop does not run forever. It stops when consecutive generations produce ontologically identical schemas. Similarity is measured as a weighted comparison of schema fields:

Similarity = 0.5 * name_overlap + 0.3 * type_match + 0.2 * exact_match
Component Weight What It Measures
Name overlap 50% Do the same field names exist in both generations?
Type match 30% Do shared fields have the same types?
Exact match 20% Are name, type, AND description all identical?

Threshold: Similarity >= 0.95 -- the loop converges and stops evolving.

But raw similarity is not the only signal. The system also detects pathological patterns:

Signal Condition What It Means
Stagnation Similarity >= 0.95 for 3 consecutive generations Ontology has stabilized
Oscillation Gen N ~ Gen N-2 (period-2 cycle) Stuck bouncing between two designs
Repetitive feedback >= 70% question overlap across 3 generations Wonder is asking the same things
Hard cap 30 generations reached Safety valve
Gen 1: {Task, Priority, Status}
Gen 2: {Task, Priority, Status, DueDate}     -> similarity 0.78 -> CONTINUE
Gen 3: {Task, Priority, Status, DueDate}     -> similarity 1.00 -> CONVERGED

Two mathematical gates, one philosophy: do not build until you are clear (Ambiguity <= 0.2), do not stop evolving until you are stable (Similarity >= 0.95).


Contributing

git clone https://github.com/Q00/ouroboros
cd ouroboros
uv sync --all-groups && uv run pytest

Issues ยท Discussions ยท Contributing Guide


Star History

Star History Chart Star History Chart

"The beginning is the end, and the end is the beginning."

The serpent does not repeat -- it evolves.

MIT License

Release History

VersionChangesUrgencyDate
v0.40.1## What's Changed ### Bug Fixes - Include `click` as an installer runtime dependency (#1299) **Full Changelog**: https://github.com/Q00/ouroboros/compare/v0.40.0...v0.40.1 ## What's Changed * fix(installer): include click runtime dependency by @Q00 in https://github.com/Q00/ouroboros/pull/1299 **Full Changelog**: https://github.com/Q00/ouroboros/compare/v0.40.0...v0.40.1High5/30/2026
v0.40.0# v0.40.0 โ€” `ooo auto` crosses the line > **This is the release where `ooo auto` stops being a demo and becomes a machine that finishes your work.** You drop in a vague intention. The Socratic interview pins it down into a precise, machine-checkable goal โ€” and then the engine refuses to stop until that goal is actually **built, verified, and shipped**. No babysitting. No "it drew up a plan and gave up halfway." The loop owns the outcome, end to end. This is not "generate a plan." This is **goHigh5/30/2026
v0.39.1## What's Changed ### Features - Add `ouroboros status run --json` projection surface (#1133) - Record durable workflow lifecycle events in orchestrator (#1134) - Add `on_error`/`on_cancel` plugin observability hooks (PR E) (#1137) - Expose MCP interview reasoning metadata (#1140) - Prompt for required trust grants on plugin install (#1141) - Expose Ralph-start alias while preserving runtime ownership - Dispatch lifecycle hooks within plugin trust boundaries - Make plugin permission waits shareHigh5/20/2026
v0.39.0## Ouroboros v0.39.0 This release lands a high-severity **security fix**, flips `ooo run` to the fat-harness execution path by default, and completes the AgentOS roadmap wiring/baseline milestone tracked in [#961](https://github.com/Q00/ouroboros/issues/961). ## ๐Ÿ”’ Security ### RCE via untrusted project-directory `.env` (high severity) Ouroboros is run *inside cloned repositories*. `config/loader.py` loaded `./.env` from the working directory into `os.environ` at import time with the **same High5/18/2026
v0.38.2## What's Changed ### Bug Fixes - Close residual `allowed_tools=[]` leak in sub-CLI envelope for interview ### Testing - Lock empty `allowedTools` passthrough - Cover strict empty allowed-tools envelope (#975) **Full Changelog**: https://github.com/Q00/ouroboros/compare/v0.38.1...v0.38.2 ## What's Changed * fix(interview): close residual allowed_tools=[] leak in sub-CLI envelope by @Q00 in https://github.com/Q00/ouroboros/pull/974 **Full Changelog**: https://github.com/Q00/ouroboros/compaHigh5/13/2026
v0.37.0## What's Changed ### Features #### `ooo auto` Pipeline - `DomainProfile` and `VerifiablePredicate` contracts (#849, #809 P3 PR 1/6) - `UNSTUCK_LATERAL` persona advisor on EVALUATE fail (#829) - EVALUATE phase verifies run output against seed AC (#825) - Formalize run-handoff idempotency contract (#843) - Chain RUNโ†’RALPH automatically with `--complete-product` (#791) - `user_preference` source + deterministic ambiguity floor (#811) - Top-level `pipeline_timeout_seconds` deadline (#790) - SteerHigh5/11/2026
v0.34.0## What's Changed in v0.34.0 ### Features - **Copilot CLI runtime** (#1f07a9c8) โ€” first-class Copilot agent runtime - **Kiro runtime backend** (#2fc7a3c8) โ€” Kiro as a first-class runtime - **Stage enum + runtime_profile.stages config** (#538) โ€” slice 1 of #519 - **Agent OS runtime_profile (Codex backend)** (#505) โ€” runtime profile foundation - **AgentProcess lifecycle projection** (#628) โ€” project lifecycle state from events - **Directive emission at StepAction sites** (#477464ef) โ€” slice 1 of High5/6/2026
v0.31.1## What's Changed * fix(parallel): honor configured worker caps by @andrew-adamson in https://github.com/Q00/ouroboros/pull/489 * feat(orchestrator): Gemini CLI runtime backend (follow-up to #312) by @shaun0927 in https://github.com/Q00/ouroboros/pull/504 * fix(router): preserve Windows literal path payloads through skill dispatch by @shaun0927 in https://github.com/Q00/ouroboros/pull/481 * fix(brownfield): scan repos with any origin remote by @andrew-adamson in https://github.com/Q00/ouroboros/High4/29/2026
v0.30.0## What's Changed ### Features - **CLI**: add `ooo resume` to recover in-flight sessions after MCP disconnect (#433) - **MCP**: add streamable HTTP serve transport - **MCP**: add compact monitoring relay views - **MCP**: suggest lateral persona from stagnation - **Resilience**: add in-run lateral recovery - **Events**: add `control.directive.emitted` event factory - **Core**: introduce `Directive` StrEnum for control-plane decisions ### Bug Fixes **MCP** - clarify job wait execution progress High4/26/2026
v0.29.2## What's Changed ### Bug Fixes - fix(mcp): protect startup stdin during shell env loading (#482) - fix(interview): guard seed-ready closure (#484) - fix(evaluate): replace language presets with AI-driven mechanical.toml (#454) **Full Changelog**: https://github.com/Q00/ouroboros/compare/v0.29.1...v0.29.2 ## What's Changed * fix(evaluate): replace language presets with AI-driven mechanical.toml by @Q00 in https://github.com/Q00/ouroboros/pull/454 * fix(interview): guard seed-ready closure byHigh4/22/2026
v0.29.1## What's Changed ### Bug Fixes - **mcp**: prevent `start_execute_seed` UUID scope crash (#470) - **mcp**: avoid replaying failed tool calls - **mcp**: avoid retrying timed-out tool calls - **mcp**: reconnect stale client transports (#467) - **runtime**: drop `initial_context` on interview resume so InterviewHandler continues the session - **router**: `extract_first_argument` returns full argument payload instead of just the first token - **tests**: make OpenCode config tests platform-agnostic High4/21/2026
v0.29.0## What's Changed This release introduces the **Hermes Agent runtime**, **capability-aware orchestration** (SandboxClass + policy audits), **OpenCode bridge plugin runtime**, and hardens the MCP transport URL validator. It also ships a large batch of setup/PM fail-closed fixes and tool-envelope enforcement across Gemini and OpenCode backends. ### Features - **Hermes runtime**: integrate Hermes Agent as a new execution runtime backend, conform to `AgentRuntime` resume contract, expose `llm_backHigh4/21/2026
v0.28.8## What's Changed ### Features - feat(tui): improve event subscription, activity display, and tree sync (#440) - feat(cli): add `ouroboros mcp doctor` diagnostic command (#445) - feat(evaluate): add `ouroboros_checklist_verify` MCP tool (#366 part 2) ### Bug Fixes - fix(hud): merge subtask events into AC tree snapshot (#439) - fix(evaluate): honour 1-item `acceptance_criteria` in single-AC path - fix(mcp): block localhost hostname in SSRF validation - fix(mcp): add real SSRF hardening beyond sHigh4/18/2026
v0.28.7## What's Changed ### Features - **events**: add `event_version` to `BaseEvent` payload (#435) - **evaluate**: wire checklist into `EvaluateHandler` for multi-AC support (#366) ### Bug Fixes - **install**: enforce Python >=3.12 for uv and pipx installers (#429) - **providers**: respect `OUROBOROS_SKIP_VERSION_CHECK` env var override - **orchestrator**: graceful fallback for legacy adapters without `llm_backend` - **orchestrator**: narrow exception scope in `_build_dependency_analyzer` - **orchHigh4/15/2026
v0.28.6## What's Changed ### Features - **add auto-confirm routing for high-confidence code answers** (#357) by @shaun0927 in https://github.com/Q00/ouroboros/pull/382 - **wire ambiguity milestones into prompt and MCP response** (#363) by @shaun0927 in https://github.com/Q00/ouroboros/pull/380 - **add milestone definitions for ambiguity score ranges** (#363) by @shaun0927 in https://github.com/Q00/ouroboros/pull/379 - **expose Socratic questions and evidence in Stage 2 output** (#367) by @shaun0927 inHigh4/14/2026
v0.28.5## What's Changed ### Bug Fixes - **evaluator must not reuse max_turns=1 shared adapter** (#368, fixes #305) - **cap recursive decomposition depth** (#369) - **remove dead llm_adapter wire + file-based artifact evaluation** (4f756c2) - **satisfy lint and mypy for depth-cap PR** (f17d511) ### Features - **skip externally satisfied ACs** (#370) - **share Anthropic rate budget across workers** (#371) ### Documentation - **fix stale MAX_FILES reference in _scan_directory docstring** (4a83c04) ##High4/13/2026
v0.28.4## What's Changed ### Bug Fixes - **fix**: detect `opencode.jsonc` config alongside `opencode.json` (#356, #374) - **fix(orchestrator)**: restore LLM-assisted top-level AC dependency planning (#373) **Full Changelog**: https://github.com/Q00/ouroboros/compare/v0.28.3...v0.28.4 ## What's Changed * fix(orchestrator): restore LLM-assisted top-level AC dependency planning by @Q00 in https://github.com/Q00/ouroboros/pull/373 * fix: detect opencode.jsonc config alongside opencode.json (#356) by @SMedium4/12/2026
v0.28.3## What's Changed ### Features - **feat(mcp):** add http/sse/streamable-http transport support (#339) - **feat(orchestrator):** wire checkpoint store + public API context (#338) ### Performance - **perf(#344):** reduce per-call overhead in Claude adapter (#348) - **perf(#345):** optimize ambiguity scorer token usage and enforce format error limit (#349) ### Bug Fixes - **fix(cli):** clarify TUI install guidance (#355) - **fix(mcp):** register ACTreeHUDHandler in composition root (#340) - **fiHigh4/10/2026
v0.28.2## What's Changed ### Bug Fixes - Fix MCP server crash when gateway spawns server with `cwd=/` (#335 follow-up) ### Refactoring - Align PMSeed and pm.md fields for consistency (#314) ### Documentation - Add `[mcp]` extra to `uvx` MCP server setup commands in docs **Full Changelog**: https://github.com/Q00/ouroboros/compare/v0.28.1...v0.28.2 ## What's Changed * refactor(pm): align PMSeed and pm.md fields by @cohemm in https://github.com/Q00/ouroboros/pull/314 **Full Changelog**: https://gHigh4/7/2026
v0.28.1## What's Changed ### Bug Fixes - fix(mcp): fix uvx MCP setup missing mcp dependency (#335) - fix(openclaw): harden gateway integration with env loading, repo fallback, and DB consolidation (#336) ### Documentation - docs: complete OpenCode runtime documentation across all guides (#333) (#334) - docs: add openclaw update note for mcp set command (#332) - docs: replace PM Mode banner with OpenClaw integration (#330) **Full Changelog**: https://github.com/Q00/ouroboros/compare/v0.28.0...v0.28.1Medium4/7/2026
v0.28.0## What's Changed ### Features - OpenCode runtime adapter for LLM completions (#324) - Reduce third-party runtime dependencies (#316) - OpenClaw channel workflow scaffold (#323) ### Bug Fixes - Prevent MCP startup timeout on large SQLite databases (#326) - Prevent SessionStart stdout from polluting Claude context (#322) - Guard against None optional params in measure_drift handler (#325) - Harden OpenCode stdin close and skill_path context manager (#329) ### Documentation - Clarify how users Medium4/6/2026
v0.27.2## What's Changed ### Features - Recursive AC decomposition + `ac_tree_hud` MCP tool (#319) ### Bug Fixes - Improve evaluate error diagnostics for Claude SDK failures (#313) **Full Changelog**: https://github.com/Q00/ouroboros/compare/v0.27.1...v0.27.2 ## What's Changed * feat: recursive AC decomposition + ac_tree_hud MCP tool by @Q00 in https://github.com/Q00/ouroboros/pull/319 * Improve evaluate error diagnostics for Claude SDK failures by @cohemm in https://github.com/Q00/ouroboros/pull/Medium4/6/2026
v0.27.1## What's Changed ### Features - Add PATH 4 research interlude for mid-interview exploration (#294) - Gemini CLI adapter for LLM completions (#315) โ€” by @kostakost2 ### Bug Fixes - Wire trigger_consensus to pipeline + expand project root detection (#317) - Resolve double-backgrounding in job tracking and differentiate polling options (#318) - Preserve CompletionResponse immutability during JSON normalization (#309) - Include tracebacks in QA/evaluate errors (#308) - Remove dead OUROBOROS_WEB_SMedium4/4/2026
v0.27.0## What's Changed ### Features - **MCP Bridge** โ€” Server-to-server MCP communication enabling external tools in `ooo run` seeds (#278, #279, #280, #281) - Auto-discovers `~/.ouroboros/mcp_servers.yaml` for upstream MCP server configuration - Injects `mcp_manager` through `ExecuteSeedHandler` โ†’ `OrchestratorRunner` pipeline - Wires bridge into evolution executor for `ooo evolve` with external tools - Transport lifecycle hardening with proper rollback and symmetric cleanup - **`ooo publisMedium4/1/2026
v0.26.6## What's Changed ### Bug Fixes - **TUI session completion**: Emit `execution.terminal` event so TUI correctly sees session completion (#272) - **Subprocess safety**: Prevent subprocess leak, fork bomb, and env poisoning (#271) **Full Changelog**: https://github.com/Q00/ouroboros/compare/v0.26.5...v0.26.6 ## What's Changed * fix(#269): prevent subprocess leak, fork bomb, and env poisoning by @Q00 in https://github.com/Q00/ouroboros/pull/271 * fix(tui): emit execution.terminal so TUI sees sesMedium3/30/2026
v0.26.5## What's Changed ### Features - PM interview now returns DECIDE_LATER questions to user instead of auto-skipping (#238) ### Bug Fixes - Tolerate non-JSON QA verdict responses (#236) - Fix adapter timeout, tool policy, and factory max_turns in providers (#259) - Fix PM handoff into the runnable Seed workflow (#252) - Honor configured interview adapter backend in PM (#245) - Fix resume retries for Codex sessions (#249) - Align CLI completion scoring with MCP flow in PM (#256) - Use configured cMedium3/30/2026
v0.26.4## What's Changed ### Bug Fixes - Show install guidance when `litellm` extra is missing (#246, fixes #239) **Full Changelog**: https://github.com/Q00/ouroboros/compare/v0.26.3...v0.26.4 ## What's Changed * fix(#239): show install guidance when litellm extra is missing by @srausser in https://github.com/Q00/ouroboros/pull/246 **Full Changelog**: https://github.com/Q00/ouroboros/compare/v0.26.3...v0.26.4Medium3/29/2026
v0.26.3## What's Changed ### Features - Add `ouroboros uninstall` and `ouroboros config backend` CLI commands (#220) ### Bug Fixes - Handle null `failed_attempts` in lateral think (#234) - Degrade gracefully on `claude_code` JSON output (#232) **Full Changelog**: https://github.com/Q00/ouroboros/compare/v0.26.2...v0.26.3 ## What's Changed * feat(cli): add ouroboros uninstall and config backend commands by @Q00 in https://github.com/Q00/ouroboros/pull/220 * fix(#212): degrade gracefully on claude_cMedium3/27/2026
v0.26.2## What's Changed * feat(evolution): graceful shutdown on SIGINT (#169) by @hackertaco in https://github.com/Q00/ouroboros/pull/172 * Improve GitHub issue intake and repository safety guidance by @hackertaco in https://github.com/Q00/ouroboros/pull/81 * fix(install): add [claude] extra to ouroboros-ai references (#217) by @hackertaco in https://github.com/Q00/ouroboros/pull/218 * fix(#214): prefer ./mvnw for Maven wrapper projects by @ouroboros-agent[bot] in https://github.com/Q00/ouroboros/pullMedium3/26/2026
v0.26.1## What's Changed ### Bug Fixes - **PM handler**: Use LLM adapter factory instead of hardcoded ClaudeAgentAdapter - **Install**: Let user choose runtime instead of defaulting to Codex - **Install**: Add marketplace update before plugin install ### Maintenance - Align import paths with InterviewHandler convention - Fix import sorting in pm_handler - Point install.sh URL to main branch **Full Changelog**: https://github.com/Q00/ouroboros/compare/v0.26.0...v0.26.1 ## What's Changed * fix(pm): Medium3/25/2026
v0.26.0## What's Changed in v0.26.0 ### ๐Ÿš€ Features - **PM Interview Engine & Brownfield Management** โ€” Socratic PM interview flow with brownfield codebase analysis (#191) by @cohemm - **Codex CLI Runtime Support** โ€” Runtime-agnostic execution with Codex CLI adapter, parallel executor, and heartbeat monitoring - **Interview Redesign** โ€” MCP question generator + main session router with breadth/closure personas (#178, #136) - **Evaluation Enhancement** โ€” Reward hacking risk detection in Stage 2 semantiMedium3/25/2026
v0.26.0b7**Full Changelog**: https://github.com/Q00/ouroboros/compare/v0.26.0b6...v0.26.0b7Medium3/24/2026
v0.26.0b6## What's Changed * feat(version): automate version sync, install UX, and update flow by @Q00 in https://github.com/Q00/ouroboros/pull/194 **Full Changelog**: https://github.com/Q00/ouroboros/compare/v0.26.0b5...v0.26.0b6Medium3/24/2026
v0.26.0b5> **This is a beta release.** Install with: `pip install ouroboros-ai==0.26.0b5` ## What's Changed ### Security - **Remove litellm dependency** โ€” PyPI supply chain attack mitigation (merged from main v0.25.2) ### Features - PM interview engine + brownfield management (#191) - Beta version update automation ### Bug Fixes - Fix missing `Result` import in PM CLI commands - Fix ruff lint issues **Full Changelog**: https://github.com/Q00/ouroboros/compare/v0.26.0b4...v0.26.0b5 ## What's ChangeMedium3/24/2026
v0.25.2## What's Changed ### Security - **Remove litellm dependency** โ€” PyPI supply chain attack mitigation (#195) ### Build - hatch-vcs dev builds on every push (#183) - Strip local version identifiers for PyPI compatibility - Show dev version in GitHub Actions annotations **Full Changelog**: https://github.com/Q00/ouroboros/compare/v0.25.1...v0.25.2 ## What's Changed * feat(build): hatch-vcs dev builds on every push by @Q00 in https://github.com/Q00/ouroboros/pull/183 * fix(security): make litelMedium3/24/2026
v0.26.0b4> **This is a beta release.** Install with: `pip install ouroboros-ai==0.26.0b4` ## What's Changed ### Features - Agent runtime abstraction with Codex CLI runtime support - Runtime-agnostic packaging with optional extras and standalone setup - Reward hacking risk detection in Stage 2 semantic evaluation (#174) - Runtime-agnostic orphan detection with WAL mode and retry - Interview breadth and closure personas (#136) - Packaged agents as source of truth (#136) - Codex artifact installation and Medium3/23/2026
v0.25.1## What's Changed * fix(consensus): honest model names in Stage 3 fallback (#161) by @Q00 in https://github.com/Q00/ouroboros/pull/163 * fix(#168): ignore pending generations in convergence by @ouroboros-agent[bot] in https://github.com/Q00/ouroboros/pull/171 * fix(#176): inherit parent MCP tools in delegated runs by @Q00 in https://github.com/Q00/ouroboros/pull/180 ## New Contributors * @ouroboros-agent[bot] made their first contribution in https://github.com/Q00/ouroboros/pull/171 **Full ChaLow3/22/2026
v0.26.0b3> **This is a beta release.** Install with: `pip install ouroboros-ai==0.26.0b3` ## What's Changed since v0.26.0b2 ### Features - **Codex MCP auto-registration**: `ouroboros setup --runtime codex` now registers the Ouroboros MCP server in `~/.codex/config.toml` with correct env vars (`OUROBOROS_AGENT_RUNTIME=codex`, `OUROBOROS_LLM_BACKEND=codex`) ### Bug Fixes - **install.sh**: Skip Python version check when `uv` or `pipx` is available (uv manages its own Python toolchain) - **install.sh**: PLow3/21/2026
v0.26.0b2> **This is a beta release.** Install with: `pip install ouroboros-ai==0.26.0b2` ## What's Changed since v0.26.0b1 ### Features - **Codex artifact auto-install**: `ouroboros setup --runtime codex` now installs 15 skills and routing rules into `~/.codex/` automatically ### Bug Fixes - **install.sh**: Added `--upgrade` flag so existing installs get updated instead of silently skipping - **install.sh**: Fixed PEP 508 specifier ordering (`name[extras]==version`) and pinned to `0.26.0b1` - **docs*Low3/21/2026
v0.25.0## What's Changed ### Features - **Standalone QA Skill** โ€” `ooo qa` now works as a fast single-pass quality verdict for any artifact, with or without MCP setup (#160) - **Stall Detection & Supervisor Recovery** โ€” Orchestrator detects stuck executions and automatically retries with checkpoint recovery (#155) - **Sequential Sub-AC Execution** โ€” Sub-acceptance criteria run sequentially with memory-pressure backoff for stability (#153) - **Context Protection Guide** โ€” Interactive guide during `ooo Low3/20/2026
v0.26.0b1> **This is a beta release.** Install with: `pip install ouroboros-ai==0.26.0b1` ## What's Changed ### Features - Runtime-agnostic orphan detection with WAL mode and retry - Add prompt parameter to `_build_command` and `_feeds_prompt_via_stdin` hook - Packaged agents as the source of truth (#136) - Multi-runtime support groundwork (claude, codex backends) - Streamlit analytics dashboard with plotly/pandas ### Bug Fixes - Comprehensive AC tree and runtime reliability fixes - AC tree not renderLow3/20/2026
v0.24.0## What's Changed ### Features - Overhaul TUI visual quality with Double Diamond visualization and performance improvements (#147) ### Bug Fixes - Add Sessions tab and improve session list readability - Load full session events including AC tree - Match Python TUI session display format - Re-sign macOS binary after copy + lazy session loading - Detect non-TTY and show helpful error message for uvx users **Full Changelog**: https://github.com/Q00/ouroboros/compare/v0.23.2...v0.24.0 ## What'sLow3/15/2026
v0.23.2## What's Changed ### Features - Add Windows x86_64 build target (`x86_64-pc-windows-msvc`) to CI release matrix - Handle `.exe` extension in CI artifact rename step ### Bug Fixes - Use `USERPROFILE` fallback for Windows home directory resolution in Rust TUI - Add Windows `subprocess.call` fallback in Python CLI (since `os.execv` behaves differently on Windows) ### Performance - Batch drain `execution_events` after event loop instead of per-event O(n) drain โ€” eliminates ~30M element shifts onLow3/14/2026
v0.23.1## What's Changed * fix(tui): SLT binary CI build + TUI bug fixes (v0.23.1) by @Q00 in https://github.com/Q00/ouroboros/pull/145 **Full Changelog**: https://github.com/Q00/ouroboros/compare/v0.23.0...v0.23.1Low3/14/2026
v0.23.0## What's Changed ### Features - **Native SLT backend for TUI monitor** โ€” A beautifully crafted alternative TUI backend built on the [SuperLightTUI](https://github.com/subinium/SuperLightTUI) library. The rendering quality and API design are genuinely impressive; it's exciting to see SLT powering Ouroboros's terminal UI (#141) - Sync CLAUDE.md version marker automatically after `ooo update` upgrade (#139) - Add Java Maven language preset for evaluation (#135) ### Bug Fixes - Add marketplace reLow3/14/2026
v0.22.0## What's Changed ### Features - Add async MCP tools for background job execution (#119) ### Bug Fixes - Fix skills to load deferred MCP tools before checking availability (#126, #127) - Fix CI to prevent test hang from QA invoking real Claude CLI (#133) - Fix evolve to add validation gate to prevent blind convergence (#118) ### Maintenance - Upgrade all dependencies to latest versions (#75) **Full Changelog**: https://github.com/Q00/ouroboros/compare/v0.21.1...v0.22.0 ## What's Changed * Low3/13/2026
v0.21.1## What's Changed ### Bug Fixes - **interview**: Resolve MCP interview tool unresponsiveness (#122) - **core**: Handle Windows OSError in `_check_stale_instance()` (#121) - **UX**: Clarify false convergence gate message (#114) (#115) **Full Changelog**: https://github.com/Q00/ouroboros/compare/v0.21.0...v0.21.1 ## What's Changed * fix: clarify false convergence gate message (#114) by @harrymunro in https://github.com/Q00/ouroboros/pull/115 * fix: handle Windows OSError in _check_stale_instanLow3/11/2026
v0.21.0## What's Changed ### Features - Add `ouroboros_cancel_execution` MCP handler for graceful session cancellation (#111) - Add `ooo update` command with auto version check (#110) - Detect CLAUDE.md git workflow for ralph/run (#109) ### Bug Fixes - Add SSL fallback and accurate messaging for version check - Add PID file management for stale MCP instance recovery (#59) - Default ambiguity_score to 0.15 and document it (#104) - Break Wonderโ†’Reflect stagnation loop in evolution (#103) - Distinguish Low3/9/2026
v0.20.0## What's Changed ### Features - **Language-agnostic mechanical verification** โ€” Stage 1 auto-detects project language (Python, Rust, Go, Zig, Node.js) and runs appropriate lint/build/test commands. Supports `.ouroboros/mechanical.toml` for custom overrides. (#84) - **Codebase-aware interview** โ€” Interview reads the codebase before asking open questions for more targeted Socratic questioning. (#67) - **Interview version check** โ€” Prompts user to update when a newer Ouroboros version is availablLow3/8/2026
v0.19.1## What's Changed ### Bug Fixes - Register QAHandler in MCP server with DI + add post-run QA (#78) - Add next-step suggestions to all ooo skill outputs - Address adversarial review findings in next-step suggestions ### Documentation - Add Star History chart to README **Full Changelog**: https://github.com/Q00/ouroboros/compare/v0.19.0...v0.19.1 ## What's Changed * fix(qa): register QAHandler in MCP server with DI + add post-run QA by @Q00 in https://github.com/Q00/ouroboros/pull/78 * Add neLow3/7/2026
v0.19.0## What's Changed ### New Features - **feat(qa)**: Add `ouroboros_qa` tool โ€” general-purpose QA verdict for any artifact type - Evaluates code, test output, documents, API responses against a quality bar - Returns structured JSON verdict with score, dimensions, differences, and suggestions - Supports iterative loop with session tracking and iteration history - Inspired by [oh-my-codex `$visual-verdict`](https://github.com/Yeachan-Heo/oh-my-codex/commit/6fd5471) by [@Yeachan-Heo](https:/Low3/5/2026
v0.18.1## What's Changed ### Bug Fixes - **fix(seed)**: Robust extraction prompt with preprocessing and retry logic (#54, closes #53) - Strengthened user prompt with inline format template and explicit "no prose" instructions - Added `_preprocess_response()` to strip markdown code blocks and conversational preamble - Added retry logic (1 retry) with clarified correction prompt on parse failure - Improved error messages with found fields and response preview ### Enhancements - **feat(interviewLow3/4/2026
v0.18.0## What's Changed ### Features - **Rewind history tracking**: Discarded generations preserved in `rewind_history` with full context, visible in TUI as collapsible subtrees - **Failure error recording**: `failure_error` field captures error messages from failed generations (e.g., MCP transport disconnect, API limit exceeded) - **Enhanced lineage status**: MCP `lineage_status` output now includes rewind history section with timestamps and discarded generation details - **TUI improvements**: GenerLow3/2/2026
v0.17.0## What's Changed ### Features - **Validation Phase**: Add post-execute validation between Execute and Evaluate phases โ€” automatically detects and fixes cascading import errors from parallel AC code generation (pytest --collect-only + Claude auto-fix, max 3 attempts) - **Sequential Execution Mode**: Add `--no-parallel` flag to ralph.sh/ralph.py and `parallel` parameter to evolve_step MCP tool for stable, sequential AC execution - **Max Plan Model Routing**: Route entire pipeline to Claude SonneLow2/28/2026
v0.16.0## What's Changed ### Features - **Evolution loop improvements**: Add oscillation detection, evaluation gate, auto-commit, and rollback support (#51) ### Bug Fixes - **Brownfield explorer**: Address code review findings from dialectical review ### Documentation - **README redesign**: New hero image, philosophical framing (Socratic reasoning โ†’ Ontology โ†’ Double Diamond), Ralph convergence example, and "The Nine Minds" agent showcase ### Testing & Maintenance - Update ontology question tests fLow2/27/2026
v0.15.0## What's Changed ### Features - Add rewind feature for evolutionary lineages โ€” roll back to any previous generation - Add TUI lineage viewer with generation detail panel, delta visualization, and rewind confirmation - Add ralph.sh / ralph.py scripts for standalone MCP-based evolution loops ### Bug Fixes - Fix FastMCP schema inference: build proper `inspect.Signature` on tool wrappers so clients can send flat argument dicts instead of `{"kwargs": {...}}` - Fix MCP stdio stdout pollution: use `Low2/27/2026
v0.14.1## What's Changed ### Bug Fixes - **interview**: Fix empty response bypass in ClaudeCodeAdapter โ€” empty content now always triggers error regardless of session_id - **interview**: Fix sub-agent turn exhaustion โ€” increase `max_turns` from 1 to 3 so the agent can use tools and still generate the question ### Maintenance - Apply ruff format to 4 files - Resolve ruff and mypy CI failures **Full Changelog**: https://github.com/Q00/ouroboros/compare/v0.14.0...v0.14.1 **Full Changelog**: https://giLow2/26/2026
v0.14.0## What's Changed ### Features - Add `--no-execute` mode to evolve step, allowing dry-run evolution without executing seeds - Track and surface execution output metadata (duration, messages processed, success status) ### Bug Fixes - Use head+tail truncation for execution output โ€” keeps first 500 + last 2000 chars instead of only the first 2000, preserving stack traces and test results that appear at the end - Guard against empty `generation_results` in evolutionary loop to prevent misleading sLow2/25/2026
v0.13.7## What's Changed ### Bug Fixes - **fix(parallel)**: Re-raise anyio `Cancelled` instead of suppressing it in parallel AC execution. Catching `BaseException` without re-raising `Cancelled` broke anyio's cancel-scope propagation inside task groups, causing the entire parallel execution to hang indefinitely when a completed AC's SDK generator cleanup triggered cancellation of sibling tasks. **Full Changelog**: https://github.com/Q00/ouroboros/compare/v0.13.6...v0.13.7 **Full Changelog**: https:Low2/25/2026
v0.13.6## What's Changed ### Bug Fixes - **Skip unknown SDK message types instead of crashing**: The Claude Agent SDK throws `MessageParseError` for unrecognized streaming events like `rate_limit_event`, killing the entire async generator. Now wraps `query()` to catch and skip these parse errors gracefully, allowing evolve_step and other LLM-dependent operations to succeed. **Full Changelog**: https://github.com/Q00/ouroboros/compare/v0.13.5...v0.13.6 **Full Changelog**: https://github.com/Q00/ouroLow2/23/2026

Dependencies & License Audit

Loading dependencies...

Similar Packages

sawzhang_skillsClaude Code skills collection โ€” CCA study guides, Twitter research, MCP review, auto-iteration tools0.0.0
claude-code-configClaude Code skills, architectural principles, and alternative approaches for AI-assisted developmentmain@2026-05-28
clumsiesPersistent, deterministic, and user-managed project memory for vibe coding.v0.19.1-alpha
aiA productive AI coworker that learns, self-improves, and ships work.main@2026-06-06
auxiliarAI agent for cloud service recommendations โ€” Chrome-verified pricing, real gotchas, jobs, and communitymain@2026-06-05

More in MCP Servers

AstrBotAgentic IM Chatbot infrastructure that integrates lots of IM platforms, LLMs, plugins and AI feature, and can be your openclaw alternative. โœจ
agentscopeBuild and run agents you can see, understand and trust.
claude-plugins-officialOfficial, Anthropic-managed directory of high quality Claude Code Plugins.
langchain4jLangChain4j is an open-source Java library that simplifies the integration of LLMs into Java applications through a unified API, providing access to popular LLMs and vector databases. It makes impleme