pilot

#1 Terminal Benchmark 2.0 — AI that ships your tickets.

agentic agentic-workflow ai-agent ai-bots ai-tools autonomous-coding claude claude-code go

Why this rank:Strong adoptionRecent releaseHealthy release cadence

Description

#1 Terminal Benchmark 2.0 — AI that ships your tickets.

README

   ██████╗ ██╗██╗      ██████╗ ████████╗
   ██╔══██╗██║██║     ██╔═══██╗╚══██╔══╝
   ██████╔╝██║██║     ██║   ██║   ██║
   ██╔═══╝ ██║██║     ██║   ██║   ██║
   ██║     ██║███████╗╚██████╔╝   ██║
   ╚═╝     ╚═╝╚══════╝ ╚═════╝    ╚═╝

AI that ships your tickets while you sleep

Docs • Install • Desktop • Quick Start • How It Works • Features • CLI • Discord • Deploy

The Problem

You have 47 tickets in your backlog. You agonize over which to prioritize. Half are "quick fixes" that somehow take 2 hours each. Your PM asks for status updates. Sound familiar?

The Solution

Pilot picks up tickets from GitHub, Linear, Jira, or Asana—plans the implementation, writes the code, runs tests, and opens a PR. You review and merge. That's it.

┌─────────────┐      ┌─────────────┐      ┌─────────────┐      ┌─────────────┐
│   Ticket    │ ───▶ │   Pilot     │ ───▶ │   Review    │ ───▶ │   Ship      │
│  (GitHub)   │      │  (AI dev)   │      │   (You)     │      │  (Merge)    │
└─────────────┘      └─────────────┘      └─────────────┘      └─────────────┘

Install

Homebrew (recommended)

brew tap qf-studio/pilot
brew install pilot

Go Install

go install github.com/qf-studio/pilot/cmd/pilot@latest

From Source

git clone https://github.com/qf-studio/pilot
cd pilot
make build
sudo make install-global

Desktop App

Download the desktop app from the latest release:

Platform	Download
macOS (Universal)	`Pilot-Desktop-macOS-universal.dmg`
Windows (x64)	`Pilot-Desktop-Windows-amd64-setup.exe`
Linux (x64)	`Pilot-Desktop-Linux-amd64.tar.gz`

Requirements

Go 1.22+ (build only)
Claude Code CLI 2.1.17+
OpenAI API key (optional, for voice transcription)

Quick Start

# 1. Initialize config
pilot init

# 2. Start Pilot
pilot start --github              # GitHub issue polling
pilot start --telegram            # Telegram bot
pilot start --telegram --github   # Both

# 3. Create a GitHub issue with 'pilot' label, or message your Telegram bot

That's it. Go grab coffee. ☕

How It Works

You label issue "pilot"
        │
        ▼
┌───────────────────┐
│  Pilot claims it  │  ← Adds "pilot/in-progress" label
└───────┬───────────┘
        │
        ▼
┌───────────────────┐
│  Creates branch   │  ← pilot/GH-{number}
└───────┬───────────┘
        │
        ▼
┌───────────────────┐
│  Plans approach   │  ← Analyzes codebase, designs solution
└───────┬───────────┘
        │
        ▼
┌───────────────────┐
│  Implements       │  ← Writes code with Claude Code
└───────┬───────────┘
        │
        ▼
┌───────────────────┐
│  Quality gates    │  ← Test, lint, build validation
└───────┬───────────┘
        │
        ▼
┌───────────────────┐
│  Opens PR         │  ← Links to issue, adds "pilot/done"
└───────┬───────────┘
        │
        ▼
    You review
        │
        ▼
      Merge 🚀

Features

133 features implemented across execution, intelligence, integrations, and infrastructure.

Core Execution

Feature	Description
Autopilot	CI monitoring, auto-merge, feedback loop (dev/stage/prod modes)
Epic Decomposition	Complex tasks auto-split into sequential subtasks via Haiku API
Self-Review	Auto code review before PR push catches issues early
Sequential Execution	Wait for PR merge before next issue (prevents conflicts)
Quality Gates	Test/lint/build validation with auto-retry
Execution Replay	Record, playback, analyze, export (HTML/JSON/MD)

Intelligence

Feature	Description
Model Routing	Haiku (trivial) → Opus 4.6 (standard/complex), auto-detected
Effort Routing	Maps task complexity to Claude thinking depth
Research Subagents	Haiku-powered parallel codebase exploration
Navigator Integration	Auto-detected `.agent/`, skipped for trivial tasks
Cross-Project Memory	Shared patterns and context across repositories

Integrations

Feature	Description
Telegram Bot	Chat, research, planning, tasks + voice & images
GitHub Polling	Auto-pick issues with `pilot` label
GitLab / Azure DevOps	Full polling + webhook adapters
Linear/Jira/Asana	Webhooks and task sync
Daily Briefs	Scheduled reports via Slack/Email/Telegram
Alerting	Task failures, cost thresholds, stuck detection

Infrastructure

Feature	Description
Dashboard TUI	Sparkline metrics cards, queue depth, autopilot status
Persistent Metrics	Token/cost/task counts survive restarts via SQLite
Hot Upgrade	Self-update with `pilot upgrade` or `u` key in dashboard
Cost Controls	Budget limits with hard enforcement
Multiple Backends	Claude Code + OpenCode support
BYOK	Bring your own Anthropic key, Bedrock, or Vertex

Autopilot Modes

Control how much autonomy Pilot has:

# Fast iteration - skip CI, auto-merge
pilot start --autopilot=dev --github

# Balanced - wait for CI, then auto-merge
pilot start --autopilot=stage --github

# Safe - wait for CI + human approval
pilot start --autopilot=prod --github

Telegram Integration

Talk to Pilot naturally - it understands different interaction modes:

Mode	Example	What Happens
💬 Chat	"What do you think about using Redis?"	Conversational response, no code changes
🔍 Question	"What files handle authentication?"	Quick read-only answer
🔬 Research	"Research how the caching layer works"	Deep analysis sent to chat
📐 Planning	"Plan how to add rate limiting"	Shows plan with Execute/Cancel buttons
🚀 Task	"Add rate limiting to /api/users"	Confirms, then creates PR

You: "Plan how to add user authentication"
Pilot: 📐 Drafting plan...
Pilot: 📋 Implementation Plan
       1. Create auth middleware...
       2. Add JWT token validation...
       [Execute] [Cancel]

You: [clicks Execute]
Pilot: 🚀 Executing...
Pilot: ✅ PR #142 ready: https://github.com/...

Send voice messages, images, or text. Pilot understands context.

Dashboard

Real-time visibility into what Pilot is doing:

┌─ Pilot Dashboard ─────────────────────────────────────────┐
│                                                           │
│  Status: ● Running    Autopilot: stage    Queue: 3        │
│                                                           │
│  Current Task                                             │
│  ├─ GH-156: Add user authentication                       │
│  ├─ Phase: Implementing (65%)                             │
│  └─ Duration: 2m 34s                                      │
│                                                           │
│  Token Usage          Cost                                │
│  ├─ Input:  124k      Today:    $4.82                     │
│  ├─ Output:  31k      This Week: $28.40                   │
│  └─ Total:  155k      Budget:    $100.00                  │
│                                                           │
│  Recent Tasks                                             │
│  ├─ ✅ GH-155  Fix login redirect      1m 12s   $0.45     │ 
│  ├─ ✅ GH-154  Add dark mode toggle    3m 45s   $1.20     │
│  └─ ✅ GH-153  Update dependencies     0m 34s   $0.15     │
│                                                           │
└───────────────────────────────────────────────────────────┘

pilot start --dashboard --github

Environment Variables

Pilot uses Claude Code for AI execution:

Variable	Description
`ANTHROPIC_API_KEY`	Custom Anthropic API key (uses your own account)
`ANTHROPIC_BASE_URL`	Custom API endpoint (proxies, enterprise)
`CLAUDE_CODE_USE_BEDROCK`	Set to `1` for AWS Bedrock
`CLAUDE_CODE_USE_VERTEX`	Set to `1` for Google Vertex AI

Example: Using AWS Bedrock

export CLAUDE_CODE_USE_BEDROCK=1
export AWS_REGION=us-east-1
pilot start --github

Configuration

Config location: ~/.pilot/config.yaml

version: "1.0"

gateway:
  host: "127.0.0.1"
  port: 9090

adapters:
  telegram:
    enabled: true
    bot_token: "${TELEGRAM_BOT_TOKEN}"
    chat_id: "${TELEGRAM_CHAT_ID}"

  github:
    enabled: true
    token: "${GITHUB_TOKEN}"
    repo: "owner/repo"
    pilot_label: "pilot"
    polling:
      enabled: true
      interval: 30s

orchestrator:
  execution:
    mode: sequential           # "sequential" or "parallel"
    wait_for_merge: true       # Wait for PR merge before next task
    poll_interval: 30s
    pr_timeout: 1h

projects:
  - name: "my-project"
    path: "~/Projects/my-project"
    navigator: true
    default_branch: main

daily_brief:
  enabled: true
  schedule: "0 8 * * *"
  timezone: "Europe/Berlin"

alerts:
  enabled: true
  channels:
    - name: telegram-alerts
      type: telegram
      severities: [critical, error, warning]

executor:
  backend: claude-code          # "claude-code" or "opencode"

CLI Reference

Core Commands

pilot start          # Start with configured inputs
pilot stop           # Stop daemon
pilot status         # Show running tasks
pilot init           # Initialize configuration
pilot version        # Show version info

`pilot start`

pilot start                          # Config-driven
pilot start --telegram               # Enable Telegram polling
pilot start --github                 # Enable GitHub issue polling
pilot start --linear                 # Enable Linear webhooks
pilot start --telegram --github      # Enable both
pilot start --dashboard              # With TUI dashboard
pilot start --no-gateway             # Polling only (no HTTP server)
pilot start --sequential             # Sequential execution mode
pilot start --autopilot=stage        # Autopilot mode (dev/stage/prod)
pilot start -p ~/Projects/myapp      # Specify project
pilot start --replace                # Kill existing instance first

`pilot task`

pilot task "Add user authentication"                    # Run in cwd
pilot task "Fix login bug" -p ~/Projects/myapp          # Specify project
pilot task "Refactor API" --verbose                     # Stream output
pilot task "Update docs" --dry-run                      # Preview only
pilot task "Implement feature" --backend opencode       # Use OpenCode

`pilot upgrade`

pilot upgrade                    # Check and upgrade
pilot upgrade check              # Only check for updates
pilot upgrade rollback           # Restore previous version
pilot upgrade --force            # Skip task completion wait
pilot upgrade --no-restart       # Don't restart after upgrade
pilot upgrade --yes              # Skip confirmation

Analytics Commands

pilot brief                       # Show scheduler status
pilot brief --now                 # Generate and send immediately
pilot brief --weekly              # Generate weekly summary

pilot metrics summary             # Last 7 days overview
pilot metrics summary --days 30   # Last 30 days
pilot metrics daily               # Daily breakdown
pilot metrics projects            # Per-project stats

pilot usage summary               # Billable usage summary
pilot usage daily                 # Daily breakdown
pilot usage export --format json  # Export for billing

pilot patterns list               # List learned patterns
pilot patterns search "auth"      # Search by keyword

Architecture

┌─────────────────────────────────────────────────────────────┐
│                          PILOT                              │
├──────────────┬──────────────────────────────────────────────┤
│ Gateway      │ HTTP/WebSocket server, routing               │
│ Adapters     │ Telegram, Slack, GitHub, Jira, Linear, Asana │
│ Executor     │ Claude Code process management               │
│ Orchestrator │ Task planning, phase management              │
│ Memory       │ SQLite + cross-project knowledge graph       │
│ Briefs       │ Scheduled reports, multi-channel delivery    │
│ Alerts       │ Failure detection, cost monitoring           │
│ Metrics      │ Token usage, execution analytics             │
└──────────────┴──────────────────────────────────────────────┘

Development

make deps        # Install dependencies
make build       # Build binary
make test        # Run tests
make lint        # Run linter
make dev         # Development mode with hot reload

FAQ

Is this safe?

Pilot runs in your environment with your permissions. It can only access repos you configure. All changes go through PR review (unless you enable auto-merge). You stay in control.

How much does it cost?

Pilot is free. You pay for Claude API usage (~$0.50-2.00 per typical task). Set budget limits to control costs.

What tasks can it handle?

Best for: bug fixes, small features, refactoring, tests, docs, dependency updates.

Not ideal for: large architectural changes, security-critical code, tasks requiring human judgment.

Does it learn my codebase?

Yes. Pilot uses Navigator to understand your patterns, conventions, and architecture. Cross-project memory shares learnings across repositories.

License

Use Case	Allowed
Internal use	✅
Self-hosting	✅
Modification & forking	✅
Non-competing products	✅
Competing SaaS	❌ (requires license)

Converts to Apache 2.0 after 4 years.

Contributing

Contributions welcome. Please open an issue first for major changes.

git checkout -b feature/my-feature
make test
# Submit PR

Stop agonizing over tickets. Let Pilot ship them.

⭐ Star on GitHub

_{Built with Claude Code + Navigator}

Release History

Version	Changes	Urgency	Date
v2.243.0	## Changelog ### 🚀 Features * b3216f27ad70ca3fffab3f739da5de28c3cb43b9: feat(github): per-project Projects V2 board config (GH-4472) (#4473) (@alekspetrov) * 5a470b7fb5f3b7bd711d743500e362417c22e256: feat: nightly ledger backup to S3 — script + systemd units + restore SOP (#4466) (@alekspetrov) ### 🐛 Bug Fixes * 8a34490461bf437df9d270a0d5eda5a9867181b9: fix(autopilot): per-project CI-checks override, fixing pointer#108 forever-pending CI (GH-4478) (#4479) (@alekspetrov) * 4e7d9219fb9bc9d9883c9	High	7/20/2026
v2.236.0	## Changelog ### 🚀 Features * 6ad1ac0f5390a65cc071ccd0c3a9d19189b2514f: feat(cmd): fan out SDK poller per repo — default + all projects[] (M7 4d.2b/c) (#4115) (@alekspetrov) * d6ab3c6bb3ce15d4db841d501009363e80f7898c: feat(metrics): hydrate PR-family counters from execution_events at daemon start (@alekspetrov) * 955f83150d71148647cd74704df0c8ba3c990b7f: feat(upgrade): cross-process drain signal + poll handshake (GH-4106) (@alekspetrov) ### 🐛 Bug Fixes * 0387c0714bb3dde4297511bbbc328814b7a56ee	High	7/8/2026
v2.201.0	## Changelog ### 🚀 Features * d503f3a8f67719e1f176397d6bd4d43698612c33: feat(api): add /ping health check endpoint (@alekspetrov)	High	6/26/2026
v2.188.0	## Changelog ### 🚀 Features * 3c61ab596936185afd11f3b39671c481844b95e7: feat(cli): interactive pilot project add wizard with gh CLI integration (GH-3017) (#3612) (@alekspetrov)	High	6/18/2026
v2.182.0	## Changelog ### 🚀 Features * 3098d553694684864f415da3eb2ed8ba59dd6a96: feat(dashboard): scope TUI dashboard to a single project when -p is set (TASK-284, GH-3513) (@alekspetrov)	High	6/10/2026
v2.166.12	## Changelog ### 📚 Documentation * bf02df534a4aa53677fc35276781127ac0e8e9e4: docs(nav): close + archive TASK-358 (dashboard failed-count classification, v2.166.10–11); +2 memories; refresh Current State (@alekspetrov) ### Other * 67dbfbd92e9ece66dc6430b5c7e234c081b1f8b0: perf(docs): cache + compression + security headers (@alekspetrov)	High	6/3/2026
v2.155.2	## Changelog ### 🐛 Bug Fixes * 5384ddc21705e06e13063004c2e274a84d1a0454: fix(executor): ghost-SHA guard + IsTaskShipped hardening (TASK-300) (@alekspetrov)	High	5/26/2026
v2.146.7	## P0 bugfix: `syncMainBranch` no longer silently destroys local commits Replaces destructive `git reset --hard origin/main` with `git merge --ff-only origin/<currentBranch>` in the post-task main-branch sync. ### What was broken `syncMainBranch()` ran `git reset --hard origin/main` after every task. When GitHub push-propagation lagged behind the immediately-following fetch+reset (a race in the remote, not push failure), Pilot's just-committed work was silently rewound and only reflog-rec	High	5/20/2026
v2.146.6	## Changelog ### 🐛 Bug Fixes * a107194bdac0b7b6e56c73e92f8f5e8454353983: fix(observability): repair Grafana mount shadow + Approval Persist Misses panel coloring (@alekspetrov)	High	5/11/2026
v2.145.0	## Changelog ### 🚀 Features * 856d12ce0b0d58b72488df802e531e4f538de13b: feat(deploy): add token/cost/executions panels to Pilot Grafana dashboard (@alekspetrov)	High	5/8/2026
v2.107.0	## Changelog ### 🚀 Features * 2eb2a15e6870218c66b3dcad7b94e7baf0659e34: feat(github,memory): add SearchPRsForIssue and InvalidateCompletion primitives (@alekspetrov)	High	5/2/2026
v2.101.0	## Changelog ### 🚀 Features * 216b9f2bc8ad3455c5e3ed666b1e46986bfa7de2: feat(memory): wire usage_events table from dispatcher (@alekspetrov)	High	4/27/2026
v2.99.3	## Changelog ### 🐛 Bug Fixes * b992a653206475751b2b9ac14d1eb699063d26fa: fix(orchestrator): preserve priority from non-Linear adapters in processTask (#2387) (@alekspetrov) ### 📚 Documentation * 0d1f5638c29288313351855d953164dac5690365: docs(guides): add custom model providers guide (#2381) (@alekspetrov) * 51361214a34d3c42dd77103c2efbb14e31856ae6: docs(nav): bring .agent/DEVELOPMENT-README.md current to v2.99.1 (@alekspetrov)	High	4/23/2026
v2.99.1	## Changelog ### 🐛 Bug Fixes * b4f1ad57079989eaf5dccaa3ddf2472b9859c07f: fix(executor): self-review --resume fallback + sanitize marker filename path separators (@alekspetrov) ### ♻️ Refactoring * 0b4a6c828ae1df5fa244fa1ef0b0e8eb09d0e689: refactor(autopilot): remove dead prod-X.Y.Z tag auto-push (@alekspetrov)	High	4/18/2026
v2.99.0	## Changelog ### 🚀 Features * e5b96bedc63434b7f17218d8c3bdcbf4e556798c: feat(executor): inject api_base_url/default_model/api_auth_token into Claude Code subprocess env (GH-2287 follow-up) (@alekspetrov)	High	4/18/2026
v2.98.0	## Changelog ### 🚀 Features * 02d258a566c381165294629580284ddb62b1c82f: feat(release): auto-push prod-X.Y.Z tag alongside vX.Y.Z to trigger docs deploy (@alekspetrov)	High	4/18/2026
v2.97.0	## Changelog ### 🚀 Features * 303a7b31afd3ae51e97cc1ee96e2d42123fd3bc1: feat(executor): honor project.default_branch for base branch and PR target (GitLab and GitHub) (@alekspetrov)	High	4/18/2026
v2.96.0	## Changelog ### 🚀 Features * 3207559850a2c365bfc727642c7ec727b9806906: feat(autopilot): after 2nd conventional-commit rejection, post suggested rewrite + stop retrying (@alekspetrov) ### 🐛 Bug Fixes * 33d1cb9eb841173b5b59853ddcfb0b3f7e46df4f: fix: migrate alekspetrov/anthropics refs to qf-studio (GH-2175) (@alekspetrov)	High	4/18/2026
v2.95.13	## Changelog ### 🐛 Bug Fixes * 00a965e87ccc453607b127f35f56a9c59ca754b2: fix(doctor): warn when projects configured but no issue adapter enabled; fail start --github loudly (@alekspetrov)	High	4/18/2026
v2.95.12	## Changelog ### Other * af7306ad7fd864888edd31e4794b2abb634d9deb: fix stress test timeout + re-land GH-2341 poller fix (@alekspetrov)	High	4/18/2026
v2.95.11	## Changelog ### 🐛 Bug Fixes * 3b9bb9296dabee9194a1ecc1b4cd227506fa9213: fix(dashboard): queue view includes closed issues with stale pilot-in-progress label (@alekspetrov)	High	4/17/2026
v2.95.10	## Changelog ### 🐛 Bug Fixes * 64ad86c6a49dfb644ddea866b26290ca71c6946f: fix(autopilot): ScanExistingPRs clobbers RestoreState on startup (@alekspetrov)	High	4/17/2026
v2.95.9	## Changelog ### 🐛 Bug Fixes * 823a5169ead7235b17586b20c95296448d3f5aa9: fix(autopilot): idempotent merge-completion notification (dedup on re-entry) (@alekspetrov) * 30d5fb879247ab134c5dd113a274fafdb1c4beb0: fix(autopilot): skip pilot-retry-ready when pilot-done already on issue (@alekspetrov)	High	4/17/2026
v2.95.8	## Changelog ### 🐛 Bug Fixes * f6f6f963f4c659cf12253b0cdadb5a4fc8290c66: fix(dispatcher): add hasLiveWorker guard to running-task reaper (mirror GH-2331) (@alekspetrov) * 745bac5842e4e7e5b22b8f4f4d3e515c9d2aa241: fix(epic): validate sub-issue titles — reject LLM analysis-style titles in decomposition (@alekspetrov) * 5a02f79727a7ee793e692403c9db6d17e269494d: fix(executor): Claude Code subprocess OOM-killed on large Navigator context tasks (GH-2324) (@alekspetrov) * b6054b9698ee3b995a77e33463430	High	4/17/2026
v2.95.7	## Changelog ### 🐛 Bug Fixes * aa62b02caf3478d13dcd8d40c503678883137db0: fix(executor): signal executor mode to Claude + classify no-diff (GH-2328) (@alekspetrov)	High	4/17/2026
v2.95.6	## Changelog ### 🐛 Bug Fixes * bbc33cb662e1fd10829e151ba056718c2b968cf4: fix(dashboard): gate stdout prints on dashboard mode (GH-2333) (@alekspetrov) ### Other * e5df708dea9e8dcdb44ec4e7508a2ad9bbe0ac76: Fix dashboard startup log and polling-mode warning (`cmd/pilot/main.go`) (@alekspetrov)	High	4/17/2026
v2.95.5	## Changelog ### 🐛 Bug Fixes * 84c4ccb14eea3861233df5fa8d16e596bb7424c0: fix(dispatcher): don't reap queued tasks when live worker exists (GH-2331) (#2331) (@alekspetrov) * 96ff63b24a88ccba2fc1820489cb7288c94ecb1c: fix(executor): persist backend stderr + final message on failure (GH-2328) (#2330) (@alekspetrov) ### 📚 Documentation * 7180185746652173871e0f2ecf6532b94850d690: docs(claude): scope Navigator-only rules to interactive sessions (#2327) (@alekspetrov)	High	4/17/2026
v2.95.4	## Changelog ### 🐛 Bug Fixes * 13e815368a5ebcc06114243d6b3f6e3aa9b90432: fix(dashboard): adapter poller tasks invisible in gateway mode with --dashboard (GH-2291) (#2292) (@efesler) * 811e3091e77adbb30636895e540115e65b0eb543: fix(dispatcher): persist Task.Labels across queue round-trip (GH-2326) (#2326) (@alekspetrov) * 89df7fc75f6ecc1af40fb92ea91d0297ed6492ef: fix(gitlab): use string type for MergeRequest.detailed_merge_status field (#2295) (@efesler) * e12cf14940750bb7094fd05cfe280b15d7ea2c7a	High	4/17/2026
v2.95.3	## Changelog ### Other * 70c14dc580adc3e1242facb3dace5b63608d4b15: Dispatcher `recoverStaleTasks()` (line 188) already marks orphans as `"failed"`, not `"completed"`. The status appears correct in the current code. (@alekspetrov)	High	4/15/2026
v2.95.2	## Changelog ### 🐛 Bug Fixes * fd1db6e16e83b6869224ab08da589b9dd1c6bfde: GH-2312: fix(autopilot): strip GH-XXXX prefix from squash commit title for conventional commit detection (@alekspetrov)	High	4/15/2026
v2.95.1	## Changelog ### Other * f1a956de2bb4c826466f1123cc30a2a5bce41caf: GH-2304: Add GitHub config validation to `pilot doctor` (`internal/health/health.go` +... (@alekspetrov) * 97b022b71a32b7f520f6548aa725d5042b952c9a: GH-2306: Add GitHub adapter config validation to `pilot doctor` (@alekspetrov) * aa7629f225f38c4c4bfbf0ebc51dae81b33025fc: GH-2307: Fix dashboard startup log and add zero-poller warning (@alekspetrov)	High	4/14/2026
v2.94.4	## Changelog ### 🐛 Bug Fixes * d7d19ea70c5c9c82ae37ea429a534d2153d90c7f: GH-2301: fix(poller): clear stale pilot-in-progress on startup to avoid restart requirement (@alekspetrov)	Medium	4/14/2026
v2.94.3	## Changelog ### 🐛 Bug Fixes * d5c6fb3c0369ba148eb0467417dc6640dc5e813e: GH-2297: fix(autopilot): post success comment on merge close — stale failure comments (@alekspetrov)	Medium	4/14/2026
v2.95.0	## Changelog ### 🚀 Features * 934f3c196f678744b2a9cfe141eecfc46be74175: GH-2298: feat(github): add issue templates for bug reports, features, and pilot tasks (@alekspetrov)	Medium	4/14/2026
v2.94.2	## Changelog ### 🐛 Bug Fixes * 0e9db39650a35787a0c9efea692fa3218ce08025: GH-2286: fix: all bare NewRunner() callsites ignore executor config (@alekspetrov)	Medium	4/14/2026
v2.94.1	## Changelog ### 🚀 Features * 555ffb2a44ecb676e0b4fa35a2040b4bd0edc42f: GH-2287: feat(executor): non-Anthropic model provider support — default_model + api_base_url (TASK-24) (@alekspetrov)	Medium	4/14/2026
v2.94.0	## Changelog ### 🚀 Features * 2f5620cc1a539e98e0a261bdfc68249b4834612a: feat(executor): non-Anthropic model provider support — default_model + api_base_url (GH-2287) (@alekspetrov) ### 🐛 Bug Fixes * 0c19431db94ec61e423dc9fbfcbd4cafe02c7acd: fix(autopilot): update execution status to completed when PR is merged externally (GH-2279) (#2281) (@alekspetrov) * 63e6235d8856b9d1c54f5aa083e46e8e3d635af6: fix(executor): complete non-Anthropic provider support — add missing classifier methods (@alekspet	High	4/12/2026
v2.93.2	## Changelog ### 🚀 Features * 85d812245b93ea2a7c993f8f6b64242e6fd0929e: feat(github): auto-retry issues with pilot-retry-ready label (GH-2276) (#2277) (@alekspetrov) ### 🐛 Bug Fixes * 12e7b3a3a6c512a4ce4e1ea0730beddec0c80b01: GH-2280: fix(dashboard): sparkline graphs wiped after 5s DB refresh (@alekspetrov)	High	4/10/2026
v2.93.1	## Changelog ### 🐛 Bug Fixes * b33ea39d2f79fa1a17f987a5363c386a45d4080c: GH-2269: fix(bench/aws): stale task manifest — always regenerate before run (@alekspetrov)	Medium	4/10/2026
v2.93.0	## Changelog ### 🐛 Bug Fixes * c46d4e9c2019396f4566478037028e25eb97e38f: GH-2267: fix(bench/aws): orchestrator hangs forever, no signal handling, crash loses results (@alekspetrov)	Medium	4/10/2026
v2.92.1	## Changelog ### 🐛 Bug Fixes * 4896b8ff7f2b33e5ead2d122533901f340fe7c20: GH-2260: fix(install): replace fragile grep+sed version parsing with jq (@alekspetrov) * dfdffa7d65122265d0b5140405d65eafeebce290: fix(autopilot): scan for externally-merged PRs periodically, not just on startup (GH-2251) (#2253) (@alekspetrov) * de6df8681bffd7ffcb61036adad08a08c93dee27: fix(poller): skip retry of closed issues with stale pilot-failed label (GH-2252) (#2255) (@alekspetrov) ### Other * a7bfefa1864aef8b58cab	Medium	4/10/2026
v2.92.0	## Changelog ### 🐛 Bug Fixes * bc90a923dbb66b7d20fea34fe2f4c797101218a5: GH-2256: fix(executor): add dryRun guard to Runner gh CLI methods + idempotency check (@alekspetrov)	Medium	4/10/2026
v2.91.0	## Changelog ### 🚀 Features * 7cc86fea9cd1806e9bf29535e0d0826cf2483455: feat(bench): add self-verification harness to PilotAgent (#2238) (#2244) (@alekspetrov) ### 🐛 Bug Fixes * 5a7c70bf376669566634f68c6aa91994420f66a5: fix(autopilot): use PR source repo for release operations on cross-repo PRs (GH-2243) (#2249) (@alekspetrov) * 3cc191dfa658dfb058b8a6af135374e4030e18f0: fix(dashboard): refresh history and metrics from DB every 5s (GH-2248) (#2250) (@alekspetrov) * 7d73d29ee2f79e41df9752a599fd2	Medium	4/10/2026
v2.90.8	## Changelog ### Other * ab545453d12c044f461a7145d6a76fc03fd2664c: GH-2237: bench: remove oracle test access from PilotAgent (@alekspetrov)	Medium	4/10/2026
v2.90.7	## Changelog ### 🐛 Bug Fixes * 9ff6e82ba955f7bbdc86a8ba58da2ad3fc17dc95: fix(bench): remove oracle test access from PilotAgent (#2239) (@alekspetrov) * 29585eba8988f46efa5c50725f563f060fede588: fix: update broken Discord invite link (#2231) (@alekspetrov) ### Other * d0264719fa778d130dd8538c8b29937fbf452b73: GH-2211: Wire native sub-issue linking into epic decomposition (`internal/executor/`) (@alekspetrov)	Medium	4/10/2026
v2.90.6	## Changelog ### 🐛 Bug Fixes * 6d92123fa1adf79e49898b6130c211b4eca8b3e7: GH-2226: fix(executor): resolve stale threshold defaults treating 0 as unset (@alekspetrov) ### Other * cf7c513b9c41b561531a58fee28072ef2360606c: GH-2221: Add `GetStaleQueuedExecutions` to memory store (`internal/memory/`) (@alekspetrov)	High	4/7/2026
v2.90.5	## Changelog ### 🐛 Bug Fixes * 71d90ba0a2cb51bc16b61f43e074db2c592db9a6: GH-2176: fix(poller): auto-retry issues stuck with pilot-failed from execution failures (@alekspetrov)	Medium	4/7/2026
v2.90.4	## Changelog ### Other * af8dce3b27c6db6833de3980add4e1c6530c79a1: GH-2212: Replace search-based parent closing with native GraphQL + fallback (`internal... (@alekspetrov)	Medium	4/7/2026
v2.90.3	## Changelog ### Other * 589ef957a916ed0c84e5ff66486bedd922a95711: GH-2212: Replace search-based parent closing with native GraphQL + fallback (`internal/autopilot/`) (@alekspetrov)	Medium	4/7/2026
v2.90.2	## Changelog ### 🐛 Bug Fixes * 875bfe50ac12f029fe6b672cd84c8b7f764f98bc: GH-2206: fix(docs): navbar stuck at v2.87.2 — docs-version-sync workflow trigger broken (GoReleaser GITHUB_TOKEN limitation) (@alekspetrov)	Medium	4/6/2026
v2.90.1	## Changelog ### 🐛 Bug Fixes * e54908d84150a8ca20ccbb4c9b00fb66912036e9: GH-2204: fix(alerts): task_stuck floods Slack — progress never emitted, per-rule cooldown rotates, no orphan cleanup (@alekspetrov)	Medium	4/6/2026
v2.90.0	## Changelog ### Other * 80465794faf1b00efa59fae1642d43ff6fa318d2: GH-2201: Add retry grace period and TaskChecker to GitHub poller (@alekspetrov)	Medium	4/5/2026
v2.89.4	## Changelog ### 🐛 Bug Fixes * a4592a7063a7fbb98bc0033872ddffbabafc686d: GH-2198: fix(autopilot): call maybeCloseParentIssue in notifyExternalClose path (@alekspetrov)	Medium	4/4/2026
v2.89.3	## Changelog ### 🐛 Bug Fixes * 454a21e19472896e9f47b513aab501226101ec94: GH-2196: fix: update remaining alekspetrov/pilot references to qf-studio in non-Go files (@alekspetrov) * b1e31ed1b522e83e36b9262ca3bf37791c79c412: fix(brew): migrate tap from alekspetrov to qf-studio/homebrew-pilot (@alekspetrov)	Medium	4/4/2026
v2.89.2	## Changelog ### 🚀 Features * 090c5db2842e8ba77ab035298126dcc2a08c959d: GH-2188: feat(release): add GoReleaser brews section for Homebrew auto-publish (@alekspetrov) ### 🐛 Bug Fixes * ec66ba597b8127a2746f4c08967ed756c08916ea: GH-2192: fix: update llms.txt GitHub URLs from alekspetrov to qf-studio (@alekspetrov)	Medium	4/4/2026
v2.89.1	## Changelog ### 🐛 Bug Fixes * c2f55a81ebc0e379f010c01bbb9b98fafb3f6406: GH-2189: fix(docs): correct brew tap and install script repo references (@alekspetrov) ### 📚 Documentation * 47a5a09b7cab1fd732d9231feb4c922e57282861: GH-2183: docs(epic): add merge-wait section to epic-decomposition page (@alekspetrov) ### 🔧 Maintenance * e992858b3afd32b7b4733a3e51d6d15fb275731c: GH-2185: chore: update CLAUDE.md version to v2.89.0 (@alekspetrov)	Medium	4/4/2026
v2.89.0	## Feature - feat(epic): Wire merge-wait into production and test paths (GH-2179). `MergeWaiter` closure in `main.go` for both multi-repo and gateway modes, gated on `execution.wait_for_merge` config. Test harness wired with immediate-success callback. Parity test updated (18 checks). ## Changelog ### 🚀 Features * 73e1584a: Wire SubIssueMergeWaitFn in main.go and harness.go (GH-2179) (@alekspetrov)	Medium	4/3/2026
v2.88.0	## Feature - feat(epic): Add merge-wait between sequential sub-issues (GH-2177). After each sub-issue's PR is created, Pilot waits for it to merge before starting the next one. Prevents conflicts when sub-issues touch overlapping files. Includes `SubIssueMergeWaitFn` callback injection, `syncMainBranch` after each merge, graceful nil degradation, and 4 test cases. ## Changelog ### 🚀 Features * fa0e7727: Add SubIssueMergeWaitFn to Runner and wire merge-wait logic in ExecuteSubIssues (GH-21	Medium	4/3/2026
v2.87.7	## Changelog ### 🚀 Features * c94742e2741d9ba29fafd8301bc644eb1bb089c2: feat(epic): enable worktree isolation for sub-issues (GH-2178) (@alekspetrov)	Medium	4/3/2026
v2.87.6	## Bug Fix - fix(git): Pass `--head` flag to `gh pr create` to bypass dirty worktree check. Worktree-based execution could fail PR creation when the main checkout had uncommitted changes. ## Changelog ### 🐛 Bug Fixes * 178a8bec: fix(git): pass --head to gh pr create to bypass dirty worktree check (GH-2177) (@alekspetrov)	Medium	4/3/2026
v2.87.5	## Bug Fix - fix(epic): Sub-issues now branch from the real repository path instead of inheriting the parent's worktree path. Previously, `SwitchToDefaultBranchAndPull` failed inside worktrees → empty branch → push fails with "src refspec does not match". `ExecuteSubIssues` now takes explicit `repoPath` param. ## Changelog ### 🐛 Bug Fixes * 715529a2: fix(epic): sub-issues branch from real repo, not parent worktree (GH-2177) (@alekspetrov)	Medium	4/3/2026
v2.87.4	## Changelog ### 🐛 Bug Fixes * a1778eed0a64bc519fafd39eda542c4b6ea0503a: fix(poller): don't mark failed issues as processed (GH-2176) (@alekspetrov)	Medium	4/3/2026
v2.87.3	## Changelog	Medium	4/3/2026

Dependencies & License Audit

Loading dependencies...

Similar Packages

AgentLintLint your repo for AI agent compatibility.v1.1.13

claude-code-tipsProvide ready-to-use plugins, hooks, and commands to enhance Claude Code sessions with data mining, automation, and integration tools.main@2026-07-19

structured-prompt-skill✍️ Write effective AI prompts with this structured prompt engineering library and Claude Code skill, featuring 300+ curated examples for high-quality results.main@2026-07-19

story-flow🤖🧠 Agentic development workflow for AI–HI (Human Intelligence) collaborationmain@2026-07-16

claude-ruby-grape-railsClaude Code plugin for Ruby, Rails, Grape, PostgreSQL, Redis, and Sidekiq developmentv1.16.15

More in Testing

multi-agent-ralph-loopAutonomous orchestration framework for Claude Code with MemPalace-inspired memory (4-layer stack, 818-token wake-up), parallel-first Agent Teams (6 teammates), Aristotle First Principles methodology,

trulensEvaluation and Tracking for LLM Experiments and AI Agents

ObservalObserval is an AI agent registry with first in class observabilty and eval framework

phoenixAI Observability & Evaluation