freshcrate
Skin:/
Home > Testing > pilot

pilot

#1 Terminal Benchmark 2.0 โ€” AI that ships your tickets.

Why this rank:Strong adoptionRecent releaseHealthy release cadence

Description

#1 Terminal Benchmark 2.0 โ€” AI that ships your tickets.

README

   โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ•— โ–ˆโ–ˆโ•—โ–ˆโ–ˆโ•—      โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ•— โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ•—
   โ–ˆโ–ˆโ•”โ•โ•โ–ˆโ–ˆโ•—โ–ˆโ–ˆโ•‘โ–ˆโ–ˆโ•‘     โ–ˆโ–ˆโ•”โ•โ•โ•โ–ˆโ–ˆโ•—โ•šโ•โ•โ–ˆโ–ˆโ•”โ•โ•โ•
   โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ•”โ•โ–ˆโ–ˆโ•‘โ–ˆโ–ˆโ•‘     โ–ˆโ–ˆโ•‘   โ–ˆโ–ˆโ•‘   โ–ˆโ–ˆโ•‘
   โ–ˆโ–ˆโ•”โ•โ•โ•โ• โ–ˆโ–ˆโ•‘โ–ˆโ–ˆโ•‘     โ–ˆโ–ˆโ•‘   โ–ˆโ–ˆโ•‘   โ–ˆโ–ˆโ•‘
   โ–ˆโ–ˆโ•‘     โ–ˆโ–ˆโ•‘โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ•—โ•šโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ•”โ•   โ–ˆโ–ˆโ•‘
   โ•šโ•โ•     โ•šโ•โ•โ•šโ•โ•โ•โ•โ•โ•โ• โ•šโ•โ•โ•โ•โ•โ•    โ•šโ•โ•
  

AI that ships your tickets while you sleep

ReleaseLicense: BSL 1.1 Go Report CardDiscord Docs โ€ข Install โ€ข Desktop โ€ข Quick Start โ€ข How It Works โ€ข Features โ€ข CLI โ€ข Discord โ€ข Deploy



The Problem

You have 47 tickets in your backlog. You agonize over which to prioritize. Half are "quick fixes" that somehow take 2 hours each. Your PM asks for status updates. Sound familiar?

The Solution

Pilot picks up tickets from GitHub, Linear, Jira, or Asanaโ€”plans the implementation, writes the code, runs tests, and opens a PR. You review and merge. That's it.

โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”      โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”      โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”      โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚   Ticket    โ”‚ โ”€โ”€โ”€โ–ถ โ”‚   Pilot     โ”‚ โ”€โ”€โ”€โ–ถ โ”‚   Review    โ”‚ โ”€โ”€โ”€โ–ถ โ”‚   Ship      โ”‚
โ”‚  (GitHub)   โ”‚      โ”‚  (AI dev)   โ”‚      โ”‚   (You)     โ”‚      โ”‚  (Merge)    โ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜      โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜      โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜      โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
pilot-preview-screenshot

Install

Homebrew (recommended)

brew tap qf-studio/pilot
brew install pilot

Go Install

go install github.com/qf-studio/pilot/cmd/pilot@latest

From Source

git clone https://github.com/qf-studio/pilot
cd pilot
make build
sudo make install-global

Desktop App

Download the desktop app from the latest release:

Platform Download
macOS (Universal) Pilot-Desktop-macOS-universal.dmg
Windows (x64) Pilot-Desktop-Windows-amd64-setup.exe
Linux (x64) Pilot-Desktop-Linux-amd64.tar.gz

Requirements

  • Go 1.22+ (build only)
  • Claude Code CLI 2.1.17+
  • OpenAI API key (optional, for voice transcription)

Quick Start

# 1. Initialize config
pilot init

# 2. Start Pilot
pilot start --github              # GitHub issue polling
pilot start --telegram            # Telegram bot
pilot start --telegram --github   # Both

# 3. Create a GitHub issue with 'pilot' label, or message your Telegram bot

That's it. Go grab coffee. โ˜•

How It Works

You label issue "pilot"
        โ”‚
        โ–ผ
โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚  Pilot claims it  โ”‚  โ† Adds "pilot/in-progress" label
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
        โ”‚
        โ–ผ
โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚  Creates branch   โ”‚  โ† pilot/GH-{number}
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
        โ”‚
        โ–ผ
โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚  Plans approach   โ”‚  โ† Analyzes codebase, designs solution
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
        โ”‚
        โ–ผ
โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚  Implements       โ”‚  โ† Writes code with Claude Code
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
        โ”‚
        โ–ผ
โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚  Quality gates    โ”‚  โ† Test, lint, build validation
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
        โ”‚
        โ–ผ
โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚  Opens PR         โ”‚  โ† Links to issue, adds "pilot/done"
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
        โ”‚
        โ–ผ
    You review
        โ”‚
        โ–ผ
      Merge ๐Ÿš€

Features

133 features implemented across execution, intelligence, integrations, and infrastructure.

Core Execution

Feature Description
Autopilot CI monitoring, auto-merge, feedback loop (dev/stage/prod modes)
Epic Decomposition Complex tasks auto-split into sequential subtasks via Haiku API
Self-Review Auto code review before PR push catches issues early
Sequential Execution Wait for PR merge before next issue (prevents conflicts)
Quality Gates Test/lint/build validation with auto-retry
Execution Replay Record, playback, analyze, export (HTML/JSON/MD)

Intelligence

Feature Description
Model Routing Haiku (trivial) โ†’ Opus 4.6 (standard/complex), auto-detected
Effort Routing Maps task complexity to Claude thinking depth
Research Subagents Haiku-powered parallel codebase exploration
Navigator Integration Auto-detected .agent/, skipped for trivial tasks
Cross-Project Memory Shared patterns and context across repositories

Integrations

Feature Description
Telegram Bot Chat, research, planning, tasks + voice & images
GitHub Polling Auto-pick issues with pilot label
GitLab / Azure DevOps Full polling + webhook adapters
Linear/Jira/Asana Webhooks and task sync
Daily Briefs Scheduled reports via Slack/Email/Telegram
Alerting Task failures, cost thresholds, stuck detection

Infrastructure

Feature Description
Dashboard TUI Sparkline metrics cards, queue depth, autopilot status
Persistent Metrics Token/cost/task counts survive restarts via SQLite
Hot Upgrade Self-update with pilot upgrade or u key in dashboard
Cost Controls Budget limits with hard enforcement
Multiple Backends Claude Code + OpenCode support
BYOK Bring your own Anthropic key, Bedrock, or Vertex

Autopilot Modes

Control how much autonomy Pilot has:

# Fast iteration - skip CI, auto-merge
pilot start --autopilot=dev --github

# Balanced - wait for CI, then auto-merge
pilot start --autopilot=stage --github

# Safe - wait for CI + human approval
pilot start --autopilot=prod --github

Telegram Integration

Talk to Pilot naturally - it understands different interaction modes:

Mode Example What Happens
๐Ÿ’ฌ Chat "What do you think about using Redis?" Conversational response, no code changes
๐Ÿ” Question "What files handle authentication?" Quick read-only answer
๐Ÿ”ฌ Research "Research how the caching layer works" Deep analysis sent to chat
๐Ÿ“ Planning "Plan how to add rate limiting" Shows plan with Execute/Cancel buttons
๐Ÿš€ Task "Add rate limiting to /api/users" Confirms, then creates PR
You: "Plan how to add user authentication"
Pilot: ๐Ÿ“ Drafting plan...
Pilot: ๐Ÿ“‹ Implementation Plan
       1. Create auth middleware...
       2. Add JWT token validation...
       [Execute] [Cancel]

You: [clicks Execute]
Pilot: ๐Ÿš€ Executing...
Pilot: โœ… PR #142 ready: https://github.com/...

Send voice messages, images, or text. Pilot understands context.

Dashboard

Real-time visibility into what Pilot is doing:

โ”Œโ”€ Pilot Dashboard โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚                                                           โ”‚
โ”‚  Status: โ— Running    Autopilot: stage    Queue: 3        โ”‚
โ”‚                                                           โ”‚
โ”‚  Current Task                                             โ”‚
โ”‚  โ”œโ”€ GH-156: Add user authentication                       โ”‚
โ”‚  โ”œโ”€ Phase: Implementing (65%)                             โ”‚
โ”‚  โ””โ”€ Duration: 2m 34s                                      โ”‚
โ”‚                                                           โ”‚
โ”‚  Token Usage          Cost                                โ”‚
โ”‚  โ”œโ”€ Input:  124k      Today:    $4.82                     โ”‚
โ”‚  โ”œโ”€ Output:  31k      This Week: $28.40                   โ”‚
โ”‚  โ””โ”€ Total:  155k      Budget:    $100.00                  โ”‚
โ”‚                                                           โ”‚
โ”‚  Recent Tasks                                             โ”‚
โ”‚  โ”œโ”€ โœ… GH-155  Fix login redirect      1m 12s   $0.45     โ”‚ 
โ”‚  โ”œโ”€ โœ… GH-154  Add dark mode toggle    3m 45s   $1.20     โ”‚
โ”‚  โ””โ”€ โœ… GH-153  Update dependencies     0m 34s   $0.15     โ”‚
โ”‚                                                           โ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
pilot start --dashboard --github

Environment Variables

Pilot uses Claude Code for AI execution:

Variable Description
ANTHROPIC_API_KEY Custom Anthropic API key (uses your own account)
ANTHROPIC_BASE_URL Custom API endpoint (proxies, enterprise)
CLAUDE_CODE_USE_BEDROCK Set to 1 for AWS Bedrock
CLAUDE_CODE_USE_VERTEX Set to 1 for Google Vertex AI

Example: Using AWS Bedrock

export CLAUDE_CODE_USE_BEDROCK=1
export AWS_REGION=us-east-1
pilot start --github

Configuration

Config location: ~/.pilot/config.yaml

version: "1.0"

gateway:
  host: "127.0.0.1"
  port: 9090

adapters:
  telegram:
    enabled: true
    bot_token: "${TELEGRAM_BOT_TOKEN}"
    chat_id: "${TELEGRAM_CHAT_ID}"

  github:
    enabled: true
    token: "${GITHUB_TOKEN}"
    repo: "owner/repo"
    pilot_label: "pilot"
    polling:
      enabled: true
      interval: 30s

orchestrator:
  execution:
    mode: sequential           # "sequential" or "parallel"
    wait_for_merge: true       # Wait for PR merge before next task
    poll_interval: 30s
    pr_timeout: 1h

projects:
  - name: "my-project"
    path: "~/Projects/my-project"
    navigator: true
    default_branch: main

daily_brief:
  enabled: true
  schedule: "0 8 * * *"
  timezone: "Europe/Berlin"

alerts:
  enabled: true
  channels:
    - name: telegram-alerts
      type: telegram
      severities: [critical, error, warning]

executor:
  backend: claude-code          # "claude-code" or "opencode"

CLI Reference

Core Commands

pilot start          # Start with configured inputs
pilot stop           # Stop daemon
pilot status         # Show running tasks
pilot init           # Initialize configuration
pilot version        # Show version info

pilot start

pilot start                          # Config-driven
pilot start --telegram               # Enable Telegram polling
pilot start --github                 # Enable GitHub issue polling
pilot start --linear                 # Enable Linear webhooks
pilot start --telegram --github      # Enable both
pilot start --dashboard              # With TUI dashboard
pilot start --no-gateway             # Polling only (no HTTP server)
pilot start --sequential             # Sequential execution mode
pilot start --autopilot=stage        # Autopilot mode (dev/stage/prod)
pilot start -p ~/Projects/myapp      # Specify project
pilot start --replace                # Kill existing instance first

pilot task

pilot task "Add user authentication"                    # Run in cwd
pilot task "Fix login bug" -p ~/Projects/myapp          # Specify project
pilot task "Refactor API" --verbose                     # Stream output
pilot task "Update docs" --dry-run                      # Preview only
pilot task "Implement feature" --backend opencode       # Use OpenCode

pilot upgrade

pilot upgrade                    # Check and upgrade
pilot upgrade check              # Only check for updates
pilot upgrade rollback           # Restore previous version
pilot upgrade --force            # Skip task completion wait
pilot upgrade --no-restart       # Don't restart after upgrade
pilot upgrade --yes              # Skip confirmation

Analytics Commands

pilot brief                       # Show scheduler status
pilot brief --now                 # Generate and send immediately
pilot brief --weekly              # Generate weekly summary

pilot metrics summary             # Last 7 days overview
pilot metrics summary --days 30   # Last 30 days
pilot metrics daily               # Daily breakdown
pilot metrics projects            # Per-project stats

pilot usage summary               # Billable usage summary
pilot usage daily                 # Daily breakdown
pilot usage export --format json  # Export for billing

pilot patterns list               # List learned patterns
pilot patterns search "auth"      # Search by keyword

Architecture

โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚                          PILOT                              โ”‚
โ”œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ค
โ”‚ Gateway      โ”‚ HTTP/WebSocket server, routing               โ”‚
โ”‚ Adapters     โ”‚ Telegram, Slack, GitHub, Jira, Linear, Asana โ”‚
โ”‚ Executor     โ”‚ Claude Code process management               โ”‚
โ”‚ Orchestrator โ”‚ Task planning, phase management              โ”‚
โ”‚ Memory       โ”‚ SQLite + cross-project knowledge graph       โ”‚
โ”‚ Briefs       โ”‚ Scheduled reports, multi-channel delivery    โ”‚
โ”‚ Alerts       โ”‚ Failure detection, cost monitoring           โ”‚
โ”‚ Metrics      โ”‚ Token usage, execution analytics             โ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ดโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜

Development

make deps        # Install dependencies
make build       # Build binary
make test        # Run tests
make lint        # Run linter
make dev         # Development mode with hot reload

FAQ

Is this safe?

Pilot runs in your environment with your permissions. It can only access repos you configure. All changes go through PR review (unless you enable auto-merge). You stay in control.

How much does it cost?

Pilot is free. You pay for Claude API usage (~$0.50-2.00 per typical task). Set budget limits to control costs.

What tasks can it handle?

Best for: bug fixes, small features, refactoring, tests, docs, dependency updates.

Not ideal for: large architectural changes, security-critical code, tasks requiring human judgment.

Does it learn my codebase?

Yes. Pilot uses Navigator to understand your patterns, conventions, and architecture. Cross-project memory shares learnings across repositories.

License

Business Source License 1.1 ยฉ Aleksei Petrov

Use Case Allowed
Internal use โœ…
Self-hosting โœ…
Modification & forking โœ…
Non-competing products โœ…
Competing SaaS โŒ (requires license)

Converts to Apache 2.0 after 4 years.

Contributing

Contributions welcome. Please open an issue first for major changes.

git checkout -b feature/my-feature
make test
# Submit PR

Stop agonizing over tickets. Let Pilot ship them.

โญ Star on GitHub

Built with Claude Code + Navigator

Release History

VersionChangesUrgencyDate
v2.166.12## Changelog ### ๐Ÿ“š Documentation * bf02df534a4aa53677fc35276781127ac0e8e9e4: docs(nav): close + archive TASK-358 (dashboard failed-count classification, v2.166.10โ€“11); +2 memories; refresh Current State (@alekspetrov) ### Other * 67dbfbd92e9ece66dc6430b5c7e234c081b1f8b0: perf(docs): cache + compression + security headers (@alekspetrov) High6/3/2026
v2.155.2## Changelog ### ๐Ÿ› Bug Fixes * 5384ddc21705e06e13063004c2e274a84d1a0454: fix(executor): ghost-SHA guard + IsTaskShipped hardening (TASK-300) (@alekspetrov) High5/26/2026
v2.146.7## P0 bugfix: `syncMainBranch` no longer silently destroys local commits Replaces destructive `git reset --hard origin/main` with `git merge --ff-only origin/<currentBranch>` in the post-task main-branch sync. ### What was broken `syncMainBranch()` ran `git reset --hard origin/main` after every task. When GitHub push-propagation lagged behind the immediately-following fetch+reset (a race in the remote, **not** push failure), Pilot's just-committed work was silently rewound and only reflog-recHigh5/20/2026
v2.146.6## Changelog ### ๐Ÿ› Bug Fixes * a107194bdac0b7b6e56c73e92f8f5e8454353983: fix(observability): repair Grafana mount shadow + Approval Persist Misses panel coloring (@alekspetrov) High5/11/2026
v2.145.0## Changelog ### ๐Ÿš€ Features * 856d12ce0b0d58b72488df802e531e4f538de13b: feat(deploy): add token/cost/executions panels to Pilot Grafana dashboard (@alekspetrov) High5/8/2026
v2.107.0## Changelog ### ๐Ÿš€ Features * 2eb2a15e6870218c66b3dcad7b94e7baf0659e34: feat(github,memory): add SearchPRsForIssue and InvalidateCompletion primitives (@alekspetrov) High5/2/2026
v2.101.0## Changelog ### ๐Ÿš€ Features * 216b9f2bc8ad3455c5e3ed666b1e46986bfa7de2: feat(memory): wire usage_events table from dispatcher (@alekspetrov) High4/27/2026
v2.99.3## Changelog ### ๐Ÿ› Bug Fixes * b992a653206475751b2b9ac14d1eb699063d26fa: fix(orchestrator): preserve priority from non-Linear adapters in processTask (#2387) (@alekspetrov) ### ๐Ÿ“š Documentation * 0d1f5638c29288313351855d953164dac5690365: docs(guides): add custom model providers guide (#2381) (@alekspetrov) * 51361214a34d3c42dd77103c2efbb14e31856ae6: docs(nav): bring .agent/DEVELOPMENT-README.md current to v2.99.1 (@alekspetrov) High4/23/2026
v2.99.1## Changelog ### ๐Ÿ› Bug Fixes * b4f1ad57079989eaf5dccaa3ddf2472b9859c07f: fix(executor): self-review --resume fallback + sanitize marker filename path separators (@alekspetrov) ### โ™ป๏ธ Refactoring * 0b4a6c828ae1df5fa244fa1ef0b0e8eb09d0e689: refactor(autopilot): remove dead prod-X.Y.Z tag auto-push (@alekspetrov) High4/18/2026
v2.99.0## Changelog ### ๐Ÿš€ Features * e5b96bedc63434b7f17218d8c3bdcbf4e556798c: feat(executor): inject api_base_url/default_model/api_auth_token into Claude Code subprocess env (GH-2287 follow-up) (@alekspetrov) High4/18/2026
v2.98.0## Changelog ### ๐Ÿš€ Features * 02d258a566c381165294629580284ddb62b1c82f: feat(release): auto-push prod-X.Y.Z tag alongside vX.Y.Z to trigger docs deploy (@alekspetrov) High4/18/2026
v2.97.0## Changelog ### ๐Ÿš€ Features * 303a7b31afd3ae51e97cc1ee96e2d42123fd3bc1: feat(executor): honor project.default_branch for base branch and PR target (GitLab and GitHub) (@alekspetrov) High4/18/2026
v2.96.0## Changelog ### ๐Ÿš€ Features * 3207559850a2c365bfc727642c7ec727b9806906: feat(autopilot): after 2nd conventional-commit rejection, post suggested rewrite + stop retrying (@alekspetrov) ### ๐Ÿ› Bug Fixes * 33d1cb9eb841173b5b59853ddcfb0b3f7e46df4f: fix: migrate alekspetrov/anthropics refs to qf-studio (GH-2175) (@alekspetrov) High4/18/2026
v2.95.13## Changelog ### ๐Ÿ› Bug Fixes * 00a965e87ccc453607b127f35f56a9c59ca754b2: fix(doctor): warn when projects configured but no issue adapter enabled; fail start --github loudly (@alekspetrov) High4/18/2026
v2.95.12## Changelog ### Other * af7306ad7fd864888edd31e4794b2abb634d9deb: fix stress test timeout + re-land GH-2341 poller fix (@alekspetrov) High4/18/2026
v2.95.11## Changelog ### ๐Ÿ› Bug Fixes * 3b9bb9296dabee9194a1ecc1b4cd227506fa9213: fix(dashboard): queue view includes closed issues with stale pilot-in-progress label (@alekspetrov) High4/17/2026
v2.95.10## Changelog ### ๐Ÿ› Bug Fixes * 64ad86c6a49dfb644ddea866b26290ca71c6946f: fix(autopilot): ScanExistingPRs clobbers RestoreState on startup (@alekspetrov) High4/17/2026
v2.95.9## Changelog ### ๐Ÿ› Bug Fixes * 823a5169ead7235b17586b20c95296448d3f5aa9: fix(autopilot): idempotent merge-completion notification (dedup on re-entry) (@alekspetrov) * 30d5fb879247ab134c5dd113a274fafdb1c4beb0: fix(autopilot): skip pilot-retry-ready when pilot-done already on issue (@alekspetrov) High4/17/2026
v2.95.8## Changelog ### ๐Ÿ› Bug Fixes * f6f6f963f4c659cf12253b0cdadb5a4fc8290c66: fix(dispatcher): add hasLiveWorker guard to running-task reaper (mirror GH-2331) (@alekspetrov) * 745bac5842e4e7e5b22b8f4f4d3e515c9d2aa241: fix(epic): validate sub-issue titles โ€” reject LLM analysis-style titles in decomposition (@alekspetrov) * 5a02f79727a7ee793e692403c9db6d17e269494d: fix(executor): Claude Code subprocess OOM-killed on large Navigator context tasks (GH-2324) (@alekspetrov) * b6054b9698ee3b995a77e33463430High4/17/2026
v2.95.7## Changelog ### ๐Ÿ› Bug Fixes * aa62b02caf3478d13dcd8d40c503678883137db0: fix(executor): signal executor mode to Claude + classify no-diff (GH-2328) (@alekspetrov) High4/17/2026
v2.95.6## Changelog ### ๐Ÿ› Bug Fixes * bbc33cb662e1fd10829e151ba056718c2b968cf4: fix(dashboard): gate stdout prints on dashboard mode (GH-2333) (@alekspetrov) ### Other * e5df708dea9e8dcdb44ec4e7508a2ad9bbe0ac76: Fix dashboard startup log and polling-mode warning (`cmd/pilot/main.go`) (@alekspetrov) High4/17/2026
v2.95.5## Changelog ### ๐Ÿ› Bug Fixes * 84c4ccb14eea3861233df5fa8d16e596bb7424c0: fix(dispatcher): don't reap queued tasks when live worker exists (GH-2331) (#2331) (@alekspetrov) * 96ff63b24a88ccba2fc1820489cb7288c94ecb1c: fix(executor): persist backend stderr + final message on failure (GH-2328) (#2330) (@alekspetrov) ### ๐Ÿ“š Documentation * 7180185746652173871e0f2ecf6532b94850d690: docs(claude): scope Navigator-only rules to interactive sessions (#2327) (@alekspetrov) High4/17/2026
v2.95.4## Changelog ### ๐Ÿ› Bug Fixes * 13e815368a5ebcc06114243d6b3f6e3aa9b90432: fix(dashboard): adapter poller tasks invisible in gateway mode with --dashboard (GH-2291) (#2292) (@efesler) * 811e3091e77adbb30636895e540115e65b0eb543: fix(dispatcher): persist Task.Labels across queue round-trip (GH-2326) (#2326) (@alekspetrov) * 89df7fc75f6ecc1af40fb92ea91d0297ed6492ef: fix(gitlab): use string type for MergeRequest.detailed_merge_status field (#2295) (@efesler) * e12cf14940750bb7094fd05cfe280b15d7ea2c7aHigh4/17/2026
v2.95.3## Changelog ### Other * 70c14dc580adc3e1242facb3dace5b63608d4b15: Dispatcher `recoverStaleTasks()` (line 188) already marks orphans as `"failed"`, not `"completed"`. The status appears correct in the current code. (@alekspetrov) High4/15/2026
v2.95.2## Changelog ### ๐Ÿ› Bug Fixes * fd1db6e16e83b6869224ab08da589b9dd1c6bfde: GH-2312: fix(autopilot): strip GH-XXXX prefix from squash commit title for conventional commit detection (@alekspetrov) High4/15/2026
v2.95.1## Changelog ### Other * f1a956de2bb4c826466f1123cc30a2a5bce41caf: GH-2304: Add GitHub config validation to `pilot doctor` (`internal/health/health.go` +... (@alekspetrov) * 97b022b71a32b7f520f6548aa725d5042b952c9a: GH-2306: Add GitHub adapter config validation to `pilot doctor` (@alekspetrov) * aa7629f225f38c4c4bfbf0ebc51dae81b33025fc: GH-2307: Fix dashboard startup log and add zero-poller warning (@alekspetrov) High4/14/2026
v2.94.4## Changelog ### ๐Ÿ› Bug Fixes * d7d19ea70c5c9c82ae37ea429a534d2153d90c7f: GH-2301: fix(poller): clear stale pilot-in-progress on startup to avoid restart requirement (@alekspetrov) Medium4/14/2026
v2.94.3## Changelog ### ๐Ÿ› Bug Fixes * d5c6fb3c0369ba148eb0467417dc6640dc5e813e: GH-2297: fix(autopilot): post success comment on merge close โ€” stale failure comments (@alekspetrov) Medium4/14/2026
v2.95.0## Changelog ### ๐Ÿš€ Features * 934f3c196f678744b2a9cfe141eecfc46be74175: GH-2298: feat(github): add issue templates for bug reports, features, and pilot tasks (@alekspetrov) Medium4/14/2026
v2.94.2## Changelog ### ๐Ÿ› Bug Fixes * 0e9db39650a35787a0c9efea692fa3218ce08025: GH-2286: fix: all bare NewRunner() callsites ignore executor config (@alekspetrov) Medium4/14/2026
v2.94.1## Changelog ### ๐Ÿš€ Features * 555ffb2a44ecb676e0b4fa35a2040b4bd0edc42f: GH-2287: feat(executor): non-Anthropic model provider support โ€” default_model + api_base_url (TASK-24) (@alekspetrov) Medium4/14/2026
v2.94.0## Changelog ### ๐Ÿš€ Features * 2f5620cc1a539e98e0a261bdfc68249b4834612a: feat(executor): non-Anthropic model provider support โ€” default_model + api_base_url (GH-2287) (@alekspetrov) ### ๐Ÿ› Bug Fixes * 0c19431db94ec61e423dc9fbfcbd4cafe02c7acd: fix(autopilot): update execution status to completed when PR is merged externally (GH-2279) (#2281) (@alekspetrov) * 63e6235d8856b9d1c54f5aa083e46e8e3d635af6: fix(executor): complete non-Anthropic provider support โ€” add missing classifier methods (@alekspetHigh4/12/2026
v2.93.2## Changelog ### ๐Ÿš€ Features * 85d812245b93ea2a7c993f8f6b64242e6fd0929e: feat(github): auto-retry issues with pilot-retry-ready label (GH-2276) (#2277) (@alekspetrov) ### ๐Ÿ› Bug Fixes * 12e7b3a3a6c512a4ce4e1ea0730beddec0c80b01: GH-2280: fix(dashboard): sparkline graphs wiped after 5s DB refresh (@alekspetrov) High4/10/2026
v2.93.1## Changelog ### ๐Ÿ› Bug Fixes * b33ea39d2f79fa1a17f987a5363c386a45d4080c: GH-2269: fix(bench/aws): stale task manifest โ€” always regenerate before run (@alekspetrov) Medium4/10/2026
v2.93.0## Changelog ### ๐Ÿ› Bug Fixes * c46d4e9c2019396f4566478037028e25eb97e38f: GH-2267: fix(bench/aws): orchestrator hangs forever, no signal handling, crash loses results (@alekspetrov) Medium4/10/2026
v2.92.1## Changelog ### ๐Ÿ› Bug Fixes * 4896b8ff7f2b33e5ead2d122533901f340fe7c20: GH-2260: fix(install): replace fragile grep+sed version parsing with jq (@alekspetrov) * dfdffa7d65122265d0b5140405d65eafeebce290: fix(autopilot): scan for externally-merged PRs periodically, not just on startup (GH-2251) (#2253) (@alekspetrov) * de6df8681bffd7ffcb61036adad08a08c93dee27: fix(poller): skip retry of closed issues with stale pilot-failed label (GH-2252) (#2255) (@alekspetrov) ### Other * a7bfefa1864aef8b58cabMedium4/10/2026
v2.92.0## Changelog ### ๐Ÿ› Bug Fixes * bc90a923dbb66b7d20fea34fe2f4c797101218a5: GH-2256: fix(executor): add dryRun guard to Runner gh CLI methods + idempotency check (@alekspetrov) Medium4/10/2026
v2.91.0## Changelog ### ๐Ÿš€ Features * 7cc86fea9cd1806e9bf29535e0d0826cf2483455: feat(bench): add self-verification harness to PilotAgent (#2238) (#2244) (@alekspetrov) ### ๐Ÿ› Bug Fixes * 5a7c70bf376669566634f68c6aa91994420f66a5: fix(autopilot): use PR source repo for release operations on cross-repo PRs (GH-2243) (#2249) (@alekspetrov) * 3cc191dfa658dfb058b8a6af135374e4030e18f0: fix(dashboard): refresh history and metrics from DB every 5s (GH-2248) (#2250) (@alekspetrov) * 7d73d29ee2f79e41df9752a599fd2Medium4/10/2026
v2.90.8## Changelog ### Other * ab545453d12c044f461a7145d6a76fc03fd2664c: GH-2237: bench: remove oracle test access from PilotAgent (@alekspetrov) Medium4/10/2026
v2.90.7## Changelog ### ๐Ÿ› Bug Fixes * 9ff6e82ba955f7bbdc86a8ba58da2ad3fc17dc95: fix(bench): remove oracle test access from PilotAgent (#2239) (@alekspetrov) * 29585eba8988f46efa5c50725f563f060fede588: fix: update broken Discord invite link (#2231) (@alekspetrov) ### Other * d0264719fa778d130dd8538c8b29937fbf452b73: GH-2211: Wire native sub-issue linking into epic decomposition (`internal/executor/`) (@alekspetrov) Medium4/10/2026
v2.90.6## Changelog ### ๐Ÿ› Bug Fixes * 6d92123fa1adf79e49898b6130c211b4eca8b3e7: GH-2226: fix(executor): resolve stale threshold defaults treating 0 as unset (@alekspetrov) ### Other * cf7c513b9c41b561531a58fee28072ef2360606c: GH-2221: Add `GetStaleQueuedExecutions` to memory store (`internal/memory/`) (@alekspetrov) High4/7/2026
v2.90.5## Changelog ### ๐Ÿ› Bug Fixes * 71d90ba0a2cb51bc16b61f43e074db2c592db9a6: GH-2176: fix(poller): auto-retry issues stuck with pilot-failed from execution failures (@alekspetrov) Medium4/7/2026
v2.90.4## Changelog ### Other * af8dce3b27c6db6833de3980add4e1c6530c79a1: GH-2212: Replace search-based parent closing with native GraphQL + fallback (`internal... (@alekspetrov) Medium4/7/2026
v2.90.3## Changelog ### Other * 589ef957a916ed0c84e5ff66486bedd922a95711: GH-2212: Replace search-based parent closing with native GraphQL + fallback (`internal/autopilot/`) (@alekspetrov) Medium4/7/2026
v2.90.2## Changelog ### ๐Ÿ› Bug Fixes * 875bfe50ac12f029fe6b672cd84c8b7f764f98bc: GH-2206: fix(docs): navbar stuck at v2.87.2 โ€” docs-version-sync workflow trigger broken (GoReleaser GITHUB_TOKEN limitation) (@alekspetrov) Medium4/6/2026
v2.90.1## Changelog ### ๐Ÿ› Bug Fixes * e54908d84150a8ca20ccbb4c9b00fb66912036e9: GH-2204: fix(alerts): task_stuck floods Slack โ€” progress never emitted, per-rule cooldown rotates, no orphan cleanup (@alekspetrov) Medium4/6/2026
v2.90.0## Changelog ### Other * 80465794faf1b00efa59fae1642d43ff6fa318d2: GH-2201: Add retry grace period and TaskChecker to GitHub poller (@alekspetrov) Medium4/5/2026
v2.89.4## Changelog ### ๐Ÿ› Bug Fixes * a4592a7063a7fbb98bc0033872ddffbabafc686d: GH-2198: fix(autopilot): call maybeCloseParentIssue in notifyExternalClose path (@alekspetrov) Medium4/4/2026
v2.89.3## Changelog ### ๐Ÿ› Bug Fixes * 454a21e19472896e9f47b513aab501226101ec94: GH-2196: fix: update remaining alekspetrov/pilot references to qf-studio in non-Go files (@alekspetrov) * b1e31ed1b522e83e36b9262ca3bf37791c79c412: fix(brew): migrate tap from alekspetrov to qf-studio/homebrew-pilot (@alekspetrov) Medium4/4/2026
v2.89.2## Changelog ### ๐Ÿš€ Features * 090c5db2842e8ba77ab035298126dcc2a08c959d: GH-2188: feat(release): add GoReleaser brews section for Homebrew auto-publish (@alekspetrov) ### ๐Ÿ› Bug Fixes * ec66ba597b8127a2746f4c08967ed756c08916ea: GH-2192: fix: update llms.txt GitHub URLs from alekspetrov to qf-studio (@alekspetrov) Medium4/4/2026
v2.89.1## Changelog ### ๐Ÿ› Bug Fixes * c2f55a81ebc0e379f010c01bbb9b98fafb3f6406: GH-2189: fix(docs): correct brew tap and install script repo references (@alekspetrov) ### ๐Ÿ“š Documentation * 47a5a09b7cab1fd732d9231feb4c922e57282861: GH-2183: docs(epic): add merge-wait section to epic-decomposition page (@alekspetrov) ### ๐Ÿ”ง Maintenance * e992858b3afd32b7b4733a3e51d6d15fb275731c: GH-2185: chore: update CLAUDE.md version to v2.89.0 (@alekspetrov) Medium4/4/2026
v2.89.0## Feature - **feat(epic):** Wire merge-wait into production and test paths (GH-2179). `MergeWaiter` closure in `main.go` for both multi-repo and gateway modes, gated on `execution.wait_for_merge` config. Test harness wired with immediate-success callback. Parity test updated (18 checks). ## Changelog ### ๐Ÿš€ Features * 73e1584a: Wire SubIssueMergeWaitFn in main.go and harness.go (GH-2179) (@alekspetrov)Medium4/3/2026
v2.88.0## Feature - **feat(epic):** Add merge-wait between sequential sub-issues (GH-2177). After each sub-issue's PR is created, Pilot waits for it to merge before starting the next one. Prevents conflicts when sub-issues touch overlapping files. Includes `SubIssueMergeWaitFn` callback injection, `syncMainBranch` after each merge, graceful nil degradation, and 4 test cases. ## Changelog ### ๐Ÿš€ Features * fa0e7727: Add SubIssueMergeWaitFn to Runner and wire merge-wait logic in ExecuteSubIssues (GH-21Medium4/3/2026
v2.87.7## Changelog ### ๐Ÿš€ Features * c94742e2741d9ba29fafd8301bc644eb1bb089c2: feat(epic): enable worktree isolation for sub-issues (GH-2178) (@alekspetrov) Medium4/3/2026
v2.87.6## Bug Fix - **fix(git):** Pass `--head` flag to `gh pr create` to bypass dirty worktree check. Worktree-based execution could fail PR creation when the main checkout had uncommitted changes. ## Changelog ### ๐Ÿ› Bug Fixes * 178a8bec: fix(git): pass --head to gh pr create to bypass dirty worktree check (GH-2177) (@alekspetrov)Medium4/3/2026
v2.87.5## Bug Fix - **fix(epic):** Sub-issues now branch from the real repository path instead of inheriting the parent's worktree path. Previously, `SwitchToDefaultBranchAndPull` failed inside worktrees โ†’ empty branch โ†’ push fails with "src refspec does not match". `ExecuteSubIssues` now takes explicit `repoPath` param. ## Changelog ### ๐Ÿ› Bug Fixes * 715529a2: fix(epic): sub-issues branch from real repo, not parent worktree (GH-2177) (@alekspetrov)Medium4/3/2026
v2.87.4## Changelog ### ๐Ÿ› Bug Fixes * a1778eed0a64bc519fafd39eda542c4b6ea0503a: fix(poller): don't mark failed issues as processed (GH-2176) (@alekspetrov) Medium4/3/2026
v2.87.3## Changelog Medium4/3/2026

Dependencies & License Audit

Loading dependencies...

Similar Packages

AgentLintLint your repo for AI agent compatibility.v1.1.13
mxcliMendix cli tool, a headless way to work with Mendix projects. Enables Mendix projects for use with 3rd party agentic coding tools like Claude Code and Copilot. Includes a starlark linter for quality vv0.12.0
claude-code-tipsProvide ready-to-use plugins, hooks, and commands to enhance Claude Code sessions with data mining, automation, and integration tools.main@2026-06-04
vibe-replayTurn AI coding sessions into animated, interactive web replaysv0.2.2
structured-prompt-skillโœ๏ธ Write effective AI prompts with this structured prompt engineering library and Claude Code skill, featuring 300+ curated examples for high-quality results.main@2026-05-31

More in Testing

vector-db-benchmarkFramework for benchmarking vector search engines
GitoAn AI-powered GitHub code review tool that uses LLMs to detect high-confidence, high-impact issuesโ€”such as security vulnerabilities, bugs, and maintainability concerns.
mxcliMendix cli tool, a headless way to work with Mendix projects. Enables Mendix projects for use with 3rd party agentic coding tools like Claude Code and Copilot. Includes a starlark linter for quality v
llm_context_benchmarks ๐Ÿ“Š LLM Context Benchmarks - A comprehensive benchmarking tool for testing LLMs with varying context sizes using Ollama. Features dual benchmark modes (API/CLI), automatic hardware detection (optimiz