Home > Frameworks > aiagentflow

aiagentflow

A local-first, CLI-driven multi-agent AI software engineering workflow orchestrator with feed specs, PRDs, and guidelines to auto-generate implementation plans and code.

ai ai-agents anthropic automation cli code-generation developer-tools gemini typescript

Why this rank:Strong adoptionRecent releaseHealthy release cadence

Description

A local-first, CLI-driven multi-agent AI software engineering workflow orchestrator with feed specs, PRDs, and guidelines to auto-generate implementation plans and code.

README

aiagentflow

A local-first CLI that orchestrates multi-agent AI workflows for software development. Give it a task — or feed it your specs, PRDs, and guidelines — and it coordinates specialized agents to architect, code, review, test, and ship automatically.

No cloud dependency. Bring your own API keys. Your code stays on your machine.

How It Works

Task → Architect → Coder → Reviewer → Tester → Fixer → Ship

Each stage uses a specialized AI agent with tuned prompts and parameters. The loop repeats until quality thresholds pass — like a small AI engineering team running on your machine.

Install

npm install -g @aiagentflow/cli

Or with pnpm:

pnpm add -g @aiagentflow/cli

Quick Start

# 1. Initialize in your project
cd /path/to/your/project
aiagentflow init

# 2. Run a task
aiagentflow run "Add a login form with email/password validation"

# 3. Or run autonomously (no approval prompts)
aiagentflow run "Refactor the auth module" --auto

# 4. Feed context docs to agents
aiagentflow run "Add auth" --context docs/api-spec.md docs/security.md

# 5. Generate a task list from specs, then batch-run
aiagentflow plan docs/prd.md -o tasks.txt
aiagentflow run --batch tasks.txt --auto

The init wizard walks you through:

Auto-detect your project (language, framework, test framework, package manager)
Select your LLM providers (Anthropic, OpenAI, Groq, Gemini, Ollama)
Enter API keys
Assign models per agent role
Choose a workflow mode (fast, balanced, strict)
Import existing docs (specs, requirements, guidelines) for auto-loading

Configuration is saved locally in .aiagentflow/config.json.

Features

Multi-agent pipeline — 6 specialized agents, each with a distinct role
Context-aware — feed specs, PRDs, architecture docs, and guidelines to every agent
Plan from docs — generate batch-ready task lists from your existing documentation
Local-first — runs entirely on your machine, no code leaves your system
Provider-agnostic — Anthropic (Claude), OpenAI (GPT), Groq, Google Gemini, Ollama (local/free)
Workflow modes — fast, balanced, or strict presets for iterations, approval, and temperatures
Smart detection — auto-detects language, framework, test runner, and package manager
Configurable — tune models, temperature, and iteration limits per agent
Git-native — auto-creates branches, auto-commits on QA pass
Human-in-the-loop — approve or override at any stage, or go full auto
QA policies — configurable quality gates (max critical issues, test requirements)
Batch mode — process multiple tasks from a file
Session persistence — crash recovery with automatic session saving
Token tracking — monitor LLM usage per agent and per run
Customizable prompts — edit agent prompts in .aiagentflow/prompts/

CLI Commands

Command	Description
`aiagentflow init`	Interactive setup wizard
`aiagentflow config`	View current configuration
`aiagentflow doctor`	Health check — verify providers and setup
`aiagentflow run <task>`	Run a workflow for a task
`aiagentflow run <task> --auto`	Autonomous mode (no approval prompts)
`aiagentflow run <task> --context <files...>`	Run with reference documents
`aiagentflow run --batch tasks.txt`	Process multiple tasks from a file
`aiagentflow plan <docs...>`	Generate a task list from documentation
`aiagentflow plan <docs...> -o tasks.txt`	Write task list to file (batch-ready)

Agent Roles

Agent	Role	What it does
🧠 Architect	Plan	Analyzes the task and creates an implementation plan
💻 Coder	Implement	Writes production-ready code based on the plan
🔍 Reviewer	Review	Reviews code for bugs, security, and quality
🧪 Tester	Test	Generates tests and runs them
🐛 Fixer	Fix	Addresses review comments and test failures
✅ Judge	QA	Final quality gate — pass or fail

Supported Providers

Provider	Type	Default Model	Setup
Anthropic	Cloud API	`claude-sonnet-4-20250514`	Requires API key
OpenAI	Cloud API	`gpt-4o-mini`	Requires API key
Groq	Cloud API	`llama-3.3-70b-versatile`	Requires API key
Google Gemini	Cloud API	`gemini-2.0-flash`	Requires API key
Ollama	Local	`llama3.2:latest`	Requires Ollama running locally

You can mix providers — use cloud APIs for reasoning agents (architect, reviewer, judge) and local models for generation agents (coder, tester, fixer).

Using with Ollama (free, local)

# Install and start Ollama
ollama serve

# Pull a model
ollama pull llama3.2

# Initialize aiagentflow with Ollama
aiagentflow init
# → Select "ollama" as provider
# → Enter model name: llama3.2

Configuration

After aiagentflow init, your project has:

.aiagentflow/
├── config.json              # Main configuration
├── prompts/                 # Customizable agent prompts
│   ├── architect.md
│   ├── coder.md
│   ├── reviewer.md
│   ├── tester.md
│   ├── fixer.md
│   └── judge.md
├── policies/                # Quality standards
│   └── coding-standards.md
├── context/                 # Reference docs (auto-loaded into every run)
│   ├── api-spec.md          # Example: your API specification
│   └── requirements.md      # Example: your PRD or requirements
└── sessions/                # Saved workflow sessions

Edit the prompt files to customize how each agent behaves. Edit coding-standards.md to set project-specific rules that all agents follow. Drop .md or .txt files into context/ and they'll be automatically included as reference material for all agents.

Context Documents

Agents work best when they understand your project's requirements, API contracts, and standards. There are three ways to provide reference documents:

1. Auto-loaded (recommended) — Drop files into .aiagentflow/context/:

cp docs/api-spec.md .aiagentflow/context/
cp docs/security-guidelines.md .aiagentflow/context/
aiagentflow run "Implement user registration"
# Both docs are automatically included in every agent's context

2. Per-run via --context flag:

aiagentflow run "Add OAuth support" --context docs/oauth-spec.md docs/auth-arch.md

3. During init — The setup wizard asks if you have existing docs and copies them for you.

What to include

Document type	Example	Why it helps
API specs	`api-spec.md`	Agents generate correct endpoints and contracts
Requirements / PRDs	`requirements.md`	Architect plans match your actual requirements
Security guidelines	`security.md`	Reviewer catches violations against your policies
Architecture docs	`architecture.md`	Coder follows your patterns and conventions
Development guidelines	`dev-guidelines.md`	All agents follow your team's standards

Plan command

Turn documentation into an actionable task list, then batch-run it:

# Generate tasks from a PRD
aiagentflow plan docs/prd.md -o tasks.txt

# Review the generated tasks
cat tasks.txt

# Run them all
aiagentflow run --batch tasks.txt --auto --context docs/architecture.md

Project Structure

src/
├── cli/            # CLI entry point and commands
├── core/           # Config system, workflow engine, QA policies
├── providers/      # LLM provider adapters (Anthropic, OpenAI, Groq, Gemini, Ollama)
├── agents/         # Agent implementations and prompt library
├── git/            # Git operations wrapper
├── prompts/        # Default prompt templates
└── utils/          # Shared utilities (logger, fs, validation)

Development

# Clone and install
git clone https://github.com/aiagentflow/aiagentflow.git
cd aiagentflow
pnpm install

# Run in dev mode
pnpm dev run "your task here"

# Type check
pnpm typecheck

# Run tests
pnpm test

# Lint & format
pnpm lint
pnpm format

Contributing

Contributions are welcome! Here's how to get started:

Fork the repo and clone your fork
Create a branch for your feature: git checkout -b feature/your-feature
Follow the coding standards:
- Functions: camelCase, Classes: PascalCase, Files: kebab-case
- All public functions need JSDoc, types, and error handling
- Use custom AppError subclasses — never raw throw new Error()
Check your work: pnpm typecheck && pnpm lint && pnpm test
Open a PR against main with a description of what and why

Architecture rules

Dependency direction flows downward: cli → core → utils → types
Config types are inferred from Zod schemas, never manually defined
New providers only require one adapter file + registry entry

Roadmap

Project scaffolding, config system, LLM provider layer
Workflow engine, agent implementations, Git integration
QA policies, token tracking, session persistence
Context documents — feed specs, PRDs, and guidelines to agents
Plan command — generate task lists from documentation
Multiple providers — Anthropic, OpenAI, Groq, Gemini, Ollama
Project auto-detection — language, framework, test runner, package manager
Auto-commit on QA pass
Workflow mode presets — fast, balanced, strict
VSCode extension
Desktop GUI

License

MIT

aiagentflow.dev

Release History

Version	Changes	Urgency	Date
v1.4.0	## What's Changed * Add agent memory system by @raj-khan in https://github.com/aiagentflow/aiagentflow/pull/57 * Bump version to 1.4.0 by @raj-khan in https://github.com/aiagentflow/aiagentflow/pull/58 Full Changelog: https://github.com/aiagentflow/aiagentflow/compare/v1.3.1...v1.4.0	High	5/24/2026
v1.2.0	## What's Changed * Add worktree isolation, plan-review checkpoint, and parallel batch execution by @raj-khan in https://github.com/aiagentflow/aiagentflow/pull/51 * Bump version to 1.2.0 by @raj-khan in https://github.com/aiagentflow/aiagentflow/pull/52 Full Changelog: https://github.com/aiagentflow/aiagentflow/compare/v1.1.0...v1.2.0	High	5/23/2026
v1.0.2	## What's Changed * feat: lint and format enforcement after code generation by @raj-khan in https://github.com/aiagentflow/aiagentflow/pull/46 * Bump version to 1.0.2 by @raj-khan in https://github.com/aiagentflow/aiagentflow/pull/47 Full Changelog: https://github.com/aiagentflow/aiagentflow/compare/v1.0.1...v1.0.2	High	4/15/2026
v1.0.1	## What's Changed * Delete stale bump branch before pushing in release.yml by @raj-khan in https://github.com/aiagentflow/aiagentflow/pull/41 * Add /update-website skill by @raj-khan in https://github.com/aiagentflow/aiagentflow/pull/42 * update README: add OpenRouter, missing commands by @raj-khan in https://github.com/aiagentflow/aiagentflow/pull/43 * fix: actionable error and warning for Groq 413 with compound models by @raj-khan in https://github.com/aiagentflow/aiagentflow/pull/44 * Bump ve	High	4/14/2026
v1.0.0	## What's Changed * Fix publish.yml not triggering after automated release creation by @raj-khan in https://github.com/aiagentflow/aiagentflow/pull/36 * Add OpenRouter provider and fix 429 retry backoff by @raj-khan in https://github.com/aiagentflow/aiagentflow/pull/37 * Use RELEASE_PAT in release.yml to trigger CI on bump PRs by @raj-khan in https://github.com/aiagentflow/aiagentflow/pull/39 * Bump version to 1.0.0 by @raj-khan in https://github.com/aiagentflow/aiagentflow/pull/40 **Full Chan	High	4/12/2026
v0.10.0	## What's Changed * Add /release project skill by @raj-khan in https://github.com/aiagentflow/aiagentflow/pull/32 * Rename /release skill to /do-release by @raj-khan in https://github.com/aiagentflow/aiagentflow/pull/33 * Fix Groq compound model streaming returning empty content by @raj-khan in https://github.com/aiagentflow/aiagentflow/pull/35 * Bump version to 0.10.0 by @github-actions[bot] in https://github.com/aiagentflow/aiagentflow/pull/34 ## New Contributors * @github-actions[bot] made t	High	4/11/2026
v0.9.0	## What's Changed * Add Groq provider by @raj-khan in https://github.com/aiagentflow/aiagentflow/pull/28 * Fix release workflow for protected main branch by @raj-khan in https://github.com/aiagentflow/aiagentflow/pull/29 * Bump version to 0.9.0 by @raj-khan in https://github.com/aiagentflow/aiagentflow/pull/30 * Fix tag-on-merge to detect bump PRs correctly by @raj-khan in https://github.com/aiagentflow/aiagentflow/pull/31 Full Changelog: https://github.com/aiagentflow/aiagentflow/compare/	Medium	4/11/2026
v0.8.0	## What's Changed ### New: `--dry-run` flag Preview the full workflow execution plan without calling any LLM providers: ```bash aiagentflow run "Add user authentication" --dry-run ``` Shows: agent pipeline, provider/model config, context documents, source files, workflow settings. No files modified, no API calls made. Works with batch mode too. ### Enhanced `aiagentflow doctor` command - Environment checks (Node.js version, git availability) - Prompt file and coding standards verification - Ag	Low	3/5/2026
v0.7.1	## What's Changed ### Enhanced `aiagentflow doctor` command - Environment checks: Node.js version (>=20), git availability - Configuration checks: file existence and schema validation - Prompt file checks: agent prompt templates and coding standards policy - Agent-provider mapping: verify each agent references a configured provider - Provider connectivity: test API connections with actionable error hints ### Cleanup - Remove unused `PLAN_SYSTEM_PROMPT` dead code from plan c	Low	3/5/2026
v0.7.0	## What's New ### Session Resume Resume interrupted workflows from where they left off — no more restarting from scratch after crashes, Ctrl+C, or transient errors. ```bash # List all saved sessions aiagentflow sessions # Resume the most recent interrupted session aiagentflow resume # Resume a specific session by ID aiagentflow resume fix-login-bug-1gk8x # Resume in autonomous mode with a mode override aiagentflow resume --auto --mode fast ``` ### Highlights - **`aiagentflow resume [sessio	Low	3/5/2026
v0.6.2	## What's New ### Improved Provider Error Messages All 4 providers (Anthropic, OpenAI, Gemini, Ollama) now give actionable error messages instead of generic HTTP status codes. Before: ``` ProviderError: Anthropic API error: 429 Too Many Requests ``` After: ``` ProviderError: anthropic API rate limit exceeded (429). Rate limited — wait a moment and retry, or check your plan's rate limits ``` ### Highlights - Status-aware hints — auth errors (401/403) suggest checking API keys,	Low	3/5/2026
v0.6.1	## What's New ### `--mode` CLI Flag Override the workflow mode directly from the command line without changing your config: ```bash aiagentflow run "build auth system" --mode fast aiagentflow run "refactor payment module" --mode strict ``` Available modes: - `fast` — 3 iterations, no approval gates, auto-commit enabled - `balanced` — 5 iterations, human approval, no auto-commit (default) - `strict` — 10 iterations, human approval, lowest temperatures The `--mode` flag works with both sin	Low	3/5/2026
v0.6.0	## What's New ### Workflow Mode Presets Choose a workflow mode during `aiagentflow init` that bundles iterations, approval, auto-commit, and per-agent temperatures: \| Mode \| Iterations \| Approval \| Auto-commit \| Agent Temps \| \|------\|-----------\|----------\|-------------\|-------------\| \| Fast \| 3 \| Off \| On \| Higher (0.4-0.7) \| \| Balanced \| 5 \| On \| Off \| Moderate (0.2-0.5) \| \| Strict \| 10 \| On \| Off \| Lower (0.1-0.4) \| Users can optionally customize individual settings after selec	Low	3/5/2026
v0.5.5	## What's New - Project auto-detection — The init wizard now automatically detects your project's language, framework, and test framework from config files. No more manual selection for most projects. ### What gets detected Languages: TypeScript (`tsconfig.json`), Python (`pyproject.toml`, `requirements.txt`), Go (`go.mod`), Rust (`Cargo.toml`), Java (`pom.xml`, `build.gradle`), Ruby (`Gemfile`), JavaScript (`package.json`) Frameworks: Next.js, Nuxt, Angular, SvelteKit, Astro, Re	Low	3/5/2026
v0.5.4	## What's New - Package manager auto-detection — Automatically detects bun, pnpm, yarn, or npm from lockfiles and uses the correct exec prefix for test commands. No more manually fixing `npx vitest run` to `pnpm exec vitest run`. - Init wizard improvement — Displays detected package manager in Step 1 and generates correct test command defaults in Step 5. - Cleaner codebase — Consolidated duplicated test command maps into a single reusable utility. ## Files Changed - `src/utils/pack	Low	3/4/2026
v0.5.3	## What's New - Auto-commit on QA pass — New `autoCommit` and `autoCommitMessage` workflow config options. When enabled, the runner automatically stages and commits all generated files after QA approval using a configurable message template (`{task}` placeholder supported). - Init wizard update — `aiagentflow init` Step 5 now prompts for auto-commit configuration. ## Files Changed - `src/core/config/schema.ts` — added `autoCommit` and `autoCommitMessage` fields - `src/core/config/defau	Low	3/2/2026
v0.5.2	## Fixes - Deduplicate file lists in workflow summary - Fix test runner exit code showing undefined - Add timeout to Ollama streaming requests - Improve repeated failure detection with error signature matching - Add CI and automated npm publish via Trusted Publishing	Low	3/1/2026

Dependencies & License Audit

Loading dependencies...

Similar Packages

mcp-anythingOne command to turn any codebase into an MCP serverv0.2.0

simBuild, deploy, and orchestrate AI agents. Sim is the central intelligence layer for your AI workforce.v0.6.103

gemini-cliAn open-source AI agent that brings the power of Gemini directly into your terminal.v0.45.1

Athena-PublicThe Linux OS for AI Agents — Persistent memory, autonomy, and time-awareness for any LLM. Own the state. Rent the intelligence.v9.9.1

claude-code-plugins-plus-skills423 plugins, 2,849 skills, 177 agents for Claude Code. Open-source marketplace at tonsofskills.com with the ccpi CLI package manager.v4.33.0

More in Frameworks

langchainThe agent engineering platform

deer-flowAn open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of ta

tqdmFast, Extensible Progress Meter

simBuild, deploy, and orchestrate AI agents. Sim is the central intelligence layer for your AI workforce.