freshcrate
Skin:/
Home > Frameworks > aiagentflow

aiagentflow

A local-first, CLI-driven multi-agent AI software engineering workflow orchestrator with feed specs, PRDs, and guidelines to auto-generate implementation plans and code.

Why this rank:Strong adoptionRecent releaseHealthy release cadence

Description

A local-first, CLI-driven multi-agent AI software engineering workflow orchestrator with feed specs, PRDs, and guidelines to auto-generate implementation plans and code.

README

aiagentflow

A local-first CLI that orchestrates multi-agent AI workflows for software development. Give it a task โ€” or feed it your specs, PRDs, and guidelines โ€” and it coordinates specialized agents to architect, code, review, test, and ship automatically.

No cloud dependency. Bring your own API keys. Your code stays on your machine.

npm version License: MIT Node.js


How It Works

Task โ†’ Architect โ†’ Coder โ†’ Reviewer โ†’ Tester โ†’ Fixer โ†’ Ship

Each stage uses a specialized AI agent with tuned prompts and parameters. The loop repeats until quality thresholds pass โ€” like a small AI engineering team running on your machine.


Install

npm install -g @aiagentflow/cli

Or with pnpm:

pnpm add -g @aiagentflow/cli

Quick Start

# 1. Initialize in your project
cd /path/to/your/project
aiagentflow init

# 2. Run a task
aiagentflow run "Add a login form with email/password validation"

# 3. Or run autonomously (no approval prompts)
aiagentflow run "Refactor the auth module" --auto

# 4. Feed context docs to agents
aiagentflow run "Add auth" --context docs/api-spec.md docs/security.md

# 5. Generate a task list from specs, then batch-run
aiagentflow plan docs/prd.md -o tasks.txt
aiagentflow run --batch tasks.txt --auto

The init wizard walks you through:

  1. Auto-detect your project (language, framework, test framework, package manager)
  2. Select your LLM providers (Anthropic, OpenAI, Groq, Gemini, Ollama)
  3. Enter API keys
  4. Assign models per agent role
  5. Choose a workflow mode (fast, balanced, strict)
  6. Import existing docs (specs, requirements, guidelines) for auto-loading

Configuration is saved locally in .aiagentflow/config.json.


Features

  • Multi-agent pipeline โ€” 6 specialized agents, each with a distinct role
  • Context-aware โ€” feed specs, PRDs, architecture docs, and guidelines to every agent
  • Plan from docs โ€” generate batch-ready task lists from your existing documentation
  • Local-first โ€” runs entirely on your machine, no code leaves your system
  • Provider-agnostic โ€” Anthropic (Claude), OpenAI (GPT), Groq, Google Gemini, Ollama (local/free)
  • Workflow modes โ€” fast, balanced, or strict presets for iterations, approval, and temperatures
  • Smart detection โ€” auto-detects language, framework, test runner, and package manager
  • Configurable โ€” tune models, temperature, and iteration limits per agent
  • Git-native โ€” auto-creates branches, auto-commits on QA pass
  • Human-in-the-loop โ€” approve or override at any stage, or go full auto
  • QA policies โ€” configurable quality gates (max critical issues, test requirements)
  • Batch mode โ€” process multiple tasks from a file
  • Session persistence โ€” crash recovery with automatic session saving
  • Token tracking โ€” monitor LLM usage per agent and per run
  • Customizable prompts โ€” edit agent prompts in .aiagentflow/prompts/

CLI Commands

Command Description
aiagentflow init Interactive setup wizard
aiagentflow config View current configuration
aiagentflow doctor Health check โ€” verify providers and setup
aiagentflow run <task> Run a workflow for a task
aiagentflow run <task> --auto Autonomous mode (no approval prompts)
aiagentflow run <task> --context <files...> Run with reference documents
aiagentflow run --batch tasks.txt Process multiple tasks from a file
aiagentflow plan <docs...> Generate a task list from documentation
aiagentflow plan <docs...> -o tasks.txt Write task list to file (batch-ready)

Agent Roles

Agent Role What it does
๐Ÿง  Architect Plan Analyzes the task and creates an implementation plan
๐Ÿ’ป Coder Implement Writes production-ready code based on the plan
๐Ÿ” Reviewer Review Reviews code for bugs, security, and quality
๐Ÿงช Tester Test Generates tests and runs them
๐Ÿ› Fixer Fix Addresses review comments and test failures
โœ… Judge QA Final quality gate โ€” pass or fail

Supported Providers

Provider Type Default Model Setup
Anthropic Cloud API claude-sonnet-4-20250514 Requires API key
OpenAI Cloud API gpt-4o-mini Requires API key
Groq Cloud API llama-3.3-70b-versatile Requires API key
Google Gemini Cloud API gemini-2.0-flash Requires API key
Ollama Local llama3.2:latest Requires Ollama running locally

You can mix providers โ€” use cloud APIs for reasoning agents (architect, reviewer, judge) and local models for generation agents (coder, tester, fixer).

Using with Ollama (free, local)

# Install and start Ollama
ollama serve

# Pull a model
ollama pull llama3.2

# Initialize aiagentflow with Ollama
aiagentflow init
# โ†’ Select "ollama" as provider
# โ†’ Enter model name: llama3.2

Configuration

After aiagentflow init, your project has:

.aiagentflow/
โ”œโ”€โ”€ config.json              # Main configuration
โ”œโ”€โ”€ prompts/                 # Customizable agent prompts
โ”‚   โ”œโ”€โ”€ architect.md
โ”‚   โ”œโ”€โ”€ coder.md
โ”‚   โ”œโ”€โ”€ reviewer.md
โ”‚   โ”œโ”€โ”€ tester.md
โ”‚   โ”œโ”€โ”€ fixer.md
โ”‚   โ””โ”€โ”€ judge.md
โ”œโ”€โ”€ policies/                # Quality standards
โ”‚   โ””โ”€โ”€ coding-standards.md
โ”œโ”€โ”€ context/                 # Reference docs (auto-loaded into every run)
โ”‚   โ”œโ”€โ”€ api-spec.md          # Example: your API specification
โ”‚   โ””โ”€โ”€ requirements.md      # Example: your PRD or requirements
โ””โ”€โ”€ sessions/                # Saved workflow sessions

Edit the prompt files to customize how each agent behaves. Edit coding-standards.md to set project-specific rules that all agents follow. Drop .md or .txt files into context/ and they'll be automatically included as reference material for all agents.


Context Documents

Agents work best when they understand your project's requirements, API contracts, and standards. There are three ways to provide reference documents:

1. Auto-loaded (recommended) โ€” Drop files into .aiagentflow/context/:

cp docs/api-spec.md .aiagentflow/context/
cp docs/security-guidelines.md .aiagentflow/context/
aiagentflow run "Implement user registration"
# Both docs are automatically included in every agent's context

2. Per-run via --context flag:

aiagentflow run "Add OAuth support" --context docs/oauth-spec.md docs/auth-arch.md

3. During init โ€” The setup wizard asks if you have existing docs and copies them for you.

What to include

Document type Example Why it helps
API specs api-spec.md Agents generate correct endpoints and contracts
Requirements / PRDs requirements.md Architect plans match your actual requirements
Security guidelines security.md Reviewer catches violations against your policies
Architecture docs architecture.md Coder follows your patterns and conventions
Development guidelines dev-guidelines.md All agents follow your team's standards

Plan command

Turn documentation into an actionable task list, then batch-run it:

# Generate tasks from a PRD
aiagentflow plan docs/prd.md -o tasks.txt

# Review the generated tasks
cat tasks.txt

# Run them all
aiagentflow run --batch tasks.txt --auto --context docs/architecture.md

Project Structure

src/
โ”œโ”€โ”€ cli/            # CLI entry point and commands
โ”œโ”€โ”€ core/           # Config system, workflow engine, QA policies
โ”œโ”€โ”€ providers/      # LLM provider adapters (Anthropic, OpenAI, Groq, Gemini, Ollama)
โ”œโ”€โ”€ agents/         # Agent implementations and prompt library
โ”œโ”€โ”€ git/            # Git operations wrapper
โ”œโ”€โ”€ prompts/        # Default prompt templates
โ””โ”€โ”€ utils/          # Shared utilities (logger, fs, validation)

Development

# Clone and install
git clone https://github.com/aiagentflow/aiagentflow.git
cd aiagentflow
pnpm install

# Run in dev mode
pnpm dev run "your task here"

# Type check
pnpm typecheck

# Run tests
pnpm test

# Lint & format
pnpm lint
pnpm format

Contributing

Contributions are welcome! Here's how to get started:

  1. Fork the repo and clone your fork
  2. Create a branch for your feature: git checkout -b feature/your-feature
  3. Follow the coding standards:
    • Functions: camelCase, Classes: PascalCase, Files: kebab-case
    • All public functions need JSDoc, types, and error handling
    • Use custom AppError subclasses โ€” never raw throw new Error()
  4. Check your work: pnpm typecheck && pnpm lint && pnpm test
  5. Open a PR against main with a description of what and why

Architecture rules

  • Dependency direction flows downward: cli โ†’ core โ†’ utils โ†’ types
  • Config types are inferred from Zod schemas, never manually defined
  • New providers only require one adapter file + registry entry

Roadmap

  • Project scaffolding, config system, LLM provider layer
  • Workflow engine, agent implementations, Git integration
  • QA policies, token tracking, session persistence
  • Context documents โ€” feed specs, PRDs, and guidelines to agents
  • Plan command โ€” generate task lists from documentation
  • Multiple providers โ€” Anthropic, OpenAI, Groq, Gemini, Ollama
  • Project auto-detection โ€” language, framework, test runner, package manager
  • Auto-commit on QA pass
  • Workflow mode presets โ€” fast, balanced, strict
  • VSCode extension
  • Desktop GUI

License

MIT


aiagentflow.dev

Release History

VersionChangesUrgencyDate
v1.4.0## What's Changed * Add agent memory system by @raj-khan in https://github.com/aiagentflow/aiagentflow/pull/57 * Bump version to 1.4.0 by @raj-khan in https://github.com/aiagentflow/aiagentflow/pull/58 **Full Changelog**: https://github.com/aiagentflow/aiagentflow/compare/v1.3.1...v1.4.0High5/24/2026
v1.2.0## What's Changed * Add worktree isolation, plan-review checkpoint, and parallel batch execution by @raj-khan in https://github.com/aiagentflow/aiagentflow/pull/51 * Bump version to 1.2.0 by @raj-khan in https://github.com/aiagentflow/aiagentflow/pull/52 **Full Changelog**: https://github.com/aiagentflow/aiagentflow/compare/v1.1.0...v1.2.0High5/23/2026
v1.0.2## What's Changed * feat: lint and format enforcement after code generation by @raj-khan in https://github.com/aiagentflow/aiagentflow/pull/46 * Bump version to 1.0.2 by @raj-khan in https://github.com/aiagentflow/aiagentflow/pull/47 **Full Changelog**: https://github.com/aiagentflow/aiagentflow/compare/v1.0.1...v1.0.2High4/15/2026
v1.0.1## What's Changed * Delete stale bump branch before pushing in release.yml by @raj-khan in https://github.com/aiagentflow/aiagentflow/pull/41 * Add /update-website skill by @raj-khan in https://github.com/aiagentflow/aiagentflow/pull/42 * update README: add OpenRouter, missing commands by @raj-khan in https://github.com/aiagentflow/aiagentflow/pull/43 * fix: actionable error and warning for Groq 413 with compound models by @raj-khan in https://github.com/aiagentflow/aiagentflow/pull/44 * Bump veHigh4/14/2026
v1.0.0## What's Changed * Fix publish.yml not triggering after automated release creation by @raj-khan in https://github.com/aiagentflow/aiagentflow/pull/36 * Add OpenRouter provider and fix 429 retry backoff by @raj-khan in https://github.com/aiagentflow/aiagentflow/pull/37 * Use RELEASE_PAT in release.yml to trigger CI on bump PRs by @raj-khan in https://github.com/aiagentflow/aiagentflow/pull/39 * Bump version to 1.0.0 by @raj-khan in https://github.com/aiagentflow/aiagentflow/pull/40 **Full ChanHigh4/12/2026
v0.10.0## What's Changed * Add /release project skill by @raj-khan in https://github.com/aiagentflow/aiagentflow/pull/32 * Rename /release skill to /do-release by @raj-khan in https://github.com/aiagentflow/aiagentflow/pull/33 * Fix Groq compound model streaming returning empty content by @raj-khan in https://github.com/aiagentflow/aiagentflow/pull/35 * Bump version to 0.10.0 by @github-actions[bot] in https://github.com/aiagentflow/aiagentflow/pull/34 ## New Contributors * @github-actions[bot] made tHigh4/11/2026
v0.9.0## What's Changed * Add Groq provider by @raj-khan in https://github.com/aiagentflow/aiagentflow/pull/28 * Fix release workflow for protected main branch by @raj-khan in https://github.com/aiagentflow/aiagentflow/pull/29 * Bump version to 0.9.0 by @raj-khan in https://github.com/aiagentflow/aiagentflow/pull/30 * Fix tag-on-merge to detect bump PRs correctly by @raj-khan in https://github.com/aiagentflow/aiagentflow/pull/31 **Full Changelog**: https://github.com/aiagentflow/aiagentflow/compare/Medium4/11/2026
v0.8.0## What's Changed ### New: `--dry-run` flag Preview the full workflow execution plan without calling any LLM providers: ```bash aiagentflow run "Add user authentication" --dry-run ``` Shows: agent pipeline, provider/model config, context documents, source files, workflow settings. No files modified, no API calls made. Works with batch mode too. ### Enhanced `aiagentflow doctor` command - Environment checks (Node.js version, git availability) - Prompt file and coding standards verification - AgLow3/5/2026
v0.7.1## What's Changed ### Enhanced `aiagentflow doctor` command - **Environment checks**: Node.js version (>=20), git availability - **Configuration checks**: file existence and schema validation - **Prompt file checks**: agent prompt templates and coding standards policy - **Agent-provider mapping**: verify each agent references a configured provider - **Provider connectivity**: test API connections with actionable error hints ### Cleanup - Remove unused `PLAN_SYSTEM_PROMPT` dead code from plan cLow3/5/2026
v0.7.0## What's New ### Session Resume Resume interrupted workflows from where they left off โ€” no more restarting from scratch after crashes, Ctrl+C, or transient errors. ```bash # List all saved sessions aiagentflow sessions # Resume the most recent interrupted session aiagentflow resume # Resume a specific session by ID aiagentflow resume fix-login-bug-1gk8x # Resume in autonomous mode with a mode override aiagentflow resume --auto --mode fast ``` ### Highlights - **`aiagentflow resume [sessioLow3/5/2026
v0.6.2## What's New ### Improved Provider Error Messages All 4 providers (Anthropic, OpenAI, Gemini, Ollama) now give **actionable error messages** instead of generic HTTP status codes. **Before:** ``` ProviderError: Anthropic API error: 429 Too Many Requests ``` **After:** ``` ProviderError: anthropic API rate limit exceeded (429). Rate limited โ€” wait a moment and retry, or check your plan's rate limits ``` ### Highlights - **Status-aware hints** โ€” auth errors (401/403) suggest checking API keys,Low3/5/2026
v0.6.1## What's New ### `--mode` CLI Flag Override the workflow mode directly from the command line without changing your config: ```bash aiagentflow run "build auth system" --mode fast aiagentflow run "refactor payment module" --mode strict ``` **Available modes:** - `fast` โ€” 3 iterations, no approval gates, auto-commit enabled - `balanced` โ€” 5 iterations, human approval, no auto-commit (default) - `strict` โ€” 10 iterations, human approval, lowest temperatures The `--mode` flag works with both sinLow3/5/2026
v0.6.0## What's New ### Workflow Mode Presets Choose a workflow mode during `aiagentflow init` that bundles iterations, approval, auto-commit, and per-agent temperatures: | Mode | Iterations | Approval | Auto-commit | Agent Temps | |------|-----------|----------|-------------|-------------| | **Fast** | 3 | Off | On | Higher (0.4-0.7) | | **Balanced** | 5 | On | Off | Moderate (0.2-0.5) | | **Strict** | 10 | On | Off | Lower (0.1-0.4) | Users can optionally customize individual settings after selecLow3/5/2026
v0.5.5## What's New - **Project auto-detection** โ€” The init wizard now automatically detects your project's language, framework, and test framework from config files. No more manual selection for most projects. ### What gets detected **Languages:** TypeScript (`tsconfig.json`), Python (`pyproject.toml`, `requirements.txt`), Go (`go.mod`), Rust (`Cargo.toml`), Java (`pom.xml`, `build.gradle`), Ruby (`Gemfile`), JavaScript (`package.json`) **Frameworks:** Next.js, Nuxt, Angular, SvelteKit, Astro, ReLow3/5/2026
v0.5.4## What's New - **Package manager auto-detection** โ€” Automatically detects bun, pnpm, yarn, or npm from lockfiles and uses the correct exec prefix for test commands. No more manually fixing `npx vitest run` to `pnpm exec vitest run`. - **Init wizard improvement** โ€” Displays detected package manager in Step 1 and generates correct test command defaults in Step 5. - **Cleaner codebase** โ€” Consolidated duplicated test command maps into a single reusable utility. ## Files Changed - `src/utils/packLow3/4/2026
v0.5.3## What's New - **Auto-commit on QA pass** โ€” New `autoCommit` and `autoCommitMessage` workflow config options. When enabled, the runner automatically stages and commits all generated files after QA approval using a configurable message template (`{task}` placeholder supported). - **Init wizard update** โ€” `aiagentflow init` Step 5 now prompts for auto-commit configuration. ## Files Changed - `src/core/config/schema.ts` โ€” added `autoCommit` and `autoCommitMessage` fields - `src/core/config/defauLow3/2/2026
v0.5.2## Fixes - Deduplicate file lists in workflow summary - Fix test runner exit code showing undefined - Add timeout to Ollama streaming requests - Improve repeated failure detection with error signature matching - Add CI and automated npm publish via Trusted PublishingLow3/1/2026

Dependencies & License Audit

Loading dependencies...

Similar Packages

mcp-anythingOne command to turn any codebase into an MCP serverv0.2.0
simBuild, deploy, and orchestrate AI agents. Sim is the central intelligence layer for your AI workforce.v0.6.103
gemini-cliAn open-source AI agent that brings the power of Gemini directly into your terminal.v0.45.1
Athena-PublicThe Linux OS for AI Agents โ€” Persistent memory, autonomy, and time-awareness for any LLM. Own the state. Rent the intelligence.v9.9.1
claude-code-plugins-plus-skills423 plugins, 2,849 skills, 177 agents for Claude Code. Open-source marketplace at tonsofskills.com with the ccpi CLI package manager.v4.33.0

More in Frameworks

langchainThe agent engineering platform
deer-flowAn open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of ta
tqdmFast, Extensible Progress Meter
simBuild, deploy, and orchestrate AI agents. Sim is the central intelligence layer for your AI workforce.