freshcrate
Home > Testing > agent-actions

agent-actions

Declarative framework for orchestrating multi-model LLM pipelines with context engineering and quality gates.

Description

Declarative framework for orchestrating multi-model LLM pipelines with context engineering and quality gates.

README

Agent Actions

License PyPI Downloads Python

Declarative LLM orchestration. Define workflows in YAML — each action gets its own model, context window, schema, and pre-check gate. The framework handles DAG resolution, parallel execution, batch processing, and output validation.

Warning

Experimental — Under active development. Expect breaking changes. Open an issue with feedback.

Agent Actions lifecycle: Define → Validate → Execute

actions:
  - name: extract_features
    intent: "Extract key product features from listing"
    model_vendor: anthropic              # Each action picks its own model
    model_name: claude-sonnet-4-20250514

  - name: generate_description
    dependencies: [extract_features]
    model_vendor: openai                 # Mix vendors in one pipeline
    model_name: gpt-4o-mini
    context_scope:
      observe:
        - extract_features.features      # See only what it needs
      drop:
        - source.raw_html                # Don't waste tokens on noise

Install

pip install agent-actions

Quick start

agac init my-project && cd my-project                # scaffold a project
agac init --example contract_reviewer my-project     # or start from an example
agac run -a my_workflow                              # execute

Why not just write Python?

You will, until you have 15 steps, 3 models, batch retry, and a teammate asks what your pipeline does.

Capability Agent Actions Python script n8n / Make
Per-step model selection YAML field Manual wiring Per-node config
Context isolation per step observe / drop You build it Not available
Pre-check guards (skip before LLM call) guard: If-statements Post-hoc branching
Parallel consensus (3 voters + merge) 2 lines of YAML Custom code Many nodes + JS
Schema validation + auto-reprompt Built in DIY Not available
Batch processing (1000s of records) Built in For-loops Loop nodes
The YAML is the documentation Yes No Visual graph

Examples

Example Pattern Key Features
Review Analyzer Parallel consensus 3 independent scorers, vote aggregation, guard on quality threshold
Contract Reviewer Map-reduce Split clauses, analyze each, aggregate risk summary
Product Listing Enrichment Tool + LLM hybrid LLM generates copy, tool fetches pricing, LLM optimizes
Book Catalog Enrichment Multi-step enrichment BISAC classification, marketing copy, SEO metadata, reading level
Incident Triage Parallel consensus Severity classification, impact assessment, team assignment, response plan

Providers

Provider Batch Provider Batch
OpenAI Yes Groq Yes
Anthropic Yes Mistral Yes
Google Gemini Yes Cohere Online only
Ollama (local) Online only

Switch providers per-action by changing model_vendor.

Key capabilities

  • Pre-flight validation — schemas, dependencies, templates, and credentials checked before any LLM call
  • Batch processing — route thousands of records through provider batch APIs
  • User-defined functions — Python tools for pre/post-processing and custom logic
  • Reprompting — auto-retry when LLM output doesn't match schema
  • Observability — per-action timing, token counts, and structured event logs
  • Interactive docsagac docs builds and serves a visual workflow dashboard

Documentation

Contributing

git clone https://github.com/Muizzkolapo/agent-actions.git && cd agent-actions
pip install -e ".[dev]"
pytest

See CONTRIBUTING.md. Report bugs via Issues.

License

Apache License 2.0

Release History

VersionChangesUrgencyDate
v0.1.12## v0.1.12 — 2026-04-19 ### Enhancement or New Feature * Self-reflection feedback strategy for reprompt retries * LLM critique escalation for stubborn reprompt failures ### Under the Hood * Unified online and batch reprompt logic via shared helpers ### Bug Fix * Drop directives now correctly exclude fields from passthrough wildcards * Pass compiled schemas to Groq, Cohere, and Mistral APIs * Retry transient OpenAI 400 JSON parsing errors * Cascade failures report as blocked instead of indepenHigh4/19/2026
v0.1.11## v0.1.11 ### Enhancement or New Feature * Cross-workflow chaining: `--upstream` and `--downstream` CLI flags, virtual action injection, and orchestrator support for multi-workflow pipelines * FILE-mode record identity: tools receive full records with `node_id`, framework matches outputs to inputs by identity instead of heuristics (**breaking change** for FILE tools — use `record["content"]["field"]` instead of `record["field"]`) * Step-by-step progress display for sequential execution mode *High4/13/2026
v0.1.10### Bug Fixes - fix: HITL FILE granularity + guard pre-filter for FILE mode (#229) - fix: HITL FILE mode truncates lineage to just its own node_id (#231) - fix: resolve false-positive StaticTypeError for Jinja loop variables (#232) - fix: accept cross-workflow dependency syntax, fix 4 crash sites - fix: improve VS Code data card indentation, grammar, and rendering (#223) - fix: improve docs data card indentation, contrast, and grammar (#224) ### Tests - test: deterministic lineage integrity tesHigh4/11/2026
v0.1.9### Bug Fixes - fix: resolve FileUDFResult lineage by source_mapping index, not source_guid scan (#220) - FILE-mode tools with multiple input records sharing the same source_guid (e.g. after flatten) now produce correct per-record lineage instead of all inheriting the first match - Fixes silent downstream data duplication when using context_scope.observe on pre-flatten ancestors - source_mapping is now fully consumed by the runtime — no longer validated-then-discardedHigh4/8/2026

Dependencies & License Audit

Loading dependencies...

Similar Packages

structured-prompt-skill✍️ Write effective AI prompts with this structured prompt engineering library and Claude Code skill, featuring 300+ curated examples for high-quality results.main@2026-04-21
ai-lead-qualifier🧠 Qualify leads with an AI-driven system that understands intent, asks key questions, and structures quality leads without hardcoding processes.main@2026-04-21
gatewayThe only fully local production-grade Super SDK that provides a simple, unified, and powerful interface for calling more than 200+ LLMs.v1.11.19
OpenACMSelf-hosted autonomous AI agent — runs on your PC, controls your environment, connects to any MCP server.main@2026-04-19
caveman🪨 why use many token when few token do trick — Claude Code skill that cuts 65% of tokens by talking like cavemanv1.6.0