freshcrate
Home > AI Agents > AgentWard

Description

AgentWard โ€“ Built for all, hardened for OpenClaw.

README

AgentWard ยท ็Ž„็”ฒOS

AgentWard (็Ž„็”ฒ) is a full-stack security operating system purpose-built for trustworthy, scalable AI agent deployment, with native code adaptation to OpenClaw. AgentWard unifies agent onboarding, secure reasoning, and trusted execution in one cohesive security architecture, with upcoming native support for other leading mainstream agent frameworks. Its heterogeneous defense-in-depth design rearchitects the agent workflow into five coordinated security layers across startup, perception, memory, decision-making, and execution, with dynamic cross-stage protections that verify foundation integrity, block adversarial deception, stop memory tampering, and validate every autonomous decision and high-risk command โ€” a complete, end-to-end closed security loop that delivers on the promise of "trustworthy at inception, controllable throughout the process, and reliable in outcomes".

Why AgentWard

  • ๐Ÿ›ก๏ธ Comprehensive Risk Coverage โ€” Heterogeneous Defense-in-Depth (DiD) architecture delivers full-scope agent security assurance, blocking diverse attack vectors across the entire agent attack surface.
  • โšก One-Click Deployment โ€” Plugin-native design weaves security natively into the full agent lifecycle. Enable comprehensive agent security with one click via non-intrusive integration, which guarantees seamless and fast version adaptation for OpenClaw.
  • ๐Ÿ”’ Deterministic System-Level Controls โ€” Delivers deterministic, fully auditable, code-enforced security that outperforms skill-based solutions depending on endogenous security, with native support for large-scale deployment and production-grade readiness.
  • ๐ŸŒ Open & Extensible Security Standard โ€” Community-driven, transparent and auditable open standard with a modular architecture designed for extensibility. Built with complete framework-algorithm decoupling for effortless integration of advanced detection algorithms, with a roadmap to extend support to general agentic systems.

Quick Start

  1. โšก Installation or Update

    # Run the setup script
    bash /path/to/agent-ward/setup.sh
  2. โœ… Verify Installation

    openclaw plugins list

    Then enjoy enhanced security for your OpenClaw!

Systematic Architecture

AgentWard is natively and deeply integrated with the OpenClaw platform and embeds native security capabilities into the full lifecycle workflow of AI agents. Its heterogeneous defense-in-depth architecture reconstructs isolated single-point security checks into a closed-loop, coordinated system-level protection system, delivering end-to-end, full-chain trustworthy assurance for AI agents from startup through to execution.

AgentWard Blueprint

Five Coordinated Defense Layers

AgentWard delivers system-level security through five tightly integrated layers that work in tandem โ€” transforming isolated security checks into a unified, end-to-end protection system for AI agents.

Layer Focus
๐Ÿ—๏ธ Foundation Scan Layer Supply chain trust and baseline integrity
๐Ÿงผ Input Sanitization Layer Prompt injection and jailbreak detection
๐Ÿง  Cognition Protection Layer Memory poisoning and context drift
๐ŸŽฏ Decision Alignment Layer Intent consistency before action
๐Ÿ”ง Execution Control Layer High-risk operation guardrails

๐Ÿšจ Threat Response and Mitigation

  • ๐Ÿ“ข Send alert messages via IM when threats are detected
  • ๐Ÿ›‘ Automatically block dangerous operations without human intervention
  • ๐Ÿ“ Clear warning descriptions to help understand risks

โš™๏ธ Flexible Configuration

  • ๐ŸŽš๏ธ Each protection layer can be enabled/disabled independently
  • ๐Ÿ‘๏ธ Supports "detection-only" mode to reduce false positive impact
  • ๐Ÿ“‹ Some layers support custom rules to meet specific scenario requirements

Defense Visualization

๐Ÿ—๏ธ Layer 1: Foundation Scan

Ensures the agent starts from a trustworthy foundation.

English Version

Foundation.Scan.Layer.mp4

Chinese Version

Foundation.Scan.Layer.ZH.mp4

๐Ÿงผ Layer 2: Input Sanitization

Identifies adversarial inputs before they propagate into the agent.

English Version

Input.Sanitization.Layer.mp4

Chinese Version

Input.Sanitization.Layer.ZH.mp4

๐Ÿง  Layer 3: Cognition Protection

Protects long-term memory and contextual continuity from poisoning.

English Version

Cognition.Protection.Layer.mp4

Chinese Version

Cognition.Protection.Layer.ZH.mp4

๐ŸŽฏ Layer 4: Decision Alignment

Keeps agent decisions aligned with authorized user intent.

English Version

Decision.Alignment.Layer.mp4

Chinese Version

Decision.Alignment.Layer.ZH.mp4

๐Ÿ”ง Layer 5: Execution Control

Enforces safety boundaries at the point of execution.

English Version

Control.Execution.Layer.mp4

Chinese Version

Execution.Control.Layer.ZH.mp4

Roadmap

๐Ÿ† End-to-End Full-Stack Security System

Our roadmap is structured around a multi-layered defense architecture designed to secure the entire agent lifecycle, from configuration and input processing to cognition, decision-making, and execution.

๐Ÿ“ System Infrastructure Framework

  • โœ… Plugin-native modular architecture
  • โœ… Base adapter suite
  • โœ… Core detection engine
    • โœ… Heuristic rule-based detection module
    • โœ… Intent risk evaluation system
    • ๐Ÿš€ Trust-aware risk assessment capabilities
  • ๐Ÿš€ Heterogeneous OS support
    • โœ… Linux
    • ๐Ÿš€ macOS
    • ๐Ÿš€ Windows

๐Ÿ—๏ธ Foundational Scanning Layer

  • โœ… Global and plugin-level configuration security checks
  • โœ… Semantic malicious skill detection
  • ๐Ÿš€ Skill source verification
  • ๐Ÿš€ Plugin dependency analysis
  • ๐Ÿš€ Hybrid natural language and code vulnerability detection

๐Ÿงผ Input Sanitization Layer

  • โœ… Rule-based injection and jailbreak detection
  • โœ… Semantic coherence analysis for user inputs
  • โœ… Fragmented malicious instruction detection
  • ๐Ÿš€ Multi-turn stealth attack detection
  • ๐Ÿš€ Secure malicious content rewriting and replacement
  • ๐Ÿš€ Multimodal injection attack detection

๐Ÿง  Cognitive Protection Layer

  • โœ… Memory consistency evaluation and calibration
  • ๐Ÿš€ Malicious memory corpus construction and threat matching
  • ๐Ÿš€ Memory vectorization and outlier detection
  • ๐Ÿš€ Checkpoint-based memory recovery
  • ๐Ÿš€ Context drift detection and correction

๐ŸŽฏ Decision Alignment Layer

  • โœ… Consistency validation between agent decisions and user intent
  • ๐Ÿš€ Static rule filtering and compliance verification
  • ๐Ÿš€ Multi-step trajectory reasoning audit
  • ๐Ÿš€ Risk-adaptive dynamic permission allocation
  • ๐Ÿš€ High-risk action identification and safe rewriting

๐Ÿ”ง Execution Control Layer

  • โœ… Real-time interception and blocking of high-risk system instructions
  • โœ… Behavioral intent analysis and risk assessment
  • ๐Ÿš€ Identity-aware dynamic permission control and access restriction
  • ๐Ÿš€ Pre-execution security validation for agent actions
  • ๐Ÿš€ Automatic rollback and recovery for abnormal execution states
  • ๐Ÿš€ eBPF-powered system-level observability
    • ๐Ÿš€ Real-time resource monitoring and adaptive restriction
    • ๐Ÿš€ Network payload auditing and anomaly detection

๐Ÿค Cross-Layer Collaboration

  • โœ… Global information aggregation and risk discovery
  • ๐Ÿš€ Historical behavior-based trust profiling
  • ๐Ÿš€ Role-aware risk scoring and dynamic permission allocation
  • ๐Ÿš€ Taint propagation and end-to-end system auditing

Legend: โœ… Completed | ๐Ÿš€ In Progress


Authors: Qi Li, Xinhao Deng, Yixiang Zhang, Jiaqing Wu, Yue Xiao, Rennai Qiu, Zhuoheng Zou, Jiaqi Bai, Jiaxing Song, and Ke Xu

Release History

VersionChangesUrgencyDate
main@2026-04-20Latest activity on main branchHigh4/20/2026
0.0.0No release found โ€” using repo HEADHigh4/7/2026
main@2026-04-07Latest activity on main branchHigh4/7/2026
main@2026-04-07Latest activity on main branchHigh4/7/2026
main@2026-04-07Latest activity on main branchMedium4/7/2026
main@2026-04-07Latest activity on main branchMedium4/7/2026
main@2026-04-07Latest activity on main branchMedium4/7/2026
main@2026-04-07Latest activity on main branchMedium4/7/2026
main@2026-04-07Latest activity on main branchMedium4/7/2026
main@2026-04-07Latest activity on main branchMedium4/7/2026

Dependencies & License Audit

Loading dependencies...

Similar Packages

nanoclawA lightweight alternative to OpenClaw that runs in containers for security. Connects to WhatsApp, Telegram, Slack, Discord, Gmail and other messaging apps,, has memory, scheduled jobs, and runs directmain@2026-04-21
mangostudioAI-powered image generation and chat studiomain@2026-04-21
guardian-agentSecurity-first AI agent orchestration system. Built-in agents with predefined capabilities, strict guardrails on what they can and cannot do, and a four-layer defense system that enforces security at main@2026-04-21
LocalLLMAI chat and developer platform.main@2026-04-21
cherry-studioAI productivity studio with smart chat, autonomous agents, and 300+ assistants. Unified access to frontier LLMsv1.9.2