freshcrate
Skin:/
Home > Security > Agent_Life_Space

Agent_Life_Space

Self-hosted autonomous AI agent โ€” 9-layer cascade, Docker sandbox, encrypted vault, review/build/control plane, 1407+ tests

Why this rank:Release freshnessHealthy release cadenceStrong adoption

Description

Self-hosted autonomous AI agent โ€” 9-layer cascade, Docker sandbox, encrypted vault, review/build/control plane, 1407+ tests

README

Agent Life Space

Self-hosted autonomous AI agent that lives on your server. Thinks with Claude, acts through its own modules, communicates via Telegram.

Wiki | Architecture | Security | API Reference | Roadmap

What it does

  • 9-layer cascade โ€” dispatch โ†’ cache โ†’ RAG โ†’ classify โ†’ LLM โ†’ quality escalation โ†’ learning โ†’ channel filter โ†’ explanation
  • Docker sandbox โ€” /sandbox code runs in isolated containers (256MB, no network, read-only FS)
  • Encrypted vault โ€” API keys, wallet keys (ETH/BTC) encrypted with Fernet AES-128
  • Epistemic memory โ€” 4 types + provenance model (observed/asserted/inferred/verified/stale), expiry, decay
  • Persistent conversation โ€” SQLite-backed context with FTS5 full-text search, survives restarts
  • Agent-to-Agent API โ€” HTTP endpoint for inter-agent communication
  • Structured review API โ€” POST /api/review runs deterministic review jobs through the shared runtime
  • Learning system โ€” skill outcome tracking, model escalation, prompt augmentation
  • Multi-provider LLM โ€” Claude CLI, Anthropic API, OpenAI, Ollama (any backend)
  • Automated security โ€” 129-test security audit + invariant suite
  • Tool governance โ€” capability manifest, policy engine, 4-step action pipeline with audit trail
  • Workspace persistence โ€” SQLite-backed workspaces with audit trail, limits, TTL, recovery
  • Approval queue โ€” structured propose โ†’ approve/deny โ†’ execute workflow with persistent storage and linkage
  • Delivery packages โ€” shared build/review delivery lifecycle with deterministic bundle previews, approval linkage, and explicit handoff state
  • Reviewer handoff artifacts โ€” operator summaries and copy-paste-ready PR comment packs now persist as first-class review artifacts
  • Planner handoff + traces โ€” persisted JobPlan records and durable qualification/budget/capability/delivery traces
  • Delivery lifecycle tracking โ€” prepared โ†’ awaiting approval โ†’ approved/rejected โ†’ handed off with audit events
  • Workspace joins โ€” workspaces now link to jobs, artifacts, approvals, and delivery bundles
  • Retained artifact records โ€” build/review/delivery outputs now carry policy, expiry, recoverability, and prune-state metadata
  • Persisted product jobs โ€” shared control-plane record of build/review job metadata, status, usage, and artifacts
  • Per-job cost ledger โ€” durable usage/token/cost entries with report and CLI inspection
  • Runtime budget governance โ€” hard-cap, stop-loss, and approval-gated intake execution
  • Managed repo acquisition โ€” supported git_url intake can clone/import into a controlled local mirror before runtime routing
  • Evidence export โ€” --export-evidence-job assembles internal or client-safe review packages with artifacts, traces, retention, and traceability
  • Environment profiles โ€” explicit review/build/acquisition/export execution profiles exposed through the runtime model
  • Controlled-environment deployment โ€” local-owner, operator-controlled, and enterprise-hardened runtime posture now has explicit deployment guidance
  • Multi-step approvals โ€” risky intake and delivery paths can require more than one approval deterministically
  • Shared policy registry โ€” deterministic job persistence, artifact retention, delivery, review-gate, and gateway defaults
  • Structured denials โ€” shared machine-readable blocker payloads across policy, intake, delivery, and evidence export flows
  • Control-plane queries โ€” shared inspection across build, review, task, job-runner, agent-loop, artifact, plan, delivery, and workspace state
  • Runtime model โ€” explicit coexistence rules for product jobs, planning tasks, infrastructure jobs, and conversational queue items
  • Release readiness gate โ€” deterministic CLI/CI quality and gateway posture gate before release or handoff
  • Operator dashboard โ€” authenticated /dashboard surface for jobs, settlements, retention, audit, operator metrics, and one-click LLM runtime control
  • Settlement workflow โ€” persisted 402/top-up approval flow across API, dashboard, and Telegram with retry support
  • Setup doctor โ€” python -m agent --setup-doctor audits self-host identity, LLM, gateway, and operator posture before first run
  • Operator CLI surfaces โ€” --report, --runtime-model, --llm-runtime-*, --export-evidence-job, --export-evidence-mode client_safe, --list-plans, --list-traces, --list-workspaces, --list-deliveries, --list-persisted-jobs, --list-retained-artifacts, --prune-expired-retained-artifacts, --list-cost-ledger, unified --intake-*, and explicit delivery handoff
  • Tiered structured logging โ€” long-tier (~30 days) for lifecycle/build/finance/audit events, short-tier (~6 hours) for verbose pipeline diagnostics, hourly cron prune sweep
  • Crash-safe vault format โ€” single-file v2 (ALSv2 magic + embedded random salt + Fernet token), atomic os.replace writes, automatic migration from v1
  • 1762+ tests โ€” unit + integration + e2e + security + routing evals + adversarial, $0.00 token cost

Quick Start

For first-time setup, follow docs/SETUP_LOCAL.md. It walks you through generating your own credentials, where to store them, and how to keep your personal data out of the repo. Nothing below references any specific operator's environment.

git clone https://github.com/B2JK-Industry/Agent_Life_Space.git
cd Agent_Life_Space
python3 -m venv .venv && source .venv/bin/activate
pip install -e . && pip install sentence-transformers
export AGENT_PROJECT_ROOT="$PWD"            # recommended for self-host + systemd
export AGENT_DATA_DIR="$PWD/.agent_runtime" # keeps runtime DBs/logs out of the source tree
export AGENT_PIDFILE_PATH="$PWD/.agent-life-space.pid"
export TELEGRAM_BOT_TOKEN="your_token"      # from @BotFather
export TELEGRAM_USER_ID="your_id"           # your Telegram user ID
export AGENT_NAME="MyAgent"                 # recommended
export AGENT_SERVER_NAME="my-server"        # recommended
export AGENT_VAULT_KEY="your_key"           # python -c "from cryptography.fernet import Fernet; print(Fernet.generate_key().decode())"
export AGENT_API_KEY="your_api_key"         # python -c "import secrets; print(f'agent_api_{secrets.token_urlsafe(24)}')"

# choose one LLM backend:
# CLI backend (Claude Code installed and logged in on the same host)
export LLM_BACKEND="cli"
# export CLAUDE_CODE_OAUTH_TOKEN="..."

# or API backend
# export LLM_BACKEND="api"
# export LLM_PROVIDER="anthropic"
# export ANTHROPIC_API_KEY="sk-ant-..."

# optional: leave owner fields empty and let the first authorized Telegram message teach the owner profile
# export AGENT_OWNER_NAME="Your name"
# export AGENT_OWNER_FULL_NAME="Your full name"

.venv/bin/python -m agent --setup-doctor
.venv/bin/python -m agent

See Deployment guide for full setup (Docker, systemd, Cloudflare tunnel, firewall). After startup, the dashboard is available at /dashboard on the API port and uses the same AGENT_API_KEY.

Runtime LLM Control

You can detach the LLM entirely, or switch between CLI and API backends without editing .env each time. The runtime override is persisted under AGENT_DATA_DIR/control/llm_runtime.json.

.venv/bin/python -m agent --llm-runtime-status
.venv/bin/python -m agent --llm-runtime-disable --llm-runtime-note "maintenance"
.venv/bin/python -m agent --llm-runtime-enable --llm-runtime-backend cli --llm-runtime-note "back to Claude CLI"
.venv/bin/python -m agent --llm-runtime-enable --llm-runtime-backend api --llm-runtime-provider anthropic
.venv/bin/python -m agent --llm-runtime-follow-env --llm-runtime-enable

The same control surface is available via:

  • GET /api/operator/llm
  • POST /api/operator/llm
  • /dashboard LLM Runtime panel

Telegram + CLI backend caveat: programming tasks sent from Telegram on the Claude CLI backend in default sandbox-only mode (AGENT_SANDBOX_ONLY=1) cannot complete because the Claude Code permission prompt is unreachable from Telegram. The brain returns a deterministic operator message instead of hanging. Two unblock paths: switch the runtime to API backend via /api/operator/llm, or set AGENT_SANDBOX_ONLY=0 on the server (explicit host opt-in). See docs/OPERATOR_HANDBOOK.md for details.

Headless server deployments: when running the agent as a daemon (systemd / Docker / nohup), set AGENT_CLI_AUTO_APPROVE=1 in .env so the CLI gets --dangerously-skip-permissions automatically. Sandbox isolation is preserved by passing --disallowed-tools "Bash,Edit,Write,NotebookEdit" so the LLM can read/search but cannot mutate the host filesystem. If left empty, the agent auto-detects TTY (also works for daemon mode).

Architecture

Telegram message
    |
1. Input sanitization (prompt injection guard)
2. /commands -> direct response (0 API calls)
3. Dispatcher -> pattern match (0 API calls)
4. Semantic router -> embedding classification (local compute)
5. Semantic cache -> cached response (local compute)
6. RAG -> knowledge base lookup (local compute)
7. Claude (Haiku $0.001 | Sonnet $0.01 | Opus $0.05-0.20)
    |
Response -> Telegram + memory + learning

Modules

Module What Status
core/ Orchestrator, router, watchdog, job runner, sandbox Stable
brain/ Decision engine, dispatcher, semantic router, skills, learning Stable
memory/ 4-type store, persistent conversation, RAG, consolidation Stable
social/ Telegram bot, handler, Agent-to-Agent API Stable
finance/ Budget, proposals (human-in-the-loop), audit trail Stable
vault/ Encrypted secrets (Fernet AES-128, PBKDF2 480K iterations) Stable
tasks/ Task lifecycle (create -> start -> complete) Stable
projects/ Project scoping Beta
review/ Structured review runtime, evidence export, reviewer handoff Stable
build/ Build execution, delivery packages, bundle lifecycle Stable
control/ Control-plane queries, runtime model, release readiness gate Stable
work/ Isolated workspaces Beta

~39,000 lines of code. Details: Modules wiki

Security

  • Input sanitization (prompt injection guard, EN + SK)
  • Owner identification + safe mode for non-owners in groups
  • Tool governance โ€” capability manifest + deterministic policy engine
  • Host file access blocked by default (AGENT_SANDBOX_ONLY=1)
  • Docker sandbox (read-only, no-network, resource limits, image whitelist)
  • Encrypted vault (fail-fast without key)
  • Approval queue for risk-sensitive actions (finance, host access, external writes)
  • API authentication (Bearer token) + rate limiting
  • Log redaction (secrets never in logs)
  • PID lockfile (prevents duplicate instances)
  • Single-file vault v2 format with crash-safe atomic writes (no temp/sidecar files left mid-write)
  • Wrong-key vault writes fail-fast with VaultDecryptionError (no silent destruction)
  • 129 automated security + invariant tests

Details: Security wiki

Testing

.venv/bin/python -m agent --setup-doctor
.venv/bin/python -m pytest tests/ -q   # 1762+ passed, offline, $0.00
Layer Tests What
Unit ~580 Individual modules
Integration 34 Cross-module flows
E2E 44 Full agent wiring
Security 116 Audit + invariants
Routing Evals 40+ Classification accuracy + adversarial
Governance 30+ Policy enforcement + action pipeline

All tests are offline โ€” no API calls, no Docker needed. Details: Testing wiki

Telegram Commands

Command Description
/status Agent status
/health CPU, RAM, disk, modules
/tasks Task list
/memory [keyword] Search memory
/budget Financial status
/newtask [name] Create task
/web [url] Fetch webpage
/sandbox [code] Run Python in Docker
/review [file] Code review
/wallet ETH/BTC status
/usage Token costs
/help All commands

Agent-to-Agent API

curl -X POST https://your-tunnel.trycloudflare.com/api/message \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer your_api_key" \
  -d '{"message": "hello", "sender": "other-agent"}'

Details: API Reference wiki

Known Limitations

This project is honest about what works and what doesn't yet.

Area Status What's missing
Memory provenance Working Conflict detection is tag-based, not semantic. No auto-consolidation pipeline yet.
Tool governance Working Review repo/diff execution and intake budgets now run under deterministic policy boundaries, but build execution is still not governed by one fully unified engine.
Workspace Working No cleanup scheduler (must call cleanup_expired() manually).
Routing Working Keyword + signal heuristics. No ML-based classification.
Learning Partial Model failure tracking resets on restart. No eval set.
Finance Partial Approval + settlement workflows exist across Telegram, API, and dashboard, but this is still owner-operated rather than a richer multi-user finance console.
Multi-channel Foundation Telegram only in production. Discord/email are interfaces, not implemented.
Dashboard Partial API-key protected operator dashboard exists, but it is still a focused owner UI, not a broader multi-user app.

See docs/SECURITY_MODEL.md for security boundaries and docs/LEARNING_MODEL.md for learning system spec.

License

MIT

Release History

VersionChangesUrgencyDate
v1.36.0**Telegram practical memory, end-to-end self-update + restart, and a comprehensive paraphrase-aware intent layer.** Closes the operator complaints from a real production session: "agent doesn't remember the previous message after a fast-path reply", "stiahni si novu verziu a nasad to fell through to a 180s timeout", "raw `Chyba: {\"type\":\"result\",\"subtype\":\"errormaxturns\",...}` JSON leaked into chat", and "I want to update + restart from Telegram without SSH-ing into the server". ## HigHigh4/9/2026
v1.35.0**Tiered Logging, Vault Crash-Safety, Runtime LLM Control, and Security Hardening** โ€” deterministic log retention, single-file atomic vault format, operator-controlled backend selection, and a deep sweep of defense-in-depth fixes across dashboard, CLI, SQL, telegram, and brain. ## Highlights - **Vault single-file v2 format** (`ALSv2` magic + 16-byte random salt + Fernet token) with embedded random salt and crash-safe atomic migration โ€” zero corrupt-state window between salt and blob writes - *High4/8/2026
v1.34.0Self-Host Onboarding Closure - closes the v1.34.0 self-host onboarding slice with safer runtime defaults and stronger setup diagnostics - fresh installs now prefer .agent_runtime while legacy installs with existing runtime data under agent/ stay compatible - setup doctor now surfaces project_root, data_dir, identity profile path, pidfile path, and stronger self-host warnings - aligns CLI --data-dir and AGENT_DATA_DIR behavior across status, readiness, and operator flows - fixes audit-discoveredMedium4/2/2026
v1.33.0## Docker-Isolated Build Execution with Auto-Fix Retry ### What's New **Docker Project Executor** (`agent/build/docker_executor.py`) - `/build` now runs generated projects entirely inside Docker containers - Phases: pip install (with network) โ†’ pytest (no network) โ†’ ruff lint - Safety: 512MB RAM, 1 CPU, 5min timeout, read-only mount **Auto-Fix Retry Loop** - When tests fail, Opus receives test output + source code - Generates fixed code โ†’ re-runs tests in Docker โ†’ up to 2 retries - Full cycleMedium4/1/2026
v1.32.0## LLM Build Pipeline โ€” Description-Driven Code Generation ### What's New **LLM Code Generation** (`agent/build/codegen.py`) - `/build . --description "..."` now generates complete implementation files via Opus - Bridges natural language descriptions to deterministic WRITE_FILE build operations - Robust JSON parser handles markdown fences, newlines, trailing commas - Safety: only WRITE_FILE ops, relative paths, max operation cap **Bilingual Task Classification** - All classifier keyword sets Medium4/1/2026
v1.31.0## Runtime Contract Closure Closes the remaining auth, coupling, and extraction-readiness gaps. ### Dashboard Authentication - `/dashboard` now requires API key (header or `?key=` query param) - Unauthenticated access โ†’ minimal login page (no HTML leak) ### Public API Discipline - Settlement service no longer accesses `._storage` private attribute - Archival API uses `get_storage_for_archival()` public method - `ControlPlaneStateService` exposes settlement + archival public methods ### ExtraMedium4/1/2026
v1.30.0## Deployment Contract Hardening Makes the agent safer and more predictable for self-host deployment. ### Deny-by-Default Enforcement - **Removed AGENT_DEV_MODE bypass** from review and build delivery approval - Policy enforcement is no longer environment-dependent โ€” delivery without approval queue is always denied - No runtime path allows bypassing approval through env vars ### Explicit Configuration - **paths.py** raises RuntimeError if no valid project root (no silent `~/.agent-life-space`Medium4/1/2026
v1.29.0## Settlement Workflow Closure Turns payment settlement from a foundation-level service into an operator-ready workflow. ### What's New **Persistence** - Settlement requests stored in SQLite โ€” survive agent restart - Automatic load-on-init, persist-on-every-state-change **Operator Workflow** - `POST /api/operator/settlements/{id}/approve` โ€” approve topup - `POST /api/operator/settlements/{id}/deny` โ€” deny payment - `POST /api/operator/settlements/{id}/execute` โ€” topup + auto-retry original cMedium4/1/2026
v1.28.1## Phase 4 Enterprise Hardening โ€” Complete Release This release consolidates all Phase 4 work (v1.25.1 through v1.28.1) into a single verified baseline. ### Highlights **Operator Control Plane** - 12 authenticated REST API endpoints under `/api/operator/` - Self-contained HTML dashboard at `/dashboard` with real-time metrics - Archive CSV export + download with path-traversal protection - Settlement Telegram commands (`/settlement`, approve, deny) **Enterprise Hardening** - 26 CI-enforced arMedium4/1/2026
v1.24.1## Phase 3 Features (v1.22.0 โ†’ v1.24.0) ### v1.22.0 โ€” Provider Delivery Workflow + Runtime Telemetry - Enriched `/deliver` with provider outcome, receipt, attention, retry, outcome filters - `/report delivery` sub-command - `TelemetrySnapshot` model with job throughput, latency, cost, delivery health - `/telemetry [hours]` command with trend detection ### v1.23.0 โ€” Seller-Side Obolos + Multi-Provider Gateway + Architecture Invariants - `seller_publish_v1` and `wallet_topup_v1` capability routeMedium3/31/2026
v1.21.1## v1.21.1 โ€” Deployment portability and security fix ### Security Fix - Hostname redaction patterns now built dynamically from `AGENT_SERVER_NAME` โ€” non-default deployments no longer leak hostnames in client-safe bundles ### Portability - Consolidation triggers derived from `get_agent_identity()` (was hardcoded "john", "b2jk") - Dockerfile fixed (source COPY before pip install) - docker-compose.yml: 7 missing env vars added - .env.example: added `AGENT_SANDBOX_ONLY`, `LLM_BACKEND`, section heaMedium3/31/2026
v1.21.0## v1.21.0 โ€” Phase 3: cost feedback + unified policy boundary ### Cost Estimation Feedback (T6-E1-S1 closed) - `get_cost_accuracy()` joins plan records with cost ledger - `/report cost` Telegram subcommand - `TraceRecordKind.COST_ACCURACY` ### Unified Policy Boundary (T5-E1-S1 closed) - `RuntimeActionRequest` + `evaluate_runtime_action()` โ€” single entry point - Existing policy functions untouched **1407 passed, 4 skipped** | 376 ins, 5 del **Full Changelog**: https://github.com/B2JK-IndustryMedium3/30/2026
v1.20.0## v1.20.0 โ€” Phase 3: runtime capability binding + delivery workflow ### Runtime Capability Binding (T4-E2-S3 โ€” P0 closed) Review/verify/deliver planner phases now bind to execution policies and delivery policies with explicit IDs in metadata โ€” no longer just planner profile labels. | Phase | Before | After | |-------|--------|-------| | Review | `source: planner_profile` | `source: execution_policy` + `execution_policy_id` | | Verify | `source: planner_profile` | `source: planner_profile` (inMedium3/30/2026
v1.19.0## v1.19.0 โ€” Phase 3 kickoff: operator Telegram surface Phase 3 begins. Existing runtime capabilities (intake, planning, reporting) are now accessible from Telegram. ### New Telegram Commands | Command | Purpose | |---------|---------| | `/intake` | Unified operator intake โ€” qualify, plan, execute review/build from chat | | `/report` | Operator report with overview, inbox, budget views | | `/build` | Shortcut for build intake | ### Usage ``` /intake . --description "security audit" /intake agMedium3/30/2026
v1.18.0## v1.18.0 โ€” Security hardening and conservative fixes Conservative release from full codebase audit. **No files deleted, no methods removed, no interface changes.** ### Security (3 fixes) | Module | Fix | |--------|-----| | `sandbox.py` | Pip package names validated against safe regex before shell interpolation | | `learning.py` | Shell-quoted `_PROJECT_ROOT` via `shlex.quote()` in skill test commands | | `risk_templates.py` | CSV export fields properly escaped (RFC 4180 + formula injection gMedium3/30/2026
v1.17.0## v1.17.0 โ€” Audit-driven quality release Quality release from full codebase audit: 7 bug fixes, documentation sync, clean Phase 3 foundation. ### Bug Fixes | Severity | Module | Fix | |----------|--------|-----| | **HIGH** | `cron.py` | Month-boundary crash in morning report โ€” `timedelta` instead of `replace(day+1)` | | **HIGH** | `telegram_handler.py` | Operator precedence bug made simple-prompt branch unreachable | | **MEDIUM** | `store.py` | `query_facts()` now filters by SEMANTIC+PROCEDURMedium3/29/2026
v1.16.1Patch release fixing fresh-install Telegram identity defaults. Highlights: - removes Daniel-specific owner and Slovak-only response defaults from runtime prompts - preserves the real Telegram display name while passing explicit owner status - makes owner and language behavior deployment-configurable via AGENT_OWNER_NAME, AGENT_OWNER_FULL_NAME, and AGENT_DEFAULT_LANGUAGE - adds regression coverage for runtime persona identity and Telegram owner resolution Verification: - ./.venv/bin/ruff check Medium3/28/2026
v1.16.0Documented buyer-side Obolos gateway release. Highlights: - add a documented buyer-side Obolos API gateway path with provider-backed marketplace catalog, wallet balance, and slug-call capabilities - persist request/response artifacts, traces, and structured payment-required denials for external API calls - align release metadata, docs, and strategy snapshots for the post-Phase-2 buyer-side gateway slice Verification: - ./.venv/bin/ruff check . - ./.venv/bin/pytest -q - PATH="/Users/danielbabjaMedium3/28/2026
v1.15.0Phase 2 closure and release-readiness release. ## Highlights - deepens the bounded builder engine with deterministic `copy_file` and `move_file` operations - adds implementation-backed acceptance summaries over changed operations, paths, operation types, and implementation mode - adds deterministic release-readiness policy helpers, provider outcome reporting, and a real CLI/CI release gate - closes the Builder v1 / Phase 2 strategy slice with synced backlog docs and controlled-environment deploMedium3/28/2026
v1.14.0Builder now supports richer deterministic insert/delete-safe mutations with capability guardrails, while the external gateway now carries provider-specific request/receipt semantics and falls back on incomplete provider receipts too. Verification: - ./.venv/bin/ruff check . - ./.venv/bin/pytest -q (1337 passed, 4 skipped) - ./.venv/bin/pytest tests/test_build_domain.py tests/test_gateway.py tests/test_control_plane_jobs.py -q (122 passed) - PATH="/Users/danielbabjak/Desktop/Agent_Life_Space/.toMedium3/28/2026
v1.13.0Phase 2 provider gateway release. Highlights: - add a concrete obolos.tech provider catalog with capability routes, readiness reporting, env/vault-backed auth resolution, and route fallback - add provider-aware delivery handoff through the external gateway for build and review bundles - add review-quality trend telemetry with release labels, runtime duration, and previous-baseline regression deltas - sync strategy docs, backlog, and release metadata to the new Phase 2 state Verification: - ./.Medium3/28/2026
v1.12.0Phase 2 verification hardening release. Highlights: - hardened builder verification discovery and command resolution across Python, Node/TS, Make, and CI hints - extended structured denial payloads across social/API, web, tool execution, and finance-budget boundaries - added golden review cases plus CI coverage for clean, secret, and unsafe-pattern verdicts - exposed the first explicit external gateway contract and enterprise-oriented data-handling rules - updated strategy docs, backlog, and reMedium3/28/2026
v1.11.0Phase 2 structured acceptance release. Highlights: - structured acceptance criteria now flow through CLI, unified intake, planner output, and builder handoff - deterministic acceptance evaluators now cover structured workspace, change-set, verification-kind, and review-threshold checks - strategy docs and backlog state are updated for the larger Phase 2 structured-acceptance slice Verification: - .venv/bin/ruff check . - .venv/bin/pytest -q -> 1303 passed, 4 skipped - .venv/bin/pytest tests/teMedium3/28/2026
v1.10.0# v1.10.0 Phase 2 builder execution release. ## Highlights - Builder now supports a bounded local implementation engine for explicit structured workspace mutations, with deterministic `write_file`, `append_text`, `replace_text`, and `json_set` operations. - Build jobs now persist implementation mode plus per-operation execution results instead of flattening the mutable build step into an audit marker only. - CLI and unified operator intake can now carry structured implementation plaMedium3/28/2026
v1.9.1Phase 2 acceptance clarity release. - Acceptance criteria now support explicit required-vs-optional semantics and evaluator hints parsed from CLI/operator intake strings. - Builder can now fail clearly on unmet required criteria with structured denial payloads and detailed unmet-criterion summaries. - Build delivery summaries now expose required/optional acceptance counts plus blocking-vs-optional unmet criteria. - Strategy docs now move the next builder work toward richer deterministic acceptaMedium3/28/2026
v1.9.0Phase 2 kickoff release. - Build execution now uses explicit source-aware execution policies and records policy traces before mutable workspace execution. - Runtime model now exposes higher-level local_owner, operator_controlled, and enterprise_hardened operating profiles. - Build delivery bundles now carry suite-level plus per-step verification artifacts and richer acceptance handoff summaries. - Strategy docs and release metadata are updated for the Phase 2 kickoff slice. Verification: .venvMedium3/28/2026
v1.8.2Reviewer phase-1 closure release. Highlights: - adds operator-summary and copy-paste PR comment review artifacts - reuses those handoff summaries in client-safe evidence export - propagates structured denial payloads across core blocked flows and reporting - adds review eval smoke coverage to CI Verification: - .venv/bin/ruff check . - .venv/bin/pytest -q (1285 passed, 4 skipped) - PATH="/Users/danielbabjak/Desktop/Agent_Life_Space/.tools/node-v24.14.0-darwin-arm64/bin:/opt/homebrew/bin:/opt/hMedium3/28/2026
v1.8.1Phase 1 delivery closure release. Highlights: - review delivery now converges on the shared delivery lifecycle with explicit approval and handoff state - retained artifacts now support an explicit prune workflow through the control plane and CLI - evidence export now supports a client-safe review mode for safer operator/external handoff - operator reporting now surfaces approval backlog detail and retention posture, including expired/pruned counts - strategy, backlog, and documentation are updaMedium3/28/2026
v1.8.0Phase 1 closure hardening release. Highlights: - managed repo acquisition/import for supported git sources before runtime routing - evidence export packages with artifact traceability, retention, traces, costs, and runtime model metadata - explicit environment profiles for review/build/acquisition/export flows - multi-step approval thresholds for risky intake and delivery paths - budget-aware model escalation and richer persisted job telemetry Verification: - .venv/bin/ruff check . - .venv/binMedium3/28/2026
v1.7.0Review Runtime Convergence + Budget Governance release. Highlights: - converged Telegram and structured API review entrypoints through the shared runtime - deterministic review execution policies with durable control-plane traces - runtime hard-cap / stop-loss budget blocking plus approval-gated intake execution - richer operator budget posture reporting - repo-local builder toolchain fallback and no-criteria acceptance proxy Verification: - .venv/bin/ruff check . - .venv/bin/pytest -q -> 1273Medium3/27/2026
v1.6.0Unified control-plane persistence and retention release. ## Highlights - persist shared ProductJobRecord metadata for build and review jobs in the control plane - add retained artifact records with policy ids, expiry timestamps, recoverability, and derived retention status - record per-job usage, token, and cost data in a durable control-plane ledger - expose persisted jobs, retained artifacts, and cost ledger entries through the orchestrator, CLI, operator report, and strategy docs - extend shMedium3/27/2026
v1.5.0## Durable Planning and Delivery Lifecycle This release turns operator planning and builder delivery into durable control-plane state. ### Highlights - Persisted `JobPlan` handoff records with stable plan IDs and shared list/get surfaces - Durable execution traces for qualification, budget, capability, delivery, verification discovery, and review policy decisions - Workspace join queries linking workspaces to jobs, artifacts, approvals, and delivery bundles - Repo-aware builder verification diMedium3/27/2026
v1.4.5Builder delivery package and operator health release. Highlights: - deterministic builder patch/diff artifacts plus build delivery package preview - richer acceptance evaluation using review, documentation, and target-file signals - approval linkage extended across jobs, artifacts, workspaces, and bundles - operator report now includes workspace health and worker execution summaries Verification: - .venv/bin/ruff check . - .venv/bin/pytest -q -> 1255 passed, 4 skipped - PATH="/Users/danielbabjMedium3/27/2026
v1.4.4Planner qualification and phase routing release. What shipped: - operator intake now resolves scope signals, risk factors, and a policy-backed budget envelope using BudgetPolicy plus live finance budget state when available - JobPlan now exposes explicit qualify/review/build/verify/deliver phases in preview and submit flows - planner output now assigns concrete build catalog capabilities plus planner profiles and structured budget metadata - planner-selected build catalog capability ids now floMedium3/27/2026
v1.4.3Runtime model and artifact planning release. Highlights: - explicit runtime coexistence rules via RuntimeModelService and --runtime-model - shared artifact query/recovery across build and review - JobPlan preview/submit output for unified operator intake - operator report now includes recent artifacts Verification: - .venv/bin/ruff check . - .venv/bin/pytest -q -> 1248 passed, 4 skipped - PATH="/Users/danielbabjak/Desktop/Agent_Life_Space/.tools/node-v24.14.0-darwin-arm64/bin:/opt/homebrew/binMedium3/27/2026
v1.4.2# Agent Life Space v1.4.2 - Control-Plane Expansion Control-plane expansion release for the post-`v1.4.1` main state. ## What changed ### Platform / Control Plane - `ReviewJob` now uses shared control-plane primitives directly - Shared job queries now cover build, review, task, job-runner, and agent-loop runtime records - Approval requests are now persistent and queryable with job/artifact linkage - Operator reporting now has a real runtime surface via `OperatorReportService` and `python -m aMedium3/27/2026
v1.4.0# Agent Life Space v1.4.0 โ€” Backlog Zero All items from the master backlog have been implemented. ## What's new - **Multi-step approval** โ€” required_approvals, partial approval tracking - **Routing confusion analysis** โ€” systematic confusion detection + fallback tests - **Workspace recovery tests** โ€” 4 crash-recovery scenarios - **Finance proposal lifecycle** โ€” end-to-end tests - **Product identity** โ€” decision doc: personal sovereign operator - **Release checklist** โ€” standardized process - *Medium3/26/2026
v1.3.0# Agent Life Space v1.3.0 Completeness release โ€” remaining backlog items. ## Memory - **Factual/conversational separation** โ€” query_facts() vs query_conversations() - **Consolidation pipeline** โ€” inferred โ†’ verified promotion, stale detection ## Learning - **Rollback** โ€” reset skills to UNKNOWN, clear model failures - **Learning report** โ€” avg confidence, mastered/failed metrics ## Workspace - **Ownership** โ€” owner_id on workspaces - **Immutable audit** โ€” SHA-256 hash-chained entries (tamperMedium3/26/2026
v1.2.0# Agent Life Space v1.2.0 Operator-grade visibility and control. 5 PR, 1016 tests. ## API & Communication - **API audit trail** โ€” every request logged (sender, IP, status, duration) - **Replay protection wired** โ€” nonce + timestamp check in API handler - **Rate-limit telemetry** โ€” by-sender abuse detection ## Finance - **Budget policy** โ€” hard cap (block), soft cap (warn), per-tx approval cap - **Budget forecast** โ€” remaining budget at each level ## Operator Visibility - **Memory inspection*Medium3/26/2026
v1.1.0# Agent Life Space v1.1.0 โ€” Breakthrough Architecture 19 PR, 991 tests, ~8000 lines added. Zero regressions. ## Epistemic Memory - Provenance model (observed/user_asserted/inferred/verified/stale) - MemoryKind (fact/belief/claim/procedure) - FTS5 full-text search, conflict detection, audit report - Consolidation pipeline: inferred โ†’ verified promotion, stale detection ## Tool Governance - Capability manifest with risk/side-effect/approval per tool - ActionEnvelope โ€” 4-step pipeline: request โ†’Medium3/26/2026
v1.0.0# Agent Life Space v1.0.0 Prvรฝ stabilnรฝ release. Vลกetko od 0.1-beta zjednotenรฉ do jednej produkฤnej verzie. ## ๐Ÿง  Core - **7-layer cascade** โ€” 5 vrstiev lokรกlneho spracovania pred LLM (ลกetrรญ tokeny) - **Provider-agnostic LLM** โ€” ClaudeCliProvider, AnthropicProvider, OpenAiProvider - **ModelTier system** โ€” FAST / BALANCED / POWERFUL mapovanรฉ per provider - **AgentBrain** โ€” channel-agnostic message processing, zero shared state - **Tool use** โ€” 10 nรกstrojov pre LLM function calling + ToolUseLoopMedium3/26/2026
v0.1-beta## First public beta Self-hosted autonomous AI agent with: - 7-layer cascade (minimizes API calls) - Learning system with behavioral changes (model escalation, prompt augmentation) - Docker sandbox (mandatory, isolated code execution) - Encrypted vault (ETH/BTC wallets, API keys) - 430+ tests, 8 E2E deployment scenarios ### Quick Start ```bash git clone https://github.com/B2JK-Industry/Agent_Life_Space.git cd Agent_Life_Space cp .env.example .env docker compose up -d RequirementMedium3/24/2026

Dependencies & License Audit

Loading dependencies...

Similar Packages

flow-nextPlan-first AI workflow plugin for Claude Code, OpenAI Codex, and Factory Droid. Zero-dep task tracking, worker subagents, Ralph autonomous mode, cross-model reviews.flow-next-v1.6.0
GENesis-AGIAutonomous AI agent with persistent memory, self-learning, and earned autonomy. Cognitive partner that remembers, learns, and evolves.v3.0b13
runtmOpen-source sandboxes where coding agents build and deploy. Spin up isolated environments where Claude Code, Cursor, and other agents code and deploy software.main@2026-05-25
GhostDeskGive any AI agent a full desktop โ€” it sees the screen, clicks, types, and runs apps like a human. Automate anything with a UI: browsers, legacy software, internal tools. No API needed. One Docker commv7.4.1
axonixA self-evolving AI agent that gets more useful every daymain@2026-04-13

More in Security

clineAutonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
E2BOpen-source, secure environment with real-world tools for enterprise-grade agents.
vm0the easiest way to run natural language-described workflows automatically
AgenvoyAgentic framework | Self-improving memory | Pluggable tool extensions | Sandbox execution