Home > Security > Agent_Life_Space

Agent_Life_Space

Self-hosted autonomous AI agent — 9-layer cascade, Docker sandbox, encrypted vault, review/build/control plane, 1407+ tests

ai-agent anthropic autonomous-agent claude code-review control-plane docker llm python

Why this rank:Release freshnessHealthy release cadenceStrong adoption

Description

Self-hosted autonomous AI agent — 9-layer cascade, Docker sandbox, encrypted vault, review/build/control plane, 1407+ tests

README

Agent Life Space

Self-hosted autonomous AI agent that lives on your server. Thinks with Claude, acts through its own modules, communicates via Telegram.

Wiki | Architecture | Security | API Reference | Roadmap

What it does

9-layer cascade — dispatch → cache → RAG → classify → LLM → quality escalation → learning → channel filter → explanation
Docker sandbox — /sandbox code runs in isolated containers (256MB, no network, read-only FS)
Encrypted vault — API keys, wallet keys (ETH/BTC) encrypted with Fernet AES-128
Epistemic memory — 4 types + provenance model (observed/asserted/inferred/verified/stale), expiry, decay
Persistent conversation — SQLite-backed context with FTS5 full-text search, survives restarts
Agent-to-Agent API — HTTP endpoint for inter-agent communication
Structured review API — POST /api/review runs deterministic review jobs through the shared runtime
Learning system — skill outcome tracking, model escalation, prompt augmentation
Multi-provider LLM — Claude CLI, Anthropic API, OpenAI, Ollama (any backend)
Automated security — 129-test security audit + invariant suite
Tool governance — capability manifest, policy engine, 4-step action pipeline with audit trail
Workspace persistence — SQLite-backed workspaces with audit trail, limits, TTL, recovery
Approval queue — structured propose → approve/deny → execute workflow with persistent storage and linkage
Delivery packages — shared build/review delivery lifecycle with deterministic bundle previews, approval linkage, and explicit handoff state
Reviewer handoff artifacts — operator summaries and copy-paste-ready PR comment packs now persist as first-class review artifacts
Planner handoff + traces — persisted JobPlan records and durable qualification/budget/capability/delivery traces
Delivery lifecycle tracking — prepared → awaiting approval → approved/rejected → handed off with audit events
Workspace joins — workspaces now link to jobs, artifacts, approvals, and delivery bundles
Retained artifact records — build/review/delivery outputs now carry policy, expiry, recoverability, and prune-state metadata
Persisted product jobs — shared control-plane record of build/review job metadata, status, usage, and artifacts
Per-job cost ledger — durable usage/token/cost entries with report and CLI inspection
Runtime budget governance — hard-cap, stop-loss, and approval-gated intake execution
Managed repo acquisition — supported git_url intake can clone/import into a controlled local mirror before runtime routing
Evidence export — --export-evidence-job assembles internal or client-safe review packages with artifacts, traces, retention, and traceability
Environment profiles — explicit review/build/acquisition/export execution profiles exposed through the runtime model
Controlled-environment deployment — local-owner, operator-controlled, and enterprise-hardened runtime posture now has explicit deployment guidance
Multi-step approvals — risky intake and delivery paths can require more than one approval deterministically
Shared policy registry — deterministic job persistence, artifact retention, delivery, review-gate, and gateway defaults
Structured denials — shared machine-readable blocker payloads across policy, intake, delivery, and evidence export flows
Control-plane queries — shared inspection across build, review, task, job-runner, agent-loop, artifact, plan, delivery, and workspace state
Runtime model — explicit coexistence rules for product jobs, planning tasks, infrastructure jobs, and conversational queue items
Release readiness gate — deterministic CLI/CI quality and gateway posture gate before release or handoff
Operator dashboard — authenticated /dashboard surface for jobs, settlements, retention, audit, operator metrics, and one-click LLM runtime control
Settlement workflow — persisted 402/top-up approval flow across API, dashboard, and Telegram with retry support
Setup doctor — python -m agent --setup-doctor audits self-host identity, LLM, gateway, and operator posture before first run
Operator CLI surfaces — --report, --runtime-model, --llm-runtime-*, --export-evidence-job, --export-evidence-mode client_safe, --list-plans, --list-traces, --list-workspaces, --list-deliveries, --list-persisted-jobs, --list-retained-artifacts, --prune-expired-retained-artifacts, --list-cost-ledger, unified --intake-*, and explicit delivery handoff
Tiered structured logging — long-tier (~30 days) for lifecycle/build/finance/audit events, short-tier (~6 hours) for verbose pipeline diagnostics, hourly cron prune sweep
Crash-safe vault format — single-file v2 (ALSv2 magic + embedded random salt + Fernet token), atomic os.replace writes, automatic migration from v1
1762+ tests — unit + integration + e2e + security + routing evals + adversarial, $0.00 token cost

Quick Start

For first-time setup, follow docs/SETUP_LOCAL.md. It walks you through generating your own credentials, where to store them, and how to keep your personal data out of the repo. Nothing below references any specific operator's environment.

git clone https://github.com/B2JK-Industry/Agent_Life_Space.git
cd Agent_Life_Space
python3 -m venv .venv && source .venv/bin/activate
pip install -e . && pip install sentence-transformers

export AGENT_PROJECT_ROOT="$PWD"            # recommended for self-host + systemd
export AGENT_DATA_DIR="$PWD/.agent_runtime" # keeps runtime DBs/logs out of the source tree
export AGENT_PIDFILE_PATH="$PWD/.agent-life-space.pid"
export TELEGRAM_BOT_TOKEN="your_token"      # from @BotFather
export TELEGRAM_USER_ID="your_id"           # your Telegram user ID
export AGENT_NAME="MyAgent"                 # recommended
export AGENT_SERVER_NAME="my-server"        # recommended
export AGENT_VAULT_KEY="your_key"           # python -c "from cryptography.fernet import Fernet; print(Fernet.generate_key().decode())"
export AGENT_API_KEY="your_api_key"         # python -c "import secrets; print(f'agent_api_{secrets.token_urlsafe(24)}')"

# choose one LLM backend:
# CLI backend (Claude Code installed and logged in on the same host)
export LLM_BACKEND="cli"
# export CLAUDE_CODE_OAUTH_TOKEN="..."

# or API backend
# export LLM_BACKEND="api"
# export LLM_PROVIDER="anthropic"
# export ANTHROPIC_API_KEY="sk-ant-..."

# optional: leave owner fields empty and let the first authorized Telegram message teach the owner profile
# export AGENT_OWNER_NAME="Your name"
# export AGENT_OWNER_FULL_NAME="Your full name"

.venv/bin/python -m agent --setup-doctor
.venv/bin/python -m agent

See Deployment guide for full setup (Docker, systemd, Cloudflare tunnel, firewall). After startup, the dashboard is available at /dashboard on the API port and uses the same AGENT_API_KEY.

Runtime LLM Control

You can detach the LLM entirely, or switch between CLI and API backends without editing .env each time. The runtime override is persisted under AGENT_DATA_DIR/control/llm_runtime.json.

.venv/bin/python -m agent --llm-runtime-status
.venv/bin/python -m agent --llm-runtime-disable --llm-runtime-note "maintenance"
.venv/bin/python -m agent --llm-runtime-enable --llm-runtime-backend cli --llm-runtime-note "back to Claude CLI"
.venv/bin/python -m agent --llm-runtime-enable --llm-runtime-backend api --llm-runtime-provider anthropic
.venv/bin/python -m agent --llm-runtime-follow-env --llm-runtime-enable

The same control surface is available via:

GET /api/operator/llm
POST /api/operator/llm
/dashboard LLM Runtime panel

Telegram + CLI backend caveat: programming tasks sent from Telegram on the Claude CLI backend in default sandbox-only mode (AGENT_SANDBOX_ONLY=1) cannot complete because the Claude Code permission prompt is unreachable from Telegram. The brain returns a deterministic operator message instead of hanging. Two unblock paths: switch the runtime to API backend via /api/operator/llm, or set AGENT_SANDBOX_ONLY=0 on the server (explicit host opt-in). See docs/OPERATOR_HANDBOOK.md for details.

Headless server deployments: when running the agent as a daemon (systemd / Docker / nohup), set AGENT_CLI_AUTO_APPROVE=1 in .env so the CLI gets --dangerously-skip-permissions automatically. Sandbox isolation is preserved by passing --disallowed-tools "Bash,Edit,Write,NotebookEdit" so the LLM can read/search but cannot mutate the host filesystem. If left empty, the agent auto-detects TTY (also works for daemon mode).

Architecture

Telegram message
    |
1. Input sanitization (prompt injection guard)
2. /commands -> direct response (0 API calls)
3. Dispatcher -> pattern match (0 API calls)
4. Semantic router -> embedding classification (local compute)
5. Semantic cache -> cached response (local compute)
6. RAG -> knowledge base lookup (local compute)
7. Claude (Haiku $0.001 | Sonnet $0.01 | Opus $0.05-0.20)
    |
Response -> Telegram + memory + learning

Modules

Module	What	Status
`core/`	Orchestrator, router, watchdog, job runner, sandbox	Stable
`brain/`	Decision engine, dispatcher, semantic router, skills, learning	Stable
`memory/`	4-type store, persistent conversation, RAG, consolidation	Stable
`social/`	Telegram bot, handler, Agent-to-Agent API	Stable
`finance/`	Budget, proposals (human-in-the-loop), audit trail	Stable
`vault/`	Encrypted secrets (Fernet AES-128, PBKDF2 480K iterations)	Stable
`tasks/`	Task lifecycle (create -> start -> complete)	Stable
`projects/`	Project scoping	Beta
`review/`	Structured review runtime, evidence export, reviewer handoff	Stable
`build/`	Build execution, delivery packages, bundle lifecycle	Stable
`control/`	Control-plane queries, runtime model, release readiness gate	Stable
`work/`	Isolated workspaces	Beta

~39,000 lines of code. Details: Modules wiki

Security

Input sanitization (prompt injection guard, EN + SK)
Owner identification + safe mode for non-owners in groups
Tool governance — capability manifest + deterministic policy engine
Host file access blocked by default (AGENT_SANDBOX_ONLY=1)
Docker sandbox (read-only, no-network, resource limits, image whitelist)
Encrypted vault (fail-fast without key)
Approval queue for risk-sensitive actions (finance, host access, external writes)
API authentication (Bearer token) + rate limiting
Log redaction (secrets never in logs)
PID lockfile (prevents duplicate instances)
Single-file vault v2 format with crash-safe atomic writes (no temp/sidecar files left mid-write)
Wrong-key vault writes fail-fast with VaultDecryptionError (no silent destruction)
129 automated security + invariant tests

Details: Security wiki

Testing

.venv/bin/python -m agent --setup-doctor
.venv/bin/python -m pytest tests/ -q   # 1762+ passed, offline, $0.00

Layer	Tests	What
Unit	~580	Individual modules
Integration	34	Cross-module flows
E2E	44	Full agent wiring
Security	116	Audit + invariants
Routing Evals	40+	Classification accuracy + adversarial
Governance	30+	Policy enforcement + action pipeline

All tests are offline — no API calls, no Docker needed. Details: Testing wiki

Telegram Commands

Command	Description
`/status`	Agent status
`/health`	CPU, RAM, disk, modules
`/tasks`	Task list
`/memory [keyword]`	Search memory
`/budget`	Financial status
`/newtask [name]`	Create task
`/web [url]`	Fetch webpage
`/sandbox [code]`	Run Python in Docker
`/review [file]`	Code review
`/wallet`	ETH/BTC status
`/usage`	Token costs
`/help`	All commands

Agent-to-Agent API

curl -X POST https://your-tunnel.trycloudflare.com/api/message \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer your_api_key" \
  -d '{"message": "hello", "sender": "other-agent"}'

Details: API Reference wiki

Known Limitations

This project is honest about what works and what doesn't yet.

Area	Status	What's missing
Memory provenance	Working	Conflict detection is tag-based, not semantic. No auto-consolidation pipeline yet.
Tool governance	Working	Review repo/diff execution and intake budgets now run under deterministic policy boundaries, but build execution is still not governed by one fully unified engine.
Workspace	Working	No cleanup scheduler (must call `cleanup_expired()` manually).
Routing	Working	Keyword + signal heuristics. No ML-based classification.
Learning	Partial	Model failure tracking resets on restart. No eval set.
Finance	Partial	Approval + settlement workflows exist across Telegram, API, and dashboard, but this is still owner-operated rather than a richer multi-user finance console.
Multi-channel	Foundation	Telegram only in production. Discord/email are interfaces, not implemented.
Dashboard	Partial	API-key protected operator dashboard exists, but it is still a focused owner UI, not a broader multi-user app.

See docs/SECURITY_MODEL.md for security boundaries and docs/LEARNING_MODEL.md for learning system spec.

License

MIT

Release History

Version	Changes	Urgency	Date
v1.36.0	Telegram practical memory, end-to-end self-update + restart, and a comprehensive paraphrase-aware intent layer. Closes the operator complaints from a real production session: "agent doesn't remember the previous message after a fast-path reply", "stiahni si novu verziu a nasad to fell through to a 180s timeout", "raw `Chyba: {\"type\":\"result\",\"subtype\":\"errormaxturns\",...}` JSON leaked into chat", and "I want to update + restart from Telegram without SSH-ing into the server". ## Hig	High	4/9/2026
v1.35.0	Tiered Logging, Vault Crash-Safety, Runtime LLM Control, and Security Hardening — deterministic log retention, single-file atomic vault format, operator-controlled backend selection, and a deep sweep of defense-in-depth fixes across dashboard, CLI, SQL, telegram, and brain. ## Highlights - Vault single-file v2 format (`ALSv2` magic + 16-byte random salt + Fernet token) with embedded random salt and crash-safe atomic migration — zero corrupt-state window between salt and blob writes - *	High	4/8/2026
v1.34.0	Self-Host Onboarding Closure - closes the v1.34.0 self-host onboarding slice with safer runtime defaults and stronger setup diagnostics - fresh installs now prefer .agent_runtime while legacy installs with existing runtime data under agent/ stay compatible - setup doctor now surfaces project_root, data_dir, identity profile path, pidfile path, and stronger self-host warnings - aligns CLI --data-dir and AGENT_DATA_DIR behavior across status, readiness, and operator flows - fixes audit-discovered	Medium	4/2/2026
v1.33.0	## Docker-Isolated Build Execution with Auto-Fix Retry ### What's New Docker Project Executor (`agent/build/docker_executor.py`) - `/build` now runs generated projects entirely inside Docker containers - Phases: pip install (with network) → pytest (no network) → ruff lint - Safety: 512MB RAM, 1 CPU, 5min timeout, read-only mount Auto-Fix Retry Loop - When tests fail, Opus receives test output + source code - Generates fixed code → re-runs tests in Docker → up to 2 retries - Full cycle	Medium	4/1/2026
v1.32.0	## LLM Build Pipeline — Description-Driven Code Generation ### What's New LLM Code Generation (`agent/build/codegen.py`) - `/build . --description "..."` now generates complete implementation files via Opus - Bridges natural language descriptions to deterministic WRITE_FILE build operations - Robust JSON parser handles markdown fences, newlines, trailing commas - Safety: only WRITE_FILE ops, relative paths, max operation cap Bilingual Task Classification - All classifier keyword sets	Medium	4/1/2026
v1.31.0	## Runtime Contract Closure Closes the remaining auth, coupling, and extraction-readiness gaps. ### Dashboard Authentication - `/dashboard` now requires API key (header or `?key=` query param) - Unauthenticated access → minimal login page (no HTML leak) ### Public API Discipline - Settlement service no longer accesses `._storage` private attribute - Archival API uses `get_storage_for_archival()` public method - `ControlPlaneStateService` exposes settlement + archival public methods ### Extra	Medium	4/1/2026
v1.30.0	## Deployment Contract Hardening Makes the agent safer and more predictable for self-host deployment. ### Deny-by-Default Enforcement - Removed AGENT_DEV_MODE bypass from review and build delivery approval - Policy enforcement is no longer environment-dependent — delivery without approval queue is always denied - No runtime path allows bypassing approval through env vars ### Explicit Configuration - paths.py raises RuntimeError if no valid project root (no silent `~/.agent-life-space`	Medium	4/1/2026
v1.29.0	## Settlement Workflow Closure Turns payment settlement from a foundation-level service into an operator-ready workflow. ### What's New Persistence - Settlement requests stored in SQLite — survive agent restart - Automatic load-on-init, persist-on-every-state-change Operator Workflow - `POST /api/operator/settlements/{id}/approve` — approve topup - `POST /api/operator/settlements/{id}/deny` — deny payment - `POST /api/operator/settlements/{id}/execute` — topup + auto-retry original c	Medium	4/1/2026
v1.28.1	## Phase 4 Enterprise Hardening — Complete Release This release consolidates all Phase 4 work (v1.25.1 through v1.28.1) into a single verified baseline. ### Highlights Operator Control Plane - 12 authenticated REST API endpoints under `/api/operator/` - Self-contained HTML dashboard at `/dashboard` with real-time metrics - Archive CSV export + download with path-traversal protection - Settlement Telegram commands (`/settlement`, approve, deny) Enterprise Hardening - 26 CI-enforced ar	Medium	4/1/2026
v1.24.1	## Phase 3 Features (v1.22.0 → v1.24.0) ### v1.22.0 — Provider Delivery Workflow + Runtime Telemetry - Enriched `/deliver` with provider outcome, receipt, attention, retry, outcome filters - `/report delivery` sub-command - `TelemetrySnapshot` model with job throughput, latency, cost, delivery health - `/telemetry [hours]` command with trend detection ### v1.23.0 — Seller-Side Obolos + Multi-Provider Gateway + Architecture Invariants - `seller_publish_v1` and `wallet_topup_v1` capability route	Medium	3/31/2026
v1.21.1	## v1.21.1 — Deployment portability and security fix ### Security Fix - Hostname redaction patterns now built dynamically from `AGENT_SERVER_NAME` — non-default deployments no longer leak hostnames in client-safe bundles ### Portability - Consolidation triggers derived from `get_agent_identity()` (was hardcoded "john", "b2jk") - Dockerfile fixed (source COPY before pip install) - docker-compose.yml: 7 missing env vars added - .env.example: added `AGENT_SANDBOX_ONLY`, `LLM_BACKEND`, section hea	Medium	3/31/2026
v1.21.0	## v1.21.0 — Phase 3: cost feedback + unified policy boundary ### Cost Estimation Feedback (T6-E1-S1 closed) - `get_cost_accuracy()` joins plan records with cost ledger - `/report cost` Telegram subcommand - `TraceRecordKind.COST_ACCURACY` ### Unified Policy Boundary (T5-E1-S1 closed) - `RuntimeActionRequest` + `evaluate_runtime_action()` — single entry point - Existing policy functions untouched 1407 passed, 4 skipped \| 376 ins, 5 del Full Changelog: https://github.com/B2JK-Industry	Medium	3/30/2026
v1.20.0	## v1.20.0 — Phase 3: runtime capability binding + delivery workflow ### Runtime Capability Binding (T4-E2-S3 — P0 closed) Review/verify/deliver planner phases now bind to execution policies and delivery policies with explicit IDs in metadata — no longer just planner profile labels. \| Phase \| Before \| After \| \|-------\|--------\|-------\| \| Review \| `source: planner_profile` \| `source: execution_policy` + `execution_policy_id` \| \| Verify \| `source: planner_profile` \| `source: planner_profile` (in	Medium	3/30/2026
v1.19.0	## v1.19.0 — Phase 3 kickoff: operator Telegram surface Phase 3 begins. Existing runtime capabilities (intake, planning, reporting) are now accessible from Telegram. ### New Telegram Commands \| Command \| Purpose \| \|---------\|---------\| \| `/intake` \| Unified operator intake — qualify, plan, execute review/build from chat \| \| `/report` \| Operator report with overview, inbox, budget views \| \| `/build` \| Shortcut for build intake \| ### Usage ``` /intake . --description "security audit" /intake ag	Medium	3/30/2026
v1.18.0	## v1.18.0 — Security hardening and conservative fixes Conservative release from full codebase audit. No files deleted, no methods removed, no interface changes. ### Security (3 fixes) \| Module \| Fix \| \|--------\|-----\| \| `sandbox.py` \| Pip package names validated against safe regex before shell interpolation \| \| `learning.py` \| Shell-quoted `_PROJECT_ROOT` via `shlex.quote()` in skill test commands \| \| `risk_templates.py` \| CSV export fields properly escaped (RFC 4180 + formula injection g	Medium	3/30/2026
v1.17.0	## v1.17.0 — Audit-driven quality release Quality release from full codebase audit: 7 bug fixes, documentation sync, clean Phase 3 foundation. ### Bug Fixes \| Severity \| Module \| Fix \| \|----------\|--------\|-----\| \| HIGH \| `cron.py` \| Month-boundary crash in morning report — `timedelta` instead of `replace(day+1)` \| \| HIGH \| `telegram_handler.py` \| Operator precedence bug made simple-prompt branch unreachable \| \| MEDIUM \| `store.py` \| `query_facts()` now filters by SEMANTIC+PROCEDUR	Medium	3/29/2026
v1.16.1	Patch release fixing fresh-install Telegram identity defaults. Highlights: - removes Daniel-specific owner and Slovak-only response defaults from runtime prompts - preserves the real Telegram display name while passing explicit owner status - makes owner and language behavior deployment-configurable via AGENT_OWNER_NAME, AGENT_OWNER_FULL_NAME, and AGENT_DEFAULT_LANGUAGE - adds regression coverage for runtime persona identity and Telegram owner resolution Verification: - ./.venv/bin/ruff check	Medium	3/28/2026
v1.16.0	Documented buyer-side Obolos gateway release. Highlights: - add a documented buyer-side Obolos API gateway path with provider-backed marketplace catalog, wallet balance, and slug-call capabilities - persist request/response artifacts, traces, and structured payment-required denials for external API calls - align release metadata, docs, and strategy snapshots for the post-Phase-2 buyer-side gateway slice Verification: - ./.venv/bin/ruff check . - ./.venv/bin/pytest -q - PATH="/Users/danielbabja	Medium	3/28/2026
v1.15.0	Phase 2 closure and release-readiness release. ## Highlights - deepens the bounded builder engine with deterministic `copy_file` and `move_file` operations - adds implementation-backed acceptance summaries over changed operations, paths, operation types, and implementation mode - adds deterministic release-readiness policy helpers, provider outcome reporting, and a real CLI/CI release gate - closes the Builder v1 / Phase 2 strategy slice with synced backlog docs and controlled-environment deplo	Medium	3/28/2026
v1.14.0	Builder now supports richer deterministic insert/delete-safe mutations with capability guardrails, while the external gateway now carries provider-specific request/receipt semantics and falls back on incomplete provider receipts too. Verification: - ./.venv/bin/ruff check . - ./.venv/bin/pytest -q (1337 passed, 4 skipped) - ./.venv/bin/pytest tests/test_build_domain.py tests/test_gateway.py tests/test_control_plane_jobs.py -q (122 passed) - PATH="/Users/danielbabjak/Desktop/Agent_Life_Space/.to	Medium	3/28/2026
v1.13.0	Phase 2 provider gateway release. Highlights: - add a concrete obolos.tech provider catalog with capability routes, readiness reporting, env/vault-backed auth resolution, and route fallback - add provider-aware delivery handoff through the external gateway for build and review bundles - add review-quality trend telemetry with release labels, runtime duration, and previous-baseline regression deltas - sync strategy docs, backlog, and release metadata to the new Phase 2 state Verification: - ./.	Medium	3/28/2026
v1.12.0	Phase 2 verification hardening release. Highlights: - hardened builder verification discovery and command resolution across Python, Node/TS, Make, and CI hints - extended structured denial payloads across social/API, web, tool execution, and finance-budget boundaries - added golden review cases plus CI coverage for clean, secret, and unsafe-pattern verdicts - exposed the first explicit external gateway contract and enterprise-oriented data-handling rules - updated strategy docs, backlog, and re	Medium	3/28/2026
v1.11.0	Phase 2 structured acceptance release. Highlights: - structured acceptance criteria now flow through CLI, unified intake, planner output, and builder handoff - deterministic acceptance evaluators now cover structured workspace, change-set, verification-kind, and review-threshold checks - strategy docs and backlog state are updated for the larger Phase 2 structured-acceptance slice Verification: - .venv/bin/ruff check . - .venv/bin/pytest -q -> 1303 passed, 4 skipped - .venv/bin/pytest tests/te	Medium	3/28/2026
v1.10.0	# v1.10.0 Phase 2 builder execution release. ## Highlights - Builder now supports a bounded local implementation engine for explicit structured workspace mutations, with deterministic `write_file`, `append_text`, `replace_text`, and `json_set` operations. - Build jobs now persist implementation mode plus per-operation execution results instead of flattening the mutable build step into an audit marker only. - CLI and unified operator intake can now carry structured implementation pla	Medium	3/28/2026
v1.9.1	Phase 2 acceptance clarity release. - Acceptance criteria now support explicit required-vs-optional semantics and evaluator hints parsed from CLI/operator intake strings. - Builder can now fail clearly on unmet required criteria with structured denial payloads and detailed unmet-criterion summaries. - Build delivery summaries now expose required/optional acceptance counts plus blocking-vs-optional unmet criteria. - Strategy docs now move the next builder work toward richer deterministic accepta	Medium	3/28/2026
v1.9.0	Phase 2 kickoff release. - Build execution now uses explicit source-aware execution policies and records policy traces before mutable workspace execution. - Runtime model now exposes higher-level local_owner, operator_controlled, and enterprise_hardened operating profiles. - Build delivery bundles now carry suite-level plus per-step verification artifacts and richer acceptance handoff summaries. - Strategy docs and release metadata are updated for the Phase 2 kickoff slice. Verification: .venv	Medium	3/28/2026
v1.8.2	Reviewer phase-1 closure release. Highlights: - adds operator-summary and copy-paste PR comment review artifacts - reuses those handoff summaries in client-safe evidence export - propagates structured denial payloads across core blocked flows and reporting - adds review eval smoke coverage to CI Verification: - .venv/bin/ruff check . - .venv/bin/pytest -q (1285 passed, 4 skipped) - PATH="/Users/danielbabjak/Desktop/Agent_Life_Space/.tools/node-v24.14.0-darwin-arm64/bin:/opt/homebrew/bin:/opt/h	Medium	3/28/2026
v1.8.1	Phase 1 delivery closure release. Highlights: - review delivery now converges on the shared delivery lifecycle with explicit approval and handoff state - retained artifacts now support an explicit prune workflow through the control plane and CLI - evidence export now supports a client-safe review mode for safer operator/external handoff - operator reporting now surfaces approval backlog detail and retention posture, including expired/pruned counts - strategy, backlog, and documentation are upda	Medium	3/28/2026
v1.8.0	Phase 1 closure hardening release. Highlights: - managed repo acquisition/import for supported git sources before runtime routing - evidence export packages with artifact traceability, retention, traces, costs, and runtime model metadata - explicit environment profiles for review/build/acquisition/export flows - multi-step approval thresholds for risky intake and delivery paths - budget-aware model escalation and richer persisted job telemetry Verification: - .venv/bin/ruff check . - .venv/bin	Medium	3/28/2026
v1.7.0	Review Runtime Convergence + Budget Governance release. Highlights: - converged Telegram and structured API review entrypoints through the shared runtime - deterministic review execution policies with durable control-plane traces - runtime hard-cap / stop-loss budget blocking plus approval-gated intake execution - richer operator budget posture reporting - repo-local builder toolchain fallback and no-criteria acceptance proxy Verification: - .venv/bin/ruff check . - .venv/bin/pytest -q -> 1273	Medium	3/27/2026
v1.6.0	Unified control-plane persistence and retention release. ## Highlights - persist shared ProductJobRecord metadata for build and review jobs in the control plane - add retained artifact records with policy ids, expiry timestamps, recoverability, and derived retention status - record per-job usage, token, and cost data in a durable control-plane ledger - expose persisted jobs, retained artifacts, and cost ledger entries through the orchestrator, CLI, operator report, and strategy docs - extend sh	Medium	3/27/2026
v1.5.0	## Durable Planning and Delivery Lifecycle This release turns operator planning and builder delivery into durable control-plane state. ### Highlights - Persisted `JobPlan` handoff records with stable plan IDs and shared list/get surfaces - Durable execution traces for qualification, budget, capability, delivery, verification discovery, and review policy decisions - Workspace join queries linking workspaces to jobs, artifacts, approvals, and delivery bundles - Repo-aware builder verification di	Medium	3/27/2026
v1.4.5	Builder delivery package and operator health release. Highlights: - deterministic builder patch/diff artifacts plus build delivery package preview - richer acceptance evaluation using review, documentation, and target-file signals - approval linkage extended across jobs, artifacts, workspaces, and bundles - operator report now includes workspace health and worker execution summaries Verification: - .venv/bin/ruff check . - .venv/bin/pytest -q -> 1255 passed, 4 skipped - PATH="/Users/danielbabj	Medium	3/27/2026
v1.4.4	Planner qualification and phase routing release. What shipped: - operator intake now resolves scope signals, risk factors, and a policy-backed budget envelope using BudgetPolicy plus live finance budget state when available - JobPlan now exposes explicit qualify/review/build/verify/deliver phases in preview and submit flows - planner output now assigns concrete build catalog capabilities plus planner profiles and structured budget metadata - planner-selected build catalog capability ids now flo	Medium	3/27/2026
v1.4.3	Runtime model and artifact planning release. Highlights: - explicit runtime coexistence rules via RuntimeModelService and --runtime-model - shared artifact query/recovery across build and review - JobPlan preview/submit output for unified operator intake - operator report now includes recent artifacts Verification: - .venv/bin/ruff check . - .venv/bin/pytest -q -> 1248 passed, 4 skipped - PATH="/Users/danielbabjak/Desktop/Agent_Life_Space/.tools/node-v24.14.0-darwin-arm64/bin:/opt/homebrew/bin	Medium	3/27/2026
v1.4.2	# Agent Life Space v1.4.2 - Control-Plane Expansion Control-plane expansion release for the post-`v1.4.1` main state. ## What changed ### Platform / Control Plane - `ReviewJob` now uses shared control-plane primitives directly - Shared job queries now cover build, review, task, job-runner, and agent-loop runtime records - Approval requests are now persistent and queryable with job/artifact linkage - Operator reporting now has a real runtime surface via `OperatorReportService` and `python -m a	Medium	3/27/2026
v1.4.0	# Agent Life Space v1.4.0 — Backlog Zero All items from the master backlog have been implemented. ## What's new - Multi-step approval — required_approvals, partial approval tracking - Routing confusion analysis — systematic confusion detection + fallback tests - Workspace recovery tests — 4 crash-recovery scenarios - Finance proposal lifecycle — end-to-end tests - Product identity — decision doc: personal sovereign operator - Release checklist — standardized process - *	Medium	3/26/2026
v1.3.0	# Agent Life Space v1.3.0 Completeness release — remaining backlog items. ## Memory - Factual/conversational separation — query_facts() vs query_conversations() - Consolidation pipeline — inferred → verified promotion, stale detection ## Learning - Rollback — reset skills to UNKNOWN, clear model failures - Learning report — avg confidence, mastered/failed metrics ## Workspace - Ownership — owner_id on workspaces - Immutable audit — SHA-256 hash-chained entries (tamper	Medium	3/26/2026
v1.2.0	# Agent Life Space v1.2.0 Operator-grade visibility and control. 5 PR, 1016 tests. ## API & Communication - API audit trail — every request logged (sender, IP, status, duration) - Replay protection wired — nonce + timestamp check in API handler - Rate-limit telemetry — by-sender abuse detection ## Finance - Budget policy — hard cap (block), soft cap (warn), per-tx approval cap - Budget forecast — remaining budget at each level ## Operator Visibility - *Memory inspection	Medium	3/26/2026
v1.1.0	# Agent Life Space v1.1.0 — Breakthrough Architecture 19 PR, 991 tests, ~8000 lines added. Zero regressions. ## Epistemic Memory - Provenance model (observed/user_asserted/inferred/verified/stale) - MemoryKind (fact/belief/claim/procedure) - FTS5 full-text search, conflict detection, audit report - Consolidation pipeline: inferred → verified promotion, stale detection ## Tool Governance - Capability manifest with risk/side-effect/approval per tool - ActionEnvelope — 4-step pipeline: request →	Medium	3/26/2026
v1.0.0	# Agent Life Space v1.0.0 Prvý stabilný release. Všetko od 0.1-beta zjednotené do jednej produkčnej verzie. ## 🧠 Core - 7-layer cascade — 5 vrstiev lokálneho spracovania pred LLM (šetrí tokeny) - Provider-agnostic LLM — ClaudeCliProvider, AnthropicProvider, OpenAiProvider - ModelTier system — FAST / BALANCED / POWERFUL mapované per provider - AgentBrain — channel-agnostic message processing, zero shared state - Tool use — 10 nástrojov pre LLM function calling + ToolUseLoop	Medium	3/26/2026
v0.1-beta	## First public beta Self-hosted autonomous AI agent with: - 7-layer cascade (minimizes API calls) - Learning system with behavioral changes (model escalation, prompt augmentation) - Docker sandbox (mandatory, isolated code execution) - Encrypted vault (ETH/BTC wallets, API keys) - 430+ tests, 8 E2E deployment scenarios ### Quick Start ```bash git clone https://github.com/B2JK-Industry/Agent_Life_Space.git cd Agent_Life_Space cp .env.example .env docker compose up -d Requirement	Medium	3/24/2026

Dependencies & License Audit

Loading dependencies...

Similar Packages

flow-nextPlan-first AI workflow plugin for Claude Code, OpenAI Codex, and Factory Droid. Zero-dep task tracking, worker subagents, Ralph autonomous mode, cross-model reviews.flow-next-v1.6.0

GENesis-AGIAutonomous AI agent with persistent memory, self-learning, and earned autonomy. Cognitive partner that remembers, learns, and evolves.v3.0b13

runtmOpen-source sandboxes where coding agents build and deploy. Spin up isolated environments where Claude Code, Cursor, and other agents code and deploy software.main@2026-05-25

GhostDeskGive any AI agent a full desktop — it sees the screen, clicks, types, and runs apps like a human. Automate anything with a UI: browsers, legacy software, internal tools. No API needed. One Docker commv7.4.1

axonixA self-evolving AI agent that gets more useful every daymain@2026-04-13

More in Security

E2BOpen-source, secure environment with real-world tools for enterprise-grade agents.

vm0the easiest way to run natural language-described workflows automatically

AgenvoyAgentic framework | Self-improving memory | Pluggable tool extensions | Sandbox execution

clineAutonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.