freshcrate
Home > Infrastructure > VisionClaw-Agent-Public-Release

VisionClaw-Agent-Public-Release

Open-source multi-tenant AI agent platform β€” 14 specialized agents, 195+ tools, 37+ AI models. Self-hosted. Fork and deploy your own AI operations team.

Description

Open-source multi-tenant AI agent platform β€” 14 specialized agents, 195+ tools, 37+ AI models. Self-hosted. Fork and deploy your own AI operations team.

README

VisionClaw Agent

CI StarsLive demo Docs Roadmap

Open-Source Multi-Tenant AI Agent Workspace β€” Documents, Research & Workflows

Built for agencies, operators, and founders who want an always-on AI operations team they own and host themselves.

Created by Robert Washburn | huskyauto@gmail.com | Live demo: example.com

Not affiliated with the unrelated AR/wearable "VisionClaw" project. This is the AI agent platform.


What Is This?

VisionClaw Agent is an open-source, multi-tenant AI platform where 16 specialized agents work together to produce real deliverables β€” research reports, legal documents, financial models, marketing campaigns, slide decks, spreadsheets, and PDFs.

Instead of a single chatbot, you get a full agent workforce. Give it a task. The right agent picks it up, selects the right tools, coordinates with other agents when needed, and delivers a finished result. Every decision is traceable, every action is governed, and every integration degrades gracefully when not configured.

Fork it. Configure your API keys. Deploy. You have an AI operations team.

The app runs with just one LLM key and a Postgres database. Everything else β€” email, payments, voice, Drive β€” is optional and appears automatically when you add the key.

Over 147k lines of TypeScript across 348 files. 40+ pages. 220 tools. 62 skills. 37+ AI models. 6 providers. 130 tables. 40 governance rules.

⚑ Deploy your own copy

Platform One-click
Replit Open in Replit β†’
Render Deploy to Render
Railway Deploy on Railway
Docker docker compose up β€” see FORK-SETUP.md

After deploy, you'll need a Postgres database with the vector extension (Render and Railway can provision one for you), one LLM key (OPENAI_API_KEY or ANTHROPIC_API_KEY), and a SESSION_SECRET. Everything else is optional.

VisionClaw Landing Page

Landing page with live agent activity feed and command center stats

VisionClaw Setup Dashboard

First-run setup dashboard β€” real-time status of every integration


Try These Prompts

Once you're set up, paste any of these into the chat to see the platform in action:

Prompt What Happens
"Research the top 5 competitors in [your industry] and build me a comparison spreadsheet" Radar researches, Atlas structures data, exports a formatted .xlsx to Google Drive
"Draft a professional proposal for [client name] based on our last conversation" Scribe pulls context from memory, writes a styled PDF, Proof reviews it for quality
"Analyze this contract for risks" (attach a PDF) Luna scans for 20 risk patterns across 9 regulatory frameworks, scores compliance
"Create a weekly content calendar for our social media" Teagan builds a structured plan with post ideas, hashtags, and optimal timing
"Give me a financial forecast for Q3 based on current revenue trends" Cassandra models projections, generates charts, delivers an executive summary
"What happened in AI news this week?" Neptune runs a deep research sweep across arXiv, HN, Reddit, and tech blogs

πŸ“Έ Tour β€” what you actually see

Real screenshots from the live instance at example.com.

Landing hero β€” Hire an AI corporation, not another chatbot

Landing hero β€” value prop in one line, with three real CTAs.

Command Center β€” 16 agents, 226 tools, 37+ models, live workflows

Command Center β€” live counts, recent ops with status pills, capability chips.

Agent Activity Feed β€” Neptune, Luna, Cassandra, Scribe, Felix shipping work

Agent Activity Feed β€” Neptune ships research, Luna runs compliance, Cassandra builds the Excel model, Felix delivers the styled PDF.

Mixture of Agents β€” 4 frontier proposers + Opus aggregator

Mixture of Agents β€” 4 frontier proposers (Sonnet Β· GPT-4.1 Β· Gemini 2.5 Pro Β· DeepSeek Reasoner) feed a Claude Opus aggregator for ensemble-quality answers.

Gated Code Proposals β€” shadow verifier in a git worktree

Gated Code Proposals (R25) β€” nightly research generates real edits; a shadow verifier compiles each in an isolated git worktree before any human reviewer sees it.

Glasses Gateway β€” Meta Ray-Ban + Gemini Live

Glasses Gateway β€” Meta Ray-Ban smart glasses stream POV video and audio to a tenant-isolated gateway with sub-second voice replies via Gemini Live.


Platform at a Glance

Metric Count
AI Agents (Personas) 16
Built-in Tools 226
AI Models Supported 37+
AI Providers 6 (OpenAI, Anthropic, Google, xAI, OpenRouter, Perplexity)
Governance Rules 40
Corporate Operation Scaffolds 75
Corporate Departments 12
Agent Skills 62
Frontend Pages 40+
API Endpoints 300+
Database Tables 130

How It Works

  User Request
       β”‚
       β–Ό
β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”     β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚   Chat Engine    │────▢│   Agent Router              β”‚
β”‚  (SSE streaming) β”‚     β”‚   picks best agent for task  β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜     β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
                                   β”‚
                    β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”Όβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
                    β–Ό              β–Ό              β–Ό
              β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”  β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”  β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
              β”‚  Felix   β”‚  β”‚ Minerva  β”‚  β”‚ Neptune  β”‚  ... 16 agents
              β”‚  (CEO)   β”‚  β”‚(Strategy)β”‚  β”‚(Research)β”‚
              β””β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”˜  β””β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”˜  β””β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”˜
                   β”‚             β”‚              β”‚
                   β–Ό             β–Ό              β–Ό
            β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
            β”‚          226 Tools                      β”‚
            β”‚  Search Β· Write Β· Build Β· Analyze Β·     β”‚
            β”‚  Email Β· Pay Β· Generate Β· Research       β”‚
            β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
                               β”‚
              β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”Όβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
              β–Ό                β–Ό                β–Ό
        β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”   β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”   β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
        β”‚ PostgreSQLβ”‚   β”‚ Google     β”‚   β”‚ 6 AI       β”‚
        β”‚ + pgvectorβ”‚   β”‚ Drive      β”‚   β”‚ Providers  β”‚
        β”‚ 130 tablesβ”‚   β”‚ Storage    β”‚   β”‚ 37+ models β”‚
        β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜   β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜   β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

Example flow: You say "Research competitor pricing and build me a comparison spreadsheet."

  1. The Chat Engine routes to Felix (CEO) who sees this needs research + document production
  2. Felix spawns Radar (Intelligence) to research competitors and Atlas (Metrics) to structure the data
  3. Radar uses web search and scraping tools, deposits findings into the knowledge base
  4. Atlas pulls findings, builds a formatted Excel spreadsheet, uploads to Google Drive
  5. You get back a summary with a download link β€” no manual steps

The 16-Agent Team

Every agent has a defined role, personality, skill set, and operating rules. They work independently or collaborate through orchestration engines.

Agent Role What They Do
VisionClaw Personal Assistant Default conversational agent β€” handles general tasks, delegates complex ones
Felix CEO / COO Revenue strategy, task orchestration, multi-agent DAG decomposition
Forge Staff Engineer Code execution, engineering standards, infrastructure, security review
Teagan Content Marketing Social media strategy, content calendars, brand voice, ad copy
Blueprint Innovation Lead Skill creation, tool learning, self-improvement, capability expansion
Chief of Staff Operations Director System health monitoring, task routing, scheduling, daily operations
Scribe Content Creator Long-form writing, editing, SEO content, documentation, blog posts
Proof Quality Reviewer Proofreading, fact-checking, QA, content review, accuracy scoring
Radar Intelligence Analyst Market intelligence, competitive analysis, trend tracking, OSINT
Neptune Deep Research Academic analysis, overnight autonomous research, multimedia deep dives
Apollo Revenue & Pipeline Sales outreach, lead qualification, pipeline management, CRM
Atlas Metrics & Reporting Analytics, dashboards, KPI tracking, data visualization
Cassandra CFO Budgets, forecasting, P&L modeling, financial analysis
Luna Legal & Compliance Contract review, regulatory compliance, risk assessment, legal drafting
Minerva Strategic Planner Plan-of-record drafting, decision-theory analysis, Felix approval-loop partner; closes the auto-apply β†’ strategic-plan loop for the proactive self-healing engine (R63)
Robert Wellness Coach Emotional-eating coach for the [Your Product] wellness layer (R56) β€” CBT/DBT/ACT/IPT framing, somatic stress interventions, shame-spiral grounding scripts

Feature Overview

AI & Intelligence

  • 37+ AI Models with cost-aware auto-routing across OpenAI, Anthropic, Google Gemini, xAI Grok, OpenRouter, and Perplexity
  • Subscription-First Routing β€” connect your existing ChatGPT Plus or Gemini Advanced subscription via OAuth to use for inference at $0 API cost
  • Streaming Responses via Server-Sent Events (SSE) β€” real-time token-by-token output
  • Thinking Mode β€” explainable reasoning with decision traces for complex problems
  • Model Failover β€” automatic fallback to healthy providers when one goes down
  • Context Window Management β€” automatic conversation compaction that preserves every fact before summarizing

Document & Content Production

  • PDF Reports β€” executive-quality styled PDFs with cover pages, branded headers/footers, charts, and tables
  • Word Documents (.docx) β€” professional documents with formatting, headers, and styles
  • Excel Spreadsheets (.xlsx) β€” auto-formatted workbooks with formulas and conditional formatting
  • Google Slides β€” automated presentation generation delivered to Google Drive
  • Charts & Diagrams β€” Recharts visualizations and Mermaid.js diagrams rendered to PNG
  • PDF Form Filling β€” fill existing PDF forms programmatically
  • Invoices β€” professional invoices with line items, taxes, and branding

Research & Intelligence

  • Autonomous Overnight Research β€” configurable research programs that run autonomously, with LLM-judged experiment scoring and auto-deposit of findings into your knowledge base
  • Web Search β€” powered by Perplexity with Wikipedia and Jina fallbacks
  • Deep Web Scraping β€” Firecrawl integration for full-site crawling and markdown extraction
  • Trend Research β€” parallel scanning across Reddit, Hacker News, Polymarket, and X/Twitter
  • Competitive Intelligence β€” automated competitor analysis with structured output

Memory & Knowledge

  • Semantic Memory Palace β€” hierarchical memory organized by Wing and Room with three-tier recall (Hot/Warm/Cold)
  • Zero-Loss Compaction β€” full pre-compaction transcripts archived and recoverable; every fact extracted before conversation summarization
  • Vector Knowledge Base β€” RAG-powered knowledge retrieval with MMR diversity re-ranking
  • Temporal Knowledge β€” subject-predicate-object facts with time validity tracking
  • Dialectic User Modeling β€” three internal agents (Deriver, Dialectic, Dreamer) progressively build a profile of each user from conversations

Multi-Agent Orchestration

  • Crews β€” agent teams with defined roles, goals, and backstories working toward a shared objective
  • Flows β€” event-driven workflow pipelines that chain agent actions
  • Minds β€” 4-role deliberation system (Proposer, Critic, Synthesizer, Judge) for complex decisions
  • Auto-Orchestration β€” the COO automatically decomposes complex requests into DAG task graphs and delegates to specialists
  • Subagent Spawning β€” agents can spawn child agents for sub-tasks with full tool access
  • Chain of Debates β€” multi-persona deliberation where 3-6 specialists argue complex questions from different perspectives

Communication & Integrations

  • Email β€” built-in email server with tenant-specific inboxes, send/reply, and notification handling
  • WhatsApp β€” full bot integration for sending/receiving messages and approval workflows
  • Telegram β€” bot integration for external interaction
  • Discord β€” bot integration for team communication
  • Google Workspace β€” Gmail, Calendar, Sheets, Docs, Slides, and Contacts integration
  • Google Drive β€” primary storage for generated deliverables; every project gets a dedicated Drive folder with automatic backup

Payment Processing

  • Stripe β€” subscription management, checkout sessions, usage billing, and customer portal
  • Stripe Connect β€” tenants can connect their own Stripe accounts for white-label payment processing
  • Coinbase Commerce β€” cryptocurrency payments via hosted checkout
  • Coinbase CDP β€” on-chain wallet management and balance checks
  • Usage Metering β€” token tracking and feature access limits tied to billing tiers

Voice & Media

  • Text-to-Speech β€” ElevenLabs integration with 23+ voice profiles
  • Voice Conversations β€” real-time voice input/output with configurable wake words
  • Image Generation β€” DALL-E and Replit AI image generation
  • Video Production β€” scene-based MP4 pipeline with parallel TTS, Ken Burns motion, 25+ transitions, and background music

Project Management

  • Project Brain β€” filing cabinet system linking conversations, files, notes, and Google Drive assets to projects
  • Scheduled Tasks β€” cron-like automation for recurring agent work
  • Activity Logging β€” comprehensive system-wide activity tracking
  • Agent Board β€” visual overview of all agent activities and status

Governance & Safety

  • 40 Governance Rules β€” built-in rules controlling agent autonomy and behavior
  • Process Governor β€” enforces execution limits and approval requirements
  • Trust Engine β€” evaluates safety and reliability of tool calls; high-risk actions require human approval
  • Prompt Injection Scanner β€” detects and blocks malicious injection attempts
  • 3-Layer Failure Recovery:
    1. Self-correction retry with adjusted parameters
    2. Lean mode fallback to a lighter model on overload
    3. Backup agent reroute to mapped specialist
    4. 5-part failure transparency (what failed, why, what was tried, what succeeded, what the user should know)
  • Critique Agent β€” every response auto-evaluated on accuracy, completeness, relevance, and clarity (scored 1-10); low scores trigger auto-refinement

Multi-Tenant Architecture

  • Full Tenant Isolation β€” each tenant has separate conversations, memory, projects, files, settings, and billing
  • Per-Tenant WhatsApp/Email/Payment β€” communication and payment channels isolated by tenant
  • Team Management β€” invite users, manage roles, and control access
  • API Keys β€” per-tenant API key management for external integrations

Developer & Admin Tools

  • Settings Dashboard β€” comprehensive admin panel with tabs for General, Payments, Integrations, Voice, Tools, Security, Data, and Tenants
  • Diagnostics β€” stuck task detection, health monitoring, provider latency testing
  • Heartbeat Engine β€” system health monitoring with configurable check intervals
  • Auto-Tuner β€” autonomous performance optimization that runs daily
  • Webhook System β€” inbound/outbound webhook triggers for external automation
  • MCP Server β€” Model Context Protocol server for AI tool integration
  • Backup & Restore β€” automated daily backups to Google Drive with manual export/import
  • Vault β€” secure credential storage for sensitive data

Technical Stack

Layer Technology
Frontend React 18, TypeScript, Vite, TailwindCSS, shadcn/ui, Wouter, TanStack Query v5
Backend Express.js, TypeScript, Node.js 20+
Database PostgreSQL with pgvector extension, Drizzle ORM
AI Routing OpenAI, Anthropic, Google Gemini, xAI Grok, OpenRouter, Perplexity
Real-time Server-Sent Events (SSE) for streaming
Auth Email/Password with HMAC-SHA256, Admin PIN, Replit OAuth, Google OAuth
Validation Zod schemas with drizzle-zod integration
Security Helmet, CSRF protection, rate limiting, injection scanning
File Storage Google Drive (primary), local uploads (fallback)
Payments Stripe, Coinbase Commerce, Coinbase CDP
Voice ElevenLabs TTS (23+ voices)
Search Perplexity, Firecrawl, Jina, Wikipedia

Repository Structure

client/                       # React frontend
  src/
    pages/                    # 40+ route pages
    components/               # Reusable UI components (shadcn/ui)
    hooks/                    # Custom React hooks
    lib/                      # Utilities, query client, API helpers
server/                       # Express backend
  chat-engine.ts              # Core AI conversation engine with streaming
  tools.ts                    # 226 tool definitions and execution handlers
  routes.ts                   # 300+ API endpoints
  site-config.ts              # Centralized env-driven configuration
  seed.ts                     # Database seeding (130 tables, 40 rules, 16 personas)
  heartbeat.ts                # Background task scheduler
  agent-manager.ts            # Autonomous agent orchestration
  subagents.ts                # Hierarchical agent spawning
  agent-channels.ts           # Internal agent messaging system
  google-drive.ts             # Google Drive integration
  stripe-connect.ts           # Stripe payment processing
  coinbase-commerce.ts        # Crypto payment processing
  whatsapp.ts                 # WhatsApp bot integration
  email.ts                    # Email server and tenant inboxes
  scaffolding.ts              # 75 corporate operation scaffolds
shared/
  schema.ts                   # Drizzle ORM schema (113 tables)
scripts/
  clean-for-release.sh        # Sanitize codebase for public release
FORK-SETUP.md                 # Detailed setup instructions

Getting Started

Prerequisites

  • Node.js 20+ (or a Replit account)
  • PostgreSQL database
  • At least one AI provider API key (OpenAI, Anthropic, Google, or xAI)

Quick Start

# 1. Clone the repo
git clone https://github.com/Huskyauto/VisionClaw-Agent-Public-Release.git
cd VisionClaw-Agent-Public-Release

# 2. Install dependencies
npm install

# 3. Set required environment variables
export DATABASE_URL="postgresql://user:pass@host:5432/dbname"
export SESSION_SECRET="$(openssl rand -hex 32)"
export OPENAI_API_KEY="sk-..."   # Or ANTHROPIC_API_KEY, XAI_API_KEY, etc.

# 4. Start the platform
npm run dev

# 5. Open your browser
# Visit http://localhost:5000
# Fresh deploys auto-redirect to /setup

What Happens on First Run

In under 10 minutes, you go from git clone to a live dashboard with 16 agents, seeded governance, and a /setup checklist that tells you exactly what's configured and what's missing.

  1. The database auto-creates all 130 tables and full index set
  2. 40 governance rules and 16 AI personas are seeded automatically
  3. You're redirected to the Setup Checklist at /setup showing what's configured
  4. Click Create Account β€” the first account becomes the admin
  5. Start chatting β€” the AI is ready to work

Environment Variables

See FORK-SETUP.md for the complete list. Here's the quick reference:

Required

Variable What It Does
DATABASE_URL PostgreSQL connection string
SESSION_SECRET Random string for session encryption
One AI key OPENAI_API_KEY, ANTHROPIC_API_KEY, XAI_API_KEY, or OPENROUTER_API_KEY

Recommended (Branding)

Variable What It Does Default
SITE_PLATFORM_NAME Your platform's display name everywhere VisionClaw
SITE_COMPANY_NAME Company name for branding Your Company
SITE_OWNER_EMAIL Admin contact email (empty)
SITE_WEBSITE_URL Your public URL (empty)

Optional (Unlock More Features)

Variable What It Unlocks
ELEVENLABS_API_KEY Voice synthesis (23+ voices, text-to-speech)
FIRECRAWL_API_KEY Advanced web scraping and full-site crawling
BROWSERLESS_API_KEY PDF generation and browser automation
STRIPE_LIVE_SECRET_KEY + STRIPE_LIVE_PUBLISHABLE_KEY Payment processing and subscriptions
COINBASE_COMMERCE_API_KEY Cryptocurrency payments
GOOGLE_DRIVE_ROOT_FOLDER_ID Google Drive file storage and backups
AGENTMAIL_API_KEY + AGENTMAIL_INBOX Email sending/receiving
TELEGRAM_BOT_TOKEN Telegram bot integration
DISCORD_BOT_TOKEN Discord bot integration
X_API_KEY + X_API_SECRET + X_ACCESS_TOKEN + X_ACCESS_TOKEN_SECRET X/Twitter posting and search

Graceful Degradation

Features that aren't configured don't break the app β€” they gracefully disappear:

Missing Config What Happens
No email key Email, WhatsApp pages hidden from sidebar
No Telegram token Telegram page hidden
No Stripe keys Payments page hidden from admin panel
No Drive folder Files saved locally; Drive tools show "not configured"
No ElevenLabs key Voice tools return "not configured"
No Firecrawl/Browserless Scraping tools fall back gracefully
No Coinbase keys Crypto payment features disabled
No OAuth client IDs OAuth connection buttons hidden

The /setup page gives you a real-time checklist showing exactly what's configured and what's not.


Admin Settings

Once logged in as admin, the Settings page (/settings) gives you control over everything:

Tab What You Configure
General Agent name, personality, default AI model, API keys, OAuth connections, billing
Payments Stripe/Coinbase integration, pricing plans, subscription tiers
Integrations Discord bot, public chat settings, webhooks, system hooks
Voice Wake words, text-to-speech provider, voice profiles
Tools Browser/search settings, code sandbox, safety limits, rate limiting
Security Access PIN, auth health monitoring
Data Backup to Google Drive (manual + automated at 3 AM UTC), export/import
Tenants Multi-tenant management for agency deployments

Pages & Navigation

The platform includes 40+ pages organized by function:

Core: Home, Chat, Inbox, Email, Projects, Files, Documents

AI Management: Personas, Memory, Knowledge, Skills, Skills Marketplace, Agent Board, Agentic Operations

Intelligence: Research, Insights, Content Writing, Scheduled Tasks

Communication: WhatsApp, Telegram, Discord (with approval workflows)

Admin: Settings, Analytics, Activity Logs, Heartbeat, Team, API Keys, MCP, Webhooks, Channel Routing, Payments

Public: Landing Page, Architecture Overview, Login/Signup, Legal Pages (Terms, Privacy, About, Contact, Refund)


Agentic Design Patterns

These are the patterns we actually use in daily production β€” not just research papers:

  1. Parallel Tool Execution β€” read-only tools run concurrently via Promise.all; mutating tools execute sequentially for causal ordering
  2. Critique Agent / Self-Correction β€” every response auto-evaluated across 4 dimensions (accuracy, completeness, relevance, clarity); scores below 6/10 trigger auto-refinement
  3. Chain of Debates β€” 3-6 specialist agents argue complex questions from their domain expertise; synthesizes a recommendation with consensus level
  4. Tree-of-Thought Reasoning β€” 2-5 distinct analytical branches evaluated by a meta-reasoning judge for optimal answers
  5. Auto-Orchestration β€” complex requests decomposed into DAG task graphs with dependency tracking and parallel execution
  6. Dialectic User Modeling β€” three agents (Deriver, Dialectic, Dreamer) progressively understand user preferences and behavior

Deployment

Self-hosted only. You must deploy on your own infrastructure β€” your own Replit account, your own server, your own Docker host. We do not provide hosting, shared instances, or managed deployments. Every fork runs independently with its own database, API keys, and configuration.

The platform works on any Node.js hosting:

  • Replit: Create your own Replit account, import the repo, set secrets in the Secrets panel, hit Run
  • Railway/Render: Connect your repo, set env vars, deploy
  • Docker: docker-compose up -d β€” includes PostgreSQL with pgvector, ready out of the box
  • VPS: Clone, npm install, set env vars, npm run dev
  • Port: Serves frontend and backend on a single port (default: 5000)
git clone https://github.com/Huskyauto/VisionClaw-Agent-Public-Release.git
cd VisionClaw-Agent-Public-Release
cp .env.example .env   # edit with your API keys
docker-compose up -d    # or: npm install && npm run dev

About the Name

VisionClaw Agent is an independent AI agent platform β€” not related to the Intent-Lab/VisionClaw project (a smart glasses AI assistant for Meta Ray-Ban). This repo is a standalone, self-hosted multi-tenant operations platform. It works with just an LLM provider and PostgreSQL β€” no external ecosystem required.


Roadmap

Areas under active development:

  • Modularization β€” Breaking down large server files (routes, tools) into domain-specific modules for easier navigation and community contribution
  • Type safety β€” Incremental migration from any types to strict TypeScript interfaces
  • CI/CD β€” GitHub Actions pipeline for lint, typecheck, and automated testing
  • Plugin architecture β€” Making it easier to add custom tools and agents without modifying core files
  • API documentation β€” OpenAPI/Swagger spec for the 300+ endpoints

Community contributions welcome β€” see CONTRIBUTING.md.


Built With

VisionClaw Agent was originally built and hosted on Replit β€” a collaborative cloud development platform that makes it easy to build, deploy, and share full-stack applications. Replit's integrated environment, managed PostgreSQL, one-click deployments, and AI-assisted development made it possible to go from idea to production-ready platform without managing infrastructure. If you're looking for the fastest way to fork and run your own instance, Replit is a great place to start.


License

MIT License β€” free to fork, modify, and deploy for any purpose. See LICENSE.


Created by Robert Washburn | huskyauto@gmail.com

Release History

VersionChangesUrgencyDate
v0.1.1## Changes in v0.1.1 ### Security - **LiveCanvas XSS fix** β€” All agent-generated HTML now sanitized with DOMPurify before rendering. Title injection and opener takeover paths closed. - **Heartbeat race condition fix** β€” Atomic task claiming prevents duplicate execution in multi-instance deployments. - **Public config PII stripped** β€” Owner phone, EIN, and location no longer exposed to unauthenticated users. - **Docker secrets enforced** β€” docker-compose.yml now requires POSTGRES_PASSWORD and SEHigh4/16/2026
v0.1.0## VisionClaw Agent Platform v0.1.0 The first public release of VisionClaw β€” an open-source, multi-tenant AI agent platform with 14 specialized agents, 195+ tools, and 37+ AI model support. ### Highlights - **14 AI Agents** β€” CEO, Engineer, Content, Research, Legal, Finance, and more working as a coordinated team - **195+ Built-in Tools** β€” document generation, web research, email, payments, voice, and browser automation - **Multi-Provider AI** β€” OpenAI, Anthropic, Google Gemini, xAI Grok, OpHigh4/16/2026

Dependencies & License Audit

Loading dependencies...

Similar Packages

claw-pilotMulti-agent orchestration runtime with task board, flow engine, budget control, MCP integration and real-time dashboard. Self-hosted on Linux/macOS.v0.81.1
gatewayThe only fully local production-grade Super SDK that provides a simple, unified, and powerful interface for calling more than 200+ LLMs.v1.11.19
ClawRouterThe agent-native LLM router for OpenClaw. 41+ models, <1ms routing, USDC payments on Base & Solana via x402.v0.12.159
tweetsave-mcpπŸ“ Fetch Twitter/X content and convert it into blog posts using the MCP server for seamless integration and easy content management.main@2026-04-21
agentic-fleet-hubSelf-hosted orchestration layer for autonomous AI agent teams. Shared memory, heartbeat scheduling, vault-first secrets, and cross-model peer review β€” one command to deploy.master@2026-04-21