A self-improving AI orchestrator that runs 100% locally.
Single Go binary. 40+ tools. 20+ LLM providers. Multi-agent coordination.
Persistent memory with a knowledge graph. Browser automation. Computer control.
And it gets better at its job over time.
π Quick Start Β· β¨ Features Β· π Scorecard Β· ποΈ Architecture Β· π€ Contributing
Most AI assistants are cloud-locked chatbots that wait for you to type something. Sofia is different.
- π Local-first β No cloud dependency. Your data stays on your machine.
- β‘ Single binary β Written in Go. No Python, no Docker, no Node.js. Just
make build. - π§ Self-improving β Evolution engine analyzes past performance and optimizes itself.
- π€ Multi-agent β Spawn, coordinate, and retire agents dynamically. A2A protocol built-in.
- π Security-first β 35+ prompt injection defenses, PII detection, AES-256-GCM encryption.
- π 20+ LLM providers β OpenAI, Claude, Gemini, DeepSeek, Grok, and more. Automatic fallback.
git clone https://github.com/grasberg/sofia.git
cd sofia
make deps && make build
./build/sofia onboard # Initialize config & workspace
./build/sofia gateway # Start the web UI
# Open http://127.0.0.1:18795 β Models tab β add your API key β start chattingPrerequisites: Go 1.26+ (download)
Register domain names, publish web pages, read/edit files, run bash commands, manage Google Services (Gmail/Calendar) β all without human intervention.
Tiered memory system β short-term, long-term, episodic, and semantic (knowledge graph). Automatic consolidation and strategic forgetting keep memory efficient.
Delegate tasks to parallel agents. AgentArchitect creates and optimizes new agents on the fly. A2A mailbox messaging with broadcast.
OpenAI, Anthropic (Claude 4.5), Gemini, DeepSeek, Grok, MiniMax, Moonshot, Qwen, GitHub Copilot, and more. Automatic fallback chains with exponential backoff.
126+ built-in skills with expert personas. Automatic skill creation, refinement, and knowledge distillation. ClawHub marketplace for community skills.
5-phase EvolutionEngine: observe β diagnose β plan β act β verify. Post-task evaluation, prompt self-optimization, and code self-modification.
Autonomous computer control via screenshots and vision-LLM. Control mouse and keyboard on macOS and Linux.
Playwright-based web browsing. Click, fill forms, take screenshots, extract text β all autonomously.
Input validation, budget management, prompt injection defense (35+ patterns in 6 languages), PII detection, and action confirmation for high-risk operations.
π More Features
- Smart Tool Management β Semantic tool matching via embeddings, performance tracking, dynamic tool composition
- Lightning-Fast Execution β Caching of tool definitions, prompts, and connection pooling for low latency
- Autonomy & Proactivity β Long-term goals, context-aware triggers, proactive suggestions, self-initiated research
- MCP Client β Model Context Protocol support for external MCP servers and tools
- Gateway Mode β Built-in support for Telegram, Discord, Email, Web, and CLI
- Plan & Execute β Structured task planning with step-by-step tracking and sub-plans
- Shared Scratchpad β Key-value storage for agent-to-agent communication
- Cron Scheduling β Independent task scheduling with at/every/cron patterns
- Modern Web UI β Brutalist design with CRT effects, real-time updates, and file uploads
Evaluated by automated code analysis across 12 capability dimensions.
| Category | Score | Highlights |
|---|---|---|
| Multi-Agent Orchestration | 9/10 | Dependency-aware scheduling, semantic delegation, dynamic agent spawning, A2A messaging |
| Tool Use & Execution | 9/10 | 40+ tools, embeddings-based filtering, parallel execution, circuit breaker |
| Context & Memory | 9/10 | Knowledge graph, 4-layer memory, strategic forgetting, prompt cache |
| Planning & Reasoning | 9/10 | Hierarchical plans, doom loop detection, evaluation loop, auto-checkpoint |
| Safety & Guardrails | 9/10 | 35+ injection patterns, PII detection, secret scrubbing, AES-256-GCM |
| Provider Abstraction | 9/10 | 20+ providers, fallback chains, Bayesian quality ranking, retry with jitter |
| Channel Integration | 8/10 | Telegram, Discord, Email, Web, CLI with shared retry logic |
| Observability & Ops | 9/10 | SQLite audit logging, distributed tracing, health/metrics endpoints |
| Self-Improvement | 8/10 | 7-phase evolution loop, SafeModifier, performance-driven agent retirement |
| Skills & Extensibility | 8/10 | 126 skills, 40 agent templates, ClawHub marketplace, lazy loading |
| Evaluation & Testing | 8/10 | LLM-as-judge scoring, A/B comparison, 5 benchmark suites |
| Scheduling & Automation | 8/10 | 3 schedule types, heartbeat, goal-driven autonomy, context triggers |
| Avg: 8.6 |
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β Channels (Input) β
β Telegram Β· Discord Β· Email Β· Web UI Β· CLI β
ββββββββββββββββββββββββββββ¬βββββββββββββββββββββββββββββββ
β
ββββββββΌβββββββ
β Gateway β β HTTP/WebSocket/REST
ββββββββ¬βββββββ
β
ββββββββββββββΌβββββββββββββββββ
β Orchestrator β
β βββββββββββ΄βββββββββββ β
β β Evolution Engine β β
β β (Self-Improvement) β β
β ββββββββββββββββββββββ β
β β
β ββββββββββββ ββββββββββββ β
β β Agent 1 β β Agent 2 β β
β β (A2A) β β (A2A) β β
β βββββββ¬βββββ βββββββ¬βββββ β
ββββββββββΌββββββββββββββΌβββββββββ
β β
ββββββββββ΄ββββββββββββββ΄βββββββββ
β Tool Layer β
β File Β· Shell Β· Web Β· Browser β
β GitHub Β· Google Β· Bitcoin β
β Cron Β· Memory Β· Plan Β· MCP β
βββββββββββββββββ¬βββββββββββββββββ
β
βββββββββββΌββββββββββ
β LLM Providers β
β 20+ with fallback β
βββββββββββββββββββββ
| Sofia | AutoGPT | CrewAI | LangChain | OpenDevin | |
|---|---|---|---|---|---|
| Runs 100% locally | β | β | β | β | β |
| Single binary | β | β | β | β | β |
| 40+ built-in tools | β | Limited | Via plugins | Via plugins | Limited |
| Knowledge graph | β | Basic | β | β | β |
| Self-improving | β | β | β | β | β |
| Multi-agent orchestration | β | Basic | β | Basic | β |
| Browser automation | β | β | β | β | β |
| Computer use | β | β | β | β | β |
| 20+ LLM providers | β | Limited | Limited | β | Limited |
| Security guardrails | β | β | β | β | β |
| Language | Go | Python | Python | Python | Python |
~/.sofia/
βββ config.json # LLM providers, model selection, preferences
βββ workspace/
β βββ IDENTITY.md # Who Sofia is
β βββ SOUL.md # Personality & behavior rules
β βββ USER.md # User preferences
β βββ skills/ # 126+ skill definitions
β βββ agents/ # 40+ agent templates
βββ db/ # SQLite databases (memory, audit, goals)
βββ logs/ # Execution logs
- Plugin SDK β Third-party tool development kit
- Voice Interface β Speech-to-text and text-to-speech
- Mobile Companion β iOS/Android control app
- Team Mode β Multi-user collaboration
- Cloud Sync β Optional encrypted cloud backup
- Marketplace β Community skills and agents marketplace
We love contributions! See CONTRIBUTING.md for guidelines.
Whether you are fixing a bug, adding a feature, improving docs, or sharing a skill β every contribution matters.
Found a vulnerability? Please see SECURITY.md for responsible disclosure.
Sofia is released under the MIT License.
β Star us on GitHub Β· π Report a Bug Β· π¬ Join the Discussion
Made with β€οΈ by the Sofia community
