freshcrate
Home > MCP Servers > sofia

sofia

Autonomous local AI assistant in Go β€” 40+ tools, 20+ LLM providers, multi-agent orchestration, self-improving

Description

Autonomous local AI assistant in Go β€” 40+ tools, 20+ LLM providers, multi-agent orchestration, self-improving

README

πŸ€– Sofia

Your Local, Autonomous AI Colleague

GitHub starsGitHub forksLicense Go Version Last Commit Version PRs Welcome

A self-improving AI orchestrator that runs 100% locally.
Single Go binary. 40+ tools. 20+ LLM providers. Multi-agent coordination.
Persistent memory with a knowledge graph. Browser automation. Computer control.
And it gets better at its job over time.

πŸš€ Quick Start Β· ✨ Features Β· πŸ“Š Scorecard Β· πŸ—οΈ Architecture Β· 🀝 Contributing


Why Sofia?

Most AI assistants are cloud-locked chatbots that wait for you to type something. Sofia is different.

  • 🏠 Local-first β€” No cloud dependency. Your data stays on your machine.
  • ⚑ Single binary β€” Written in Go. No Python, no Docker, no Node.js. Just make build.
  • 🧠 Self-improving β€” Evolution engine analyzes past performance and optimizes itself.
  • πŸ€– Multi-agent β€” Spawn, coordinate, and retire agents dynamically. A2A protocol built-in.
  • πŸ” Security-first β€” 35+ prompt injection defenses, PII detection, AES-256-GCM encryption.
  • 🌐 20+ LLM providers β€” OpenAI, Claude, Gemini, DeepSeek, Grok, and more. Automatic fallback.

πŸš€ Quick Start

git clone https://github.com/grasberg/sofia.git
cd sofia
make deps && make build
./build/sofia onboard      # Initialize config & workspace
./build/sofia gateway      # Start the web UI
# Open http://127.0.0.1:18795 β†’ Models tab β†’ add your API key β†’ start chatting

Prerequisites: Go 1.26+ (download)


✨ Key Features

πŸ› οΈ Autonomous Tool Use

Register domain names, publish web pages, read/edit files, run bash commands, manage Google Services (Gmail/Calendar) β€” all without human intervention.

🧠 Advanced Memory

Tiered memory system β€” short-term, long-term, episodic, and semantic (knowledge graph). Automatic consolidation and strategic forgetting keep memory efficient.

πŸ€– Multi-Agent Orchestration

Delegate tasks to parallel agents. AgentArchitect creates and optimizes new agents on the fly. A2A mailbox messaging with broadcast.

🌐 20+ LLM Providers

OpenAI, Anthropic (Claude 4.5), Gemini, DeepSeek, Grok, MiniMax, Moonshot, Qwen, GitHub Copilot, and more. Automatic fallback chains with exponential backoff.

πŸ“š Skill System with Self-Learning

126+ built-in skills with expert personas. Automatic skill creation, refinement, and knowledge distillation. ClawHub marketplace for community skills.

πŸ”„ Evolution & Self-Improvement

5-phase EvolutionEngine: observe β†’ diagnose β†’ plan β†’ act β†’ verify. Post-task evaluation, prompt self-optimization, and code self-modification.

πŸ–₯️ Computer Use

Autonomous computer control via screenshots and vision-LLM. Control mouse and keyboard on macOS and Linux.

🌍 Browser Automation

Playwright-based web browsing. Click, fill forms, take screenshots, extract text β€” all autonomously.

πŸ›‘οΈ Guardrails & Security

Input validation, budget management, prompt injection defense (35+ patterns in 6 languages), PII detection, and action confirmation for high-risk operations.

πŸ“‹ More Features
  • Smart Tool Management β€” Semantic tool matching via embeddings, performance tracking, dynamic tool composition
  • Lightning-Fast Execution β€” Caching of tool definitions, prompts, and connection pooling for low latency
  • Autonomy & Proactivity β€” Long-term goals, context-aware triggers, proactive suggestions, self-initiated research
  • MCP Client β€” Model Context Protocol support for external MCP servers and tools
  • Gateway Mode β€” Built-in support for Telegram, Discord, Email, Web, and CLI
  • Plan & Execute β€” Structured task planning with step-by-step tracking and sub-plans
  • Shared Scratchpad β€” Key-value storage for agent-to-agent communication
  • Cron Scheduling β€” Independent task scheduling with at/every/cron patterns
  • Modern Web UI β€” Brutalist design with CRT effects, real-time updates, and file uploads

πŸ“Š Agentic AI Capability Scorecard

Evaluated by automated code analysis across 12 capability dimensions.

Category Score Highlights
Multi-Agent Orchestration 9/10 Dependency-aware scheduling, semantic delegation, dynamic agent spawning, A2A messaging
Tool Use & Execution 9/10 40+ tools, embeddings-based filtering, parallel execution, circuit breaker
Context & Memory 9/10 Knowledge graph, 4-layer memory, strategic forgetting, prompt cache
Planning & Reasoning 9/10 Hierarchical plans, doom loop detection, evaluation loop, auto-checkpoint
Safety & Guardrails 9/10 35+ injection patterns, PII detection, secret scrubbing, AES-256-GCM
Provider Abstraction 9/10 20+ providers, fallback chains, Bayesian quality ranking, retry with jitter
Channel Integration 8/10 Telegram, Discord, Email, Web, CLI with shared retry logic
Observability & Ops 9/10 SQLite audit logging, distributed tracing, health/metrics endpoints
Self-Improvement 8/10 7-phase evolution loop, SafeModifier, performance-driven agent retirement
Skills & Extensibility 8/10 126 skills, 40 agent templates, ClawHub marketplace, lazy loading
Evaluation & Testing 8/10 LLM-as-judge scoring, A/B comparison, 5 benchmark suites
Scheduling & Automation 8/10 3 schedule types, heartbeat, goal-driven autonomy, context triggers
Avg: 8.6

πŸ—οΈ Architecture

β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚                    Channels (Input)                      β”‚
β”‚   Telegram  Β·  Discord  Β·  Email  Β·  Web UI  Β·  CLI    β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
                           β”‚
                    β”Œβ”€β”€β”€β”€β”€β”€β–Όβ”€β”€β”€β”€β”€β”€β”
                    β”‚   Gateway    β”‚  ← HTTP/WebSocket/REST
                    β””β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”˜
                           β”‚
              β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”Όβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
              β”‚         Orchestrator          β”‚
              β”‚  β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”΄β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”      β”‚
              β”‚  β”‚   Evolution Engine  β”‚      β”‚
              β”‚  β”‚  (Self-Improvement) β”‚      β”‚
              β”‚  β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜      β”‚
              β”‚                               β”‚
              β”‚  β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”  β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”  β”‚
              β”‚  β”‚  Agent 1  β”‚  β”‚  Agent 2  β”‚  β”‚
              β”‚  β”‚  (A2A)    β”‚  β”‚  (A2A)    β”‚  β”‚
              β”‚  β””β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”˜  β””β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”˜  β”‚
              β””β”€β”€β”€β”€β”€β”€β”€β”€β”Όβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”Όβ”€β”€β”€β”€β”€β”€β”€β”€β”˜
                       β”‚             β”‚
              β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”΄β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”΄β”€β”€β”€β”€β”€β”€β”€β”€β”
              β”‚         Tool Layer             β”‚
              β”‚  File Β· Shell Β· Web Β· Browser  β”‚
              β”‚  GitHub Β· Google Β· Bitcoin     β”‚
              β”‚  Cron Β· Memory Β· Plan Β· MCP    β”‚
              β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
                              β”‚
                    β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β–Όβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”
                    β”‚   LLM Providers    β”‚
                    β”‚  20+ with fallback β”‚
                    β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

βš”οΈ Comparison

Sofia AutoGPT CrewAI LangChain OpenDevin
Runs 100% locally βœ… ❌ ❌ ❌ ❌
Single binary βœ… ❌ ❌ ❌ ❌
40+ built-in tools βœ… Limited Via plugins Via plugins Limited
Knowledge graph βœ… Basic ❌ ❌ ❌
Self-improving βœ… ❌ ❌ ❌ ❌
Multi-agent orchestration βœ… Basic βœ… Basic ❌
Browser automation βœ… ❌ ❌ ❌ ❌
Computer use βœ… ❌ ❌ ❌ ❌
20+ LLM providers βœ… Limited Limited βœ… Limited
Security guardrails βœ… ❌ ❌ ❌ ❌
Language Go Python Python Python Python

πŸ“‚ Workspace Structure

~/.sofia/
β”œβ”€β”€ config.json          # LLM providers, model selection, preferences
β”œβ”€β”€ workspace/
β”‚   β”œβ”€β”€ IDENTITY.md      # Who Sofia is
β”‚   β”œβ”€β”€ SOUL.md          # Personality & behavior rules
β”‚   β”œβ”€β”€ USER.md          # User preferences
β”‚   β”œβ”€β”€ skills/          # 126+ skill definitions
β”‚   └── agents/          # 40+ agent templates
β”œβ”€β”€ db/                  # SQLite databases (memory, audit, goals)
└── logs/                # Execution logs

πŸ—ΊοΈ Roadmap

  • Plugin SDK β€” Third-party tool development kit
  • Voice Interface β€” Speech-to-text and text-to-speech
  • Mobile Companion β€” iOS/Android control app
  • Team Mode β€” Multi-user collaboration
  • Cloud Sync β€” Optional encrypted cloud backup
  • Marketplace β€” Community skills and agents marketplace

🀝 Contributing

We love contributions! See CONTRIBUTING.md for guidelines.

Whether you are fixing a bug, adding a feature, improving docs, or sharing a skill β€” every contribution matters.

πŸ›‘οΈ Security

Found a vulnerability? Please see SECURITY.md for responsible disclosure.

πŸ“„ License

Sofia is released under the MIT License.


⭐ Star us on GitHub Β· πŸ› Report a Bug Β· πŸ’¬ Join the Discussion

Made with ❀️ by the Sofia community

Release History

VersionChangesUrgencyDate
main@2026-04-11Latest activity on main branchHigh4/11/2026
0.0.0No release found β€” using repo HEADHigh4/11/2026

Dependencies & License Audit

Loading dependencies...

Similar Packages

ryvosOpen-source autonomous AI assistant with 5-tier security, 62 tools, 14 LLM providers. Written in Rust. Single binary.v0.9.0
mcp-anythingOne command to turn any codebase into an MCP serverv0.1.0
DaemoraOpen-source AI agent - any LLM, any MCP server, any channel. Self-hosted, autonomous, multi-tenant.2026.1.0-beta.0
aiA productive AI coworker that learns, self-improves, and ships work.main@2026-04-21
tweetsave-mcpπŸ“ Fetch Twitter/X content and convert it into blog posts using the MCP server for seamless integration and easy content management.main@2026-04-21