freshcrate
Home > RAG & Memory > antfly

Description

README

Antfly

Antfly is a distributed search engine built on etcd's raft library. It combines full-text search (BM25), vector similarity, and graph traversal over multimodal data β€” text, images, audio, and video. Embeddings, chunking, and graph edges are generated automatically as you write data. Built-in RAG agents tie it all together with retrieval-augmented generation.

Quickstart

Quick Start

# Start a single-node cluster with built-in ML inference
go run ./cmd/antfly swarm

# Or run with Docker
docker run -p 8080:8080 ghcr.io/antflydb/antfly:omni

That gives you the Antfarm dashboard at http://localhost:8080 β€” playgrounds for search, RAG, knowledge graphs, embeddings, reranking, and more.

See the quickstart guide for a full walkthrough.

Features

  • Hybrid search β€” full-text (BM25), dense vectors, and sparse vectors (SPLADE), all in one query
  • RAG agents β€” built-in retrieval-augmented generation with streaming, multi-turn chat, tool calling (web search, graph traversal), and confidence scoring
  • Graph indexes β€” automatic relationship extraction and graph traversal queries over your data
  • Multimodal β€” index and search images, audio, and video with CLIP, CLAP, and vision-language models
  • Reranking β€” cross-encoder reranking with score-based pruning to cut the noise
  • Aggregations β€” stats (sum/min/max/avg) and terms facets for analytics
  • Transactions β€” ACID transactions at the shard level with distributed coordination
  • Document TTL β€” automatic document expiration so you don't have to clean up yourself
  • S3 storage β€” store data in S3/MinIO/R2 for big cost savings and way faster shard splits
  • SIMD / SME acceleration β€” vector operations use hardware intrinsics via go-highway on x86 and ARM
  • Distributed β€” Raft consensus, automatic sharding and replication, horizontal scaling
  • Enrichment pipelines β€” configurable pipelines per index for embeddings, summaries, graph edges, and custom computed fields
  • Bring your own models β€” Ollama, OpenAI, Bedrock, Google, or run models locally with Termite
  • Auth β€” built-in user management with API keys, basic auth, and bearer tokens
  • Backup & restore β€” to local disk or S3
  • Kubernetes operator β€” deploy and manage clusters with the operator
  • MCP server β€” Model Context Protocol so LLMs can use Antfly as a tool
  • A2A protocol β€” Agent-to-Agent support for Google's A2A standard
  • Antfarm β€” web dashboard with playgrounds for search, RAG, knowledge graphs, embeddings, reranking, chunking, NER, OCR, and transcription

Documentation

antfly.io/docs

SDKs & Client Libraries

Language Package Source
Go github.com/antflydb/antfly/pkg/client pkg/client
TypeScript @antfly/sdk ts/packages/sdk
Python antfly py/
React @antfly/components ts/packages/components
PostgreSQL pgaf extension rs/pgaf

pgaf β€” PostgreSQL Extension

pgaf brings Antfly search into Postgres. Create an index, use the @@@ operator, and you're done:

CREATE INDEX idx_content ON docs USING antfly (content)
  WITH (url = 'http://localhost:8080/api/v1/', collection = 'my_docs');

SELECT * FROM docs WHERE content @@@ 'fix my computer';

React Components

@antfly/components gives you drop-in React components for search UIs β€” SearchBox, Autosuggest, Facet, Results, RAGBox, AnswerBox, plus streaming hooks like useAnswerStream and useCitations.

Termite β€” ML Inference

Termite handles the ML side: embeddings, chunking, reranking, classification, NER, OCR, transcription, generation, and more. It ships as an in-tree module (pkg/termite) and runs automatically in swarm mode β€” you don't need to set it up separately.

Libraries & Tools

Package What it does Source
docsaf Ingest content from filesystem, web crawl, git repos, and S3 pkg/docsaf
evalaf LLM/RAG/agent evaluation ("promptfoo for Go") pkg/evalaf
Genkit plugin Firebase Genkit integration for retrieval and docstore pkg/genkit/antfly

Architecture

Antfly uses a multi-raft design with separate consensus groups:

  • Metadata raft β€” table schemas, shard assignments, cluster topology
  • Storage rafts β€” one per shard, handling data, indexes, and queries

End-to-end chaos tests β€” inspired by Jepsen β€” cover node crashes, leader failures, shard splits under load, and cluster scaling. These tests run real multi-node clusters and inject faults to verify that Raft consensus, transactions, and replication behave correctly under failure.

Critical distributed protocols are formally specified and model-checked with TLA+:

Community

Join the Discord for support, discussion, and updates.

Interested in contributing? See CONTRIBUTING.md.

License

The core server is Elastic License 2.0 (ELv2). That means you can use it, modify it, self-host it, and build products on top of it β€” you just can't offer Antfly itself as a managed service. Everything else β€” the SDKs, React components, Termite, pgaf, docsaf, evalaf β€” is Apache 2.0. We tried to keep as much as possible under a permissive license.

Release History

VersionChangesUrgencyDate
v0.1.1Latest release: v0.1.1High4/21/2026
pkg/docsaf/v0.0.1Tag pkg/docsaf/v0.0.1Medium3/31/2026
ts/antfly/components/v0.0.14Tag ts/antfly/components/v0.0.14Low3/20/2026
ts/antfly/sdk/v0.0.14Tag ts/antfly/sdk/v0.0.14Low3/19/2026
ts/antfly/components/v0.0.13Tag ts/antfly/components/v0.0.13Low3/19/2026
v0.1.0Tag v0.1.0Low3/17/2026
v0.0.19Tag v0.0.19Low3/17/2026
v0.0.18Tag v0.0.18Low3/17/2026
pkg/operator/v0.0.16Tag pkg/operator/v0.0.16Low3/17/2026
pkg/client/v0.0.1Tag pkg/client/v0.0.1Low3/16/2026
pkg/libaf/v0.0.1Tag pkg/libaf/v0.0.1Low3/16/2026

Dependencies & License Audit

Loading dependencies...

Similar Packages

taleThe Sovereign AI Platform​ β€” Local AI models, agents, skills, and automations β€” on your own infrastructure, connected to your datav0.2.53
engraphLocal knowledge graph for AI agents. Hybrid search + MCP server for Obsidian vaults.v1.6.1
memory-engineAgentic memory built on Postgresv0.1.16
agentopsThe operational layer for coding agents. Memory, validation, and feedback loops that compound between sessions.v2.37.2
PageIndexπŸ“‘ PageIndex: Document Index for Vectorless, Reasoning-based RAGmain@2026-04-10