freshcrate
Home > Databases > matrixone

matrixone

AI-native HTAP database with Git-for-Data and built-in vector search, serving as the data and memory backbone for intelligent agents and applications.

Description

AI-native HTAP database with Git-for-Data and built-in vector search, serving as the data and memory backbone for intelligent agents and applications.

README

Connect with us:

matrixone16 matrixone16

Contents

What is MatrixOne?

MatrixOne is the industry's first database to bring Git-style version control to data, combined with MySQL compatibility, AI-native capabilities, and cloud-native architecture.

At its core, MatrixOne is a HTAP (Hybrid Transactional/Analytical Processing) database with a hyper-converged HSTAP engine that seamlessly handles transactional (OLTP), analytical (OLAP), full-text search, and vector search workloads in a single unified systemโ€”no data movement, no ETL, no compromises.

๐ŸŽฌ Git for Data - The Game Changer

Just as Git revolutionized code management, MatrixOne revolutionizes data management. Manage your database like code:

  • ๐Ÿ“ธ Instant Snapshots - Zero-copy snapshots in milliseconds, no storage explosion
  • โฐ Time Travel - Query data as it existed at any point in history
  • ๐Ÿ”€ Branch & Merge - Test migrations and transformations in isolated branches
  • โ†ฉ๏ธ Instant Rollback - Restore to any previous state without full backups
  • ๐Ÿ” Complete Audit Trail - Track every data change with immutable history

Why it matters: Data mistakes are expensive. Git for Data gives you the safety net and flexibility developers have enjoyed with Gitโ€”now for your most critical asset: your data.


๐ŸŽฏ Built for the AI Era

๐Ÿ—„๏ธ MySQL-Compatible

Drop-in replacement for MySQL. Use existing tools, ORMs, and applications without code changes. Seamless migration path.

๐Ÿค– AI-Native

Built-in vector search (IVF/HNSW) and full-text search. Build RAG apps and semantic search directlyโ€”no external vector databases needed.

โ˜๏ธ Cloud-Native

Storage-compute separation. Deploy anywhere. Elastic scaling. Kubernetes-native. Zero-downtime operations.


๐Ÿš€ One Database for Everything

The typical modern data stack:

๐Ÿ—„๏ธ MySQL for transactions โ†’ ๐Ÿ“Š ClickHouse for analytics โ†’ ๐Ÿ” Elasticsearch for search โ†’ ๐Ÿค– Pinecone for AI

The problem: 4 databases ยท Multiple ETL jobs ยท Hours of data lag ยท Sync nightmares

MatrixOne replaces all of them:

๐ŸŽฏ One database with native OLTP, OLAP, full-text search, and vector search. Real-time. ACID compliant. No ETL.

MatrixOne

โšก๏ธ Get Started in 60 Seconds

1๏ธโƒฃ Launch MatrixOne

docker run -d -p 6001:6001 --name matrixone matrixorigin/matrixone:latest

2๏ธโƒฃ Create Database

mysql -h127.0.0.1 -P6001 -p111 -uroot -e "create database demo"

3๏ธโƒฃ Connect & Query

Install Python SDK:

pip install matrixone-python-sdk

Vector search:

from matrixone import Client
from matrixone.orm import declarative_base
from sqlalchemy import Column, Integer, String, Text
from matrixone.sqlalchemy_ext import create_vector_column

# Create client and connect
client = Client()
client.connect(database='demo')

# Define model using MatrixOne ORM
Base = declarative_base()

class Article(Base):
    __tablename__ = 'articles'
    id = Column(Integer, primary_key=True, autoincrement=True)
    title = Column(String(200), nullable=False)
    content = Column(Text, nullable=False)
    embedding = create_vector_column(8, "f32")

# Create table using client API
client.create_table(Article)

# Insert some data using client API
articles = [
    {'title': 'Machine Learning Guide',
     'content': 'Comprehensive machine learning tutorial...',
     'embedding': [0.1, 0.2, 0.3, 0.15, 0.25, 0.35, 0.12, 0.22]},
    {'title': 'Python Programming',
     'content': 'Learn Python programming basics',
     'embedding': [0.2, 0.3, 0.4, 0.25, 0.35, 0.45, 0.22, 0.32]},
]
client.batch_insert(Article, articles)

client.vector_ops.create_ivf(
    Article,
    name='idx_embedding',
    column='embedding',
    lists=100,
    op_type='vector_l2_ops'
)

query_vector = [0.2, 0.3, 0.4, 0.25, 0.35, 0.45, 0.22, 0.32]
results = client.query(
    Article.title,
    Article.content,
    Article.embedding.l2_distance(query_vector).label("distance"),
).filter(Article.embedding.l2_distance(query_vector) < 0.1).execute()
for row in results.rows:
    print(f"Title: {row[0]}, Content: {row[1][:50]}...")

# Cleanup
client.drop_table(Article)  # Use client API
client.disconnect()

Fulltext Search:

...
from matrixone.sqlalchemy_ext import boolean_match

# Create fulltext index using SDK 
client.fulltext_index.create(
    Article,name='ftidx_content',columns=['title', 'content']
)

# Boolean search with must/should operators
results = client.query(
    Article.title,
    Article.content,
    boolean_match('title', 'content')
        .must('machine')
        .must('learning')
        .must_not('basics')
).execute()

# Results is a ResultSet object
for row in results.rows:
    print(f"Title: {row[0]}, Content: {row[1][:50]}...")
...

That's it! ๐ŸŽ‰ You're now running a production-ready database with Git-like snapshots, vector search, and full ACID compliance.

๐Ÿ’ก Want more control? Check out the Installation & Deployment section below for production-grade installation options.

๐Ÿ“– Python SDK Documentation โ†’

๐Ÿ“š Tutorials & Demos

Ready to dive deeper? Explore our comprehensive collection of hands-on tutorials and real-world demos:

๐ŸŽฏ Getting Started Tutorials

Tutorial Language/Framework Description
Java CRUD Demo Java Java application development
SpringBoot and JPA CRUD Demo Java SpringBoot with Hibernate/JPA
PyMySQL CRUD Demo Python Basic database operations with Python
SQLAlchemy CRUD Demo Python Python with SQLAlchemy ORM
Django CRUD Demo Python Django web framework
Golang CRUD Demo Go Go application development
Gorm CRUD Demo Go Go with Gorm ORM
C# CRUD Demo C# .NET application development
TypeScript CRUD Demo TypeScript TypeScript application development

๐Ÿš€ Advanced Features Tutorials

Tutorial Use Case Related MatrixOne Features
Pinecone-Compatible Vector Search AI & Search vector search, Pinecone-compatible API
IVF Index Health Monitoring AI & Search vector search, IVF index
HNSW Vector Index AI & Search vector search, HNSW index
Fulltext Natural Search AI & Search fulltext search, natural language
Fulltext Boolean Search AI & Search fulltext search, boolean operators
Fulltext JSON Search AI & Search fulltext search, JSON data
Hybrid Search AI & Search hybrid search, vector + fulltext + SQL
RAG Application Demo AI & Search RAG, vector search, fulltext search
Picture(Text)-to-Picture Search AI & Search multimodal search, image similarity
Dify Integration Demo AI & Search AI platform integration
HTAP Application Demo Performance HTAP, real-time analytics
Instant Clone for Multi-Team Development Performance instant clone, Git for Data
Safe Production Upgrade with Instant Rollback Performance snapshot, rollback, Git for Data

๐Ÿ“– View All Tutorials โ†’

๐Ÿ› ๏ธ Installation & Deployment

MatrixOne supports multiple installation methods. Choose the one that best fits your needs:

๐Ÿณ Local Multi-CN Development

Run a complete distributed cluster locally with multiple CN nodes, load balancing, and easy configuration management.

# Quick start
make dev-build && make dev-up

# Connect via proxy (load balanced)
mysql -h 127.0.0.1 -P 6001 -u root -p111

# Configure specific service (interactive editor)
make dev-edit-cn1          # Edit CN1 config
make dev-restart-cn1       # Restart only CN1 (fast!)

๐Ÿ“– Complete Development Guide โ†’ - Comprehensive guide covering standalone setup, multi-CN clusters, monitoring, metrics, configuration, and all make dev-* commands

๐ŸŽฏ Using mo_ctl Tool (Recommended for Production)

One-command deployment and lifecycle management with the official mo_ctl tool. Handles installation, upgrades, backups, and health monitoring automatically.

๐Ÿ“– Complete mo_ctl Installation Guide โ†’

โš™๏ธ Building from Source

Build MatrixOne from source for development, customization, or contributing. Requires Go 1.22, GCC/Clang, Git, and Make.

๐Ÿ“– Complete Build from Source Guide โ†’

๐Ÿณ Other Methods

Docker standalone, Kubernetes, binary packages, and more deployment options.

๐Ÿ“– All Installation Options โ†’

๐Ÿ”Ž Architecture

MatrixOne's architecture is as below:

MatrixOne

For more details, you can checkout MatrixOne Architecture Design.

๐Ÿ Python SDK

MatrixOne provides a comprehensive Python SDK for database operations, vector search, fulltext search, and advanced features like snapshots, PITR, and account management.

Key Features: High-performance async/await support, vector similarity search with IVF/HNSW indexing, fulltext search, metadata analysis, and complete type safety.

๐Ÿ“š Complete Documentation Documentation Status

๐Ÿ“– Python SDK README - Full features, installation, and usage guide

๐Ÿ“ฆ Installation: pip install matrixone-python-sdk

๐Ÿ™Œ Contributing

Contributions to MatrixOne are welcome from everyone.
See Contribution Guide for details on submitting patches and the contribution workflow.

๐Ÿ‘ All contributors

nnsgmsone
Nnsgmsone
XuPeng-SH
XuPeng-SH
zhangxu19830126
Fagongzi
reusee
Reusee
ouyuanning
Ouyuanning
daviszhen
Daviszhen
aunjgr
BRong Njam
sukki37
Maomao
iamlinjunhong
Iamlinjunhong
jiangxinmeng1
Jiangxinmeng1
jianwan0214
Jianwan0214
LeftHandCold
GreatRiver
w-zr
Wei Ziran
m-schen
Chenmingsong
dengn
Dengn
aptend
Aptend
lni
Lni
xzxiong
Jackson
YANGGMM
YANGGMM
qingxinhome
Qingxinhome
badboynt1
Nitao
broccoliSpicy
BroccoliSpicy
mooleetzi
Mooleetzi
fengttt
Fengttt
zzl200012
Kutori
lacrimosaprinz
Prinz
guguducken
Brown
dongdongyang33
Dongdongyang
JackTan25
Boyu Tan
cnutshell
Cui Guoke
JinHai-CN
Jin Hai
lignay
Matthew
bbbearxyz
Bbbearxyz
tianyahui-python
Tianyahui-python
wanglei4687
Wanglei
triump2020
Triump2020
heni02
Heni02
wanhanbo
Wanhanbo
iceTTTT
IceTTTT
volgariver6
LiuBo
taofengliu
ๅˆ˜้™ถๅณฐ
Ariznawlll
Ariznawlll
goodMan-code
GoodMan-code
yingfeng
Yingfeng
mklzl
Mklzl
jensenojs
Jensen
domingozhang
DomingoZhang
arjunsk
Arjun Sunil Kumar
chrisxu333
Nuo Xu
aressu1985
Aressu1985
matrix-meow
Mo-bot
zengyan1
Zengyan1
aylei
Aylei
noneback
NoneBack
WenhaoKong2001
Otter
richelleguice
Richelle Guice
yjw1268
Ryan
e1ijah1
Elijah
MatrixAdventurer
MatrixAdventurer
NTH19
NTH19
anitajjx
Anitajjx
whileskies
Whileskies
BePPPower
BePPPower
jiajunhuang
Jiajun Huang
Morranto
Morranto
Y7n05h
Y7n05h
songjiayang
Songjiayang
Abirdcfly
Abirdcfly
decster
Binglin Chang
Charlie17Li
Charlie17Li
DanielZhangQD
DanielZhangQD
Juneezee
Eng Zer Jun
ericsyh
Eric Shen
Fungx
Fungx
player-kirito
Kirito
JasonPeng1310
Jason Peng
ikenchina
O2
RinChanNOWWW
RinChanNOW!
TheR1sing3un
TheR1sing3un
chaixuqing
XuQing Chai
qqIsAProgrammer
Yiliang Qiu
yubindy
ZeYu Zhao
adlternative
ZheNing Hu
TszKitLo40
Zijie Lu
ZoranPandovski
Zoran Pandovski
yegetables
Ajian
bxiiiiii
Binxxi
coderzc
Coderzc
forsaken628
ColdWater
dr-lab
Dr-lab
florashi181
Florashi181
hiyoyolumi
Hiyoyolumi
jinfuchiang
Jinfu
sourcelliu
Liuguangliang
lokax
Lokax
lyfer233
Lyfer233
sundy-li
Sundyli
supermario1990
Supermario1990
lawrshen
Tjie
Toms1999
Toms
wuliuqii
Wuliuqii
xiw5
Xiyuedong
yclchuxue
Yclchuxue
ZtXavier
Zt

MatrixOne is licensed under the Apache License, Version 2.0.

Release History

VersionChangesUrgencyDate
v3.0.9Release date: Apr 1 2026 MatrixOne version: v3.0.9 MatrixOne 3.0.9 introduces improvements and bug fixes that enhance usability. Below are the major updates. ## Key Improvements ### Vector Search & AI Features *IVF-FLAT build parity: Port IVF-FLAT index build from main for consistent behavior across branches (#23737). *IVF create plan: Remove cross apply from ivf_create on 3.0-dev to simplify the plan (#23780). *IVF runtime robustness: Harden InMem IVF paths against distVec issueMedium4/1/2026

Dependencies & License Audit

Loading dependencies...

Similar Packages

vector-cache-optimizerโšก Optimize vector searches with a hyper-efficient cache that uses machine learning for faster, smarter data access and reduced costs.base-setup@2026-04-21
ai-daily-newsDeliver daily Chinese AI news summaries by filtering top global tech blog articles into concise, valuable insights for quick reading.main@2026-04-21
mddbA minimal, lightweight structured data store designed for small applications, scripts and automation workflows. Built for simplicity, portability and low overhead.v2.9.14
crateCrateDB is a distributed and scalable SQL database for storing and analyzing massive amounts of data in near real-time, even with complex queries. It is PostgreSQL-compatible, and based on Lucene.6.2.6
milvusMilvus is a high-performance, cloud-native vector database built for scalable vector ANN searchv2.6.15