freshcrate
Skin:/
Home > Databases > infinity

infinity

The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text.

Why this rank:Strong adoptionRecent releaseHealthy release cadence

Description

The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text.

README

Infinity logo

The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense embedding, sparse embedding, tensor and full-text

Infinity is a cutting-edge AI-native database that provides a wide range of search capabilities for rich data types such as dense vector, sparse vector, tensor, full-text, and structured data. It provides robust support for various LLM applications, including search, recommenders, question-answering, conversational AI, copilot, content generation, and many more RAG (Retrieval-augmented Generation) applications.

âšĄī¸ Performance

Infinity performance comparison

🌟 Key Features

Infinity comes with high performance, flexibility, ease-of-use, and many features designed to address the challenges facing the next-generation AI applications:

🚀 Incredibly fast

  • Achieves 0.1 milliseconds query latency and 15K+ QPS on million-scale vector datasets.
  • Achieves 1 millisecond latency and 12K+ QPS in full-text search on 33M documents.

See the Benchmark report for more information.

🔮 Powerful search

  • Supports a hybrid search of dense embedding, sparse embedding, tensor, and full text, in addition to filtering.
  • Supports several types of rerankers including RRF, weighted sum and ColBERT.

🍔 Rich data types

Supports a wide range of data types including strings, numerics, vectors, and more.

🎁 Ease-of-use

  • Intuitive Python API. See the Python API
  • A single-binary architecture with no dependencies, making deployment a breeze.
  • Embedded in Python as a module and friendly to AI developers.

🎮 Get Started

This section provides guidance on deploying the Infinity database using Docker, with the client and server as separate processes.

Prerequisites

  • CPU: x86_64 with AVX2 support.
  • OS:
    • Linux with glibc 2.17+.
    • Windows 10+ with WSL/WSL2.
    • MacOS
  • Python: Python 3.11+.

Install Infinity server

Linux x86_64 & MacOS x86_64

sudo mkdir -p /var/infinity && sudo chown -R $USER /var/infinity
docker pull infiniflow/infinity:nightly
docker run -d --name infinity -v /var/infinity/:/var/infinity --ulimit nofile=500000:500000 --network=host infiniflow/infinity:nightly

Windows

If you are on Windows 10+, you must enable WSL or WSL2 to deploy Infinity using Docker. Suppose you've installed Ubuntu in WSL2:

  1. Follow this to enable systemd inside WSL2.

  2. Install docker-ce according to the instructions here.

  3. If you have installed Docker Desktop version 4.29+ for Windows: Settings > Features in development, then select Enable host networking.

  4. Pull the Docker image and start Infinity:

    sudo mkdir -p /var/infinity && sudo chown -R $USER /var/infinity
    docker pull infiniflow/infinity:nightly
    docker run -d --name infinity -v /var/infinity/:/var/infinity --ulimit nofile=500000:500000 --network=host infiniflow/infinity:nightly

Install Infinity client

pip install infinity-sdk==0.7.0.dev5

Run a vector search

import infinity

infinity_obj = infinity.connect(infinity.NetworkAddress("<SERVER_IP_ADDRESS>", 23817)) 
db_object = infinity_object.get_database("default_db")
table_object = db_object.create_table("my_table", {"num": {"type": "integer"}, "body": {"type": "varchar"}, "vec": {"type": "vector, 4, float"}})
table_object.insert([{"num": 1, "body": "unnecessary and harmful", "vec": [1.0, 1.2, 0.8, 0.9]}])
table_object.insert([{"num": 2, "body": "Office for Harmful Blooms", "vec": [4.0, 4.2, 4.3, 4.5]}])
res = table_object.output(["*"])
                  .match_dense("vec", [3.0, 2.8, 2.7, 3.1], "float", "ip", 2)
                  .to_pl()
print(res)

🔧 Deploy Infinity using binary

If you wish to deploy Infinity using binary with the server and client as separate processes, see the Deploy infinity using binary guide.

🔧 Build from Source

See the Build from Source guide.

📚 Document

📜 Roadmap

See the Infinity Roadmap 2025

🙌 Community

Release History

VersionChangesUrgencyDate
v0.7.0Release v0.7.0 created from 544d5b876b0214d62bce4a05f7790833648e16d0 at 2026-05-15 11:58:20+00:00 # Summary - Fast plaid tensor index - Mmap secondary index - Secondary functional index - JSON type and related JSON functions - JSON Index - Low cardinality optimization for secondary index and add boolean type support - ARM64 offical build - GO SDK - LIKE/NOT LIKE operators - Language-aware stemming support in RagTokenizer - Plaid Index Refactor - Aggregation Refactor ## What'sHigh5/15/2026
v0.7.0-dev7Release v0.7.0-dev7 created from ce179311837657b0db93abfa4a96178b6e44d94f at 2026-05-13 10:26:01+00:00 High5/13/2026
v0.7.0-dev6Release v0.7.0-dev6 created from 39c473403b13b4c9c29d98cce2dae92f025192bb at 2026-05-06 15:18:15+00:00 High5/6/2026
v0.7.0-dev5Release v0.7.0-dev5 created from 45692bc9cde22cf8eb9c0ce6632de752780a5b95 at 2026-03-27 11:44:26+00:00 Medium3/27/2026
nightly-arm64Release nightly-arm64 created from f326811b82cb1e12d64e11a1f18a07c69108bb78 at 2026-03-23 22:31:27+09:00 Medium3/23/2026
v0.7.0-dev3Release v0.7.0-dev3 created from fe2ea0334f8731fde75db7dfa53d98a13bc97f72 at 2026-02-27 18:05:15+08:00 Low2/27/2026
v0.7.0-dev2Release v0.7.0-dev2 created from b4440ad97ed3eea237a950dcbcec22c9bd28df34 at 2026-01-27 09:38:28+08:00Low1/26/2026
nightlyRelease nightly created from 18f164fab4ae2a4d3811fb9feb44c120833c5d51 at 2026-04-21 14:05:05+00:00 Low1/20/2026
v0.7.0-dev1Release v0.7.0-dev1 created from f69d7b3b7f3942fc56c31f210967694ba517cdbb at 2026-01-19 14:38:14+08:00Low1/16/2026
v0.6.15Release v0.6.15 created from 0ab28d4170b2419b6ea3f24b09b726fbeb217b6a at 2025-12-27 17:35:25+08:00 Low12/27/2025
v0.6.14Release v0.6.14 created from a2f898f391936aeeaa3b391997ab1af751e4d1c8 at 2025-12-26 18:05:39+08:00 Low12/26/2025
v0.6.13Release v0.6.13 created from 06489fd0040d131e1f743b2cb0b0b31296641ff5 at 2025-12-24 16:43:41+08:00 Low12/24/2025
v0.6.12Release v0.6.12 created from a17dc2a4ac0d4aa33e96cb5d0d8a639f11b10b48 at 2025-12-14 00:46:13+08:00 Low12/15/2025
v0.6.11Release v0.6.11 created from 50a007e86db445dfa185775eeb283500b4f30356 at 2025-12-08 13:45:15+08:00 Low12/8/2025
v0.6.10Release v0.6.10 created from a6417aa61707dcc63a28e5ef92138bb716452439 at 2025-12-04 10:32:14+08:00 Low12/4/2025
v0.6.9Release v0.6.9 created from e1d63bf2e4dbe9ef73e42c8c38c82e5b600d1ec8 at 2025-12-03 17:29:37+08:00 Low12/3/2025
v0.6.8Release v0.6.8 created from faa2996f0e497a9752f541126ec0aa96ff7fdcc4 at 2025-12-02 13:01:12+08:00 Low12/2/2025
v0.6.7Release v0.6.7 created from e68246822c0f33b10909d0f058f196421889b50d at 2025-11-25 18:42:49+08:00 Low11/25/2025
v0.6.6Release v0.6.6 created from 2f757bf1cf4a2ca39e158e01ddc955f1d4eb7a1b at 2025-11-19 20:03:27+08:00 Low11/19/2025
v0.6.5Release v0.6.5 created from 4433cc4110b0e2be738916cdd27541809de9427c at 2025-11-12 11:53:50+08:00 Low11/12/2025
v0.6.4Release v0.6.4 created from 4263035d00d241482e09a62e364f7d15d97d0364 at 2025-11-06 19:51:05+08:00 Low11/6/2025
v0.6.3Release v0.6.3 created from b18f68a6713c6c19fdfe0743a862a90571e6afbe at 2025-11-04 15:01:18+08:00Low11/4/2025
v0.6.2Release v0.6.2 created from 5dd3f6a688fe878305de88ff33e4d071089b29d6 at 2025-10-29 22:17:08+08:00 Low10/29/2025
v0.6.1Release v0.6.1 created from f410598d672765165f4a7b9c81b81e7e9fa9ff74 at 2025-10-21 20:12:37+08:00 Low10/25/2025
v0.6.0Release v0.6.0 created from 49a3a0a0b1857610a80bdf2d00f8fa75f169bf0d at 2025-10-14 11:37:28+00:00 Low10/14/2025
v0.6.0-dev7Release v0.6.0-dev7 created from 7b3389b1ff920e524d369559eb4823120e14f220 at 2025-10-10 02:14:24+00:00 Low10/10/2025
v0.6.0-dev6Release v0.6.0-dev6 created from adfcb10076f61c792560188c238e17e482e83d62 at 2025-09-29 09:52:52+00:00 Low9/29/2025
v0.6.0-dev5Release v0.6.0-dev5 created from e061e77bbd24984f49a538a1da80513a75021be1 at 2025-07-28 10:00:31+00:00 Low7/28/2025
v0.6.0-dev4## What's Changed * Add order by wrong column check. by @small-turtle-1 in https://github.com/infiniflow/infinity/pull/2515 * Refactor txn inferface before refactoring META data engine by @JinHai-CN in https://github.com/infiniflow/infinity/pull/2490 * Unnest2 by @small-turtle-1 in https://github.com/infiniflow/infinity/pull/2514 * Support rank features query:part2 by @yingfeng in https://github.com/infiniflow/infinity/pull/2518 * Fix empty datablock array. by @small-turtle-1 in https://gitLow7/23/2025
v0.6.0-dev3Release v0.6.0-dev3 created from d498c5dbff052d1e2452cd9eb8c5a2faff3ed422 at 2025-02-05 11:53:43+00:00 Low2/5/2025
v0.6.0-dev2Release v0.6.0-dev2 created from f5fbb58efa4563cfcf01351b63bf9c16e93ddb31 at 2025-01-15 09:40:23+00:00 Low1/15/2025
v0.6.0-dev1Release v0.6.0-dev1 created from edf6434d8c0866162019948fa5f3b2110fa49e15 at 2025-01-09 13:36:10+00:00 Low1/9/2025
v0.5.2Release v0.5.2 created from 2df22b55b05222cc38cf86f0ebae5d1b934aa22e at 2024-12-24 03:45:36+00:00 Low12/24/2024
v0.5.1Release v0.5.1 created from ebc567d1d4a488059aacc3d99089ae5090eed161 at 2024-12-23 12:50:09+00:00 Low12/23/2024
v0.5.0## Summary - Support for high availability architecture based on shared storage. - Support product quantization for dense vectors - Support 8 bit scalar quantization for dense vectors - Support binary dense vectors, and provide Hamming distance similarity. - Support search results caching and pagination. - Allow to specify comment when creating database and index. - Support functions including: regular expression, md5, substring, lower, upper, ltrim, rtrim, trim, char_posiition, sqrt, Low12/18/2024
v0.5.0-dev7Release v0.5.0-dev7 created from 4c0d416890270b89698fbfff42192c5e54b61e79 at 2024-12-16 08:57:29+00:00 Low12/16/2024
v0.5.0-dev6Release v0.5.0-dev6 created from 3f07ecf0da107a117f59549ee330bb8eecde3230 at 2024-12-12 04:02:06+00:00 Low12/12/2024
v0.5.0-dev5Release v0.5.0-dev5 created from 94cb416891284017bf52f581fb8fe7e7df6f1ab9 at 2024-11-23 03:48:26+00:00 Low11/23/2024
v0.5.0-dev4Release v0.5.0-dev4 created from 6f695bcbad418df742fc3e16f1dfc68f2576a188 at 2024-11-20 03:48:21+00:00 Low11/20/2024
v0.5.0-dev3Release v0.5.0-dev3 created from 65bf2084f48c33abb7c8fa34b94731f1a1bbdee1 at 2024-11-17 06:38:24+00:00 Low11/17/2024
v0.5.0-dev2Release v0.5.0-dev2 created from dfe9caf19a59b73406817d393f1a24e97dde81a4 at 2024-11-09 04:54:06+00:00 Low11/9/2024
v0.5.0-dev1Release v0.5.0-dev1 created from bca2188ffd6c7701dfd245c1efa429a293779a54 at 2024-10-29 08:41:08+00:00 Low10/29/2024
v0.4.0Added support: - S3 compatible storage - Different filters for each search way - Highlight/Min should match/keywords as filters for full text search, Japanese/Korean language support for full text search. - Int8/F16/BF16, single document multi-vector for dense embedding data. - IVF dense vector index. - IN operator,adding/deleting columns, parquet format importing and exporting Low10/15/2024
v0.4.0-dev5Release v0.4.0-dev5 created from 443f9f456de8a030049a326753d4129e74cf06d2 at 2024-10-15 06:59:29+00:00 Low10/15/2024
v0.4.0-dev4Release v0.4.0-dev4 created from 1e0ac828ccbf5d9dce9414c4ba69929e50d567d1 at 2024-10-14 15:26:28+00:00 Low10/14/2024
v0.4.0-dev3Release v0.4.0-dev3 created from e554c06c3976f981eb48b47b5e9ea5d0b24a652b at 2024-10-09 12:24:17+00:00 Low10/9/2024
v0.4.0-dev2Release v0.4.0-dev2 created from 14962ebe8e95e8f514b2c535151a795419ce5761 at 2024-09-23 02:33:08+00:00 Low9/23/2024
v0.4.0-dev1Release v0.4.0-dev1 created from 81d7c7f5d68662d75effc24ffc2ce1ff04266dff at 2024-09-12 01:21:10+00:00 Low9/12/2024
v0.3.0## What's Changed * Support simd functions dispatch at runtime by @yangzq50 in https://github.com/infiniflow/infinity/pull/1473 * Add Qdrant benchmark on sparse embedding of SPLADE by @Ami11111 in https://github.com/infiniflow/infinity/pull/1478 * Update simd support for aarch64 by @yangzq50 in https://github.com/infiniflow/infinity/pull/1481 * Fix document error by @JinHai-CN in https://github.com/infiniflow/infinity/pull/1483 * Update performance image by @JinHai-CN in https://github.com/Low9/3/2024
v0.3.0-dev8Release v0.3.0-dev8 created from 82b5e5765130e0fe8b641d3e5a828c577053ad2e at 2024-09-02 10:01:19+00:00 Low9/2/2024
v0.2.1## What's Changed * Fix build script for nanobind by @yangzq50 in https://github.com/infiniflow/infinity/pull/1390 * Fix a docusaurus display issue by @writinwaters in https://github.com/infiniflow/infinity/pull/1393 * Fix: use static link for nanobind by @yangzq50 in https://github.com/infiniflow/infinity/pull/1392 * Remove redundant code by @JinHai-CN in https://github.com/infiniflow/infinity/pull/1396 * Rerank hnsw lvq by @small-turtle-1 in https://github.com/infiniflow/infinity/pull/13Low7/15/2024
v0.2.0## What's Changed * Let thrift client reconnect on insert failure by @yuzhichang in https://github.com/infiniflow/infinity/pull/1156 * Declare fma requirement at CMakeLists.txt by @yuzhichang in https://github.com/infiniflow/infinity/pull/1162 * Update blockmax_maxscore_iterator.cppm by @eltociear in https://github.com/infiniflow/infinity/pull/1166 * Added python venv into CentOS 7 builder image by @yuzhichang in https://github.com/infiniflow/infinity/pull/1169 * Update infinity configs andLow6/26/2024
v0.1.1## What's Changed * Update benchmark.md by @writinwaters in https://github.com/infiniflow/infinity/pull/1158 * Fix benchmark doc error by @JinHai-CN in https://github.com/infiniflow/infinity/pull/1159 * Add specific pysdk version to README by @JinHai-CN in https://github.com/infiniflow/infinity/pull/1160 * Editorial updates to RESTful APIs by @writinwaters in https://github.com/infiniflow/infinity/pull/1165 * Added quickstart by @writinwaters in https://github.com/infiniflow/infinity/pull/1Low5/10/2024

Dependencies & License Audit

Loading dependencies...

Similar Packages

reasonkit-mem🚀 Build memory and retrieval infrastructure for ReasonKit, enhancing data management and access for your applications with ease and efficiency.main@2026-05-31
SeekStormSeekStorm: vector & lexical search - in-process library & multi-tenancy server, in Rust.v3.2.1
weaviateWeaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cv1.37.7
knowhereVector search engine inside Milvus, integrating FAISS, HNSW, DiskANN.v3.0.3
endeeEndee.io – A high-performance vector database, designed to handle up to 1B vectors on a single node, delivering significant performance gains through optimized indexing and execution. Also available i1.3.5

More from infiniflow

ragflowRAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

More in Databases

milvusMilvus is a high-performance, cloud-native vector database built for scalable vector ANN search
WeKnoraLLM-powered framework for deep document understanding, semantic retrieval, and context-aware answers using RAG paradigm.
ai-real-estate-assistantAdvanced AI Real Estate Assistant using RAG, LLMs, and Python. Features market analysis, property valuation, and intelligent search.
alibabacloud-adb20211201Alibaba Cloud adb (20211201) SDK Library for Python