txtai

Home > Databases > txtai

💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows

agents ai ai-agents embeddings information-retrieval language-model large-language-models llm python vector-database

Why this rank:Strong adoptionRecent releaseHealthy release cadence

Description

💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows

README

All-in-one AI framework

txtai is an all-in-one AI framework for semantic search, LLM orchestration and language model workflows.

The key component of txtai is an embeddings database, which is a union of vector indexes (sparse and dense), graph networks and relational databases.

This foundation enables vector search and/or serves as a powerful knowledge source for large language model (LLM) applications.

Build autonomous agents, retrieval augmented generation (RAG) processes, multi-model workflows and more.

Summary of txtai features:

🔎 Vector search with SQL, object storage, topic modeling, graph analysis and multimodal indexing
📄 Create embeddings for text, documents, audio, images and video
💡 Pipelines powered by language models that run LLM prompts, question-answering, labeling, transcription, translation, summarization and more
↪️️ Workflows to join pipelines together and aggregate business logic. txtai processes can be simple microservices or multi-model workflows.
🤖 Agents that intelligently connect embeddings, pipelines, workflows and other agents together to autonomously solve complex problems
⚙️ Web and Model Context Protocol (MCP) APIs. Bindings available for JavaScript, Java, Rust and Go.
🔋 Batteries included with defaults to get up and running fast
☁️ Run local or scale out with container orchestration

txtai is built with Python 3.10+, Hugging Face Transformers, Sentence Transformers and FastAPI. txtai is open-source under an Apache 2.0 license.

Note

NeuML is the company behind txtai and we provide AI consulting services around our stack. Schedule a meeting or send a message to learn more.

We're also building an easy and secure way to run hosted txtai applications with txtai.cloud.

Why txtai?

New vector databases, LLM frameworks and everything in between are sprouting up daily. Why build with txtai?

Up and running in minutes with pip or Docker

# Get started in a couple lines
import txtai

embeddings = txtai.Embeddings()
embeddings.index(["Correct", "Not what we hoped"])
embeddings.search("positive", 1)
#[(0, 0.29862046241760254)]

Built-in API makes it easy to develop applications using your programming language of choice

# app.yml
embeddings:
    path: sentence-transformers/all-MiniLM-L6-v2

CONFIG=app.yml uvicorn "txtai.api:app"
curl -X GET "http://localhost:8000/search?query=positive"

Run local - no need to ship data off to disparate remote services
Work with micromodels all the way up to large language models (LLMs)
Low footprint - install additional dependencies and scale up when needed
Learn by example - notebooks cover all available functionality

Use Cases

The following sections introduce common txtai use cases. A comprehensive set of over 70 example notebooks and applications are also available.

Semantic Search

Build semantic/similarity/vector/neural search applications.

Traditional search systems use keywords to find data. Semantic search has an understanding of natural language and identifies results that have the same meaning, not necessarily the same keywords.

Get started with the following examples.

Notebook	Description
Introducing txtai ▶️	Overview of the functionality provided by txtai
Similarity search with images	Embed images and text into the same space for search
Build a QA database	Question matching with semantic search
Semantic Graphs	Explore topics, data connectivity and run network analysis

LLM Orchestration

Autonomous agents, retrieval augmented generation (RAG), chat with your data, pipelines and workflows that interface with large language models (LLMs).

See below to learn more.

Notebook	Description
Prompt templates and task chains	Build model prompts and connect tasks together with workflows
Integrate LLM frameworks	Integrate llama.cpp, LiteLLM and custom generation frameworks
Build knowledge graphs with LLMs	Build knowledge graphs with LLM-driven entity extraction
Parsing the stars with txtai	Explore an astronomical knowledge graph of known stars, planets, galaxies

Agents

Agents connect embeddings, pipelines, workflows and other agents together to autonomously solve complex problems.

txtai agents are built on top of the smolagents framework. This supports all LLMs txtai supports (Hugging Face, llama.cpp, OpenAI / Claude / AWS Bedrock via LiteLLM). Agent prompting with agents.md and skill.md are also supported.

Check out this Agent Quickstart Example. Additional examples are listed below.

Notebook	Description
Granting autonomy to agents	Agents that iteratively solve problems as they see fit
TxtAI got skills	Integrate skill.md files with your agent
Agent Tools ▶️	Learn about the txtai agent toolkit
Analyzing LinkedIn Company Posts with Graphs and Agents	Exploring how to improve social media engagement with AI

Retrieval augmented generation

Retrieval augmented generation (RAG) reduces the risk of LLM hallucinations by constraining the output with a knowledge base as context. RAG is commonly used to "chat with your data".

Check out this RAG Quickstart Example. Additional examples are listed below.

Notebook	Description
Build RAG pipelines with txtai ▶️	Guide on retrieval augmented generation including how to create citations
RAG is more than Vector Search	Context retrieval via Web, SQL and other sources
GraphRAG with Wikipedia and GPT OSS	Deep graph search powered RAG
Speech to Speech RAG ▶️	Full cycle speech to speech workflow with RAG

Language Model Workflows

Language model workflows, also known as semantic workflows, connect language models together to build intelligent applications.

While LLMs are powerful, there are plenty of smaller, more specialized models that work better and faster for specific tasks. This includes models for extractive question-answering, automatic summarization, text-to-speech, transcription and translation.

Check out this Workflow Quickstart Example. Additional examples are listed below.

Notebook	Description
Run pipeline workflows ▶️	Simple yet powerful constructs to efficiently process data
Building abstractive text summaries	Run abstractive text summarization
Transcribe audio to text	Convert audio files to text
Translate text between languages	Streamline machine translation and language detection

Installation

The easiest way to install is via pip and PyPI

pip install txtai

Python 3.10+ is supported. Using a Python virtual environment is recommended.

See the detailed install instructions for more information covering optional dependencies, environment specific prerequisites, installing from source, conda support and how to run with containers.

Model guide

See the table below for the current recommended models. These models all allow commercial use and offer a blend of speed and performance.

Component	Model(s)
Embeddings	all-MiniLM-L6-v2
Image Captions	BLIP
Labels - Zero Shot	DeBERTa v3 Zeroshot
Labels - Fixed	Fine-tune with training pipeline
Large Language Model (LLM)	Gemma 4 31B
Summarization	DistilBART
Text-to-Speech	ESPnet JETS
Transcription	Whisper
Translation	OPUS Model Series

Models can be loaded as either a path from the Hugging Face Hub or a local directory. Model paths are optional, defaults are loaded when not specified. For tasks with no recommended model, txtai uses the default models as shown in the Hugging Face Tasks guide.

See the following links to learn more.

Powered by txtai

The following applications are powered by txtai.

Application	Description
rag	Retrieval Augmented Generation (RAG) application
ncoder	Open-Source AI coding agent
paperai	AI for medical and scientific papers
annotateai	Automatically annotate papers with LLMs

In addition to this list, there are also many other open-source projects, published research and closed proprietary/commercial projects that have built on txtai in production.

Documentation

Full documentation on txtai including configuration settings for embeddings, pipelines, workflows, API and a FAQ with common questions/issues is available.

Contributing

For those who would like to contribute to txtai, please see this guide.

Release History

Version	Changes	Urgency	Date
v9.11.0	This release adds support for the turbovec ANN backend and LiteParse text extraction See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add turbovec an ANN backend (#1109) - Add text extraction with LiteParse (#1118) Improvements -------------------------- - Migrate from ggml-py to ggml-python (#1105) - Revert noisy logging workaround when fixed upstream (#1106) - Add reindex parameter for API (#1111) - Use g	High	7/1/2026
v9.10.0	This release adds LiteRT vector support and Knowledge Distillation training See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add support for LiteRT vectors (#1097) - Add URLRetrieve pipeline (#1100) - Add a Knowledge Distillation Trainer (#1103) - Add a Progressive Distillation Example Notebook (#1104) Improvements -------------------------- - Reduce noisy logging messages with Transformers v5 (#1102)	High	6/4/2026
v9.9.0	This release adds a zero dependency minimal install, LiteRT and Transformers v5 support See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Zero dependency minimal install (#1089, #1090, #1091, #1092, #1093, #1094) - Transformers v5 Support (#1083, #1084, #1085, #1086, #1087, #1088) - Add LiteRT-LM LLM (#1095)	High	5/12/2026
v9.8.0	This release is primarily a bug fix, security and compatibility release. See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add support for last token pooling (#1072) - Add support for streaming Labels pipeline (#1075) - Add safeopen parameter for Textractor (#1077) Improvements -------------------------- - Support Authorization tokens with MCP endpoints (#899) - Update embeddings documentation to mention load	High	4/29/2026
v9.7.0	This release adds the new TxtAI Coding Agent Toolkit See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add coding agent toolkit (#1054, #1055, #1056, #1057, #1058, #1059, #1060, #1061) - Add agent tools example notebook (#1062) Improvements -------------------------- - Apply sigmoid to fused BB25 scores (#1053) Bug Fixes -------------------------- - Fix translation chunk concatenation when showmodels is en	Medium	3/20/2026
v9.6.0	This release adds many great new Embeddings database features: BB25 normalization, ngram tokenization, database expression indexes and conditional JSON data storage See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add Bayesian normalization "bb25" method for keyword scoring (#1023, #1037, Thank you @sigridjineth) - Add BB25 normalization for sparse encoders (#1042, #1046 Thank you @jaepil) - Add log-odds conjuncti	Low	2/25/2026
v9.5.0	TxtAI got skills! This release adds a lot of great new agent capabilities: agents.md, skill.md and memory See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add support for agents.md (#1014) - Add support for skill.md (#1015) - Add support for Agent memory (#1016) - Add ability to store agent memory by session (#1025) - Add example demonstrating how to use agents.md files (#1018) - Add example demonstrating how t	Low	2/12/2026
v9.4.1	This release adds the following new features, improvements and bug fixes. _Note that the Docker build will not be updated since it installs all packages by default;_ Bug Fixes -------------------------- - httpx import at module level causes ModuleNotFoundError for Embeddings-only usage (#1012) - Add minimal build workflow (#1013)	Low	1/23/2026
v9.4.0	Adds OpenCode integration, improved instruction prompts support for vectors, additional keyword tokenization methods and more See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add OpenCode LLM (#1010) - Add additional tokenization methods for keyword search (#1006) - Add example notebook covering OpenCode LLM (#1011) - Add example notebook showing how to finetune LLMs (#1008) Improvements ---------------------	Low	1/21/2026
v9.3.0	Expands RAG to any function, new quickstart examples and many improvements See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add support for any callable search function in RAG pipeline (#1000) - Add notebook covering any search method with the RAG Pipeline (#1002) - Add RAG Quickstart Example (#991) - Add Agent Quickstart Example (#992) - Add Workflow Quickstart Example (#996) Improvements ------------------	Low	12/22/2025
v9.2.0	This release is primarily a bug fix and compatibility release. See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add tuples option to Segmentation pipeline (#979) - Add stripthink parameter to LLM API endpoint (#980) - Support stripthink in streaming LLM responses (#981) Improvements -------------------------- - Update deprecated parameter for HFTrainer (#978) - Update llama.cpp vectors to support default len	Low	11/21/2025
v9.1.0	This release focuses on vector "un-databases" - store vectors with NumPy, Torch and even GGUF from llama.cpp! See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add GGML ANN (#975) - Add quantization support to Torch ANN (#971) - Add support for Safetensors storage to NumPy/Torch ANN backend (#969) - Add example notebook demonstrating how to access low level vector APIs (#973) - Add GraphRAG example notebook with	Low	11/4/2025
v9.0.1	This release adds the following new features, improvements and bug fixes. Bug Fixes -------------------------- - Update build script for macOS builds (#964) - Additional checks for tar file extraction (#965)	Low	9/15/2025
v9.0.0	### 🎉 We're excited to announce the release of txtai 9.0 🎉 _If you like txtai, please remember to give it a ⭐!_ 9.0 adds first class support for sparse vector models (i.e. SPLADE), late interaction models (i.e. ColBERT), fixed dimensional encoding (i.e. MUVERA) and reranking pipelines. There are also plenty of improvements and bug fixes! New Features -------------------------- - Add sparse vector scoring (#924) - Add IVFFlat indexing for sparse vectors (#929) - Add pgsparse sc	Low	8/28/2025
v8.6.0	This release fixes a number of integration issues with downstream libraries and other performance improvements. See below for full details on the new features, improvements and bug fixes. Improvements -------------------------- - Handling truncation for the Similarity pipeline (#882) - Update tagline to the all-in-one AI framework (#901) Bug Fixes -------------------------- - Encoding issue with latest version of LiteLLM (#902) - Fix bug with latest version of smolagents (#906	Low	6/10/2025
v8.5.0	This release migrates from Transformers Agents to smolagents, adds Model Context Protocol (MCP) support and now requires Python 3.10+ See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Migrate to smolagents (#890) - Add Model Context Protocol (MCP) Support (#892) - Add support for MCP servers to Agent Framework (#898) - Require Python 3.10 (#897) Improvements -------------------------- - Lazy load list of tran	Low	4/14/2025
v8.4.0	This release adds support for vision LLMs, graph vector search, embeddings checkpoints, observability and an OpenAI-compatible API See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add support for vision models to HF LLM pipeline (#884) - Add similar query clause to graph queries (#875) - Feature Request: Embeddings index checkpointing (#695) - Feature Request: Enhance observability and tracing capabilities (#869)	Low	3/11/2025
v8.3.1	This release adds the following new features, improvements and bug fixes. Bug Fixes -------------------------- - Ensure staticvectors is installed before calling method (#876)	Low	2/12/2025
v8.3.0	This release adds support for GLiNER, Chonkie, Kokoro TTS and Static Vectors See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add support for GLiNER models (#862) Thank you @urchade - Add semantic chunking pipeline (#812) Thank you @bhavnicksm - Add Kokoro TTS support to TextToSpeech pipeline (#854) Thank you @hexgrad - Add staticvectors inference (#859) - Add example notebook for Entity Extraction with GLiNER (	Low	2/11/2025
v8.2.0	This release simplifies LLM chat messages, adds attribute filtering to Graph RAG and enables multi-cpu/gpu vector encoding See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add defaultrole to LLM pipeline (#841) - Feature Request: Graph RAG - Add extra attributes (#684) - Support graph=True in embeddings config (#848) - Support pulling attribute data in graph.scan (#849) - Encoding using multiple-GPUs (#541) - A	Low	1/9/2025
v8.1.0	This release adds Docling integration, Embeddings context managers and significant database component enhancements See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add text extraction with Docling (#814) - Add Embeddings context manager (#832) - Add support for halfvec and bit vector types with PGVector ANN (#839) - Persist embeddings components to specified schema (#829) - Add example notebook that analyzes the	Low	12/10/2024
v8.0.0	### 🎉 We're excited to announce the release of txtai 8.0 🎉 _If you like txtai, please remember to give it a ⭐!_ 8.0 introduces agents. Agents automatically create workflows to answer multi-faceted user requests. Agents iteratively prompt and/or interface with tools to step through a process and ultimately come to an answer for a request. This release also adds support for Model2Vec vectorization. See below for more. New Features -------------------------- - Add txtai agents 🚀	Low	11/18/2024
v7.5.1	This release adds the following new features, improvements and bug fixes. Bug Fixes -------------------------- - Update translation pipeline to use hf_hub_download for language detection (#803)	Low	10/25/2024
v7.5.0	This release adds Speech to Speech RAG, new TTS models and Generative Audio features See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add Speech to Speech example notebook (#789) - Add streaming speech generation (#784) - Add a microphone pipeline (#785) - Add an audio playback pipeline (#786) - Add Text to Audio pipeline (#792) - Add support for SpeechT5 ONNX exports with Text to Speech pipeline (#793) - Add	Low	10/14/2024
v7.4.0	This release adds the SQLite ANN, new text extraction features and a programming language neutral embeddings index format See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add SQLite ANN (#780) - Enhance markdown support for Textractor (#758) - Update txtai index format to remove Python-specific serialization (#769) - Add new functionality to RAG application (#753) - Add bm25s library to benchmarks (#757) Thank y	Low	9/5/2024
v7.3.0	This release adds a new RAG front-end application template, streaming LLM and streaming RAG support along with significant text extraction improvements See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add support for streaming LLM generation (#680) - Add RAG API endpoint (#735) - Add RAG deepdive notebook (#737) - Add RAG example application (#743) Improvements -------------------------- - Improve textractor	Low	7/15/2024
v7.2.0	This release adds Postgres integration for all components, LLM Chat Messages and vectorization with llama.cpp/LiteLLM See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add pgvector ANN backend (#698) - Add RDBMS Graph (#699) - Add notebook covering txtai integration with Postgres (#701) - Add Postgres Full Text Scoring (#713) - Add support for chat messages in LLM pipeline (#718) - Add support for LiteLLM vector	Low	5/31/2024
v7.1.0	This release adds dynamic embeddings vector support along with semantic graph and RAG improvements See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add support for dynamic vector dimensions (#674) - Add batch node and edge creation for graphs (#693) - Add notebook on Retrieval Augmented and Guided Generation (#694) Improvements -------------------------- - Pass options to underlying vector models (#675) - Mo	Low	4/19/2024
v7.0.0	### 🎉 We're excited to announce the release of txtai 7.0 🎉 _If you like txtai, please remember to give it a ⭐!_ 7.0 introduces the next generation of the semantic graph. This release adds support for graph search, advanced graph traversal and graph RAG. It also adds binary support to the API, index format improvements and training LoRA/QLoRA models. See below for more. New Features -------------------------- - Add indexing of embeddings graph relationships (#525) - Expand the grap	Low	2/21/2024
v6.3.0	This release adds new LLM inference methods, API Authorization and RAG improvements 📄 New LLM methods. llama.cpp and LiteLLM support added. LLM pipeline now supports Hugging Face models, GGUF files and LLM API inference all with one line of code. 🔒 API Authorization. Adds support for API keys and pluggable authentication methods when running through txtai API. See below for full details on the new features, improvements and bug fixes. New Features --------------------------	Low	1/2/2024
v6.2.0	This release adds binary quantization, bind parameters for multimedia SQL queries and performance improvements ⚡ Scalar quantization. Supports 1 bit (binary) through 8 bit quantization. Can dramatically reduce vector storage requirements. 🚀 SQL bind parameters. Enables searching binary content with SQL statements, along with being a standard best practice. See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add	Low	11/8/2023
v6.1.0	This release adds metadata support for client-server databases and custom scoring implementations 🗃️ Client-server database integration. Store index metadata in Postgres, MariaDB/MySQL, MSSQL and more. 🖹 Custom scoring implementations. Store keyword index data in systems such as Elasticsearch. Similar to functionality already available in vector index component. See below for full details on the new features, improvements and bug fixes. New Features --------------------------	Low	9/26/2023
v6.0.0	### 🥳 We're excited to announce the release of txtai 6.0 🥳 _This significant milestone release marks txtai's 3 year birthday🎉 If you like txtai, please remember to give it a ⭐!_ 6.0 adds sparse, hybrid and subindexes to the embeddings interface. It also makes significant improvements to the LLM pipeline workflow. See below for more. Breaking changes The vast majority of changes are fully backwards compatible. New features are only enabled when specified. The only breaking cha	Low	8/10/2023
v5.5.1	This release adds the following new features, improvements and bug fixes. Bug Fixes -------------------------- - Wrap DuckDB as conditional import (#464)	Low	4/27/2023
v5.5.0	This release adds workflow streams and DuckDB as a database backend ↪️️ Workflow streams enable server-side processing of large datasets. Streams iteratively pass content to workflows, no need to pass bulk data through the API. 🦆 DuckDB is a new database backend. Certain larger non-vector driven queries and aggregations will now run significantly faster than with SQLite. See below for full details on the new features, improvements and bug fixes. New Features -------------------	Low	4/20/2023
v5.4.0	This release adds prompt templates, conversational task chaining and Hugging Face Hub integration 📃 Prompt templates dynamically generate text using workflow task inputs. This enables chaining multiple prompts and models together. 🤗 Embeddings now integrate with the Hugging Face Hub! Easily share and load embeddings indexes. There is a full embeddings index available for English Wikipedia. See below for full details on the new features, improvements and bug fixes. New Features	Low	3/6/2023
v5.3.0	This release adds embeddings-guided and prompt-driven search along with a number of methods to train language models 🔎 Prompt-driven search is a big step forward towards conversational search in txtai. With this release, complex prompts can now be passed to txtai to customize how search results are returned. Lots of exciting possibilities on this front, stay tuned. 💡 The trainer pipeline now has support for training language models from scratch. It supports masked language modeling (	Low	2/6/2023
v5.2.0	This release adds TextToSpeech and Cross-Encoder pipelines. The performance of the `embeddings.batchtransform` method was significantly improved, enabling a speed up in building semantic graphs. Default configuration is now available for Embeddings, allowing an Embeddings instance to be created with no arguments like Pipelines. See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add Cross-Encoder support to Similarity pip	Low	12/20/2022
v5.1.0	This release adds new model support for the translation pipeline, OpenAI Whisper support in the transcription pipeline and ARM Docker images. Topic modeling was also updated with improvements, including how to use BM25/TF-IDF indexes to drive topic models. See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Multiarch docker image (#324) - Add notebook covering classic topic modeling with BM25 (#360) Improvements ----	Low	10/18/2022
v5.0.0	### 🎈🎉🥳 We're excited to announce the release of txtai 5.0! 🥳🎉🎈 _Thank you to the txtai community! Please remember to ⭐ txtai!_ txtai 5.0 is a major new release. This release adds the semantic graph along with enabling external integrations. It also adds a number of improvements and bug fixes. New Features -------------------------- - Add scoring-based search (#327) - Add notebook demonstrating functionality of individual embeddings components (#328) - Add SQL expression col	Low	9/27/2022
v4.6.0	### 🎈🎉🥳 txtai turns 2 🎈🎉🥳 ### _We're excited to release the 25th version of txtai marking it's 2 year anniversary. Thank you to the txtai community. Please remember to ⭐ txtai!_ txtai 4.6 is a large but backwards compatible release! This release adds better integration between embeddings and workflows. It also adds a number of significant performance improvements and bug fixes. New Features -------------------------- - Add transform workflow action to application (#281) - Add	Low	8/15/2022
v4.5.0	This release adds the following new features, improvements and bug fixes. New Features -------------------------- - Add scripts to train bashsql query translation model (#271) - Add QA database example notebook (#272) - Add CITATION file (#273) Improvements -------------------------- - Improve efficiency of external vectors (#275) - Refactor vectors package to improve code reuse (#276) - Add logic to detect external vectors method (#277) Bug Fixes -------------------------- -	Low	5/17/2022
v4.4.0	This release adds the following new features, improvements and bug fixes. New Features -------------------------- - Add semantic search explainability (#248) - Add notebook covering model explainability (#249) - Add txtai console (#252) - Add sequences pipeline (#261) - Add scripts to train query translation models (#265) - Add query translation logic in embeddings searches (#266) - Add notebook for query translation (#269) Improvements -------------------------- - Update HFTrain	Low	4/20/2022
v4.3.1	This release adds the following new features, improvements and bug fixes. Bug Fixes -------------------------- - Fix word embeddings regression with batch transformation (#245)	Low	3/11/2022
v4.3.0	This release adds the following new features, improvements and bug fixes. New Features -------------------------- - Add notebook covering txtai embeddings index file structure (#237) - Add Image Hash pipeline (#240) - Add support for custom SQL functions in embeddings queries (#241) - Add notebook for Embeddings SQL functions (#243) - Add notebook for near-duplicate image detection (#244) Improvements -------------------------- - Rename SQLException to SQLError (#232) - Refactor A	Low	3/10/2022
v4.2.1	This release adds the following new features, improvements and bug fixes. Bug Fixes -------------------------- - Fixed mislabeled API config definition (#231)	Low	2/28/2022
v4.2.0	This release adds the following new features, improvements and bug fixes. New Features -------------------------- - Add notebook for workflow notifications (#225) - Add default and custom docker configurations (#226) - Create docker configuration for AWS Lambda (#228) - Add support for loading/storing embedding indexes on cloud storage (#229) Improvements -------------------------- - Add support for SQL \|\| operator (#223) - Add flag to disable loading index data in API (#230) Bu	Low	2/24/2022
v4.1.0	This release adds the following new features, improvements and bug fixes. New Features -------------------------- - Add entity extraction pipeline (#203) - Add workflow scheduling (#206) - Add workflow search task to API (#210) - Add Console Task (#215) - Add Export Task (#216) - Add notebook for workflow scheduling (#218) Improvements -------------------------- - Default documentation theme using system preference (#197) - Improve multi-user experience for workflow application (	Low	2/3/2022
v4.0.0	### 🎈🎉🥳 We're excited to announce the release of txtai 4.0! 🥳🎉🎈 _Thank you to the growing txtai community. This couldn't be done without you. Please remember to ⭐ txtai if it has been helpful._ txtai 4.0 is a major release with a significant number of new features. This release adds content storage, querying with sql, object storage, reindexing, index compression, external vectors and more! To quantify the changes, the code base increased by 50% with 36 resolved issues, by far th	Low	1/11/2022
v3.7.0	This release adds the following new features, improvements and bug fixes. New Features -------------------------- - Add object detection pipeline (#148) - Add image caption pipeline (#149) - Add retrieval task (#150) - Add no-op pipeline (#152) - Add new workflow functionality (#155) Improvements -------------------------- - Add korean translation to README.md. Thank you @0206pdh! (#138) - Add links to external articles (#139) - Update example applications to be consistent (#140)	Low	11/23/2021
v3.6.0	This release adds the following new features, improvements and bug fixes. New Features -------------------------- - Add post workflow action to API (#129) - Add tabular pipeline (#134) - Enhance ServiceTask to support additional use cases (#135) - Add notebook for tabular pipeline (#136) - Add topn option to extractor pipeline (#137) Improvements -------------------------- - Refactor registering new auto models to use methods in Transformers library (#128) - Update workflow exampl	Low	11/8/2021
v3.5.0	This release adds the following new features, improvements and bug fixes. New Features -------------------------- - Add scikit-learn to ONNX export pipeline (#124) - Add registry methods for auto models (#126) - Add notebook to demonstrate loading scikit-learn and PyTorch models (#127) Improvements -------------------------- - Add parameter to return raw model outputs for labels pipeline (#123) - Add parameter to use standard pooling for TransformersVectors (#125) Bug Fixes ----	Low	10/18/2021
v3.4.0	This release adds the following new features, improvements and bug fixes. New Features -------------------------- - Create notebook using extractive qa to build structured data (#117) - Modify extractor pipeline to support similarity pipeline backed context (#119) Improvements -------------------------- - Improve performance of extractor context queries (#120) Bug Fixes -------------------------- - Update labels pipeline to filter text classification output (#116) - Fix issues w	Low	10/7/2021
v3.3.0	This release adds the following new features, improvements and bug fixes. New Features -------------------------- - Add ONNX export pipeline (#107) - Add notebook for ONNX pipeline (#108) - Add ONNX support for Embeddings and Pipelines (#109) - Support QA models in Trainer pipeline (#111) - Add notebook for training QA models (#115 ) Improvements -------------------------- - Remove deprecated packages (#114) Bug Fixes -------------------------- - Fix issues with latest Trans	Low	9/10/2021

Dependencies & License Audit

Loading dependencies...

Similar Packages

PageIndex📑 PageIndex: Document Index for Vectorless, Reasoning-based RAGv0.3.0.dev3

server-nexeLocal AI server with persistent memory, RAG, and multi-backend inference (MLX / llama.cpp / Ollama). Runs entirely on your machine — zero data sent to external services.v1.0.7

uniAISyllabus-aware RAG study assistant for university students. Answers strictly from your own notes & PDFs, unit-scoped retrieval, cross-encoder reranking, and a hallucination gate — built to help studen0.0.0

onyxOpen Source AI Platform - AI Chat with advanced features that works with every LLMv4.4.2

bigragSelf-hostable RAG platform - document ingestion, embedding, and vector search behind a simple REST APImain@2026-07-23

More in Databases

alibabacloud-adb20211201Alibaba Cloud adb (20211201) SDK Library for Python

milvusMilvus is a high-performance, cloud-native vector database built for scalable vector ANN search

onyxOpen Source AI Platform - AI Chat with advanced features that works with every LLM

sentence-transformersEmbeddings, Retrieval, and Reranking