| v9.10.0 | **This release adds LiteRT vector support and Knowledge Distillation training** See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add support for LiteRT vectors (#1097) - Add URLRetrieve pipeline (#1100) - Add a Knowledge Distillation Trainer (#1103) - Add a Progressive Distillation Example Notebook (#1104) Improvements -------------------------- - Reduce noisy logging messages with Transformers v5 (#1102) | High | 6/4/2026 |
| v9.9.0 | **This release adds a zero dependency minimal install, LiteRT and Transformers v5 support** See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Zero dependency minimal install (#1089, #1090, #1091, #1092, #1093, #1094) - Transformers v5 Support (#1083, #1084, #1085, #1086, #1087, #1088) - Add LiteRT-LM LLM (#1095) | High | 5/12/2026 |
| v9.8.0 | **This release is primarily a bug fix, security and compatibility release.** See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add support for last token pooling (#1072) - Add support for streaming Labels pipeline (#1075) - Add safeopen parameter for Textractor (#1077) Improvements -------------------------- - Support Authorization tokens with MCP endpoints (#899) - Update embeddings documentation to mention load | High | 4/29/2026 |
| v9.7.0 | **This release adds the new TxtAI Coding Agent Toolkit** See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add coding agent toolkit (#1054, #1055, #1056, #1057, #1058, #1059, #1060, #1061) - Add agent tools example notebook (#1062) Improvements -------------------------- - Apply sigmoid to fused BB25 scores (#1053) Bug Fixes -------------------------- - Fix translation chunk concatenation when showmodels is en | Medium | 3/20/2026 |
| v9.6.0 | **This release adds many great new Embeddings database features: BB25 normalization, ngram tokenization, database expression indexes and conditional JSON data storage** See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add Bayesian normalization "bb25" method for keyword scoring (#1023, #1037, Thank you @sigridjineth) - Add BB25 normalization for sparse encoders (#1042, #1046 Thank you @jaepil) - Add log-odds conjuncti | Low | 2/25/2026 |
| v9.5.0 | **TxtAI got skills! This release adds a lot of great new agent capabilities: agents.md, skill.md and memory** See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add support for agents.md (#1014) - Add support for skill.md (#1015) - Add support for Agent memory (#1016) - Add ability to store agent memory by session (#1025) - Add example demonstrating how to use agents.md files (#1018) - Add example demonstrating how t | Low | 2/12/2026 |
| v9.4.1 | This release adds the following new features, improvements and bug fixes. _Note that the Docker build will not be updated since it installs all packages by default;_ Bug Fixes -------------------------- - httpx import at module level causes ModuleNotFoundError for Embeddings-only usage (#1012) - Add minimal build workflow (#1013) | Low | 1/23/2026 |
| v9.4.0 | **Adds OpenCode integration, improved instruction prompts support for vectors, additional keyword tokenization methods and more** See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add OpenCode LLM (#1010) - Add additional tokenization methods for keyword search (#1006) - Add example notebook covering OpenCode LLM (#1011) - Add example notebook showing how to finetune LLMs (#1008) Improvements --------------------- | Low | 1/21/2026 |
| v9.3.0 | **Expands RAG to any function, new quickstart examples and many improvements** See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add support for any callable search function in RAG pipeline (#1000) - Add notebook covering any search method with the RAG Pipeline (#1002) - Add RAG Quickstart Example (#991) - Add Agent Quickstart Example (#992) - Add Workflow Quickstart Example (#996) Improvements ------------------ | Low | 12/22/2025 |
| v9.2.0 | **This release is primarily a bug fix and compatibility release.** See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add tuples option to Segmentation pipeline (#979) - Add stripthink parameter to LLM API endpoint (#980) - Support stripthink in streaming LLM responses (#981) Improvements -------------------------- - Update deprecated parameter for HFTrainer (#978) - Update llama.cpp vectors to support default len | Low | 11/21/2025 |
| v9.1.0 | **This release focuses on vector "un-databases" - store vectors with NumPy, Torch and even GGUF from llama.cpp!** See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add GGML ANN (#975) - Add quantization support to Torch ANN (#971) - Add support for Safetensors storage to NumPy/Torch ANN backend (#969) - Add example notebook demonstrating how to access low level vector APIs (#973) - Add GraphRAG example notebook with | Low | 11/4/2025 |
| v9.0.1 | This release adds the following new features, improvements and bug fixes. Bug Fixes -------------------------- - Update build script for macOS builds (#964) - Additional checks for tar file extraction (#965) | Low | 9/15/2025 |
| v9.0.0 | ### ๐ We're excited to announce the release of txtai 9.0 ๐ _If you like txtai, please remember to give it a โญ!_ 9.0 adds first class support for sparse vector models (i.e. SPLADE), late interaction models (i.e. ColBERT), fixed dimensional encoding (i.e. MUVERA) and reranking pipelines. There are also plenty of improvements and bug fixes! New Features -------------------------- - Add sparse vector scoring (#924) - Add IVFFlat indexing for sparse vectors (#929) - Add pgsparse sc | Low | 8/28/2025 |
| v8.6.0 | **This release fixes a number of integration issues with downstream libraries and other performance improvements.** See below for full details on the new features, improvements and bug fixes. Improvements -------------------------- - Handling truncation for the Similarity pipeline (#882) - Update tagline to the all-in-one AI framework (#901) Bug Fixes -------------------------- - Encoding issue with latest version of LiteLLM (#902) - Fix bug with latest version of smolagents (#906 | Low | 6/10/2025 |
| v8.5.0 | **This release migrates from Transformers Agents to smolagents, adds Model Context Protocol (MCP) support and now requires Python 3.10+** See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Migrate to smolagents (#890) - Add Model Context Protocol (MCP) Support (#892) - Add support for MCP servers to Agent Framework (#898) - Require Python 3.10 (#897) Improvements -------------------------- - Lazy load list of tran | Low | 4/14/2025 |
| v8.4.0 | **This release adds support for vision LLMs, graph vector search, embeddings checkpoints, observability and an OpenAI-compatible API** See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add support for vision models to HF LLM pipeline (#884) - Add similar query clause to graph queries (#875) - Feature Request: Embeddings index checkpointing (#695) - Feature Request: Enhance observability and tracing capabilities (#869) | Low | 3/11/2025 |
| v8.3.1 | This release adds the following new features, improvements and bug fixes. Bug Fixes -------------------------- - Ensure staticvectors is installed before calling method (#876) | Low | 2/12/2025 |
| v8.3.0 | **This release adds support for GLiNER, Chonkie, Kokoro TTS and Static Vectors** See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add support for GLiNER models (#862) Thank you @urchade - Add semantic chunking pipeline (#812) Thank you @bhavnicksm - Add Kokoro TTS support to TextToSpeech pipeline (#854) Thank you @hexgrad - Add staticvectors inference (#859) - Add example notebook for Entity Extraction with GLiNER ( | Low | 2/11/2025 |
| v8.2.0 | **This release simplifies LLM chat messages, adds attribute filtering to Graph RAG and enables multi-cpu/gpu vector encoding** See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add defaultrole to LLM pipeline (#841) - Feature Request: Graph RAG - Add extra attributes (#684) - Support graph=True in embeddings config (#848) - Support pulling attribute data in graph.scan (#849) - Encoding using multiple-GPUs (#541) - A | Low | 1/9/2025 |
| v8.1.0 | **This release adds Docling integration, Embeddings context managers and significant database component enhancements** See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add text extraction with Docling (#814) - Add Embeddings context manager (#832) - Add support for halfvec and bit vector types with PGVector ANN (#839) - Persist embeddings components to specified schema (#829) - Add example notebook that analyzes the | Low | 12/10/2024 |
| v8.0.0 | ### ๐ We're excited to announce the release of txtai 8.0 ๐ _If you like txtai, please remember to give it a โญ!_ 8.0 introduces agents. Agents automatically create workflows to answer multi-faceted user requests. Agents iteratively prompt and/or interface with tools to step through a process and ultimately come to an answer for a request. This release also adds support for Model2Vec vectorization. See below for more. New Features -------------------------- - Add txtai agents ๐ | Low | 11/18/2024 |
| v7.5.1 | This release adds the following new features, improvements and bug fixes. Bug Fixes -------------------------- - Update translation pipeline to use hf_hub_download for language detection (#803) | Low | 10/25/2024 |
| v7.5.0 | **This release adds Speech to Speech RAG, new TTS models and Generative Audio features** See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add Speech to Speech example notebook (#789) - Add streaming speech generation (#784) - Add a microphone pipeline (#785) - Add an audio playback pipeline (#786) - Add Text to Audio pipeline (#792) - Add support for SpeechT5 ONNX exports with Text to Speech pipeline (#793) - Add | Low | 10/14/2024 |
| v7.4.0 | **This release adds the SQLite ANN, new text extraction features and a programming language neutral embeddings index format** See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add SQLite ANN (#780) - Enhance markdown support for Textractor (#758) - Update txtai index format to remove Python-specific serialization (#769) - Add new functionality to RAG application (#753) - Add bm25s library to benchmarks (#757) Thank y | Low | 9/5/2024 |
| v7.3.0 | **This release adds a new RAG front-end application template, streaming LLM and streaming RAG support along with significant text extraction improvements** See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add support for streaming LLM generation (#680) - Add RAG API endpoint (#735) - Add RAG deepdive notebook (#737) - Add RAG example application (#743) Improvements -------------------------- - Improve textractor | Low | 7/15/2024 |
| v7.2.0 | **This release adds Postgres integration for all components, LLM Chat Messages and vectorization with llama.cpp/LiteLLM** See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add pgvector ANN backend (#698) - Add RDBMS Graph (#699) - Add notebook covering txtai integration with Postgres (#701) - Add Postgres Full Text Scoring (#713) - Add support for chat messages in LLM pipeline (#718) - Add support for LiteLLM vector | Low | 5/31/2024 |
| v7.1.0 | **This release adds dynamic embeddings vector support along with semantic graph and RAG improvements** See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add support for dynamic vector dimensions (#674) - Add batch node and edge creation for graphs (#693) - Add notebook on Retrieval Augmented and Guided Generation (#694) Improvements -------------------------- - Pass options to underlying vector models (#675) - Mo | Low | 4/19/2024 |
| v7.0.0 | ### ๐ We're excited to announce the release of txtai 7.0 ๐ _If you like txtai, please remember to give it a โญ!_ 7.0 introduces the next generation of the semantic graph. This release adds support for graph search, advanced graph traversal and graph RAG. It also adds binary support to the API, index format improvements and training LoRA/QLoRA models. See below for more. New Features -------------------------- - Add indexing of embeddings graph relationships (#525) - Expand the grap | Low | 2/21/2024 |
| v6.3.0 | **This release adds new LLM inference methods, API Authorization and RAG improvements** ๐ New LLM methods. llama.cpp and LiteLLM support added. LLM pipeline now supports Hugging Face models, GGUF files and LLM API inference all with one line of code. ๐ API Authorization. Adds support for API keys and pluggable authentication methods when running through txtai API. See below for full details on the new features, improvements and bug fixes. New Features -------------------------- | Low | 1/2/2024 |
| v6.2.0 | **This release adds binary quantization, bind parameters for multimedia SQL queries and performance improvements** โก Scalar quantization. Supports 1 bit (binary) through 8 bit quantization. Can dramatically reduce vector storage requirements. ๐ SQL bind parameters. Enables searching binary content with SQL statements, along with being a standard best practice. See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add | Low | 11/8/2023 |
| v6.1.0 | **This release adds metadata support for client-server databases and custom scoring implementations** ๐๏ธ Client-server database integration. Store index metadata in Postgres, MariaDB/MySQL, MSSQL and more. ๐น Custom scoring implementations. Store keyword index data in systems such as Elasticsearch. Similar to functionality already available in vector index component. See below for full details on the new features, improvements and bug fixes. New Features --------------------------
| Low | 9/26/2023 |
| v6.0.0 | ### ๐ฅณ We're excited to announce the release of txtai 6.0 ๐ฅณ _This significant milestone release marks txtai's 3 year birthday๐ If you like txtai, please remember to give it a โญ!_ 6.0 adds sparse, hybrid and subindexes to the embeddings interface. It also makes significant improvements to the LLM pipeline workflow. See below for more. **Breaking changes** The vast majority of changes are fully backwards compatible. New features are only enabled when specified. The only breaking cha | Low | 8/10/2023 |
| v5.5.1 | This release adds the following new features, improvements and bug fixes. Bug Fixes -------------------------- - Wrap DuckDB as conditional import (#464) | Low | 4/27/2023 |
| v5.5.0 | **This release adds workflow streams and DuckDB as a database backend** โช๏ธ๏ธ Workflow streams enable server-side processing of large datasets. Streams iteratively pass content to workflows, no need to pass bulk data through the API. ๐ฆ DuckDB is a new database backend. Certain larger non-vector driven queries and aggregations will now run significantly faster than with SQLite. See below for full details on the new features, improvements and bug fixes. New Features ------------------- | Low | 4/20/2023 |
| v5.4.0 | **This release adds prompt templates, conversational task chaining and Hugging Face Hub integration** ๐ Prompt templates dynamically generate text using workflow task inputs. This enables chaining multiple prompts and models together. ๐ค Embeddings now integrate with the Hugging Face Hub! Easily share and load embeddings indexes. There is a full embeddings index available for English Wikipedia. See below for full details on the new features, improvements and bug fixes. New Features
| Low | 3/6/2023 |
| v5.3.0 | **This release adds embeddings-guided and prompt-driven search along with a number of methods to train language models** ๐ Prompt-driven search is a big step forward towards conversational search in txtai. With this release, complex prompts can now be passed to txtai to customize how search results are returned. Lots of exciting possibilities on this front, stay tuned. ๐ก The trainer pipeline now has support for training language models from scratch. It supports masked language modeling ( | Low | 2/6/2023 |
| v5.2.0 | This release adds TextToSpeech and Cross-Encoder pipelines. The performance of the `embeddings.batchtransform` method was significantly improved, enabling a speed up in building semantic graphs. Default configuration is now available for Embeddings, allowing an Embeddings instance to be created with no arguments like Pipelines. See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add Cross-Encoder support to Similarity pip | Low | 12/20/2022 |
| v5.1.0 | This release adds new model support for the translation pipeline, OpenAI Whisper support in the transcription pipeline and ARM Docker images. Topic modeling was also updated with improvements, including how to use BM25/TF-IDF indexes to drive topic models. See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Multiarch docker image (#324) - Add notebook covering classic topic modeling with BM25 (#360) Improvements ---- | Low | 10/18/2022 |
| v5.0.0 | ### ๐๐๐ฅณ We're excited to announce the release of txtai 5.0! ๐ฅณ๐๐ _Thank you to the txtai community! Please remember to โญ txtai!_ txtai 5.0 is a major new release. This release adds the semantic graph along with enabling external integrations. It also adds a number of improvements and bug fixes. New Features -------------------------- - Add scoring-based search (#327) - Add notebook demonstrating functionality of individual embeddings components (#328) - Add SQL expression col | Low | 9/27/2022 |
| v4.6.0 | ### ๐๐๐ฅณ txtai turns 2 ๐๐๐ฅณ ### _We're excited to release the 25th version of txtai marking it's 2 year anniversary. Thank you to the txtai community. Please remember to โญ txtai!_ txtai 4.6 is a large but backwards compatible release! This release adds better integration between embeddings and workflows. It also adds a number of significant performance improvements and bug fixes. New Features -------------------------- - Add transform workflow action to application (#281) - Add | Low | 8/15/2022 |
| v4.5.0 | This release adds the following new features, improvements and bug fixes. New Features -------------------------- - Add scripts to train bashsql query translation model (#271) - Add QA database example notebook (#272) - Add CITATION file (#273) Improvements -------------------------- - Improve efficiency of external vectors (#275) - Refactor vectors package to improve code reuse (#276) - Add logic to detect external vectors method (#277) Bug Fixes -------------------------- - | Low | 5/17/2022 |
| v4.4.0 | This release adds the following new features, improvements and bug fixes. New Features -------------------------- - Add semantic search explainability (#248) - Add notebook covering model explainability (#249) - Add txtai console (#252) - Add sequences pipeline (#261) - Add scripts to train query translation models (#265) - Add query translation logic in embeddings searches (#266) - Add notebook for query translation (#269) Improvements -------------------------- - Update HFTrain | Low | 4/20/2022 |
| v4.3.1 | This release adds the following new features, improvements and bug fixes. Bug Fixes -------------------------- - Fix word embeddings regression with batch transformation (#245) | Low | 3/11/2022 |
| v4.3.0 | This release adds the following new features, improvements and bug fixes. New Features -------------------------- - Add notebook covering txtai embeddings index file structure (#237) - Add Image Hash pipeline (#240) - Add support for custom SQL functions in embeddings queries (#241) - Add notebook for Embeddings SQL functions (#243) - Add notebook for near-duplicate image detection (#244) Improvements -------------------------- - Rename SQLException to SQLError (#232) - Refactor A | Low | 3/10/2022 |
| v4.2.1 | This release adds the following new features, improvements and bug fixes. Bug Fixes -------------------------- - Fixed mislabeled API config definition (#231) | Low | 2/28/2022 |
| v4.2.0 | This release adds the following new features, improvements and bug fixes. New Features -------------------------- - Add notebook for workflow notifications (#225) - Add default and custom docker configurations (#226) - Create docker configuration for AWS Lambda (#228) - Add support for loading/storing embedding indexes on cloud storage (#229) Improvements -------------------------- - Add support for SQL || operator (#223) - Add flag to disable loading index data in API (#230) Bu | Low | 2/24/2022 |
| v4.1.0 | This release adds the following new features, improvements and bug fixes. New Features -------------------------- - Add entity extraction pipeline (#203) - Add workflow scheduling (#206) - Add workflow search task to API (#210) - Add Console Task (#215) - Add Export Task (#216) - Add notebook for workflow scheduling (#218) Improvements -------------------------- - Default documentation theme using system preference (#197) - Improve multi-user experience for workflow application ( | Low | 2/3/2022 |
| v4.0.0 | ### ๐๐๐ฅณ We're excited to announce the release of txtai 4.0! ๐ฅณ๐๐ _Thank you to the growing txtai community. This couldn't be done without you. Please remember to โญ txtai if it has been helpful._ txtai 4.0 is a major release with a significant number of new features. This release adds content storage, querying with sql, object storage, reindexing, index compression, external vectors and more! To quantify the changes, the code base increased by 50% with 36 resolved issues, by far th | Low | 1/11/2022 |
| v3.7.0 | This release adds the following new features, improvements and bug fixes. New Features -------------------------- - Add object detection pipeline (#148) - Add image caption pipeline (#149) - Add retrieval task (#150) - Add no-op pipeline (#152) - Add new workflow functionality (#155) Improvements -------------------------- - Add korean translation to README.md. Thank you @0206pdh! (#138) - Add links to external articles (#139) - Update example applications to be consistent (#140) | Low | 11/23/2021 |
| v3.6.0 | This release adds the following new features, improvements and bug fixes. New Features -------------------------- - Add post workflow action to API (#129) - Add tabular pipeline (#134) - Enhance ServiceTask to support additional use cases (#135) - Add notebook for tabular pipeline (#136) - Add topn option to extractor pipeline (#137) Improvements -------------------------- - Refactor registering new auto models to use methods in Transformers library (#128) - Update workflow exampl | Low | 11/8/2021 |
| v3.5.0 | This release adds the following new features, improvements and bug fixes. New Features -------------------------- - Add scikit-learn to ONNX export pipeline (#124) - Add registry methods for auto models (#126) - Add notebook to demonstrate loading scikit-learn and PyTorch models (#127) Improvements -------------------------- - Add parameter to return raw model outputs for labels pipeline (#123) - Add parameter to use standard pooling for TransformersVectors (#125) Bug Fixes ---- | Low | 10/18/2021 |
| v3.4.0 | This release adds the following new features, improvements and bug fixes. New Features -------------------------- - Create notebook using extractive qa to build structured data (#117) - Modify extractor pipeline to support similarity pipeline backed context (#119) Improvements -------------------------- - Improve performance of extractor context queries (#120) Bug Fixes -------------------------- - Update labels pipeline to filter text classification output (#116) - Fix issues w | Low | 10/7/2021 |
| v3.3.0 | This release adds the following new features, improvements and bug fixes. New Features -------------------------- - Add ONNX export pipeline (#107) - Add notebook for ONNX pipeline (#108) - Add ONNX support for Embeddings and Pipelines (#109) - Support QA models in Trainer pipeline (#111) - Add notebook for training QA models (#115 ) Improvements -------------------------- - Remove deprecated packages (#114) Bug Fixes -------------------------- - Fix issues with latest Trans | Low | 9/10/2021 |