freshcrate
Skin:/
Home > Databases > txtai

txtai

๐Ÿ’ก All-in-one AI framework for semantic search, LLM orchestration and language model workflows

Why this rank:Strong adoptionRecent releaseHealthy release cadence

Description

๐Ÿ’ก All-in-one AI framework for semantic search, LLM orchestration and language model workflows

README

All-in-one AI framework

Version GitHub last commit GitHub issues Join Slack Build Status Coverage Status

txtai is an all-in-one AI framework for semantic search, LLM orchestration and language model workflows.

architecture architecture

The key component of txtai is an embeddings database, which is a union of vector indexes (sparse and dense), graph networks and relational databases.

This foundation enables vector search and/or serves as a powerful knowledge source for large language model (LLM) applications.

Build autonomous agents, retrieval augmented generation (RAG) processes, multi-model workflows and more.

Summary of txtai features:

  • ๐Ÿ”Ž Vector search with SQL, object storage, topic modeling, graph analysis and multimodal indexing
  • ๐Ÿ“„ Create embeddings for text, documents, audio, images and video
  • ๐Ÿ’ก Pipelines powered by language models that run LLM prompts, question-answering, labeling, transcription, translation, summarization and more
  • โ†ช๏ธ๏ธ Workflows to join pipelines together and aggregate business logic. txtai processes can be simple microservices or multi-model workflows.
  • ๐Ÿค– Agents that intelligently connect embeddings, pipelines, workflows and other agents together to autonomously solve complex problems
  • โš™๏ธ Web and Model Context Protocol (MCP) APIs. Bindings available for JavaScript, Java, Rust and Go.
  • ๐Ÿ”‹ Batteries included with defaults to get up and running fast
  • โ˜๏ธ Run local or scale out with container orchestration

txtai is built with Python 3.10+, Hugging Face Transformers, Sentence Transformers and FastAPI. txtai is open-source under an Apache 2.0 license.

Note

NeuML is the company behind txtai and we provide AI consulting services around our stack. Schedule a meeting or send a message to learn more.

We're also building an easy and secure way to run hosted txtai applications with txtai.cloud.

Why txtai?

why why

New vector databases, LLM frameworks and everything in between are sprouting up daily. Why build with txtai?

  • Up and running in minutes with pip or Docker
# Get started in a couple lines
import txtai

embeddings = txtai.Embeddings()
embeddings.index(["Correct", "Not what we hoped"])
embeddings.search("positive", 1)
#[(0, 0.29862046241760254)]
  • Built-in API makes it easy to develop applications using your programming language of choice
# app.yml
embeddings:
    path: sentence-transformers/all-MiniLM-L6-v2
CONFIG=app.yml uvicorn "txtai.api:app"
curl -X GET "http://localhost:8000/search?query=positive"
  • Run local - no need to ship data off to disparate remote services
  • Work with micromodels all the way up to large language models (LLMs)
  • Low footprint - install additional dependencies and scale up when needed
  • Learn by example - notebooks cover all available functionality

Use Cases

The following sections introduce common txtai use cases. A comprehensive set of over 70 example notebooks and applications are also available.

Semantic Search

Build semantic/similarity/vector/neural search applications.

demo

Traditional search systems use keywords to find data. Semantic search has an understanding of natural language and identifies results that have the same meaning, not necessarily the same keywords.

search search

Get started with the following examples.

Notebook Description
Introducing txtai โ–ถ๏ธ Overview of the functionality provided by txtai Open In Colab
Similarity search with images Embed images and text into the same space for search Open In Colab
Build a QA database Question matching with semantic search Open In Colab
Semantic Graphs Explore topics, data connectivity and run network analysis Open In Colab

LLM Orchestration

Autonomous agents, retrieval augmented generation (RAG), chat with your data, pipelines and workflows that interface with large language models (LLMs).

llm

See below to learn more.

Notebook Description
Prompt templates and task chains Build model prompts and connect tasks together with workflows Open In Colab
Integrate LLM frameworks Integrate llama.cpp, LiteLLM and custom generation frameworks Open In Colab
Build knowledge graphs with LLMs Build knowledge graphs with LLM-driven entity extraction Open In Colab
Parsing the stars with txtai Explore an astronomical knowledge graph of known stars, planets, galaxies Open In Colab

Agents

Agents connect embeddings, pipelines, workflows and other agents together to autonomously solve complex problems.

agent

txtai agents are built on top of the smolagents framework. This supports all LLMs txtai supports (Hugging Face, llama.cpp, OpenAI / Claude / AWS Bedrock via LiteLLM). Agent prompting with agents.md and skill.md are also supported.

Check out this Agent Quickstart Example. Additional examples are listed below.

Notebook Description
Granting autonomy to agents Agents that iteratively solve problems as they see fit Open In Colab
TxtAI got skills Integrate skill.md files with your agent Open In Colab
Agent Tools โ–ถ๏ธ Learn about the txtai agent toolkit Open In Colab
Analyzing LinkedIn Company Posts with Graphs and Agents Exploring how to improve social media engagement with AI Open In Colab

Retrieval augmented generation

Retrieval augmented generation (RAG) reduces the risk of LLM hallucinations by constraining the output with a knowledge base as context. RAG is commonly used to "chat with your data".

rag rag

Check out this RAG Quickstart Example. Additional examples are listed below.

Notebook Description
Build RAG pipelines with txtai โ–ถ๏ธ Guide on retrieval augmented generation including how to create citations Open In Colab
RAG is more than Vector Search Context retrieval via Web, SQL and other sources Open In Colab
GraphRAG with Wikipedia and GPT OSS Deep graph search powered RAG Open In Colab
Speech to Speech RAG โ–ถ๏ธ Full cycle speech to speech workflow with RAG Open In Colab

Language Model Workflows

Language model workflows, also known as semantic workflows, connect language models together to build intelligent applications.

flows flows

While LLMs are powerful, there are plenty of smaller, more specialized models that work better and faster for specific tasks. This includes models for extractive question-answering, automatic summarization, text-to-speech, transcription and translation.

Check out this Workflow Quickstart Example. Additional examples are listed below.

Notebook Description
Run pipeline workflows โ–ถ๏ธ Simple yet powerful constructs to efficiently process data Open In Colab
Building abstractive text summaries Run abstractive text summarization Open In Colab
Transcribe audio to text Convert audio files to text Open In Colab
Translate text between languages Streamline machine translation and language detection Open In Colab

Installation

install install

The easiest way to install is via pip and PyPI

pip install txtai

Python 3.10+ is supported. Using a Python virtual environment is recommended.

See the detailed install instructions for more information covering optional dependencies, environment specific prerequisites, installing from source, conda support and how to run with containers.

Model guide

models

See the table below for the current recommended models. These models all allow commercial use and offer a blend of speed and performance.

Component Model(s)
Embeddings all-MiniLM-L6-v2
Image Captions BLIP
Labels - Zero Shot DeBERTa v3 Zeroshot
Labels - Fixed Fine-tune with training pipeline
Large Language Model (LLM) Gemma 4 31B
Summarization DistilBART
Text-to-Speech ESPnet JETS
Transcription Whisper
Translation OPUS Model Series

Models can be loaded as either a path from the Hugging Face Hub or a local directory. Model paths are optional, defaults are loaded when not specified. For tasks with no recommended model, txtai uses the default models as shown in the Hugging Face Tasks guide.

See the following links to learn more.

Powered by txtai

The following applications are powered by txtai.

apps

Application Description
rag Retrieval Augmented Generation (RAG) application
ncoder Open-Source AI coding agent
paperai AI for medical and scientific papers
annotateai Automatically annotate papers with LLMs

In addition to this list, there are also many other open-source projects, published research and closed proprietary/commercial projects that have built on txtai in production.

Further Reading

further further

Documentation

Full documentation on txtai including configuration settings for embeddings, pipelines, workflows, API and a FAQ with common questions/issues is available.

Contributing

For those who would like to contribute to txtai, please see this guide.

Release History

VersionChangesUrgencyDate
v9.10.0**This release adds LiteRT vector support and Knowledge Distillation training** See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add support for LiteRT vectors (#1097) - Add URLRetrieve pipeline (#1100) - Add a Knowledge Distillation Trainer (#1103) - Add a Progressive Distillation Example Notebook (#1104) Improvements -------------------------- - Reduce noisy logging messages with Transformers v5 (#1102) High6/4/2026
v9.9.0**This release adds a zero dependency minimal install, LiteRT and Transformers v5 support** See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Zero dependency minimal install (#1089, #1090, #1091, #1092, #1093, #1094) - Transformers v5 Support (#1083, #1084, #1085, #1086, #1087, #1088) - Add LiteRT-LM LLM (#1095) High5/12/2026
v9.8.0**This release is primarily a bug fix, security and compatibility release.** See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add support for last token pooling (#1072) - Add support for streaming Labels pipeline (#1075) - Add safeopen parameter for Textractor (#1077) Improvements -------------------------- - Support Authorization tokens with MCP endpoints (#899) - Update embeddings documentation to mention loadHigh4/29/2026
v9.7.0**This release adds the new TxtAI Coding Agent Toolkit** See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add coding agent toolkit (#1054, #1055, #1056, #1057, #1058, #1059, #1060, #1061) - Add agent tools example notebook (#1062) Improvements -------------------------- - Apply sigmoid to fused BB25 scores (#1053) Bug Fixes -------------------------- - Fix translation chunk concatenation when showmodels is enMedium3/20/2026
v9.6.0**This release adds many great new Embeddings database features: BB25 normalization, ngram tokenization, database expression indexes and conditional JSON data storage** See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add Bayesian normalization "bb25" method for keyword scoring (#1023, #1037, Thank you @sigridjineth) - Add BB25 normalization for sparse encoders (#1042, #1046 Thank you @jaepil) - Add log-odds conjunctiLow2/25/2026
v9.5.0**TxtAI got skills! This release adds a lot of great new agent capabilities: agents.md, skill.md and memory** See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add support for agents.md (#1014) - Add support for skill.md (#1015) - Add support for Agent memory (#1016) - Add ability to store agent memory by session (#1025) - Add example demonstrating how to use agents.md files (#1018) - Add example demonstrating how tLow2/12/2026
v9.4.1This release adds the following new features, improvements and bug fixes. _Note that the Docker build will not be updated since it installs all packages by default;_ Bug Fixes -------------------------- - httpx import at module level causes ModuleNotFoundError for Embeddings-only usage (#1012) - Add minimal build workflow (#1013)Low1/23/2026
v9.4.0**Adds OpenCode integration, improved instruction prompts support for vectors, additional keyword tokenization methods and more** See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add OpenCode LLM (#1010) - Add additional tokenization methods for keyword search (#1006) - Add example notebook covering OpenCode LLM (#1011) - Add example notebook showing how to finetune LLMs (#1008) Improvements ---------------------Low1/21/2026
v9.3.0**Expands RAG to any function, new quickstart examples and many improvements** See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add support for any callable search function in RAG pipeline (#1000) - Add notebook covering any search method with the RAG Pipeline (#1002) - Add RAG Quickstart Example (#991) - Add Agent Quickstart Example (#992) - Add Workflow Quickstart Example (#996) Improvements ------------------Low12/22/2025
v9.2.0**This release is primarily a bug fix and compatibility release.** See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add tuples option to Segmentation pipeline (#979) - Add stripthink parameter to LLM API endpoint (#980) - Support stripthink in streaming LLM responses (#981) Improvements -------------------------- - Update deprecated parameter for HFTrainer (#978) - Update llama.cpp vectors to support default lenLow11/21/2025
v9.1.0**This release focuses on vector "un-databases" - store vectors with NumPy, Torch and even GGUF from llama.cpp!** See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add GGML ANN (#975) - Add quantization support to Torch ANN (#971) - Add support for Safetensors storage to NumPy/Torch ANN backend (#969) - Add example notebook demonstrating how to access low level vector APIs (#973) - Add GraphRAG example notebook with Low11/4/2025
v9.0.1This release adds the following new features, improvements and bug fixes. Bug Fixes -------------------------- - Update build script for macOS builds (#964) - Additional checks for tar file extraction (#965) Low9/15/2025
v9.0.0### ๐ŸŽ‰ We're excited to announce the release of txtai 9.0 ๐ŸŽ‰ _If you like txtai, please remember to give it a โญ!_ 9.0 adds first class support for sparse vector models (i.e. SPLADE), late interaction models (i.e. ColBERT), fixed dimensional encoding (i.e. MUVERA) and reranking pipelines. There are also plenty of improvements and bug fixes! New Features -------------------------- - Add sparse vector scoring (#924) - Add IVFFlat indexing for sparse vectors (#929) - Add pgsparse scLow8/28/2025
v8.6.0**This release fixes a number of integration issues with downstream libraries and other performance improvements.** See below for full details on the new features, improvements and bug fixes. Improvements -------------------------- - Handling truncation for the Similarity pipeline (#882) - Update tagline to the all-in-one AI framework (#901) Bug Fixes -------------------------- - Encoding issue with latest version of LiteLLM (#902) - Fix bug with latest version of smolagents (#906Low6/10/2025
v8.5.0**This release migrates from Transformers Agents to smolagents, adds Model Context Protocol (MCP) support and now requires Python 3.10+** See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Migrate to smolagents (#890) - Add Model Context Protocol (MCP) Support (#892) - Add support for MCP servers to Agent Framework (#898) - Require Python 3.10 (#897) Improvements -------------------------- - Lazy load list of tranLow4/14/2025
v8.4.0**This release adds support for vision LLMs, graph vector search, embeddings checkpoints, observability and an OpenAI-compatible API** See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add support for vision models to HF LLM pipeline (#884) - Add similar query clause to graph queries (#875) - Feature Request: Embeddings index checkpointing (#695) - Feature Request: Enhance observability and tracing capabilities (#869)Low3/11/2025
v8.3.1This release adds the following new features, improvements and bug fixes. Bug Fixes -------------------------- - Ensure staticvectors is installed before calling method (#876)Low2/12/2025
v8.3.0**This release adds support for GLiNER, Chonkie, Kokoro TTS and Static Vectors** See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add support for GLiNER models (#862) Thank you @urchade - Add semantic chunking pipeline (#812) Thank you @bhavnicksm - Add Kokoro TTS support to TextToSpeech pipeline (#854) Thank you @hexgrad - Add staticvectors inference (#859) - Add example notebook for Entity Extraction with GLiNER (Low2/11/2025
v8.2.0**This release simplifies LLM chat messages, adds attribute filtering to Graph RAG and enables multi-cpu/gpu vector encoding** See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add defaultrole to LLM pipeline (#841) - Feature Request: Graph RAG - Add extra attributes (#684) - Support graph=True in embeddings config (#848) - Support pulling attribute data in graph.scan (#849) - Encoding using multiple-GPUs (#541) - ALow1/9/2025
v8.1.0**This release adds Docling integration, Embeddings context managers and significant database component enhancements** See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add text extraction with Docling (#814) - Add Embeddings context manager (#832) - Add support for halfvec and bit vector types with PGVector ANN (#839) - Persist embeddings components to specified schema (#829) - Add example notebook that analyzes theLow12/10/2024
v8.0.0### ๐ŸŽ‰ We're excited to announce the release of txtai 8.0 ๐ŸŽ‰ _If you like txtai, please remember to give it a โญ!_ 8.0 introduces agents. Agents automatically create workflows to answer multi-faceted user requests. Agents iteratively prompt and/or interface with tools to step through a process and ultimately come to an answer for a request. This release also adds support for Model2Vec vectorization. See below for more. New Features -------------------------- - Add txtai agents ๐Ÿš€ Low11/18/2024
v7.5.1This release adds the following new features, improvements and bug fixes. Bug Fixes -------------------------- - Update translation pipeline to use hf_hub_download for language detection (#803) Low10/25/2024
v7.5.0**This release adds Speech to Speech RAG, new TTS models and Generative Audio features** See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add Speech to Speech example notebook (#789) - Add streaming speech generation (#784) - Add a microphone pipeline (#785) - Add an audio playback pipeline (#786) - Add Text to Audio pipeline (#792) - Add support for SpeechT5 ONNX exports with Text to Speech pipeline (#793) - Add Low10/14/2024
v7.4.0**This release adds the SQLite ANN, new text extraction features and a programming language neutral embeddings index format** See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add SQLite ANN (#780) - Enhance markdown support for Textractor (#758) - Update txtai index format to remove Python-specific serialization (#769) - Add new functionality to RAG application (#753) - Add bm25s library to benchmarks (#757) Thank yLow9/5/2024
v7.3.0**This release adds a new RAG front-end application template, streaming LLM and streaming RAG support along with significant text extraction improvements** See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add support for streaming LLM generation (#680) - Add RAG API endpoint (#735) - Add RAG deepdive notebook (#737) - Add RAG example application (#743) Improvements -------------------------- - Improve textractorLow7/15/2024
v7.2.0**This release adds Postgres integration for all components, LLM Chat Messages and vectorization with llama.cpp/LiteLLM** See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add pgvector ANN backend (#698) - Add RDBMS Graph (#699) - Add notebook covering txtai integration with Postgres (#701) - Add Postgres Full Text Scoring (#713) - Add support for chat messages in LLM pipeline (#718) - Add support for LiteLLM vectorLow5/31/2024
v7.1.0**This release adds dynamic embeddings vector support along with semantic graph and RAG improvements** See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add support for dynamic vector dimensions (#674) - Add batch node and edge creation for graphs (#693) - Add notebook on Retrieval Augmented and Guided Generation (#694) Improvements -------------------------- - Pass options to underlying vector models (#675) - MoLow4/19/2024
v7.0.0### ๐ŸŽ‰ We're excited to announce the release of txtai 7.0 ๐ŸŽ‰ _If you like txtai, please remember to give it a โญ!_ 7.0 introduces the next generation of the semantic graph. This release adds support for graph search, advanced graph traversal and graph RAG. It also adds binary support to the API, index format improvements and training LoRA/QLoRA models. See below for more. New Features -------------------------- - Add indexing of embeddings graph relationships (#525) - Expand the grapLow2/21/2024
v6.3.0**This release adds new LLM inference methods, API Authorization and RAG improvements** ๐Ÿ“„ New LLM methods. llama.cpp and LiteLLM support added. LLM pipeline now supports Hugging Face models, GGUF files and LLM API inference all with one line of code. ๐Ÿ”’ API Authorization. Adds support for API keys and pluggable authentication methods when running through txtai API. See below for full details on the new features, improvements and bug fixes. New Features -------------------------- Low1/2/2024
v6.2.0**This release adds binary quantization, bind parameters for multimedia SQL queries and performance improvements** โšก Scalar quantization. Supports 1 bit (binary) through 8 bit quantization. Can dramatically reduce vector storage requirements. ๐Ÿš€ SQL bind parameters. Enables searching binary content with SQL statements, along with being a standard best practice. See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add Low11/8/2023
v6.1.0**This release adds metadata support for client-server databases and custom scoring implementations** ๐Ÿ—ƒ๏ธ Client-server database integration. Store index metadata in Postgres, MariaDB/MySQL, MSSQL and more. ๐Ÿ–น Custom scoring implementations. Store keyword index data in systems such as Elasticsearch. Similar to functionality already available in vector index component. See below for full details on the new features, improvements and bug fixes. New Features -------------------------- Low9/26/2023
v6.0.0### ๐Ÿฅณ We're excited to announce the release of txtai 6.0 ๐Ÿฅณ _This significant milestone release marks txtai's 3 year birthday๐ŸŽ‰ If you like txtai, please remember to give it a โญ!_ 6.0 adds sparse, hybrid and subindexes to the embeddings interface. It also makes significant improvements to the LLM pipeline workflow. See below for more. **Breaking changes** The vast majority of changes are fully backwards compatible. New features are only enabled when specified. The only breaking chaLow8/10/2023
v5.5.1This release adds the following new features, improvements and bug fixes. Bug Fixes -------------------------- - Wrap DuckDB as conditional import (#464)Low4/27/2023
v5.5.0**This release adds workflow streams and DuckDB as a database backend** โ†ช๏ธ๏ธ Workflow streams enable server-side processing of large datasets. Streams iteratively pass content to workflows, no need to pass bulk data through the API. ๐Ÿฆ† DuckDB is a new database backend. Certain larger non-vector driven queries and aggregations will now run significantly faster than with SQLite. See below for full details on the new features, improvements and bug fixes. New Features -------------------Low4/20/2023
v5.4.0**This release adds prompt templates, conversational task chaining and Hugging Face Hub integration** ๐Ÿ“ƒ Prompt templates dynamically generate text using workflow task inputs. This enables chaining multiple prompts and models together. ๐Ÿค— Embeddings now integrate with the Hugging Face Hub! Easily share and load embeddings indexes. There is a full embeddings index available for English Wikipedia. See below for full details on the new features, improvements and bug fixes. New Features Low3/6/2023
v5.3.0**This release adds embeddings-guided and prompt-driven search along with a number of methods to train language models** ๐Ÿ”Ž Prompt-driven search is a big step forward towards conversational search in txtai. With this release, complex prompts can now be passed to txtai to customize how search results are returned. Lots of exciting possibilities on this front, stay tuned. ๐Ÿ’ก The trainer pipeline now has support for training language models from scratch. It supports masked language modeling (Low2/6/2023
v5.2.0This release adds TextToSpeech and Cross-Encoder pipelines. The performance of the `embeddings.batchtransform` method was significantly improved, enabling a speed up in building semantic graphs. Default configuration is now available for Embeddings, allowing an Embeddings instance to be created with no arguments like Pipelines. See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Add Cross-Encoder support to Similarity pipLow12/20/2022
v5.1.0This release adds new model support for the translation pipeline, OpenAI Whisper support in the transcription pipeline and ARM Docker images. Topic modeling was also updated with improvements, including how to use BM25/TF-IDF indexes to drive topic models. See below for full details on the new features, improvements and bug fixes. New Features -------------------------- - Multiarch docker image (#324) - Add notebook covering classic topic modeling with BM25 (#360) Improvements ----Low10/18/2022
v5.0.0### ๐ŸŽˆ๐ŸŽ‰๐Ÿฅณ We're excited to announce the release of txtai 5.0! ๐Ÿฅณ๐ŸŽ‰๐ŸŽˆ _Thank you to the txtai community! Please remember to โญ txtai!_ txtai 5.0 is a major new release. This release adds the semantic graph along with enabling external integrations. It also adds a number of improvements and bug fixes. New Features -------------------------- - Add scoring-based search (#327) - Add notebook demonstrating functionality of individual embeddings components (#328) - Add SQL expression colLow9/27/2022
v4.6.0### ๐ŸŽˆ๐ŸŽ‰๐Ÿฅณ txtai turns 2 ๐ŸŽˆ๐ŸŽ‰๐Ÿฅณ ### _We're excited to release the 25th version of txtai marking it's 2 year anniversary. Thank you to the txtai community. Please remember to โญ txtai!_ txtai 4.6 is a large but backwards compatible release! This release adds better integration between embeddings and workflows. It also adds a number of significant performance improvements and bug fixes. New Features -------------------------- - Add transform workflow action to application (#281) - AddLow8/15/2022
v4.5.0This release adds the following new features, improvements and bug fixes. New Features -------------------------- - Add scripts to train bashsql query translation model (#271) - Add QA database example notebook (#272) - Add CITATION file (#273) Improvements -------------------------- - Improve efficiency of external vectors (#275) - Refactor vectors package to improve code reuse (#276) - Add logic to detect external vectors method (#277) Bug Fixes -------------------------- - Low5/17/2022
v4.4.0This release adds the following new features, improvements and bug fixes. New Features -------------------------- - Add semantic search explainability (#248) - Add notebook covering model explainability (#249) - Add txtai console (#252) - Add sequences pipeline (#261) - Add scripts to train query translation models (#265) - Add query translation logic in embeddings searches (#266) - Add notebook for query translation (#269) Improvements -------------------------- - Update HFTrainLow4/20/2022
v4.3.1This release adds the following new features, improvements and bug fixes. Bug Fixes -------------------------- - Fix word embeddings regression with batch transformation (#245)Low3/11/2022
v4.3.0This release adds the following new features, improvements and bug fixes. New Features -------------------------- - Add notebook covering txtai embeddings index file structure (#237) - Add Image Hash pipeline (#240) - Add support for custom SQL functions in embeddings queries (#241) - Add notebook for Embeddings SQL functions (#243) - Add notebook for near-duplicate image detection (#244) Improvements -------------------------- - Rename SQLException to SQLError (#232) - Refactor ALow3/10/2022
v4.2.1This release adds the following new features, improvements and bug fixes. Bug Fixes -------------------------- - Fixed mislabeled API config definition (#231) Low2/28/2022
v4.2.0This release adds the following new features, improvements and bug fixes. New Features -------------------------- - Add notebook for workflow notifications (#225) - Add default and custom docker configurations (#226) - Create docker configuration for AWS Lambda (#228) - Add support for loading/storing embedding indexes on cloud storage (#229) Improvements -------------------------- - Add support for SQL || operator (#223) - Add flag to disable loading index data in API (#230) BuLow2/24/2022
v4.1.0This release adds the following new features, improvements and bug fixes. New Features -------------------------- - Add entity extraction pipeline (#203) - Add workflow scheduling (#206) - Add workflow search task to API (#210) - Add Console Task (#215) - Add Export Task (#216) - Add notebook for workflow scheduling (#218) Improvements -------------------------- - Default documentation theme using system preference (#197) - Improve multi-user experience for workflow application (Low2/3/2022
v4.0.0### ๐ŸŽˆ๐ŸŽ‰๐Ÿฅณ We're excited to announce the release of txtai 4.0! ๐Ÿฅณ๐ŸŽ‰๐ŸŽˆ _Thank you to the growing txtai community. This couldn't be done without you. Please remember to โญ txtai if it has been helpful._ txtai 4.0 is a major release with a significant number of new features. This release adds content storage, querying with sql, object storage, reindexing, index compression, external vectors and more! To quantify the changes, the code base increased by 50% with 36 resolved issues, by far thLow1/11/2022
v3.7.0This release adds the following new features, improvements and bug fixes. New Features -------------------------- - Add object detection pipeline (#148) - Add image caption pipeline (#149) - Add retrieval task (#150) - Add no-op pipeline (#152) - Add new workflow functionality (#155) Improvements -------------------------- - Add korean translation to README.md. Thank you @0206pdh! (#138) - Add links to external articles (#139) - Update example applications to be consistent (#140)Low11/23/2021
v3.6.0This release adds the following new features, improvements and bug fixes. New Features -------------------------- - Add post workflow action to API (#129) - Add tabular pipeline (#134) - Enhance ServiceTask to support additional use cases (#135) - Add notebook for tabular pipeline (#136) - Add topn option to extractor pipeline (#137) Improvements -------------------------- - Refactor registering new auto models to use methods in Transformers library (#128) - Update workflow examplLow11/8/2021
v3.5.0This release adds the following new features, improvements and bug fixes. New Features -------------------------- - Add scikit-learn to ONNX export pipeline (#124) - Add registry methods for auto models (#126) - Add notebook to demonstrate loading scikit-learn and PyTorch models (#127) Improvements -------------------------- - Add parameter to return raw model outputs for labels pipeline (#123) - Add parameter to use standard pooling for TransformersVectors (#125) Bug Fixes ----Low10/18/2021
v3.4.0This release adds the following new features, improvements and bug fixes. New Features -------------------------- - Create notebook using extractive qa to build structured data (#117) - Modify extractor pipeline to support similarity pipeline backed context (#119) Improvements -------------------------- - Improve performance of extractor context queries (#120) Bug Fixes -------------------------- - Update labels pipeline to filter text classification output (#116) - Fix issues wLow10/7/2021
v3.3.0This release adds the following new features, improvements and bug fixes. New Features -------------------------- - Add ONNX export pipeline (#107) - Add notebook for ONNX pipeline (#108) - Add ONNX support for Embeddings and Pipelines (#109) - Support QA models in Trainer pipeline (#111) - Add notebook for training QA models (#115 ) Improvements -------------------------- - Remove deprecated packages (#114) Bug Fixes -------------------------- - Fix issues with latest TransLow9/10/2021

Dependencies & License Audit

Loading dependencies...

Similar Packages

PageIndex๐Ÿ“‘ PageIndex: Document Index for Vectorless, Reasoning-based RAGmain@2026-06-02
server-nexeLocal AI server with persistent memory, RAG, and multi-backend inference (MLX / llama.cpp / Ollama). Runs entirely on your machine โ€” zero data sent to external services.v1.0.5-beta
uniAISyllabus-aware RAG study assistant for university students. Answers strictly from your own notes & PDFs, unit-scoped retrieval, cross-encoder reranking, and a hallucination gate โ€” built to help studen0.0.0
ai-real-estate-assistantAdvanced AI Real Estate Assistant using RAG, LLMs, and Python. Features market analysis, property valuation, and intelligent search.v5.0.7
redis-vl-pythonRedis Vector Library (RedisVL) -- the AI-native Python client for Redis.v0.20.0

More in Databases

milvusMilvus is a high-performance, cloud-native vector database built for scalable vector ANN search
WeKnoraLLM-powered framework for deep document understanding, semantic retrieval, and context-aware answers using RAG paradigm.
ai-real-estate-assistantAdvanced AI Real Estate Assistant using RAG, LLMs, and Python. Features market analysis, property valuation, and intelligent search.
alibabacloud-adb20211201Alibaba Cloud adb (20211201) SDK Library for Python