freshcrate
Skin:/
Home > Databases > llama_index

llama_index

LlamaIndex is the leading document agent and OCR platform

Why this rank:Strong adoptionRecent releaseHealthy release cadence

Description

LlamaIndex is the leading document agent and OCR platform

README

๐Ÿ—‚๏ธ LlamaIndex ๐Ÿฆ™

PyPI - Downloads Build GitHub contributors Discord Twitter RedditAsk AI

LlamaIndex OSS (by LlamaIndex) is an open-source framework to build agentic applications. Parse is our enterprise platform for agentic OCR, parsing, extraction, indexing and more. You can use LlamaParse with this framework or on its own; see LlamaParse below for signup and product links.

๐Ÿ“š Documentation:

Building with LlamaIndex typically involves working with LlamaIndex core and a chosen set of integrations (or plugins). There are two ways to start building with LlamaIndex in Python:

  1. Starter: llama-index. A starter Python package that includes core LlamaIndex as well as a selection of integrations.

  2. Customized: llama-index-core. Install core LlamaIndex and add your chosen LlamaIndex integration packages on LlamaHub that are required for your application. There are over 300 LlamaIndex integration packages that work seamlessly with core, allowing you to build with your preferred LLM, embedding, and vector store providers.

The LlamaIndex Python library is namespaced such that import statements which include core imply that the core package is being used. In contrast, those statements without core imply that an integration package is being used.

# typical pattern
from llama_index.core.xxx import ClassABC  # core submodule xxx
from llama_index.xxx.yyy import (
    SubclassABC,
)  # integration yyy for submodule xxx

# concrete example
from llama_index.core.llms import LLM
from llama_index.llms.openai import OpenAI

LlamaParse (document agent platform)

LlamaParse is its own platformโ€”focused on document agents and agentic OCR. It includes Parse (parsing), LlamaAgents (deployed document agents), Extract (structured extraction), and Index (ingest and RAG). You can use it with the LlamaIndex framework or standalone.

  • Sign up for LlamaParse โ€” Create an account and get your API key.
  • Parse โ€” Agentic OCR and document parsing (130+ formats). Docs
  • Extract โ€” Structured data extraction from documents. Docs
  • Index โ€” Ingest, index, and RAG pipelines. Docs
  • Split โ€” Split large documents into subcategories. Docs
  • Agents โ€” Build end-to-end document agents with Workflows and Agent Builder. Docs

Important Links

Documentation

X (formerly Twitter)

LinkedIn

Reddit

Discord

๐Ÿš€ Overview

NOTE: This README is not updated as frequently as the documentation. Please check out the documentation above for the latest updates!

Context

  • LLMs are a phenomenal piece of technology for knowledge generation and reasoning. They are pre-trained on large amounts of publicly available data.
  • How do we best augment LLMs with our own private data?

We need a comprehensive toolkit to help perform this data augmentation for LLMs.

Proposed Solution

That's where LlamaIndex comes in. LlamaIndex is a "data framework" to help you build LLM apps. It provides the following tools:

  • Offers data connectors to ingest your existing data sources and data formats (APIs, PDFs, docs, SQL, etc.).
  • Provides ways to structure your data (indices, graphs) so that this data can be easily used with LLMs.
  • Provides an advanced retrieval/query interface over your data: Feed in any LLM input prompt, get back retrieved context and knowledge-augmented output.
  • Allows easy integrations with your outer application framework (e.g. with LangChain, Flask, Docker, ChatGPT, or anything else).

LlamaIndex provides tools for both beginner users and advanced users. Our high-level API allows beginner users to use LlamaIndex to ingest and query their data in 5 lines of code. Our lower-level APIs allow advanced users to customize and extend any module (data connectors, indices, retrievers, query engines, reranking modules), to fit their needs.

๐Ÿ’ก Contributing

Interested in contributing? Contributions to LlamaIndex core as well as contributing integrations that build on the core are both accepted and highly encouraged! See our Contribution Guide for more details.

New integrations should meaningfully integrate with existing LlamaIndex framework components. At the discretion of LlamaIndex maintainers, some integrations may be declined.

๐Ÿ“„ Documentation

Full documentation can be found here

Please check it out for the most up-to-date tutorials, how-to guides, references, and other resources!

๐Ÿ’ป Example Usage

# custom selection of integrations to work with core
pip install llama-index-core
pip install llama-index-llms-openai
pip install llama-index-llms-ollama
pip install llama-index-embeddings-huggingface

Examples are in the docs/examples folder. Indices are in the indices folder (see list of indices below).

To build a simple vector store index using OpenAI:

import os

os.environ["OPENAI_API_KEY"] = "YOUR_OPENAI_API_KEY"

from llama_index.core import VectorStoreIndex, SimpleDirectoryReader

documents = SimpleDirectoryReader("YOUR_DATA_DIRECTORY").load_data()
index = VectorStoreIndex.from_documents(documents)

To build a simple vector store index using non-OpenAI LLMs, e.g. LLMs hosted through Ollama:

from llama_index.core import Settings, VectorStoreIndex, SimpleDirectoryReader
from llama_index.embeddings.huggingface import HuggingFaceEmbedding
from llama_index.llms.ollama import Ollama
from transformers import AutoTokenizer

# set the LLM
Settings.llm = Ollama(
    model="llama-3.1:latest",
    request_timeout=360.0,
)

# set tokenizer to match LLM
Settings.tokenizer = AutoTokenizer.from_pretrained(
    "meta-llama/Llama-3.1-8B-Instruct"
)

# set the embed model
Settings.embed_model = HuggingFaceEmbedding(
    model_name="BAAI/bge-small-en-v1.5"
)

documents = SimpleDirectoryReader("YOUR_DATA_DIRECTORY").load_data()
index = VectorStoreIndex.from_documents(
    documents,
)

To query:

query_engine = index.as_query_engine()
query_engine.query("YOUR_QUESTION")

By default, data is stored in-memory. To persist to disk (under ./storage):

index.storage_context.persist()

To reload from disk:

from llama_index.core import StorageContext, load_index_from_storage

# rebuild storage context
storage_context = StorageContext.from_defaults(persist_dir="./storage")
# load index
index = load_index_from_storage(storage_context)

A note on Verification of Build Assets

By default, llama-index-core includes a _static folder that contains the nltk and tiktoken cache that is included with the package installation. This ensures that you can easily run llama-index in environments with restrictive disk access permissions at runtime.

To verify that these files are safe and valid, we use the github attest-build-provenance action. This action will verify that the files in the _static folder are the same as the files in the llama-index-core/llama_index/core/_static folder.

To verify this, you can run the following script (pointing to your installed package):

#!/bin/bash
STATIC_DIR="venv/lib/python3.13/site-packages/llama_index/core/_static"
REPO="run-llama/llama_index"

find "$STATIC_DIR" -type f | while read -r file; do
    echo "Verifying: $file"
    gh attestation verify "$file" -R "$REPO" || echo "Failed to verify: $file"
done

๐Ÿ“– Citation

Reference to cite if you use LlamaIndex in a paper:

@software{Liu_LlamaIndex_2022,
author = {Liu, Jerry},
doi = {10.5281/zenodo.1234},
month = {11},
title = {{LlamaIndex}},
url = {https://github.com/jerryjliu/llama_index},
year = {2022}
}

Release History

VersionChangesUrgencyDate
v0.14.22# Release Notes ## [2026-05-14] ### llama-index-agent-agentmesh [0.2.0] - mass uv lock --upgrade ([#21638](https://github.com/run-llama/llama_index/pull/21638)) ### llama-index-callbacks-agentops [0.5.0] - chore(deps): bump the pip group across 55 directories with 3 updates ([#21435](https://github.com/run-llama/llama_index/pull/21435)) - mass uv lock --upgrade ([#21638](https://github.com/run-llama/llama_index/pull/21638)) ### llama-index-callbacks-aim [0.4.1] - mass uv lock --upgrade ([High5/14/2026
v0.14.21# Release Notes ## [2026-04-21] ### llama-index-callbacks-honeyhive [0.5.0] - chore(deps): bump the pip group across 87 directories with 2 updates ([#21382](https://github.com/run-llama/llama_index/pull/21382)) - chore(deps): bump the pip group across 68 directories with 2 updates ([#21394](https://github.com/run-llama/llama_index/pull/21394)) ### llama-index-core [0.14.21] - fix(core): prevent `KeyError` in `DocumentSummaryIndex.delete_nodes` when invalid node ID is provided ([#21067](httpHigh4/21/2026
v0.14.20# Release Notes ## [2026-04-03] ### llama-index-agent-agentmesh [0.2.0] - fix vulnerability with nltk ([#21275](https://github.com/run-llama/llama_index/pull/21275)) ### llama-index-callbacks-agentops [0.5.0] - chore(deps): bump the uv group across 50 directories with 2 updates ([#21164](https://github.com/run-llama/llama_index/pull/21164)) - chore(deps): bump the uv group across 24 directories with 1 update ([#21219](https://github.com/run-llama/llama_index/pull/21219)) - chore(deps): bumpHigh4/3/2026
v0.14.19# Release Notes ## [2026-03-25] ### llama-index-agent-agentmesh [0.2.0] - chore(deps): bump the uv group across 49 directories with 1 update ([#21083](https://github.com/run-llama/llama_index/pull/21083)) ### llama-index-callbacks-argilla [0.5.0] - chore(deps): bump the uv group across 3 directories with 1 update ([#21069](https://github.com/run-llama/llama_index/pull/21069)) ### llama-index-core [0.14.19] - fix: pass `delete_from_docstore` parameter in `BaseIndex.delete_ref_doc` ([#20990Medium3/25/2026
v0.14.18# Release Notes ## [2026-03-16] ### llama-index-agent-agentmesh [0.2.0] - chore: deprecate python 3.9 once and for all ([#20956](https://github.com/run-llama/llama_index/pull/20956)) ### llama-index-agent-azure [0.3.0] - chore: deprecate python 3.9 once and for all ([#20956](https://github.com/run-llama/llama_index/pull/20956)) ### llama-index-callbacks-agentops [0.5.0] - chore: deprecate python 3.9 once and for all ([#20956](https://github.com/run-llama/llama_index/pull/20956)) ### llamLow3/16/2026
v0.14.16# Release Notes ## [2026-03-10] ### llama-index-core [0.14.16] - Add token-bucket rate limiter for LLM and embedding API calls ([#20712](https://github.com/run-llama/llama_index/pull/20712)) - Fix/20706 chonkie init doc ([#20713](https://github.com/run-llama/llama_index/pull/20713)) - fix: pass tool_choice through FunctionCallingProgram ([#20740](https://github.com/run-llama/llama_index/pull/20740)) - feat: Multimodal LLMReranker ([#20743](https://github.com/run-llama/llama_index/pull/20743))Low3/10/2026
v0.14.15# Release Notes ## [2026-02-18] ### llama-index-agent-agentmesh [0.1.0] - [Integration] AgentMesh: Trust Layer for LlamaIndex Agents ([#20644](https://github.com/run-llama/llama_index/pull/20644)) ### llama-index-core [0.14.15] - Support basic operations for multimodal types ([#20640](https://github.com/run-llama/llama_index/pull/20640)) - Feat recursive llm type support ([#20642](https://github.com/run-llama/llama_index/pull/20642)) - fix: remove redundant metadata_seperator field from TexLow2/18/2026
v0.14.14# Release Notes ## [2026-02-10] ### llama-index-callbacks-wandb [0.4.2] - Fix potential crashes and improve security defaults in core components ([#20610](https://github.com/run-llama/llama_index/pull/20610)) ### llama-index-core [0.14.14] - fix: catch pydantic ValidationError in VectorStoreQueryOutputParser ([#20450](https://github.com/run-llama/llama_index/pull/20450)) - fix: distinguish empty string from None in MediaResource.hash ([#20451](https://github.com/run-llama/llama_index/pull/2Low2/10/2026
v0.14.13# Release Notes ## [2026-01-21] ### llama-index-core [0.14.13] - feat: add early_stopping_method parameter to agent workflows ([#20389](https://github.com/run-llama/llama_index/pull/20389)) - feat: Add token-based code splitting support to CodeSplitter ([#20438](https://github.com/run-llama/llama_index/pull/20438)) - Add RayIngestionPipeline integration for distributed data ingestion ([#20443](https://github.com/run-llama/llama_index/pull/20443)) - Added the multi-modal version of the CondensLow1/21/2026
v0.14.12# Release Notes ## [2025-12-30] ### llama-index-callbacks-agentops [0.4.1] - Feat/async tool spec support ([#20338](https://github.com/run-llama/llama_index/pull/20338)) ### llama-index-core [0.14.12] - Feat/async tool spec support ([#20338](https://github.com/run-llama/llama_index/pull/20338)) - Improve `MockFunctionCallingLLM` ([#20356](https://github.com/run-llama/llama_index/pull/20356)) - fix(openai): sanitize generic Pydantic model schema names ([#20371](https://github.com/run-llama/lLow12/30/2025
v0.14.10# Release Notes ## [2025-12-04] ### llama-index-core [0.14.10] - feat: add mock function calling llm ([#20331](https://github.com/run-llama/llama_index/pull/20331)) ### llama-index-llms-qianfan [0.4.1] - test: fix typo 'reponse' to 'response' in variable names ([#20329](https://github.com/run-llama/llama_index/pull/20329)) ### llama-index-tools-airweave [0.1.0] - feat: add Airweave tool integration with advanced search features ([#20111](https://github.com/run-llama/llama_index/pull/20111Low12/4/2025
v0.14.9# Release Notes ## [2025-12-02] ### llama-index-agent-azure [0.2.1] - fix: Pin azure-ai-projects version to prevent breaking changes ([#20255](https://github.com/run-llama/llama_index/pull/20255)) ### llama-index-core [0.14.9] - MultiModalVectorStoreIndex now returns a multi-modal ContextChatEngine. ([#20265](https://github.com/run-llama/llama_index/pull/20265)) - Ingestion to vector store now ensures that \_node-content is readable ([#20266](https://github.com/run-llama/llama_index/pull/20Low12/2/2025
v0.14.8# Release Notes ## [2025-11-10] ### llama-index-core [0.14.8] - Fix ReActOutputParser getting stuck when "Answer:" contains "Action:" ([#20098](https://github.com/run-llama/llama_index/pull/20098)) - Add buffer to image, audio, video and document blocks ([#20153](https://github.com/run-llama/llama_index/pull/20153)) - fix(agent): Handle multi-block ChatMessage in ReActAgent ([#20196](https://github.com/run-llama/llama_index/pull/20196)) - Fix/20209 ([#20214](https://github.com/run-llama/llamaLow11/10/2025
v0.14.7# Release Notes ## [2025-10-30] ### llama-index-core [0.14.7] - Feat/serpex tool integration ([#20141](https://github.com/run-llama/llama_index/pull/20141)) - Fix outdated error message about setting LLM ([#20157](https://github.com/run-llama/llama_index/pull/20157)) - Fixing some recently failing tests ([#20165](https://github.com/run-llama/llama_index/pull/20165)) - Fix: update lock to latest workflow and fix issues ([#20173](https://github.com/run-llama/llama_index/pull/20173)) - fix: ensuLow10/30/2025
v0.14.6# Release Notes ## [2025-10-26] ### llama-index-core [0.14.6] - Add allow_parallel_tool_calls for non-streaming ([#20117](https://github.com/run-llama/llama_index/pull/20117)) - Fix invalid use of field-specific metadata ([#20122](https://github.com/run-llama/llama_index/pull/20122)) - update doc for SemanticSplitterNodeParser ([#20125](https://github.com/run-llama/llama_index/pull/20125)) - fix rare cases when sentence splits are larger than chunk size ([#20147](https://github.com/run-llama/lLow10/26/2025
v0.14.5# Release Notes ## [2025-10-15] ### llama-index-core [0.14.5] - Remove debug print ([#20000](https://github.com/run-llama/llama_index/pull/20000)) - safely initialize RefDocInfo in Docstore ([#20031](https://github.com/run-llama/llama_index/pull/20031)) - Add progress bar for multiprocess loading ([#20048](https://github.com/run-llama/llama_index/pull/20048)) - Fix duplicate node positions when identical text appears multiple times in document ([#20050](https://github.com/run-llama/llama_indeLow10/15/2025
v0.14.4# Release Notes ## [2025-09-24] ### llama-index-core [0.14.4] - fix pre-release installs ([#20010](https://github.com/run-llama/llama_index/pull/20010)) ### llama-index-embeddings-anyscale [0.4.2] - fix llm deps for openai ([#19944](https://github.com/run-llama/llama_index/pull/19944)) ### llama-index-embeddings-baseten [0.1.2] - fix llm deps for openai ([#19944](https://github.com/run-llama/llama_index/pull/19944)) ### llama-index-embeddings-fireworks [0.4.2] - fix llLow10/3/2025
v0.14.3# Release Notes ## [2025-09-24] ### llama-index-core [0.14.3] - Fix Gemini thought signature serialization ([#19891](https://github.com/run-llama/llama_index/pull/19891)) - Adding a ThinkingBlock among content blocks ([#19919](https://github.com/run-llama/llama_index/pull/19919)) ### llama-index-llms-anthropic [0.9.0] - Adding a ThinkingBlock among content blocks ([#19919](https://github.com/run-llama/llama_index/pull/19919)) ### llama-index-llms-baseten [0.1.4] - added kimik2 0905 and reLow9/24/2025
v0.14.2# Release Notes Low9/16/2025
v0.14.1.post1# Release Notes Low9/15/2025
v0.14.1# Release Notes Low9/15/2025
v0.14.0# Release Notes ## [2025-09-08] **NOTE:** All packages have been bumped to handle the latest llama-index-core version. ### `llama-index-core` [0.14.0] - breaking: bumped `llama-index-workflows` dependency to 2.0 - Improve stacktraces clarity by avoiding wrapping errors in WorkflowRuntimeError - Remove deprecated checkpointer feature - Remove deprecated sub-workflows feature - Remove deprecated `send_event` method from Workflow class (still existing on the Context class) - Remove deLow9/8/2025
v0.13.6# Release Notes Low9/7/2025
v0.13.5# Release Notes ## [2025-09-04] ### `llama-index-core` [0.13.5] - feat: add thinking delta field to AgentStream events to expose from LLM responses (#19785) - fix: fix path handling in SimpleDirectoryReader and PDFReader path fix (#19794) ### `llama-index-llms-bedrock-converse` [0.9.0] - feat: add system prompt and tool caching config kwargs to BedrockConverse (#19737) ### `llama-index-llms-litellm` [0.6.2] - fix: Handle missing tool call IDs with UUID fallback (#19789) - fix: Fix criticaLow9/4/2025
v0.13.4# Release Notes ## [2025-09-01] ### `llama-index-core` [0.13.4] - feat: Add PostgreSQL schema support to Memory and SQLAlchemyChatStore (#19741) - feat: add missing sync wrapper of put_messages in memory (#19746) - feat: add option for an initial tool choice in FunctionAgent (#19738) - fix: Calling ContextChatEngine with a QueryBundle (instead of a string) (#19714) ### `llama-index-embeddings-baseten` [0.1.0] - feat: baseten integration (#19710) ### `llama-index-embeddings-iLow9/2/2025
v0.13.3.post1# Release Notes Low8/29/2025
v0.13.3# Release Notes ## [2025-08-22] ### `llama-index-core` [0.13.3] - fix: add timeouts on image `.get()` requests (#19723) - fix: fix StreamingAgentChatResponse losses message bug (#19674) - fix: Fixing crashing when retrieving from empty vector store index (#19706) - fix: Calling ContextChatEngine with a QueryBundle (instead of a string) (#19714) - fix: Fix faithfulness evaluate crash when no images provided (#19686) ### `llama-index-embeddings-heroku` [0.1.0] - feat: Adds support for HerokuELow8/22/2025
v0.13.2.post1# Release Notes - docs fixes Low8/14/2025
v0.13.2# Release Notes ## [2025-08-14] ### `llama-index-core` [0.13.2] - feat: allow streaming to be disabled in agents (#19668) - fix: respect the value of NLTK_DATA env var if present (#19664) - fix: Order preservation and fetching in batch non-cached embeddings in `a/get_text_embedding_batch()` (#19536) ### `llama-index-embeddings-ollama` [0.8.1] - fix: Access embedding output (#19635) - fix: use normalized embeddings (#19622) ### `llama-index-graph-rag-cognee` [0.3.0] - fix: Update and fix cLow8/14/2025
v0.13.1# Release Notes ## [2025-08-08] ### `llama-index-core` [0.13.1] - fix: safer token counting in messages (#19599) - fix: Fix Document truncation in `FunctionTool._parse_tool_output` (#19585) - feat: Enabled partially formatted system prompt for ReAct agent (#19598) ### `llama-index-embeddings-ollama` [0.8.0] - fix: use /embed instead of /embeddings for ollama (#19622) ### `llama-index-embeddings-voyageai` [0.4.1] - feat: Add support for voyage context embeddings (#19590) ### `llama-index-Low8/8/2025
v0.13.0.post3# Release Notes Low8/8/2025
v0.13.0.post2# Release Notes Low8/5/2025
v0.13.0.post1# Release Notes Low7/31/2025
v0.13.0# Release Notes **NOTE:** All packages have been bumped to handle the latest llama-index-core version. ### `llama-index-core` [0.13.0] - breaking: removed deprecated agent classes, including `FunctionCallingAgent`, the older `ReActAgent` implementation, `AgentRunner`, all step workers, `StructuredAgentPlanner`, `OpenAIAgent`, and more. All users should migrate to the new workflow based agents: `FunctionAgent`, `CodeActAgent`, `ReActAgent`, and `AgentWorkflow` (#19529) - breaking: removLow7/31/2025
v0.12.52.post1# Release Notes Low7/28/2025
v0.12.52# Release Notes ## [2025-07-22] ### `llama-index-core` [0.12.52.post1] - fix: do not write system prompt to memory in agents (#19512) ### `llama-index-core` [0.12.52] - fix: Fix missing prompt in async MultiModalLLMProgram calls (#19504) - fix: Properly raise errors from docstore, fixes Vector Index Retrieval for `stores_text=True/False` (#19501) ### `llama-index-indices-managed-bge-m3` [0.5.0] - feat: optimize memory usage for BGEM3Index persistence (#19496) ### `llama-Low7/23/2025
v0.12.51# Release Notes ## [2025-07-21] ### `llama-index-core` [0.12.51] - feat: Enhance FunctionTool with auto type conversion for basic Python types like date when using pydantic fields in functions (#19479) - fix: Fix retriever KeyError when using FAISS and other vector stores that do no store text (#19476) - fix: add system prompt to memory and use it also for structured generation (#19490) ### `llama-index-readers-azstorage-blob` [0.3.2] - fix: Fix metadata serialization issue in ALow7/22/2025
v0.12.50# Release Notes ## [2025-07-19] ### `llama-index-core` [0.12.50] - feat: support html table extraction in MarkdownElementNodeParser (#19449) - fix/slightly breaking: make `get_cache_dir()` function more secure by changing default location (#19415) - fix: resolve race condition in SQLAlchemyChatStore with precise timestamps (#19432) - fix: update document store import to use BaseDocumentStore in DocumentContextExtractor (#19466) - fix: improve empty retrieval check in vector index retriever (#Low7/19/2025
v0.12.49# Release Notes ## [2025-07-14] ### `llama-index-core` [0.12.49] - fix: skip tests on CI (#19416) - fix: fix structured output (#19414) - Fix: prevent duplicate triplets in SimpleGraphStore.upsert_triplet (#19404) - Add retry capability to workflow agents (#19393) - chore: modifying raptors dependencies with stricter rules to avoid test failures (#19394) - feat: adding a first implementation of structured output in agents (#19337) - Add tests for and fix issues with Vector Store nLow7/14/2025
v0.12.48# Release Notes ## [2025-07-09] ### `llama-index-core` [0.12.48] - fix: convert dict chat_history to ChatMessage objects in AgentWorkflowStartEvent (#19371) - fix: Replace ctx.get/set with ctx.store.get/set in Context (#19350) - Bump the pip group across 6 directories with 1 update (#19357) - Make fewer trips to KV store during Document Hash Checks (#19362) - Don't store Copy of document in metadata and properly return Nodes (#19343) - Bump llama-index-core from 0.12.8 to 0.12.41 iLow7/9/2025
v0.12.47# Release Notes ### `llama-index-core` [0.12.47] - feat: add default `max_iterations` arg to `.run()` of 20 for agents (#19035) - feat: set `tool_required` to `True` for `FunctionCallingProgram` and structured LLMs where supported (#19326) - fix: fix missing raw in agent workflow events (#19325) - fix: fixed parsing of empty list in parsing json output (#19318) - chore: Deprecate Multi Modal LLMs (#19115) - All existing multi-modal llms are now extensions of their base `LLM` counterLow7/7/2025
v0.12.46.post1# Release Notes Low7/3/2025
v0.12.46# Release Notes ## [2025-07-02] ### `llama-index-core` [0.12.46] - feat: Add async delete and insert to vector store index (#19281) - fix: Fixing ChatMessage to str handling of empty inputs (#19302) - fix: fix function tool context detection with typed context (#19309) - fix: inconsistent ref node handling (#19286) - chore: simplify citation block schema (#19308) ### `llama-index-embeddings-google-genai` [0.2.1] - chore: bump min google-genai version (#19304) ### `llama-indLow7/3/2025
v0.12.45# Release Notes ## [2025-06-30] ### `llama-index-core` [0.12.45] - feat: allow tools to output content blocks (#19265) - feat: Add chat UI events and models to core package (#19242) - fix: Support loading `Node` from ingestion cache (#19279) - fix: Fix SemanticDoubleMergingSplitterNodeParser not respecting max_chunk_size (#19235) - fix: replace `get_doc_id()` with `id_` in base index (#19266) - chore: remove usage and references to deprecated Context get/set API (#19275) - chore: Low7/1/2025
v0.12.44.post1# Release Notes Low6/30/2025
v0.12.44# Release Notes ### `llama-index-core` [0.12.44] - feat: Adding a `CachePoint` content block for caching chat messages (#19193) - fix: fix react system header formatting in workflow agent (#19158) - fix: fix ReActOutputParser when no "Thought:" prefix is produced by the LLM (#19190) - fix: Fixed string striping in react output parser (#19192) - fix: properly handle system prompt for CodeAct agent (#19191) - fix: Exclude raw field in AgentStream event to fix potential serialization issLow6/26/2025
v0.12.43# Release Notes ### `llama-index-core` [0.12.43] - feat: Make BaseWorkflowAgent a workflow itself (#19052) - fix: make the progress bar of title extractor unified (#19131) - fix: Use `get_tqdm_iterable` in SimpleDirectoryReader (#18722) - chore: move out Workflows code to `llama-index-workflows` and keeping backward compatibility (#19043) - chore: move instrumentation code out to its own package `llama-index-instrumentation` (#19062) ### `llama-index-llms-bedrock-converse` [0.7.2] Low6/19/2025
v0.12.42# Release Notes ### `llama-index-core` [0.12.42] - fix: pass input message to memory get (#19054) - fix: use async memory operations within async functions (#19032) - fix: Using uuid instead of hashing for broader compatibility in SQLTableNodeMapping (#19011) ### `llama-index-embeddings-bedrock` [0.5.1] - feat: Update aioboto3 dependency (#19015) ### `llama-index-indices-managed-llama-cloud` [0.7.7] - feat: figure retrieval SDK integration (#19017) - fix: Return empty list wLow6/12/2025
v0.12.41# Release Notes ### `llama-index-core` [0.12.41] - feat: Add MutableMappingKVStore for easier caching (#18893) - fix: async functions in tool specs (#19000) - fix: properly apply file limit to SimpleDirectoryReader (#18983) - fix: overwriting of LLM callback manager from Settings (#18951) - fix: Adding warning in the docstring of JsonPickleSerializer for the user to deserialize only safe things, rename to PickleSerializer (#18943) - fix: ImageDocument path and url checking to ensure tLow6/7/2025
v0.12.40# Release Notes ### `llama-index-core` [0.12.40] - feat: Add StopEvent step validation so only one workflow step can handle StopEvent (#18932) - fix: Add compatibility check before providing `tool_required` to LLM args (#18922) ### `llama-index-embeddings-cohere` [0.5.1] - fix: add batch size validation with 96 limit for Cohere API (#18915) ### `llama-index-llms-anthropic` [0.7.2] - feat: Support passing static AWS credentials to Anthropic Bedrock (#18935) - fix: Handle untesLow6/3/2025
v0.12.39# Release Notes ## [2025-05-30] ### `llama-index-core` [0.12.39] - feat: Adding Resource to perform dependency injection in Workflows (docs coming soon!) (#18884) - feat: Add `tool_required` param to function calling LLMs (#18654) - fix: make prefix and response non-required for hitl events (#18896) - fix: SelectionOutputParser when LLM chooses no choices (#18886) ### `llama-index-indices-managed-llama-cloud` [0.7.2] - feat: add non persisted composite retrieval (#18908) ###Low5/30/2025

Dependencies & License Audit

Loading dependencies...

Similar Packages

cognitaRAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry 0.0.0
txtai๐Ÿ’ก All-in-one AI framework for semantic search, LLM orchestration and language model workflowsv9.10.0
PageIndex๐Ÿ“‘ PageIndex: Document Index for Vectorless, Reasoning-based RAGmain@2026-06-02
mem0Universal memory layer for AI Agentscli-node-v0.2.8
Awesome-RAG-Production๐Ÿš€ Build and scale reliable Retrieval-Augmented Generation (RAG) systems with this curated collection of tools, frameworks, and best practices.main@2026-06-07

More in Databases

orbitOne API for 20+ LLM providers, your databases, and your files โ€” self-hosted, open-source AI gateway with RAG, voice, and guardrails.
alibabacloud-adb20211201Alibaba Cloud adb (20211201) SDK Library for Python
milvusMilvus is a high-performance, cloud-native vector database built for scalable vector ANN search
qdrantQdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/