freshcrate
Skin:/
Home > RAG & Memory > spiceai

spiceai

A portable accelerated SQL query, search, and LLM-inference engine, written in Rust, for data-grounded AI apps and agents.

Why this rank:Strong adoptionRecent releaseHealthy release cadence

Description

A portable accelerated SQL query, search, and LLM-inference engine, written in Rust, for data-grounded AI apps and agents.

README

spice oss logo

๐Ÿ“„ Docs | โšก๏ธ Quickstart | ๐Ÿง‘โ€๐Ÿณ Cookbook

Spice is a SQL query, search, and LLM-inference engine, written in Rust, for data apps and agents.

Spice.ai Open Source accelerated data query and LLM-inference engine

Spice provides four industry standard APIs in a lightweight, portable runtime (single binary/container):

  1. SQL Query & Search: HTTP, Arrow Flight, Arrow Flight SQL, ODBC, JDBC, and ADBC APIs; vector_search and text_search UDTFs.
  2. OpenAI-Compatible APIs: HTTP APIs for OpenAI SDK compatibility, local model serving (CUDA/Metal accelerated), and hosted model gateway.
  3. Iceberg Catalog REST APIs: A unified Iceberg REST Catalog API.
  4. MCP HTTP+SSE APIs: Integration with external tools via Model Context Protocol (MCP) using HTTP and Server-Sent Events (SSE).

๐ŸŽฏ Goal: Developers can focus on building data apps and AI agents confidently, knowing they are grounded in data.

Spice's primary features include:

  • Data Federation: SQL query across any database, data warehouse, or data lake. Scale from single-node to distributed multi-node query execution. Learn More.
  • Data Materialization and Acceleration: Materialize, accelerate, and cache database queries with Arrow, DuckDB, SQLite, PostgreSQL, or Spice Cayenne (Vortex). Read the MaterializedView interview - Building a CDN for Databases
  • Enterprise Search: Keyword, vector, and full-text search with Tantivy-powered BM25 and petabyte-scale vector similarity search via Amazon S3 Vectors or pgvector for structured and unstructured data.
  • AI apps and agents: An AI-database powering retrieval-augmented generation (RAG) and intelligent agents with OpenAI-compatible APIs and MCP integration. Learn More.

If you want to build with DataFusion, DuckDB, or Vortex, Spice provides a simple, flexible, and production-ready engine you can just use.

๐Ÿ“ฃ Read the Spice.ai 1.0-stable announcement.

Spice is built-on industry leading technologies including Apache DataFusion, Apache Arrow, Arrow Flight, SQLite, and DuckDB.

How Spice works.

๐ŸŽฅ Watch the CMU Databases Accelerating Data and AI with Spice.ai Open-Source

๐ŸŽฅ Watch How to Query Data using Spice, OpenAI, and MCP

๐ŸŽฅ Watch How to search with Amazon S3 Vectors

Why Spice?

Spice.ai

Spice simplifies building data-driven AI applications and agents by making it fast and easy to query, federate, and accelerate data from one or more sources using SQL, while grounding AI in real-time, reliable data. Co-locate datasets with apps and AI models to power AI feedback loops, enable RAG and search, and deliver fast, low-latency data-query and AI-inference with full control over cost and performance.

Latest Capabilities

  • Spice Cayenne Data Accelerator: Simplified multi-file acceleration using the Vortex columnar format + SQLite metadata. Delivers DuckDB-comparable performance without single-file scaling limitations.
  • Multi-Node Distributed Query: Scale query execution across multiple nodes with Apache Ballista integration for improved performance on large datasets.
  • Acceleration Snapshots: Bootstrap accelerations from S3 for fast cold starts (seconds vs. minutes). Supports ephemeral storage with persistent recovery.
  • Iceberg Table Writes: Write to Iceberg tables using standard SQL INSERT INTO for data ingestion and transformationโ€”no Spark required.
  • Petabyte-Scale Vector Search: Native Amazon S3 Vectors integration manages the full vector lifecycle from ingestion to embedding to querying. SQL-integrated hybrid search with RRF.

How is Spice different?

  1. AI-Native Runtime: Spice combines data query and AI inference in a single engine, for data-grounded AI and accurate AI.

  2. Application-Focused: Designed to run distributed at the application and agent level, often as a 1:1 or 1:N mapping between app and Spice instance, unlike traditional data systems built for many apps on one centralized database. Itโ€™s common to spin up multiple Spice instancesโ€”even one per tenant or customer.

  3. Dual-Engine Acceleration: Supports both OLAP (Arrow/DuckDB) and OLTP (SQLite/PostgreSQL) engines at the dataset level, providing flexible performance across analytical and transactional workloads.

  4. Disaggregated Storage: Separation of compute from disaggregated storage, co-locating local, materialized working sets of data with applications, dashboards, or ML pipelines while accessing source data in its original storage.

  5. Edge to Cloud Native: Deploy as a standalone instance, Kubernetes sidecar, microservice, or clusterโ€”across edge/POP, on-prem, and public clouds. Chain multiple Spice instances for tier-optimized, distributed deployments.

How does Spice compare?

Data Query and Analytics

Feature Spice Trino / Presto Dremio ClickHouse Materialize
Primary Use-Case Data & AI apps/agents Big data analytics Interactive analytics Real-time analytics Real-time analytics
Primary deployment model Sidecar Cluster Cluster Cluster Cluster
Federated Query Support โœ… โœ… โœ… โ€• โ€•
Acceleration/Materialization โœ… (Arrow, SQLite, DuckDB, PostgreSQL) Intermediate storage Reflections (Iceberg) Materialized views โœ… (Real-time views)
Catalog Support โœ… (Iceberg, Unity Catalog, AWS Glue) โœ… โœ… โ€• โ€•
Query Result Caching โœ… โœ… โœ… โœ… Limited
Multi-Modal Acceleration โœ… (OLAP + OLTP) โ€• โ€• โ€• โ€•
Change Data Capture (CDC) โœ… (Debezium) โ€• โ€• โ€• โœ… (Debezium)

AI Apps and Agents

Feature Spice LangChain LlamaIndex AgentOps.ai Ollama
Primary Use-Case Data & AI apps Agentic workflows RAG apps Agent operations LLM apps
Programming Language Any language (HTTP interface) JavaScript, Python Python Python Any language (HTTP interface)
Unified Data + AI Runtime โœ… โ€• โ€• โ€• โ€•
Federated Data Query โœ… โ€• โ€• โ€• โ€•
Accelerated Data Access โœ… โ€• โ€• โ€• โ€•
Tools/Functions โœ… (MCP HTTP+SSE) โœ… โœ… Limited Limited
LLM Memory โœ… โœ… โ€• โœ… โ€•
Hybrid Search โœ… (Keyword, Vector, & Full-Text-Search) โœ… โœ… Limited Limited
Caching โœ… (Query and results caching) Limited โ€• โ€• โ€•
Embeddings โœ… (Built-in & pluggable models/DBs) โœ… โœ… Limited โ€•

โœ… = Fully supported โŒ = Not supported Limited = Partial or restricted support

Example Use-Cases

Data-grounded Agentic AI Applications

  • OpenAI-compatible API: Connect to hosted models (OpenAI, Anthropic, xAI, Amazon Bedrock) or deploy locally (Llama, NVIDIA NIM) with OpenAI Responses API support for advanced interactions. AI Gateway Recipe
  • Federated Data Access: Query using SQL and NSQL (text-to-SQL) across databases, data warehouses, and data lakes with advanced query push-down for fast retrieval. Scale to distributed multi-node query execution with Apache Ballista. Federated SQL Query Recipe
  • Search and RAG: Search and retrieve context with accelerated embeddings for retrieval-augmented generation (RAG) workflows. Native Amazon S3 Vectors integration for petabyte-scale vector search. Full-text search (FTS) via Tantivy-powered BM25 and vector similarity search (VSS) integrated into SQL via text_search and vector_search UDTFs. Reciprocal rank fusion (RRF) for hybrid search. Amazon S3 Vectors Cookbook Recipe
  • LLM Memory and Observability: Store and retrieve history and context for AI agents while gaining deep visibility into data flows, model performance, and traces. LLM Memory Recipe | Observability & Monitoring Features Documentation

Database CDN and Query Mesh

  • Data Acceleration: Co-locate materialized datasets in Arrow, SQLite, DuckDB, PostgreSQL, or Cayenne (Vortex+SQLite) with applications for sub-second query. Bootstrap from snapshots stored in S3 for fast cold starts. Write to Iceberg tables with standard SQL INSERT INTO. DuckDB Data Accelerator Recipe
  • Resiliency and Local Dataset Replication: Maintain application availability with local replicas of critical datasets. Recover from federated source outages using acceleration snapshots. Local Dataset Replication Recipe
  • Responsive Dashboards: Enable fast, real-time analytics by accelerating data for frontends and BI tools with configurable refresh schedules. Sales BI Dashboard Demo
  • Simplified Legacy Migration: Use a single endpoint to unify legacy systems with modern infrastructure, including federated SQL querying across multiple sources. Federated SQL Query Recipe

Retrieval-Augmented Generation (RAG)

  • Unified Search with Vector Similarity: Perform efficient vector similarity search across structured and unstructured data sources with native Amazon S3 Vectors integration for petabyte-scale vector storage and querying. The Spice runtime manages the vector lifecycle: ingesting data, embedding it using AWS Bedrock (Amazon Titan, Cohere), HuggingFace models, or Model2Vec (500x faster static embeddings), and storing in S3 Vector buckets or pgvector. Supports cosine similarity, Euclidean distance, or dot product. SQL-integrated search via vector_search and text_search UDTFs with hybrid search using reciprocal rank fusion (RRF). Example: SELECT * FROM vector_search(my_table, 'search query', 10) WHERE condition ORDER BY _score;. Amazon S3 Vectors Cookbook Recipe
  • Semantic Knowledge Layer: Define a semantic context model to enrich data for AI. Semantic Model Feature Documentation
  • Text-to-SQL: Convert natural language queries into SQL using built-in NSQL and sampling tools for accurate query. Text-to-SQL Recipe

FAQ

  • Is Spice a cache? No specifically; you can think of Spice data acceleration as an active cache, materialization, or data prefetcher. A cache would fetch data on a cache-miss while Spice prefetches and materializes filtered data on an interval, trigger, or as data changes using CDC. In addition to acceleration Spice supports results caching.

  • Is Spice a CDN for databases? Yes, a common use-case for Spice is as a CDN for different data sources. Using CDN concepts, Spice enables you to ship (load) a working set of your database (or data lake, or data warehouse) where it's most frequently accessed, like from a data-intensive application or for AI context.

โžก๏ธ Docs FAQ

Watch a 30-sec BI dashboard acceleration demo

BI.dashboard.acceleration.with.Spice.mp4

See more demos on YouTube.

Supported Data Connectors

Name Description Status Protocol/Format
databricks (mode: delta_lake) Databricks Stable S3/Delta Lake
delta_lake Delta Lake Stable Delta Lake
dremio Dremio Stable Arrow Flight
duckdb DuckDB Stable Embedded
file File Stable Parquet, CSV
github GitHub Stable GitHub API
postgres PostgreSQL Stable
s3 S3 Stable Parquet, CSV
mysql MySQL Stable
spice.ai Spice.ai Stable Arrow Flight
graphql GraphQL Release Candidate JSON
dynamodb Amazon DynamoDB Release Candidate
databricks (mode: spark_connect) Databricks Beta Spark Connect
flightsql FlightSQL Beta Arrow Flight SQL
iceberg Apache Iceberg Beta Parquet
mssql Microsoft SQL Server Beta Tabular Data Stream (TDS)
odbc ODBC Beta ODBC
snowflake Snowflake Beta Arrow
spark Spark Beta Spark Connect
oracle Oracle Alpha Oracle ODPI-C
abfs Azure BlobFS Alpha Parquet, CSV
clickhouse Clickhouse Alpha
debezium Debezium CDC Alpha Kafka + JSON
gcs, gs Google Cloud Storage Alpha Parquet, CSV, JSON
kafka Kafka Alpha Kafka + JSON
ftp, sftp FTP/SFTP Alpha Parquet, CSV
glue AWS Glue Alpha Iceberg, Parquet, CSV
http, https HTTP(s) Alpha Parquet, CSV, JSON
imap IMAP Alpha IMAP Emails
localpod Local dataset replication Alpha
mongodb MongoDB Alpha
sharepoint Microsoft SharePoint Alpha Unstructured UTF-8 documents
scylladb ScyllaDB Alpha
smb SMB (Server Message Block) Alpha SMB
elasticsearch ElasticSearch Roadmap

Supported Data Accelerators

Name Description Status Engine Modes
cayenne Spice Cayenne (Vortex) Release Candidate file
arrow In-Memory Arrow Records Stable memory
duckdb Embedded DuckDB Stable memory, file
postgres Attached PostgreSQL Release Candidate N/A
sqlite Embedded SQLite Release Candidate memory, file

Supported Model Providers

Name Description Status ML Format(s) LLM Format(s)
openai OpenAI (or compatible) LLM endpoint Release Candidate - OpenAI-compatible HTTP endpoint
file Local filesystem Release Candidate ONNX GGUF, GGML, SafeTensor
huggingface Models hosted on HuggingFace Release Candidate ONNX GGUF, GGML, SafeTensor
spice.ai Models hosted on the Spice.ai Cloud Platform ONNX OpenAI-compatible HTTP endpoint
azure Azure OpenAI - OpenAI-compatible HTTP endpoint
bedrock Amazon Bedrock (Nova models) Alpha - OpenAI-compatible HTTP endpoint
anthropic Models hosted on Anthropic Alpha - OpenAI-compatible HTTP endpoint
xai Models hosted on xAI Alpha - OpenAI-compatible HTTP endpoint

Supported Embeddings Providers

Name Description Status ML Format(s) LLM Format(s)*
openai OpenAI (or compatible) LLM endpoint Release Candidate - OpenAI-compatible HTTP endpoint
file Local filesystem Release Candidate ONNX GGUF, GGML, SafeTensor
huggingface Models hosted on HuggingFace Release Candidate ONNX GGUF, GGML, SafeTensor
model2vec Static embeddings (500x faster) Release Candidate Model2Vec -
azure Azure OpenAI Alpha - OpenAI-compatible HTTP endpoint
bedrock AWS Bedrock (e.g., Titan, Cohere) Alpha - OpenAI-compatible HTTP endpoint

Supported Vector Stores

Name Description Status
s3_vectors Amazon S3 Vectors for petabyte-scale vector storage and querying Alpha
pgvector PostgreSQL with pgvector extension Alpha
duckdb_vector DuckDB with vector extension for efficient vector storage and search Alpha
sqlite_vec SQLite with sqlite-vec extension for lightweight vector operations Alpha

Supported Catalogs

Catalog Connectors connect to external catalog providers and make their tables available for federated SQL query in Spice. Configuring accelerations for tables in external catalogs is not supported. The schema hierarchy of the external catalog is preserved in Spice.

Name Description Status Protocol/Format
spice.ai Spice.ai Cloud Platform Stable Arrow Flight
unity_catalog Unity Catalog Stable Delta Lake
databricks Databricks Beta Spark Connect, S3/Delta Lake
iceberg Apache Iceberg Beta Parquet
glue AWS Glue Alpha CSV, Parquet, Iceberg

โšก๏ธ Quickstart (Local Machine)

quickstart.mp4

Installation

Install the Spice CLI:

On macOS, Linux, and WSL:

curl https://install.spiceai.org | /bin/bash

Or using brew:

brew install spiceai/spiceai/spice

On Windows using PowerShell:

iex ((New-Object System.Net.WebClient).DownloadString("https://install.spiceai.org/Install.ps1"))

Usage

Step 1. Initialize a new Spice app with the spice init command:

spice init spice_qs

A spicepod.yaml file is created in the spice_qs directory. Change to that directory:

cd spice_qs

Step 2. Start the Spice runtime:

spice run

Example output will be shown as follows:

2025/01/20 11:26:10 INFO Spice.ai runtime starting...
2025-01-20T19:26:10.679068Z  INFO runtime::init::dataset: No datasets were configured. If this is unexpected, check the Spicepod configuration.
2025-01-20T19:26:10.679716Z  INFO runtime::flight: Spice Runtime Flight listening on 127.0.0.1:50051
2025-01-20T19:26:10.679786Z  INFO runtime::metrics_server: Spice Runtime Metrics listening on 127.0.0.1:9090
2025-01-20T19:26:10.680140Z  INFO runtime::http: Spice Runtime HTTP listening on 127.0.0.1:8090
2025-01-20T19:26:10.879126Z  INFO runtime::init::results_cache: Initialized sql results cache; max size: 128.00 MiB, item ttl: 1s

The runtime is now started and ready for queries.

Step 3. In a new terminal window, add the spiceai/quickstart Spicepod. A Spicepod is a package of configuration defining datasets and ML models.

spice add spiceai/quickstart

The spicepod.yaml file will be updated with the spiceai/quickstart dependency.

version: v1
kind: Spicepod
name: spice_qs
dependencies:
  - spiceai/quickstart

The spiceai/quickstart Spicepod will add a taxi_trips data table to the runtime which is now available to query by SQL.

2025-01-20T19:26:30.011633Z  INFO runtime::init::dataset: Dataset taxi_trips registered (s3://spiceai-demo-datasets/taxi_trips/2024/), acceleration (arrow), results cache enabled.
2025-01-20T19:26:30.013002Z  INFO runtime::accelerated_table::refresh_task: Loading data for dataset taxi_trips
2025-01-20T19:26:40.312839Z  INFO runtime::accelerated_table::refresh_task: Loaded 2,964,624 rows (399.41 MiB) for dataset taxi_trips in 10s 299ms

Step 4. Start the Spice SQL REPL:

spice sql

The SQL REPL inferface will be shown:

Welcome to the Spice.ai SQL REPL! Type 'help' for help.

show tables; -- list available tables
sql>

Enter show tables; to display the available tables for query:

sql> show tables;
+---------------+--------------+---------------+------------+
| table_catalog | table_schema | table_name    | table_type |
+---------------+--------------+---------------+------------+
| spice         | public       | taxi_trips    | BASE TABLE |
| spice         | runtime      | query_history | BASE TABLE |
| spice         | runtime      | metrics       | BASE TABLE |
+---------------+--------------+---------------+------------+

Time: 0.022671708 seconds. 3 rows.

Enter a query to display the longest taxi trips:

SELECT trip_distance, total_amount FROM taxi_trips ORDER BY trip_distance DESC LIMIT 10;

Output:

+---------------+--------------+
| trip_distance | total_amount |
+---------------+--------------+
| 312722.3      | 22.15        |
| 97793.92      | 36.31        |
| 82015.45      | 21.56        |
| 72975.97      | 20.04        |
| 71752.26      | 49.57        |
| 59282.45      | 33.52        |
| 59076.43      | 23.17        |
| 58298.51      | 18.63        |
| 51619.36      | 24.2         |
| 44018.64      | 52.43        |
+---------------+--------------+

Time: 0.045150667 seconds. 10 rows.

โš™๏ธ Runtime Container Deployment

Using the Docker image locally:

docker pull spiceai/spiceai

In a Dockerfile:

from spiceai/spiceai:latest

Using Helm:

helm repo add spiceai https://helm.spiceai.org
helm install spiceai spiceai/spiceai

๐ŸŽ๏ธ Next Steps

Explore the Spice.ai Cookbook

The Spice.ai Cookbook is a collection of recipes and examples for using Spice. Find it at https://github.com/spiceai/cookbook.

Using Spice.ai Cloud Platform

Access ready-to-use Spicepods and datasets hosted on the Spice.ai Cloud Platform using the Spice runtime. A list of public Spicepods is available on Spicerack: https://spicerack.org/.

To use public datasets, create a free account on Spice.ai:

  1. Visit spice.ai and click Try for Free. Try for Free

  2. After creating an account, create an app to generate an API key. Create App

Once set up, you can access ready-to-use Spicepods including datasets. For this demonstration, use the taxi_trips dataset from the Spice.ai Quickstart.

Step 1. Initialize a new project.

# Initialize a new Spice app
spice init spice_app

# Change to app directory
cd spice_app

Step 2. Log in and authenticate from the command line using the spice login command. A pop up browser window will prompt you to authenticate:

spice login

Step 3. Start the runtime:

# Start the runtime
spice run

Step 4. Configure the dataset:

In a new terminal window, configure a new dataset using the spice dataset configure command:

spice dataset configure

Enter a dataset name that will be used to reference the dataset in queries. This name does not need to match the name in the dataset source.

dataset name: (spice_app) taxi_trips

Enter the description of the dataset:

description: Taxi trips dataset

Enter the location of the dataset:

from: spice.ai/spiceai/quickstart/datasets/taxi_trips

Select y when prompted whether to accelerate the data:

Locally accelerate (y/n)? y

You should see the following output from your runtime terminal:

2024-12-16T05:12:45.803694Z  INFO runtime::init::dataset: Dataset taxi_trips registered (spice.ai/spiceai/quickstart/datasets/taxi_trips), acceleration (arrow, 10s refresh), results cache enabled.
2024-12-16T05:12:45.805494Z  INFO runtime::accelerated_table::refresh_task: Loading data for dataset taxi_trips
2024-12-16T05:13:24.218345Z  INFO runtime::accelerated_table::refresh_task: Loaded 2,964,624 rows (8.41 GiB) for dataset taxi_trips in 38s 412ms.

Step 5. In a new terminal window, use the Spice SQL REPL to query the dataset

spice sql
SELECT tpep_pickup_datetime, passenger_count, trip_distance from taxi_trips LIMIT 10;

The output displays the results of the query along with the query execution time:

+----------------------+-----------------+---------------+
| tpep_pickup_datetime | passenger_count | trip_distance |
+----------------------+-----------------+---------------+
| 2024-01-11T12:55:12  | 1               | 0.0           |
| 2024-01-11T12:55:12  | 1               | 0.0           |
| 2024-01-11T12:04:56  | 1               | 0.63          |
| 2024-01-11T12:18:31  | 1               | 1.38          |
| 2024-01-11T12:39:26  | 1               | 1.01          |
| 2024-01-11T12:18:58  | 1               | 5.13          |
| 2024-01-11T12:43:13  | 1               | 2.9           |
| 2024-01-11T12:05:41  | 1               | 1.36          |
| 2024-01-11T12:20:41  | 1               | 1.11          |
| 2024-01-11T12:37:25  | 1               | 2.04          |
+----------------------+-----------------+---------------+

Time: 0.00538925 seconds. 10 rows.

You can experiment with the time it takes to generate queries when using non-accelerated datasets. You can change the acceleration setting from true to false in the datasets.yaml file.

๐Ÿ“„ Documentation

Comprehensive documentation is available at spiceai.org/docs.

Over 45 quickstarts and samples available in the Spice Cookbook.

๐Ÿ”Œ Extensibility

Spice.ai is designed to be extensible with extension points documented at EXTENSIBILITY.md. Build custom Data Connectors, Data Accelerators, Catalog Connectors, Secret Stores, Models, or Embeddings.

๐Ÿ”จ Upcoming Features

๐Ÿš€ See the Roadmap for upcoming features.

๐Ÿค Connect with us

We greatly appreciate and value your support! You can help Spice in a number of ways:

โญ๏ธ star this repo! Thank you for your support! ๐Ÿ™

Release History

VersionChangesUrgencyDate
v2.0.0# Spice v2.0.0 (June 4, 2026) Spice v2.0.0 is the next major release of Spice and a major milestone in the project's development, advancing Spice from a single-node engine into a distributed data and query platform built for enterprise AI agents. These agents need low-latency, governed access to data spread across many production systems, and because they generate their own queries autonomously, that access has to be sandboxed, observable, and able to absorb occasional heavy analytical querieHigh6/5/2026
v1.11.6# Spice v1.11.6 (May 6, 2026) Spice v1.11.6 is a patch release improving **DynamoDB** bootstrap ingestion performance and making **Helm chart** health probes configurable. ## What's New in v1.11.6 ### DynamoDB: Faster Bootstrap Ingestion Improved [DynamoDB](https://spiceai.org/docs/components/data-connectors/dynamodb) bootstrap ingestion throughput for `refresh_mode: changes` datasets ([#10616](https://github.com/spiceai/spiceai/pull/10616)). Large tables now maintain a high, steady High5/6/2026
v2.0.0-rc.3# Spice v2.0.0-rc.3 (Apr 18, 2026) v2.0.0-rc.3 is the third release candidate for advanced testing of v2.0, building on [v2.0.0-rc.2](https://github.com/spiceai/spiceai/releases/tag/v2.0.0-rc.2). Highlights in this release candidate include: - **HTTP Connector Enhancements** with OAuth2 refresh-token authentication, query-parameter pagination, and map-to-array conversion for broader API compatibility - **Databricks and Unity Catalog Reliability Improvements** with resilience controls, improveHigh4/20/2026
v2.0.0-unstableThis is the release candidate 2.0.0-unstableMedium4/10/2026
v2.0.0-rc.2# Spice v2.0.0-rc.2 (Apr 9, 2026) v2.0.0-rc.2 is the second release candidate for advanced testing of v2.0, building on [v2.0.0-rc.1](https://github.com/spiceai/spiceai/releases/tag/v2.0.0-rc.1). Highlights in this release candidate include: - **Distributed Spice Cayenne Query and Write Improvements** with data-local query routing and partition-aware write-through - **DataFusion v52.4.0 Upgrade** with aligned `arrow-rs`, `datafusion-federation`, and `datafusion-table-providers` - **MERGE INTOMedium4/10/2026
v1.11.5# Spice v1.11.5 (Apr 1, 2026) Spice v1.11.5 is a patch release improving **`on_zero_results: use_source`** fallback performance, **Delta Lake** timestamp predicate data skipping, **S3 Parquet** read performance, **PostgreSQL** partitioned table support, **Cayenne** target file size handling, and preparing the CLI for v2.0 runtime upgrades. ## What's New in v1.11.5 ### `on_zero_results: use_source` Fallback Performance Improvement Improved the [`on_zero_results: use_source`](https://sMedium4/1/2026
v1.11.5# Spice v1.11.5 (Apr 1, 2026) Spice v1.11.5 is a patch release improving **`on_zero_results: use_source`** fallback performance, **Delta Lake** timestamp predicate data skipping, **S3 Parquet** read performance, **PostgreSQL** partitioned table support, **Cayenne** target file size handling, and preparing the CLI for v2.0 runtime upgrades. ## What's New in v1.11.5 ### `on_zero_results: use_source` Fallback Performance Improvement Improved the [`on_zero_results: use_source`](https://sMedium4/1/2026
v1.11.5# Spice v1.11.5 (Apr 1, 2026) Spice v1.11.5 is a patch release improving **`on_zero_results: use_source`** fallback performance, **Delta Lake** timestamp predicate data skipping, **S3 Parquet** read performance, **PostgreSQL** partitioned table support, **Cayenne** target file size handling, and preparing the CLI for v2.0 runtime upgrades. ## What's New in v1.11.5 ### `on_zero_results: use_source` Fallback Performance Improvement Improved the [`on_zero_results: use_source`](https://sMedium4/1/2026
v1.11.5# Spice v1.11.5 (Apr 1, 2026) Spice v1.11.5 is a patch release improving **`on_zero_results: use_source`** fallback performance, **Delta Lake** timestamp predicate data skipping, **S3 Parquet** read performance, **PostgreSQL** partitioned table support, **Cayenne** target file size handling, and preparing the CLI for v2.0 runtime upgrades. ## What's New in v1.11.5 ### `on_zero_results: use_source` Fallback Performance Improvement Improved the [`on_zero_results: use_source`](https://sMedium4/1/2026
v1.11.5# Spice v1.11.5 (Apr 1, 2026) Spice v1.11.5 is a patch release improving **`on_zero_results: use_source`** fallback performance, **Delta Lake** timestamp predicate data skipping, **S3 Parquet** read performance, **PostgreSQL** partitioned table support, **Cayenne** target file size handling, and preparing the CLI for v2.0 runtime upgrades. ## What's New in v1.11.5 ### `on_zero_results: use_source` Fallback Performance Improvement Improved the [`on_zero_results: use_source`](https://sMedium4/1/2026
v1.11.5# Spice v1.11.5 (Apr 1, 2026) Spice v1.11.5 is a patch release improving **`on_zero_results: use_source`** fallback performance, **Delta Lake** timestamp predicate data skipping, **S3 Parquet** read performance, **PostgreSQL** partitioned table support, **Cayenne** target file size handling, and preparing the CLI for v2.0 runtime upgrades. ## What's New in v1.11.5 ### `on_zero_results: use_source` Fallback Performance Improvement Improved the [`on_zero_results: use_source`](https://sMedium4/1/2026
v1.11.5# Spice v1.11.5 (Apr 1, 2026) Spice v1.11.5 is a patch release improving **`on_zero_results: use_source`** fallback performance, **Delta Lake** timestamp predicate data skipping, **S3 Parquet** read performance, **PostgreSQL** partitioned table support, **Cayenne** target file size handling, and preparing the CLI for v2.0 runtime upgrades. ## What's New in v1.11.5 ### `on_zero_results: use_source` Fallback Performance Improvement Improved the [`on_zero_results: use_source`](https://sMedium4/1/2026
v1.11.5# Spice v1.11.5 (Apr 1, 2026) Spice v1.11.5 is a patch release improving **`on_zero_results: use_source`** fallback performance, **Delta Lake** timestamp predicate data skipping, **S3 Parquet** read performance, **PostgreSQL** partitioned table support, **Cayenne** target file size handling, and preparing the CLI for v2.0 runtime upgrades. ## What's New in v1.11.5 ### `on_zero_results: use_source` Fallback Performance Improvement Improved the [`on_zero_results: use_source`](https://sMedium4/1/2026
v1.11.5# Spice v1.11.5 (Apr 1, 2026) Spice v1.11.5 is a patch release improving **`on_zero_results: use_source`** fallback performance, **Delta Lake** timestamp predicate data skipping, **S3 Parquet** read performance, **PostgreSQL** partitioned table support, **Cayenne** target file size handling, and preparing the CLI for v2.0 runtime upgrades. ## What's New in v1.11.5 ### `on_zero_results: use_source` Fallback Performance Improvement Improved the [`on_zero_results: use_source`](https://sMedium4/1/2026
v1.11.5# Spice v1.11.5 (Apr 1, 2026) Spice v1.11.5 is a patch release improving **`on_zero_results: use_source`** fallback performance, **Delta Lake** timestamp predicate data skipping, **S3 Parquet** read performance, **PostgreSQL** partitioned table support, **Cayenne** target file size handling, and preparing the CLI for v2.0 runtime upgrades. ## What's New in v1.11.5 ### `on_zero_results: use_source` Fallback Performance Improvement Improved the [`on_zero_results: use_source`](https://sMedium4/1/2026
v1.11.5# Spice v1.11.5 (Apr 1, 2026) Spice v1.11.5 is a patch release improving **`on_zero_results: use_source`** fallback performance, **Delta Lake** timestamp predicate data skipping, **S3 Parquet** read performance, **PostgreSQL** partitioned table support, **Cayenne** target file size handling, and preparing the CLI for v2.0 runtime upgrades. ## What's New in v1.11.5 ### `on_zero_results: use_source` Fallback Performance Improvement Improved the [`on_zero_results: use_source`](https://sMedium4/1/2026
v1.11.5# Spice v1.11.5 (Apr 1, 2026) Spice v1.11.5 is a patch release improving **`on_zero_results: use_source`** fallback performance, **Delta Lake** timestamp predicate data skipping, **S3 Parquet** read performance, **PostgreSQL** partitioned table support, **Cayenne** target file size handling, and preparing the CLI for v2.0 runtime upgrades. ## What's New in v1.11.5 ### `on_zero_results: use_source` Fallback Performance Improvement Improved the [`on_zero_results: use_source`](https://sLow4/1/2026
v1.11.5# Spice v1.11.5 (Apr 1, 2026) Spice v1.11.5 is a patch release improving **`on_zero_results: use_source`** fallback performance, **Delta Lake** timestamp predicate data skipping, **S3 Parquet** read performance, **PostgreSQL** partitioned table support, **Cayenne** target file size handling, and preparing the CLI for v2.0 runtime upgrades. ## What's New in v1.11.5 ### `on_zero_results: use_source` Fallback Performance Improvement Improved the [`on_zero_results: use_source`](https://sLow4/1/2026
v1.11.5# Spice v1.11.5 (Apr 1, 2026) Spice v1.11.5 is a patch release improving **`on_zero_results: use_source`** fallback performance, **Delta Lake** timestamp predicate data skipping, **S3 Parquet** read performance, **PostgreSQL** partitioned table support, **Cayenne** target file size handling, and preparing the CLI for v2.0 runtime upgrades. ## What's New in v1.11.5 ### `on_zero_results: use_source` Fallback Performance Improvement Improved the [`on_zero_results: use_source`](https://sLow4/1/2026
v1.11.5# Spice v1.11.5 (Apr 1, 2026) Spice v1.11.5 is a patch release improving **`on_zero_results: use_source`** fallback performance, **Delta Lake** timestamp predicate data skipping, **S3 Parquet** read performance, **PostgreSQL** partitioned table support, **Cayenne** target file size handling, and preparing the CLI for v2.0 runtime upgrades. ## What's New in v1.11.5 ### `on_zero_results: use_source` Fallback Performance Improvement Improved the [`on_zero_results: use_source`](https://sLow4/1/2026
v1.11.5# Spice v1.11.5 (Apr 1, 2026) Spice v1.11.5 is a patch release improving **`on_zero_results: use_source`** fallback performance, **Delta Lake** timestamp predicate data skipping, **S3 Parquet** read performance, **PostgreSQL** partitioned table support, **Cayenne** target file size handling, and preparing the CLI for v2.0 runtime upgrades. ## What's New in v1.11.5 ### `on_zero_results: use_source` Fallback Performance Improvement Improved the [`on_zero_results: use_source`](https://sLow4/1/2026
v1.11.4# Spice v1.11.4 (Mar 12, 2026) Spice v1.11.4 is a patch release improving **S3** metadata column query robustness and enabling **`on_zero_results: use_source`** for accelerated views. ## What's New in v1.11.4 ### Accelerated Views: `on_zero_results: use_source` Support Accelerated [views](https://spiceai.org/docs/components/views) now support the [`on_zero_results: use_source`](https://spiceai.org/docs/features/data-acceleration/data-refresh#behavior-on-zero-results) configuration ([Low3/12/2026
v1.11.3Spice v1.11.3 is a patch release fixing schema consistency issues in the **S3** and **FlightSQL** data connectors, improving **CDC cache invalidation**, and enhancing the **HTTP** data connector's error handling and response metadata. ## What's New in v1.11.3 ### S3 Data Connector Fix Fixed an issue where queries using metadata columns (`location`, `last_modified`, `size`) on [S3](https://spiceai.org/docs/components/data-connectors/s3) datasets produced `Input field name does not match Low3/9/2026
v2.0.0-rc.1# Spice v2.0.0-rc.1 (Mar 2, 2026) v2.0.0-rc.1 is the first release candidate for early testing of v2.0. Highlights in this release candidate include: - **Active-Active Highly-Available Distributed Query** that is object-store-native and built on Apache Ballista, with dynamic cluster sizing, distributed ingestion, and cluster observability - **Spice Cayenne RC** with staged append writes, file-based retention deletes, composite partitioning, and distributed ingestion - **DataFusion v52.2.0 UpgLow3/4/2026
v1.11.2Spice v1.11.2 is a patch release that tunes the behavior of the HTTP data connector. ## What's New in v1.11.2 - **HTTP Data Connector**: HTTP 429 (Too Many Requests) responses are now treated as retryable and non-cacheable, preventing rate-limited responses from being stored in the HTTP cache accelerator. ## Contributors - [@sgrebnov](https://github.com/sgrebnov) ## Breaking Changes No breaking changes. ## Cookbook Updates No new cookbook recipes. The [Spice Cookbook](Low2/18/2026
v1.11.1# Spice v1.11.1 (Feb 9, 2026) v1.11.1 is a patch release improving [Spice Cayenne](https://spiceai.org/docs/components/data-accelerators/cayenne) accelerator reliability and performance, enhancing **DynamoDB Streams** and **HTTP** data connectors, and fixing issues in **Federated Task History** and **FlightSQL**. ## What's New in v1.11.1 ### Spice Cayenne Accelerator Improvements This release includes stability and performance fixes for the [Spice Cayenne](https://spiceai.org/docs/coLow2/10/2026
v1.11.0In Spice v1.11.0, **Spice Cayenne reaches Beta status** with acceleration snapshots, Key-based deletion vectors, and Amazon S3 Express One Zone support. **DataFusion has been upgraded to v51** along with **Arrow v57.2**, and **iceberg-rust v0.8.0**. v1.11 adds several **DynamoDB & DynamoDB Streams improvements** such as JSON nesting, and adds significant improvements to **Distributed Query** with active-active schedulers and mTLS for enterprise-grade high-availability and secure cluster communicLow1/28/2026
v1.11.0-rc.3# Spice v1.11.0-rc.3 (Jan 22, 2026) v1.11.0-rc.3 is a patch release that includes improvements to Hash Indexing for Arrow Acceleration and fixes for TLS connections with Flight SQL endpoints. ## What's New in v1.11.0-rc.3 ### Hash Indexing for Arrow Acceleration (experimental) Arrow-based accelerations now support hash indexing for faster point lookups on equality predicates. Hash indexes provide O(1) average-case lookup performance for columns with high cardinality. **Features**: - PrimarLow1/23/2026
v1.11.0-rc.2# Spice v1.11.0-rc.2 (Jan 20, 2026) v1.11.0-rc.2 is the second release candidate for advanced test of v1.11. It brings **Spice Cayenne to Beta status** with acceleration snapshots support, a new **ScyllaDB Data Connector**, upgrades to **DataFusion v51**, **Arrow 57.2**, and **iceberg-rust v0.8.0**. It includes significant improvements to distributed query, caching, and observability. ## What's New in v1.11.0-rc.2 ### Spice Cayenne Accelerator Reaches Beta [Spice Cayenne](https://spiceai.orgLow1/21/2026
v1.11.0-rc.1# Spice v1.11.0-rc.1 (Jan 5, 2026) v1.11.0-rc.1 is the first release candidate for early testing of v1.11 features including **Distributed Query with mTLS** for enterprise-grade secure cluster communication, new **SMB and NFS Data Connectors** for direct network-attached storage access, **Prepared Statements** for improved query performance and security, **Cayenne Accelerator Enhancements** with Key-based deletion vectors and Amazon S3 Express One Zone support, **Google LLM Support** for expaLow1/7/2026
v1.10.4# Spice v1.10.4 (Jan 5, 2026) v1.10.4 is a patch release with fixes for **Kafka/Debezium batch commits**, **ABFSS URL support** for Azure Data Lake Storage Gen2, and improved **column projection handling** for location metadata columns. ## What's New in v1.10.4 ### Additional Improvements & Bug Fixes - **Reliability**: Fixed Kafka and Debezium batch commit handling to properly commit offsets across all partitions. Previously, only the last message's offset was committed, which could Low1/5/2026
v1.10.3# Spice v1.10.3 (Dec 29, 2025) v1.10.3 is a patch release with improved startup reliability, fixes for Azure BlobFS versioned containers, S3 custom endpoint query resolution, and a fix for the OpenAI Responses API. ## What's New in v1.10.3 ### Additional Improvements & Bug Fixes - **Reliability**: Telemetry exporter initialization now runs asynchronously, preventing blocked startup in environments with network restrictions (e.g., Kubernetes with restrictive network policies). - **ReLow12/29/2025
v1.10.2# Spice v1.10.2 (Dec 22, 2025) v1.10.2 introduces **Tiered Caching Acceleration with Localpod** for multi-layer acceleration architectures, **Periodic Acceleration Snapshots** with configurable intervals, **DynamoDB JSON Nesting** for column consolidation, and **Kafka/Debezium Batching** for faster data ingestion. This release also includes fixes for SQLite accelerator decimal/date handling and real-time status reporting for the `/v1/datasets` and `/v1/models` API endpoints. ## What's New Low12/22/2025
v1.10.1# Spice v1.10.1 (Dec 15, 2025) v1.10.1 is a patch release with **Cayenne accelerator improvements** including configurable compression strategies and improved partition ID handling, **isolated refresh runtime** for better query API responsiveness, and **security hardening**. In addition, the GO SDK, gospice v8 has been released. ## What's New in v1.10.1 ### Cayenne Accelerator Improvements Several improvements and bug fixes for the [Cayenne data accelerator](https://spiceai.org/docs/Low12/16/2025
v1.10.0# Spice v1.10.0 (Dec 9, 2025) Spice v1.10.0 introduces a new **Caching Acceleration Mode** with stale-while-revalidate (SWR) semantics for disk-persisted, low-latency queries with background refresh. This release also adds the **TinyLFU eviction policy** for the SQL results cache, a preview of the **DynamoDB Streams connector** for real-time CDC, **S3 location predicate pruning** for faster partitioned queries, improved **distributed query execution**, and multiple security hardening improvemLow12/9/2025
v1.9.2.d2541e8## What's Changed * 1.9.2 Release Notes (cherry-pick) by @krinart in https://github.com/spiceai/spiceai/pull/8315 * release/1.9: cherry pick security fixes by @phillipleblanc in https://github.com/spiceai/spiceai/pull/8303 **Full Changelog**: https://github.com/spiceai/spiceai/compare/v1.9.2...v1.9.2.d2541e8Low12/4/2025
v1.10.0-rc.1# Spice v1.10.0-rc1 (Dec 2, 2025) v1.10.0-rc1 is a release candidate for early testing of v1.10 features including an all new `caching` acceleration mode, `tiny_lfu` caching policy, a new DynamoDB Streams connector (Preview), improvements to the DynamoDB connector, faster distributed query execution, S3 connector improvements, and security hardening for v1.10.0-stable. ## What's New in v1.10.0-rc1 ### Caching Acceleration Mode with SWR and TinyLFU This release introduces a new `caching` [accLow12/3/2025
v1.9.2# Spice v1.9.2 (Nov 26, 2025) v1.9.2 is a patch release that fixes a bug in SQL query results cache metrics emission, ensuring cache performance metrics are properly available for monitoring and observability. ## What's New in v1.9.2 ## SQL Results Cache Metrics Fixed The SQL query results cache metrics were not being properly emitted to the Prometheus metrics endpoint after startup. This release fixes the issue, ensuring all cache-related metrics are now correctly available for moniLow11/26/2025
v1.9.1# Spice v1.9.1 (Nov 24, 2025) v1.9.1 introduces **Amazon Bedrock Nova 2 Multimodal embeddings support** with high-dimensional vectors up to 3,072 dimensions and purpose-optimized embeddings for semantic search and retrieval operations, **DynamoDB timestamp filter pushdown** for more efficient append-mode acceleration with configurable time formatting, **HTTP Data Connector health probe configuration** for improved endpoint validation reliability, and **Spice .NET SDK v0.2** with expanded .NETLow11/25/2025
v1.9.0# Spice v1.9.0 (Nov 18, 2025) v1.9.0-stable introduces **Spice Cayenne**, a new high-performance data accelerator built on the **Vortex** columnar format that delivers better than DuckDB performance without single-file scaling limitations, and a preview of **Multi-Node Distributed Query** based on Apache Ballista. v1.9.0 also upgrades to DataFusion v50, DuckDB v1.4.2, and Delta-Kernel v0.16 for even higher query performance, expands search capabilities with full-text search on views and multiLow11/19/2025
v1.9.0-rc.4This release candidate brings **DuckDB v1.4.2**, **Cayenne partitioning improvements**, and comprehensive **security hardening** across the CLI, data connectors, runtime, and MCP. v1.9.0-rc.4 also includes **MySQL and PostgreSQL** connector improvements with fixed nullability inferences and full-text search support, **DynamoDB consistency** improvements, **HTTP connector** validation and UX enhancements, and numerous reliability and performance optimizations. Significant work was also made to teLow11/18/2025
v1.9.0-rc.3# Spice v1.9.0-rc.3 (Nov 12, 2025) This release candidate improves partitioned table data filtering logic from the second release candidate for v1.9.0, which introduces **Spice Cayenne**, a new high-performance data accelerator built on the **Vortex** columnar format that delivers better than DuckDB performance without single-file scaling limitations and a preview of **Multi-Node Distributed Query** based on Apache Ballista. v1.9.0 also upgrades to DataFusion v50 and DuckDB v1.4.1 for even hiLow11/13/2025
v1.9.0-rc.2# Spice v1.9.0-rc.2 (Nov 11, 2025) This is the second release candidate for v1.9.0, which introduces **Spice Cayenne**, a new high-performance data accelerator built on the **Vortex** columnar format that delivers better than DuckDB performance without single-file scaling limitations and a preview of **Multi-Node Distributed Query** based on Apache Ballista. v1.9.0-rc.2 also upgrades to DataFusion v50 and DuckDB v1.4.1 for even higher query performance, expands search capabilities with full-tLow11/11/2025
v1.7.3# Spice v1.7.3 (Nov 06, 2025) Spice v1.7.3 is a focused patch release that improves AWS SDK credential handling by adding retry logic for transient network failures. ## What's Fixed - **AWS SDK credential resilience:** Improved credential initialization with automatic retry using Fibonacci backoff for ConnectorError failures resulting in more reliable connections to AWS services. ## Upgrading To upgrade to v1.7.3, use one of the following methods: **CLI**: ```console spice upgrade ``` **Low11/7/2025
v1.9.0-rc.1# Spice v1.9.0-rc.1 (Nov 3, 2025) This is the first release candidate for v1.9.0, which introduces **Cayenne**, a new high-performance data accelerator built on the Vortex columnar format that delivers DuckDB-comparable performance without scaling limitations. This release also upgrades to DataFusion v50 for improved query performance, expands search capabilities with full-text search on views and multi-column embeddings, includes significant DynamoDB and DuckDB accelerator improvements, and Low11/4/2025
v1.7.2# Spice v1.7.2 (Oct 30, 2025) Spice v1.7.2 is a focused patch release that hardens dataset refresh handling when a downstream dependency panics. Instances now recover automatically from refresh worker panics triggered by downstream dependencies (such as corrupted Parquet files), and operators gain visibility into these events through a new metric. ## What's Fixed - **Refresh worker resilience:** The acceleration refresh loop now catches and recovers from panics raised by the underlying Arrow Low10/30/2025
v1.8.3# Spice v1.8.3 (Oct 27, 2025) Spice v1.8.3 is a patch release focused on performance, reliability, and observability. This release delivers optimizations for DuckDB acceleration, parameterized queries, and query plans. A new opt-in dedicated thread pool for queries is now in preview. ## What's New in v1.8.3 ### DuckDB Data Accelerator Improvements - **Connection Pool Sizing**: The DuckDB accelerator now supports a configurable `connection_pool_size` parameter, supporting fine-grainedLow10/28/2025
v1.8.2# Spice v1.8.2 (Oct 21, 2025) Spice v1.8.2 is a patch release focused on reliability, validation, performance, and bug fixes, with improvements across DuckDB acceleration, S3 Vectors, document tables, and HTTP search. ## What's New in v1.8.2 ### Support Table Relations in `/v1/search` HTTP Endpoint Spice now supports table relations for the `additional_columns` and `where` parameters in the `/v1/search` endpoint. This enables improved search for multi-dataset use cases, where filtersLow10/22/2025
v1.8.1# Spice v1.8.1 (Oct 13, 2025) Spice v1.8.1 is a patch release that adds Acceleration Snapshots Indexes, and includes a number of bug fixes and performance improvements. ## What's New in v1.8.1 ### Acceleration Snapshot Indexes - **Management of [Acceleration Snapshots](https://spiceai.org/docs/features/data-acceleration/snapshots)** has been improved by adopting an Iceberg-inspired `metadata.json`, which now encodes pointer IDs, schema serialization, and robust checksum and size, whiLow10/13/2025
v1.8.0# Spice v1.8.0 (Oct 6, 2025) Spice v1.8.0 delivers major advances in data writes, scalable vector search, and now in previewโ€”**managed acceleration snapshots** for fast cold starts. This release introduces **write support for Iceberg tables** using standard SQL `INSERT INTO`, **partitioned S3 Vector indexes** for petabyte-scale vector search, and **preview of the AI SQL function** for direct LLM integration in SQL. Additional improvements include improved reliability, and the v3.0.3 release oLow10/7/2025
v1.7.1# Spice v1.7.1 (Sep 29, 2025) Spice v1.7.1 is a patch release focused on search improvements, bug fixes, and performance enhancements. This release introduces the Reciprocal Rank Fusion (RRF) user-defined table function (UDTF) for hybrid search, improves vector and text search reliability, and resolves several issues across the runtime, connectors, and query engine. ## What's New in v1.7.1 **Reciprocal Rank Fusion (RRF) UDTF**: Spice now supports Reciprocal Rank Fusion (RRF) as a user-dLow9/30/2025
v1.7.0# Spice v1.7.0 (Sep 23, 2025) Spice v1.7.0 upgrades to DataFusion v49 for improved performance and query optimization, introduces real-time full-text search indexing for CDC streams, EmbeddingGemma support for high-quality embeddings, new search table functions powering the `/v1/search` API, embedding request caching for faster and cost-efficient search and indexing, and OpenAI Responses API tool calls with streaming. This release also includes numerous bug fixes across CDC streams, vector seLow9/24/2025
v1.6.1# Spice v1.6.1 (Sep 1, 2025) Spice 1.6.1 is a patch release that provides improved Kafka type inference and JSON flattening support, alongside several bug fixes. ## What's New in v1.6.1 **Improved Kafka Type Inference**: Improve Kafka type inference by configuring the number of Kafka messages sampled during schema inference. Increasing the sample size can improve the robustness and reliability of inferred schemas, especially in cases where data contains optional fields or varying structLow9/1/2025
v1.6.0# Spice v1.6.0 (Aug 26, 2025) Spice 1.6.0 upgrades DataFusion to v48, reducing expressions memory footprint by ~50% for faster planning and lower memory usage, eliminating unnecessary projections in queries, optimizing string functions like `ascii` and `character_length` for up to 3x speedup, and accelerating unbounded aggregate window functions by 5.6x. The release adds Kafka and MongoDB connectors for real-time streaming and NoSQL data acceleration, supports OpenAI Responses API for advanceLow8/27/2025
v1.5.2# Spice v1.5.2 (Aug 4, 2025) Spice v1.5.2 introduces a new Amazon Bedrock Models Provider for converse API (Nova) compatible models, AWS Redshift support using the Postgres data connector, and Hadoop Catalog Support for Iceberg tables along with several bug fixes and improvements. ## What's New in v1.5.2 **Amazon Bedrock Models Provider**: Adds a new Amazon Bedrock LLM Provider. Models compatible with the [Converse API](https://docs.aws.amazon.com/bedrock/latest/APIReference/API_runtime_ConveLow8/12/2025
v1.5.1# Spice v1.5.1 (July 28, 2025) Spice v1.5.1 expands the GitHub data connector to include pull-request comments, adds a configurable rate limiting for AWS Bedrock embedding models, expands partition pruning with inequality operators, and adds client-supplied cache keys for granular caching control in the HTTP and Arrow Flight SQL APIs. ## What's New in v1.5.1 **GitHub Data Connector Pull Request Comments**: Configure GitHub pulls datasets to include comments. Example Spicepod.yaml: Low7/29/2025
v1.5.0# Spice v1.5.0 (July 21, 2025) Spice v1.5.0 brings major upgrades to search and retrieval. It introduces native support for Amazon S3 Vectors, enabling petabyte scale vector search directly from S3 vector buckets, alongside SQL-integrated vector and tantivy-powered full-text search, partitioning for DuckDB acceleration, and automated refreshes for search indexes and views. It includes the AWS Bedrock Embeddings Model Provider, the Oracle Database connector, and the now-stable Spice.ai Cloud DLow7/22/2025
v1.5.0-rc.3# Spice v1.5.0-rc.3 (July 16, 2025) This is the third release candidate for v1.5.0, building on the capabilities introduced in [v1.5.0-rc.2](v1.5.0-rc.2.md). This release introduces native support for Amazon S3 Vectors, enabling petabyte scale vector search directly from S3 vector buckets, alongside SQL-integrated vector and full-text search, partitioning for DuckDB acceleration, and automated refreshes for search indexes and views. It includes the AWS Bedrock Embeddings Model Provider, the OraLow7/17/2025
v1.5.0-rc.2# Spice v1.5.0-rc.2 (July 14, 2025) This is the second release candidate for v1.5.0, which introduces SQL-integrated vector and full-text search, partitioning for DuckDB acceleration, and automated refreshes for search indexes and views. It adds a new AWS Bedrock Embeddings Model Provider, a new Oracle Database connector, and promotes the Spice.ai Cloud Data Connector to stable, alongside multi-column vector search for expanded search. This release also upgrades DuckDB from v1.1.3 to v1.3.2, acLow7/15/2025
v1.5.0-rc.1# Spice v1.5.0-rc.1 (July 7, 2025) This is the first release candidate for v1.5.0, which introduces partitioning for DuckDB acceleration, SQL-integrated vector and full-text search, and automated refreshes for search indexes and views. It adds a new AWS Bedrock Embeddings Model Provider, a new Oracle Database connector, and promotes the Spice.ai Cloud Data Connector to stable, alongside multi-column vector search for expanded search. ## What's New in v1.5.0-rc.1 **Partitioned AcceleratiLow7/8/2025
v1.4.0# Spice v1.4.0 (June 17, 2025) This release upgrades DataFusion to v47 and Arrow to v55 for faster queries, more efficient Parquet/CSV handling, and improved reliability. It introduces the AWS Glue Catalog and Data Connectors for native access to Glue-managed data on S3, and adds support for Databricks U2M OAuth for secure Databricks user authentication. New Cron-based dataset refreshes and worker schedules enable automated task management, while dataset and search results caching improvemLow6/18/2025
v1.4.0-rc.1# Spice v1.4.0-rc.1 (June 11, 2025) This release candidate for v1.4.0 upgrades DataFusion to v47 and Arrow to v55 for faster queries, more efficient Parquet/CSV handling, and improved reliability. It introduces the AWS Glue Catalog and Data Connectors for native access to Glue-managed data on S3 and supports Databricks U2M OAuth for secure Databricks user authentication. New Cron-based dataset refreshes and worker schedules enable automated task management, while dataset and search results caLow6/11/2025
v1.3.2# Spice v1.3.2 (June 3, 2025) Spice v1.3.2 improves DuckDB acceleration to accept `ORDER BY rand()` and `ORDER BY NULL` SQL queries, and supports the `TIMESTAMP_NTZ(0)` (timestamp with seconds precision) type in Snowflake. ## Contributors - [@phillipleblanc](https://github.com/phillipleblanc) - [@kczimm](https://github.com/kczimm) ## Breaking Changes No breaking changes. ## Cookbook Updates No new cookbook recipes. The [Spice Cookbook](https://spiceai.org/cookbook) now includes 67 recipeLow6/3/2025
v1.3.1# Spice v1.3.1 (May 25, 2025) Spice v1.3.1 includes improvements to Databricks SQL Warehouse support and parameterized query handling, along with several bugfixes. ## What's New in v1.3.1 - **Databricks SQL Warehouse** Added support for the `STRUCT` type, enabled join pushdown for queries within the same SQL Warehouse and added projection to logical plans to force federation with correct SQL dialect. - **SQL Improvements**: Fixed an issue where `ILike` was incorrectly optimized to stLow5/26/2025
v1.3.0# Spice v1.3.0 (May 19, 2025) Spice v1.3.0 accelerates data and AI applications with significantly improved query performance, reliability, and expanded Databricks integration. New support for the Databricks SQL Statement Execution API enables direct SQL queries on Databricks SQL Warehouses, complementing Mosaic AI model serving and embeddings (introduced in v1.2.2) and existing Databricks catalog and dataset integrations. This release upgrades to DataFusion v46, optimizes results caching perfoLow5/20/2025
v1.2.2# Spice v1.2.2 (May 12, 2025) Spice v1.2.2 introduces support for Databricks Mosaic AI model serving and embeddings, alongside the existing Databricks catalog and dataset integrations. It adds configurable service ports in the Helm chart and resolves several bugs to improve stability and performance. ## Highlights in v1.2.2 - **Databricks Model & Embedding Provider**: Spice integrates with [Databricks Model Serving](https://www.databricks.com/product/model-serving) for models and embeddLow5/13/2025
v1.2.1# Spice v1.2.1 (May 6, 2025) Spice v1.2.1 includes several data connector fixes and improves query performance for accelerated views. This release also introduces Databricks Service Principal (M2M OAuth) authentication and expands parameterized queries. ## Highlights in v1.2.1 - **Databricks Service Principal Support**: Databricks datasets and catalogs now support Machine-to-Machine (M2M) OAuth authentication via Service Principals, enabling secure machine connections to Databricks. Low5/6/2025

Dependencies & License Audit

Loading dependencies...

Similar Packages

awesome-opensource-aiCurated list of the best truly open-source AI projects, models, tools, and infrastructure.main@2026-06-06
RAGHubA community-driven collection of RAG (Retrieval-Augmented Generation) frameworks, projects, and resources. Contribute and explore the evolving RAG ecosystem.main@2026-05-26
swiftideFast, streaming indexing, query, and agentic LLM applications in Rustv0.32.1
raptorNo descriptionmain@2026-06-05
MiniSearchMinimalist web-searching platform with an AI assistant that runs directly from your browser. Uses WebLLM, Wllama and SearXNG. Demo: https://felladrin-minisearch.hf.spacemain@2026-06-05

More in RAG & Memory

vllmA high-throughput and memory-efficient inference and serving engine for LLMs
awesome-opensource-aiCurated list of the best truly open-source AI projects, models, tools, and infrastructure.
antflyNo description
generative-ai-for-beginners21 Lessons, Get Started Building with Generative AI