freshcrate
Skin:/
Home > MCP Servers > UltraRAG

UltraRAG

A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines

Why this rank:Strong adoptionRelease freshnessHealthy release cadence

Description

A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines

README

UltraRAG

Less Code, Lower Barrier, Faster Deployment

OpenBMB%2FUltraRAG | Trendshift

HomepageDocumentationDatasetPaper Daily ็ฎ€ไฝ“ไธญๆ–‡ ย |ย  English


Latest News ๐Ÿ”ฅ

  • [2026.01.23] ๐ŸŽ‰ UltraRAG 3.0 Released: Say no to "black box" developmentโ€”make every line of reasoning logic clearly visible ๐Ÿ‘‰ ๐Ÿ“– Blog
  • [2026.01.20] ๐ŸŽ‰ AgentCPM-Report Model Released! DeepResearch is finally localized: 8B on-device writing agent AgentCPM-Report is open-sourced ๐Ÿ‘‰ ๐Ÿค— Model
Previous News
  • [2025.11.11] ๐ŸŽ‰ UltraRAG 2.1 Released: Enhanced knowledge ingestion & multimodal support, with a more complete unified evaluation system!
  • [2025.09.23] New daily RAG paper digest, updated every day ๐Ÿ‘‰ ๐Ÿ“– Papers
  • [2025.09.09] Released a Lightweight DeepResearch Pipeline local setup tutorial ๐Ÿ‘‰ ๐Ÿ“บ bilibili ยท ๐Ÿ“– Blog
  • [2025.09.01] Released a step-by-step UltraRAG installation and full RAG walkthrough video ๐Ÿ‘‰ ๐Ÿ“บ bilibili ยท ๐Ÿ“– Blog
  • [2025.08.28] ๐ŸŽ‰ UltraRAG 2.0 Released! UltraRAG 2.0 is fully upgraded: build a high-performance RAG with just a few dozen lines of code, empowering researchers to focus on ideas and innovation! We have preserved the UltraRAG v2 code, which can be viewed at v2.
  • [2025.01.23] UltraRAG Released! Enabling large models to better comprehend and utilize knowledge bases. The UltraRAG 1.0 code is still available at v1.

๐Ÿ’ก About UltraRAG

UltraRAG is the first lightweight RAG development framework based on the Model Context Protocol (MCP) architecture design, jointly launched by THUNLP at Tsinghua University, NEUIR at Northeastern University, OpenBMB, and AI9stars.

Designed for research exploration and industrial prototyping, UltraRAG standardizes core RAG components (Retriever, Generation, etc.) as independent MCP Servers, combined with the powerful workflow orchestration capabilities of the MCP Client. Developers can achieve precise orchestration of complex control structures such as conditional branches and loops simply through YAML configuration.

UltraRAG Architecture

๐Ÿ–ฅ๏ธ UltraRAG UI

UltraRAG UI transcends the boundaries of traditional chat interfaces, evolving into a visual RAG Integrated Development Environment (IDE) that combines orchestration, debugging, and demonstration.

The system features a powerful built-in Pipeline Builder that supports bidirectional real-time synchronization between "Canvas Construction" and "Code Editing," allowing for granular online adjustments of pipeline parameters and prompts. Furthermore, it introduces an Intelligent AI Assistant to empower the entire development lifecycle, from pipeline structural design to parameter tuning and prompt generation. Once constructed, logic flows can be converted into interactive dialogue systems with a single click. The system seamlessly integrates Knowledge Base Management components, enabling users to build custom knowledge bases for document Q&A. This truly realizes a one-stop closed loop, spanning from underlying logic construction and data governance to final application deployment.

UltraRAG.Seamless.Integration.of.Development.Deployment.mp4

โœจ Key Highlights

๐Ÿš€ Low-Code Orchestration of Complex Workflows

Inference Orchestration: Natively supports control structures such as sequential, loop, and conditional branches. Developers only need to write YAML configuration files to implement complex iterative RAG logic in dozens of lines of code.

โšก Modular Extension and Reproduction

Atomic Servers: Based on the MCP architecture, functions are decoupled into independent Servers. New features only need to be registered as function-level Tools to seamlessly integrate into workflows, achieving extremely high reusability.

๐Ÿ“Š Unified Evaluation and Benchmark Comparison

Research Efficiency: Built-in standardized evaluation workflows, ready-to-use mainstream research benchmarks. Through unified metric management and baseline integration, significantly improves experiment reproducibility and comparison efficiency.

๐ŸŽฏ Rapid Interactive Prototype Generation

One-Click Delivery: Say goodbye to tedious UI development. With just one command, Pipeline logic can be instantly converted into an interactive conversational Web UI, shortening the distance from algorithm to demonstration.

๐Ÿ“ฆ Installation

We provide two installation methods: local source code installation (recommended using uv for package management) and Docker container deployment

Method 1: Source Code Installation

We strongly recommend using uv to manage Python environments and dependencies, as it can greatly improve installation speed.

Prepare Environment

If you haven't installed uv yet, please execute:

## Direct installation
pip install uv
## Download
curl -LsSf https://astral.sh/uv/install.sh | sh

Download Source Code

git clone https://github.com/OpenBMB/UltraRAG.git --depth 1
cd UltraRAG

Install Dependencies

Choose one of the following modes to install dependencies based on your use case:

A: Create a New Environment Use uv sync to automatically create a virtual environment and synchronize dependencies:

  • Core dependencies: If you only need to run basic core functions, such as only using UltraRAG UI:

    uv sync
  • Full installation: If you want to fully experience UltraRAG's retrieval, generation, corpus processing, and evaluation functions, please run:

    uv sync --all-extras
  • On-demand installation: If you only need to run specific modules, keep the corresponding --extra as needed, for example:

    uv sync --extra retriever   # Retrieval module only
    uv sync --extra generation  # Generation module only

Once installed, activate the virtual environment:

# Windows CMD
.venv\Scripts\activate.bat

# Windows Powershell
.venv\Scripts\Activate.ps1

# macOS / Linux
source .venv/bin/activate

B: Install into an Existing Environment To install UltraRAG into your currently active Python environment, use uv pip:

# Core dependencies
uv pip install -e .

# Full installation
uv pip install -e ".[all]"

# On-demand installation
uv pip install -e ".[retriever]"

Method 2: Docker Container Deployment

If you prefer not to configure a local Python environment, you can deploy using Docker.

Get Code and Images

# 1. Clone the repository
git clone https://github.com/OpenBMB/UltraRAG.git --depth 1
cd UltraRAG

# 2. Prepare the image (choose one)
# Option A: Pull from Docker Hub
docker pull hdxin2002/ultrarag:v0.3.0-base-cpu # Base version (CPU)
docker pull hdxin2002/ultrarag:v0.3.0-base-gpu # Base version (GPU)
docker pull hdxin2002/ultrarag:v0.3.0          # Full version (GPU)

# Option B: Build locally
docker build -t ultrarag:v0.3.0 .

# 3. Start container (port 5050 is automatically mapped)
docker run -it --gpus all -p 5050:5050 <docker_image_name>

Start the Container

# Start the container (Port 5050 is mapped by default)
docker run -it --gpus all -p 5050:5050 <docker_image_name>

Note: After the container starts, UltraRAG UI will run automatically. You can directly access http://localhost:5050 in your browser to use it.

Verify Installation

After installation, run the following example command to check if the environment is normal:

ultrarag run examples/experiments/sayhello.yaml

If you see the following output, the installation is successful:

Hello, UltraRAG v3!

๐Ÿš€ Quick Start

We provide complete tutorial examples from beginner to advanced. Whether you are conducting academic research or building industrial applications, you can find guidance here. Welcome to visit the Documentation for more details.

๐Ÿ”ฌ Research Experiments

Designed for researchers, providing data, experimental workflows, and visualization analysis tools.

  • Getting Started: Learn how to quickly run standard RAG experimental workflows based on UltraRAG.
  • Evaluation Data: Download the most commonly used public evaluation datasets in the RAG field and large-scale retrieval corpora, directly for research benchmark testing.
  • Case Analysis: Provides a visual Case Study interface to deeply track each intermediate output of the workflow, assisting in analysis and error attribution.
  • Code Integration: Learn how to directly call UltraRAG components in Python code to achieve more flexible customized development.

๐Ÿ› ๏ธ Demo Systems

Designed for developers and end users, providing complete UI interaction and complex application cases.

  • Quick Start: Learn how to start UltraRAG UI and familiarize yourself with various advanced configurations in administrator mode.
  • Deployment Guide: Detailed production environment deployment tutorials, covering the setup of Retriever, Generation models (LLM), and Milvus vector database.
  • Deep Research: Flagship case, deploy a Deep Research Pipeline. Combined with the AgentCPM-Report model, it can automatically perform multi-step retrieval and integration to generate tens of thousands of words of survey reports.

๐Ÿค Contributing

Thanks to the following contributors for their code submissions and testing. We also welcome new members to join us in collectively building a comprehensive RAG ecosystem!

You can contribute by following the standard process: Fork this repository โ†’ Submit Issues โ†’ Create Pull Requests (PRs).

โญ Support Us

If you find this repository helpful for your research, please consider giving us a โญ to show your support.

Star History Chart

๐Ÿ’ฌ Contact Us

  • For technical issues and feature requests, please use GitHub Issues.
  • For questions about usage, feedback, or any discussions related to RAG technologies, you are welcome to join our WeChat group, Feishu group, and Discord to exchange ideas with us.
  • If you have any questions, feedback, or would like to get in touch, please feel free to reach out to us via email at yanyk.thu@gmail.com
WeChat Group QR Code
WeChat Group
Feishu Group QR Code
Feishu Group
Join Discord

๐Ÿ“– Publications

Papers

  1. Shi Yu, Chaoyue Tang, Bokai Xu, Junbo Cui, Junhao Ran, Yukun Yan, Zhenghao Liu, Shuo Wang, Xu Han, Zhiyuan Liu, Maosong Sun. (2025) VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents. arXiv:2410.10594 and In Proceedings of the Thirteenth International Conference on Learning Representations (ICLR 2025).

  2. Xinze Li, Sen Mei, Zhenghao Liu, Yukun Yan, Shuo Wang, Shi Yu, Zheni Zeng, Hao Chen, Ge Yu, Zhiyuan Liu, Maosong Sun, Chenyan Xiong. (2025) RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards. arXiv:2410.13509 and In Proceedings of the Thirteenth International Conference on Learning Representations (ICLR 2025).

  3. Kunlun Zhu, Yifan Luo, Dingling Xu, Yukun Yan, Zhenghao Liu, Shi Yu, Ruobing Wang, Shuo Wang, Yishan Li, Nan Zhang, Xu Han, Zhiyuan Liu, Maosong Sun. (2025) RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework. arXiv:2408.01262 and In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025).

  4. Ruobing Wang, Qingfei Zhao, Yukun Yan, Daren Zha, Yuxuan Chen, Shi Yu, Zhenghao Liu, Yixuan Wang, Shuo Wang, Xu Han, Zhiyuan Liu, Maosong Sun. (2025) DeepNote: Note-Centric Deep Retrieval-Augmented Generation. arXiv:2410.08821 and In Findings of the Association for Computational Linguistics: EMNLP 2025.

Models

  1. Yishan Li, Wentong Chen, Yukun Yan, Mingwei Li, Sen Mei, Xiaorong Wang, Kunpeng Liu, Xin Cong, Shuo Wang, Zhong Zhang, Yaxi Lu, Zhenghao Liu, Yankai Lin, Zhiyuan Liu, Maosong Sun. (2026) AgentCPM-Report: Interleaving Drafting and Deepening for Open-Ended Deep Research. arXiv:2602.06540.

  2. OpenBMB. MiniCPM-Embedding-Light. Hugging Face Model Card.

Release History

VersionChangesUrgencyDate
v0.3.0.2Release date: 2026.4.9 ## Highlights This release delivers a major end-to-end memory upgrade for UltraRAG, introducing persistent user memory, project memory retrieval, and a dedicated memory-aware RAG demo. It also makes the demo experience significantly more stateful and personalized with SQLite-backed authentication, persistent chat sessions, nickname and model settings management, and knowledge base visibility controls supporting public, private, and shared access. The frontend is sHigh4/9/2026
v0.3.0.1Release date: 2026.3.10 ## Highlights This release introduces expanded document parsing for `.doc `and `.wps` formats, alongside new web search capabilities for the demo environment. We have significantly enhanced the chat experience by adding `DOCX` and `markdown` export functionality, downloadable text actions, and a streamlined flow for session creation and interruption handling. The citation system has been further optimized with automatic renumbering and improved source sorting for beMedium3/10/2026
v0.3.0Release date: 2026.1.23 ## Highlights Introducing UltraRAG 3.0: Reject "Black Box" Development. Make Every Line of Inference Logic Visible! UltraRAG 3.0 solves the "Last Mile" problem in RAG development, developed by THUNLP, NEUIR, OpenBMB & AI9Stars. This release represents a significant milestone with **129 pull requests** merged, bringing major enhancements to functionality, UI/UX, and system stability. * WYSIWYG Pipeline Builder: From logic to prototype in seconds. Our dual-mode buiLow1/26/2026
v0.2.1.3Release date: 2026.1.12 ## Highlights This release focuses on improving system stability and core logic. We have addressed a path indexing bug in the generation server to ensure reliable image rendering. Additionally, the search-o1 pipeline has been completely reimplemented, resolving previous implementation flaws. ## What's Changed 1. Fixed index reference for image paths in generation server. by @xhd0728 #146 2. Reimplemented the search-o1 pipeline by @mssssss123 @lifelsl #158Low1/12/2026
v0.2.1.2Release date: 2025.11.25 ## Highlights This release introduces a refreshed UltraRAG front-end UI, fixes several logical issues, and adds new ToolCall and PipelineCall capabilities for directly invoking UltraRAG tools or pipelines from your own code. The retriever server has been further optimized to support full deployment without repeated corpus/index initialization, significantly improving experimental efficiency. We also refine the GPU/CPU configuration logic for more stable and flexiblLow11/25/2025
v0.2.1.1Release date: 2025.11.13 ## Highlights This release improves the stability and flexibility of UltraRAG. Retriever and index are now fully decoupled, with added support for Milvus and Faiss. The VisRAG 2.0 pipeline is newly supported, and compatibility with chonkie 1.4.0+ has been updated. Several pipelines, including Search-o1 and WebNote, have been fixed. User experience is improved through progress-bar support during document parsing, a script for saving retrieval results, and zoom-in sLow11/13/2025
v0.2.1**Release date:** 2025.10.22 ## Highlights 1. Comprehensive Multimodal Upgrade: Both the Retriever and Generation Servers now support multimodal inputs, enabling a complete end-to-end multimodal workflow from retrieval to generation. 2. Corpus Parsing and Chunking Redesign: The Corpus Server adds multi-format file parsing with deep MinerU integration, supporting token-level, sentence-level, and customizable chunking strategies to flexibly adapt to diverse corpus structures. 3. Unified Low10/22/2025
v0.2.0**Release date:** 2025.08.28 ## Highlights UltraRAG 2.0 introduces a complete redesign of the system architecture, fully adopting the MCP-based modular structure. This release significantly improves flexibility, extensibility, and developer experience, enabling researchers and engineers to rapidly build, customize, and reproduce complex RAG pipelines with minimal overhead. The core of UltraRAG 2.0 is the new MCP Serverโ€“Client workflow, providing a clean separation between functional cLow10/21/2025

Dependencies & License Audit

Loading dependencies...

Similar Packages

solon-aiJava AI application development framework (supports LLM-tool,skill; RAG; MCP; Agent-ReAct,Team-Agent). Compatible with java8 ~ java25. It can also be embedded in SpringBoot, jFinal, Vert.x, Quarkus, av3.10.6
synaptic-memoryBrain-inspired knowledge graph: spreading activation, Hebbian learning, memory consolidation.v0.16.0
mcp-rag-agent๐Ÿ” Build a production-ready RAG system that combines LangGraph and MCP integration for precise, context-aware AI-driven question answering.main@2026-06-06
claude-code-proxyMonitor and visualize your Claude Code API interactions with Claude Code Proxy. Easily set up a transparent proxy and live dashboard. ๐Ÿ› ๏ธ๐Ÿš€main@2026-06-06
AstrBotAgentic IM Chatbot infrastructure that integrates lots of IM platforms, LLMs, plugins and AI feature, and can be your openclaw alternative. โœจv4.25.3

More in MCP Servers

AstrBotAgentic IM Chatbot infrastructure that integrates lots of IM platforms, LLMs, plugins and AI feature, and can be your openclaw alternative. โœจ
agentscopeBuild and run agents you can see, understand and trust.
claude-plugins-officialOfficial, Anthropic-managed directory of high quality Claude Code Plugins.
langchain4jLangChain4j is an open-source Java library that simplifies the integration of LLMs into Java applications through a unified API, providing access to popular LLMs and vector databases. It makes impleme