freshcrate
Skin:/
Home > MCP Servers > ragflow

ragflow

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

Why this rank:Strong adoptionRecent releaseHealthy release cadence

Description

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

README

README in English ็ฎ€ไฝ“ไธญๆ–‡็‰ˆ่‡ช่ฟฐๆ–‡ไปถ ็น้ซ”็‰ˆไธญๆ–‡่‡ช่ฟฐๆ–‡ไปถ ๆ—ฅๆœฌ่ชžใฎREADME ํ•œ๊ตญ์–ด Bahasa Indonesia Portuguรชs(Brasil) README en Franรงais README in Arabic Tรผrkรงe README

follow on X(Twitter) Static Badge docker pull infiniflow/ragflow:v0.25.0 Latest Release license Ask DeepWiki

infiniflow%2Fragflow | Trendshift
๐Ÿ“• Table of Contents

๐Ÿ’ก What is RAGFlow?

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs. It offers a streamlined RAG workflow adaptable to enterprises of any scale. Powered by a converged context engine and pre-built agent templates, RAGFlow enables developers to transform complex data into high-fidelity, production-ready AI systems with exceptional efficiency and precision.

๐ŸŽฎ Demo

Try our demo at https://cloud.ragflow.io.

๐Ÿ”ฅ Latest Updates

  • 2026-03-24 RAGFlow Skill on OpenClaw โ€” Provides an official skill for accessing RAGFlow datasets via OpenClaw.
  • 2025-12-26 Supports 'Memory' for AI agent.
  • 2025-11-19 Supports Gemini 3 Pro.
  • 2025-11-12 Supports data synchronization from Confluence, S3, Notion, Discord, Google Drive.
  • 2025-10-23 Supports MinerU & Docling as document parsing methods.
  • 2025-10-15 Supports orchestrable ingestion pipeline.
  • 2025-08-08 Supports OpenAI's latest GPT-5 series models.
  • 2025-08-01 Supports agentic workflow and MCP.
  • 2025-05-23 Adds a Python/JavaScript code executor component to Agent.
  • 2025-05-05 Supports cross-language query.
  • 2025-03-19 Supports using a multi-modal model to make sense of images within PDF or DOCX files.

๐ŸŽ‰ Stay Tuned

โญ๏ธ Star our repository to stay up-to-date with exciting new features and improvements! Get instant notifications for new releases! ๐ŸŒŸ

๐ŸŒŸ Key Features

๐Ÿญ "Quality in, quality out"

  • Deep document understanding-based knowledge extraction from unstructured data with complicated formats.
  • Finds "needle in a data haystack" of literally unlimited tokens.

๐Ÿฑ Template-based chunking

  • Intelligent and explainable.
  • Plenty of template options to choose from.

๐ŸŒฑ Grounded citations with reduced hallucinations

  • Visualization of text chunking to allow human intervention.
  • Quick view of the key references and traceable citations to support grounded answers.

๐Ÿ” Compatibility with heterogeneous data sources

  • Supports Word, slides, excel, txt, images, scanned copies, structured data, web pages, and more.

๐Ÿ›€ Automated and effortless RAG workflow

  • Streamlined RAG orchestration catered to both personal and large businesses.
  • Configurable LLMs as well as embedding models.
  • Multiple recall paired with fused re-ranking.
  • Intuitive APIs for seamless integration with business.

๐Ÿ”Ž System Architecture

๐ŸŽฌ Get Started

๐Ÿ“ Prerequisites

  • CPU >= 4 cores
  • RAM >= 16 GB
  • Disk >= 50 GB
  • Docker >= 24.0.0 & Docker Compose >= v2.26.1
  • gVisor: Required only if you intend to use the code executor (sandbox) feature of RAGFlow.

Tip

If you have not installed Docker on your local machine (Windows, Mac, or Linux), see Install Docker Engine.

๐Ÿš€ Start up the server

  1. Ensure vm.max_map_count >= 262144:

    To check the value of vm.max_map_count:

    $ sysctl vm.max_map_count

    Reset vm.max_map_count to a value at least 262144 if it is not.

    # In this case, we set it to 262144:
    $ sudo sysctl -w vm.max_map_count=262144

    This change will be reset after a system reboot. To ensure your change remains permanent, add or update the vm.max_map_count value in /etc/sysctl.conf accordingly:

    vm.max_map_count=262144
  2. Clone the repo:

    $ git clone https://github.com/infiniflow/ragflow.git
  3. Start up the server using the pre-built Docker images:

Caution

All Docker images are built for x86 platforms. We don't currently offer Docker images for ARM64. If you are on an ARM64 platform, follow this guide to build a Docker image compatible with your system.

The command below downloads the v0.25.0 edition of the RAGFlow Docker image. See the following table for descriptions of different RAGFlow editions. To download a RAGFlow edition different from v0.25.0, update the RAGFLOW_IMAGE variable accordingly in docker/.env before using docker compose to start the server.

   $ cd ragflow/docker

   # git checkout v0.25.0
   # Optional: use a stable tag (see releases: https://github.com/infiniflow/ragflow/releases)
   # This step ensures the **entrypoint.sh** file in the code matches the Docker image version.

   # Use CPU for DeepDoc tasks:
   $ docker compose -f docker-compose.yml up -d

   # To use GPU to accelerate DeepDoc tasks:
   # sed -i '1i DEVICE=gpu' .env
   # docker compose -f docker-compose.yml up -d

Note: Prior to v0.22.0, we provided both images with embedding models and slim images without embedding models. Details as follows:

RAGFlow image tag Image size (GB) Has embedding models? Stable?
v0.21.1 โ‰ˆ9 โœ”๏ธ Stable release
v0.21.1-slim โ‰ˆ2 โŒ Stable release

Starting with v0.22.0, we ship only the slim edition and no longer append the -slim suffix to the image tag.

  1. Check the server status after having the server up and running:

    $ docker logs -f docker-ragflow-cpu-1

    The following output confirms a successful launch of the system:

          ____   ___    ______ ______ __
         / __ \ /   |  / ____// ____// /____  _      __
        / /_/ // /| | / / __ / /_   / // __ \| | /| / /
       / _, _// ___ |/ /_/ // __/  / // /_/ /| |/ |/ /
      /_/ |_|/_/  |_|\____//_/    /_/ \____/ |__/|__/
    
     * Running on all addresses (0.0.0.0)

    If you skip this confirmation step and directly log in to RAGFlow, your browser may prompt a network abnormal error because, at that moment, your RAGFlow may not be fully initialized.

  2. In your web browser, enter the IP address of your server and log in to RAGFlow.

    With the default settings, you only need to enter http://IP_OF_YOUR_MACHINE (sans port number) as the default HTTP serving port 80 can be omitted when using the default configurations.

  3. In service_conf.yaml.template, select the desired LLM factory in user_default_llm and update the API_KEY field with the corresponding API key.

    See llm_api_key_setup for more information.

    The show is on!

๐Ÿ”ง Configurations

When it comes to system configurations, you will need to manage the following files:

  • .env: Keeps the fundamental setups for the system, such as SVR_HTTP_PORT, MYSQL_PASSWORD, and MINIO_PASSWORD.
  • service_conf.yaml.template: Configures the back-end services. The environment variables in this file will be automatically populated when the Docker container starts. Any environment variables set within the Docker container will be available for use, allowing you to customize service behavior based on the deployment environment.
  • docker-compose.yml: The system relies on docker-compose.yml to start up.

The ./docker/README file provides a detailed description of the environment settings and service configurations which can be used as ${ENV_VARS} in the service_conf.yaml.template file.

To update the default HTTP serving port (80), go to docker-compose.yml and change 80:80 to <YOUR_SERVING_PORT>:80.

Updates to the above configurations require a reboot of all containers to take effect:

$ docker compose -f docker-compose.yml up -d

Switch doc engine from Elasticsearch to Infinity

RAGFlow uses Elasticsearch by default for storing full text and vectors. To switch to Infinity, follow these steps:

  1. Stop all running containers:

    $ docker compose -f docker/docker-compose.yml down -v

Warning

-v will delete the docker container volumes, and the existing data will be cleared.

  1. Set DOC_ENGINE in docker/.env to infinity.

  2. Start the containers:

    $ docker compose -f docker-compose.yml up -d

Warning

Switching to Infinity on a Linux/arm64 machine is not yet officially supported.

๐Ÿ”ง Build a Docker image

This image is approximately 2 GB in size and relies on external LLM and embedding services.

git clone https://github.com/infiniflow/ragflow.git
cd ragflow/
docker build --platform linux/amd64 -f Dockerfile -t infiniflow/ragflow:nightly .

Or if you are behind a proxy, you can pass proxy arguments:

docker build --platform linux/amd64 \
  --build-arg http_proxy=http://YOUR_PROXY:PORT \
  --build-arg https_proxy=http://YOUR_PROXY:PORT \
  -f Dockerfile -t infiniflow/ragflow:nightly .

๐Ÿ”จ Launch service from source for development

  1. Install uv and pre-commit, or skip this step if they are already installed:

    pipx install uv pre-commit
  2. Clone the source code and install Python dependencies:

    git clone https://github.com/infiniflow/ragflow.git
    cd ragflow/
    uv sync --python 3.12 # install RAGFlow dependent python modules
    uv run python3 download_deps.py
    pre-commit install
  3. Launch the dependent services (MinIO, Elasticsearch, Redis, and MySQL) using Docker Compose:

    docker compose -f docker/docker-compose-base.yml up -d

    Add the following line to /etc/hosts to resolve all hosts specified in docker/.env to 127.0.0.1:

    127.0.0.1       es01 infinity mysql minio redis sandbox-executor-manager
    
  4. If you cannot access HuggingFace, set the HF_ENDPOINT environment variable to use a mirror site:

    export HF_ENDPOINT=https://hf-mirror.com
  5. If your operating system does not have jemalloc, please install it as follows:

    # Ubuntu
    sudo apt-get install libjemalloc-dev
    # CentOS
    sudo yum install jemalloc
    # OpenSUSE
    sudo zypper install jemalloc
    # macOS
    sudo brew install jemalloc
  6. Launch backend service:

    source .venv/bin/activate
    export PYTHONPATH=$(pwd)
    bash docker/launch_backend_service.sh
  7. Install frontend dependencies:

    cd web
    npm install
  8. Launch frontend service:

    npm run dev

    The following output confirms a successful launch of the system:

  9. Stop RAGFlow front-end and back-end service after development is complete:

    pkill -f "ragflow_server.py|task_executor.py"

๐Ÿ“š Documentation

๐Ÿ“œ Roadmap

See the RAGFlow Roadmap 2026

๐Ÿ„ Community

๐Ÿ™Œ Contributing

RAGFlow flourishes via open-source collaboration. In this spirit, we embrace diverse contributions from the community. If you would like to be a part, review our Contribution Guidelines first.

Release History

VersionChangesUrgencyDate
v0.25.6## Summary Released on May 26, 2026. ### New features - Agent: Adds a **Browser** component that enables AI to autonomously navigate and interact with web pages. [#14888](https://github.com/infiniflow/ragflow/pull/14888) ### Improvements - RAG: RAPTOR construction now introduces AHC mode (ฮจ-RAG), which expands semantics from the document level to the dataset level. Not only is index construction performance significantly higher than the previous RAPTOR, but it also outperforms theHigh5/27/2026
v0.25.5## Summary Released on May 20, 2026. ### New features - Adds local & SSH providers in admin panel. [#15039](https://github.com/infiniflow/ragflow/pull/15039) ### Improvements - Accelerated dataset search path, reducing latency by 50โ€“100% by removing expensive vector fetch and rerank similarity computation steps. [#14970](https://github.com/infiniflow/ragflow/pull/14970) - Pushes metadata filters down to Infinity, significantly speeding up metadata filtering. [#14974](https://githHigh5/20/2026
v0.25.4 ## Summary ### New features - Adds a generic, configuration-driven RESTful API data source connector. ### Improvements - Agent tag management with filtering and sorting. - Widget customization and persistence. ### Model support - Adds gpt-5.4-mini and gpt-5.4-nano to the OpenAI model list ### Bug fixes Fixed dataset document download route. ## What's Changed * Feature/generic api connector by @ahmadintisar in https://github.com/infiniflow/ragflow/pull/13545 * fix:High5/14/2026
v0.25.2## Summary ### Improvements - API refactoring and unification: Continues the transition of web APIs to RESTful conventions, ensuring backward compatibility for all legacy endpoints. ### Data source - Introduces a lightweight snapshot mechanism for synchronizing deleted files across eight data sourcesโ€”including Moodle, DingTalk AI Table, and RSSโ€”ensuring a faithful reflection of all remote data sources. [#14362](https://github.com/infiniflow/ragflow/issues/14362)[#14499](https://githuHigh5/9/2026
v0.25.1## Summary ### Improvements - API refactoring and unification: Standardizes web APIs to RESTful conventions across all endpoints, unifying document creation and indexing flows while maintaining backward compatibility. - Parsing optimizations: Adds [OpenDataLoader](https://github.com/opendataloader-project/opendataloader-pdf) PDF backend. [#14097](https://github.com/infiniflow/ragflow/pull/14097) - Introduces lazy loading and chunked parsing for large PDFs (&gt;50 pages), significantly reHigh4/30/2026
v0.25.0Release v0.25.0 created from a33d0737cdcd71fd220e2869c84f37225f0b51b5 at 2026-04-21 17:32:49+08:00 ## What's Changed * Docs: Added v0.24.0 release notes by @writinwaters in https://github.com/infiniflow/ragflow/pull/13096 * Fix: upload image files by @Magicbook1108 in https://github.com/infiniflow/ragflow/pull/13071 * Fix: Correct Gemini embedding model name in llm_factories.json by @ahmadintisar in https://github.com/infiniflow/ragflow/pull/13051 * Fix: Make time_utils tests timezone-indHigh4/21/2026
v0.24.0# Summary ## New features - Memory - Introduces APIs and an SDK for developer integration. - Adds Memory extraction log display in the console for improved debugging and tracing. - Dataset - Added support for batch management of Metadata. - Renamed "ToC (Table of Contents)" to "PageIndex". - Agent - Launches a new Chat-like Agent conversation management interface that retains Sessions and dialogue history. - Introduces a multi-Sandbox mechanism, currently supporting locLow2/10/2026
v0.23.1## v0.23.1 Released on December 31, 2025. ### Improvements - Memory: Enhances the stability of memory extraction when all memory types are selected. - RAG: Refines the context window extraction strategy for images and tables. ### Fixed issues - Memory: - The RAGFlow server failed to start if an empty memory object existed. - Unable to delete a newly created empty Memory. - RAG: MDX file parsing was not supported. ### Data sources - GitHub - Gitlab - Asana - IMALow12/31/2025
v0.23.0## v0.23.0 Released on December 27, 2025. ### New features - Memory - Implements a **Memory** interface for managing memory. - Supports configuring context via the **Retrieval** or **Message** component. - Agent - Improves the **Agent** component's performance by refactoring the underlying architecture. - The **Agent** component can now output structured data for use in downstream components. - Supports using webhook to trigger agent execution. - Supports voice Low12/27/2025
nightlyRelease nightly created from 74b44e1aa3ecd6687b3aa4ef731d0187720c3cb5 at 2026-04-21 21:38:13+08:00 Low12/1/2025
v0.22.1Release v0.22.1 created from cfdccebb178bc2c49eacf321e5da483ed1e48511 at 2025-11-19 20:13:52+08:00 ## Summary ### Improvements - Agent: - Supports exporting Agent outputs in Word or Markdown formats. - Adds a **List operations** component. - Adds a **Variable aggregator** component. - Data sources: - Supports S3-compatible data sources, e.g., MinIO. - Adds data synchronization with JIRA. - Continues the redesign of the **Profile** page layouts. - Upgrades the Flask webLow11/19/2025
v0.22.0Released on November 12, 2025. ## Breaking Changes - From this release onwards, we ship only the slim edition (without embedding models) Docker image and no longer append the `-slim` suffix to the image tag. ## New Features - Dataset: - Supports data synchronization from five online sources (AWS S3, Google Drive, Notion, Confluence, and Discord). - RAPTOR can be built across an entire dataset or on individual documents. - Ingestion pipeline: Supports [Docling document parsing]Low11/12/2025
v0.21.1Release v0.21.1 created from de24e74b4c90ec154965c1bc7dc7a0fe79e669a1 at 2025-10-23 19:08:09+08:00 ## New features - Experimental: Adds support for PDF document parsing using MinerU. See [here](./faq.mdx#how-to-use-mineru-to-parse-pdf-documents). ## Improvements - Enhances UI/UX for the dataset and personal center pages. - Upgrades RAGFlow's document engine, [Infinity](https://github.com/infiniflow/infinity), to v0.6.1. ## Fixed issues - An issue with video parsing. ## WhatLow10/23/2025
v0.21.0Released on October 15, 2025. ### New features - Orchestratable ingestion pipeline: Supports customized data ingestion and cleansing workflows, enabling users to flexibly design their data flows or directly apply the official data flow templates on the canvas. - GraphRAG & RAPTOR write process optimized: Replaces the automatic incremental build process with manual batch building, significantly reducing construction overhead. - Long-context RAG: Automatically generates document-level tablLow10/15/2025
v0.20.5## Summary ### Improvements - Agent Performance Optimized: Improved planning and reflection speed for simple tasks; optimized concurrent tool calls for parallelizable scenarios, significantly reducing overall response time. - Agent Prompt Framework exposed: Developers can now customize and override framework-level prompts in the system prompt section, enhancing flexibility and control. - Execute SQL Component Enhanced: Replaced the original variable reference component with a text input fiLow9/10/2025
v0.20.4## Summary ### Improvements - Agent component: Completes Chinese localization for the Agent component. - Introduces the `ENABLE_TIMEOUT_ASSERTION` environment variable to enable or disable timeout assertions for file parsing tasks. - Dataset: - Improves Markdown file parsing, with AST support to avoid unintended chunking. - Enhances HTML parsing, supporting bs4-based HTML tag traversal. ### Added models ZHIPU GLM-4.5 ### New Agent templates Ecommerce Customer Service WoLow8/27/2025
v0.20.3# Summary ## Improvements - Revamps the user interface for the Datasets, Chat, and Search pages. - Search and Chat: Introduces document-level metadata filtering, allowing automatic or manual filtering during chats or searches. - Search: Supports creating search apps tailored to various business scenarios - Chat: Supports comparing answer performance of up to three chat model settings on a single Chat page. - Agent: - Implements a toggle in the Agent component to enable or disable citLow8/20/2025
v0.20.2Release v0.20.2Low8/19/2025
v0.20.1Release v0.20.1 created from 9b026fc5b6f234a3abdf954f359bc2f9e99d9ba8 at 2025-08-08 18:51:10+08:00 # Summary ## New Features - The Retrieval components now support the dynamic specification of knowledge base names using variables. - The user interface now includes a French language option. ## Added Models - GPT 5 - Claude 4.1 ## New agent Templates (both workflow and agentic) - SQL Assistant Workflow: Empowers non-technical teams (e.g., operations, product) to independently query buLow8/8/2025
v0.20.0Released on August 1, 2025. ## Features and improvements - Unified orchestration of both Agents and Workflows. - A comprehensive refactor of the Agent, greatly enhancing its capabilities and usability, with support for Multi-Agent configurations, planning and reflection, and visual functionalities. - Fully realized MCP functionality, allowing for MCP Server import, Agents functioning as MCP Clients, and RAGFlow itself operating as an MCP Server. - Access to runtime logs for Agents. - CLow8/1/2025
v0.19.1## v0.19.1 Released on June 23, 2025. ### Fixed issues - A memory leak issue during high-concurrency requests. - Fixed a freezing issue when parsing large files with GraphRAG entity resolution enabled. [#8223](https://github.com/infiniflow/ragflow/pull/8223) - A context error occurring when using Sandbox in standalone mode. [#8340](https://github.com/infiniflow/ragflow/pull/8340) - An excessive CPU usage issue caused by Ollama. [#8216](https://github.com/infiniflow/ragflow/pull/8216)Low6/20/2025
v0.19.0Released on May 26, 2025. ### New features - Cross-language search is supported in the Knowledge and Chat modules, enhancing search accuracy and user experience in multilingual environments, such as in Chinese-English knowledge bases. - Agent component: A new Code component supports Python and JavaScript scripts, enabling developers to handle more complex tasks like dynamic data processing. - Enhanced image display: Images in Chat and Search now render directly within responses, rather tLow5/23/2025
v0.18.0Release v0.18.0 created from 94181a990b957ed302952b4de17583d2b44f3099 at 2025-04-23 14:11:59+08:00 ## New features - MCP server: enables access to RAGFlow's knowledge bases via MCP. - DeepDoc supports adopting VLM model as a processing pipeline during document layout recognition, enabling in-depth analysis of images in PDFs. - Agent version control: all updates are continuously logged and can be rolled back to a previous version via export. - Team collaboration: Agents can be shared witLow4/23/2025
v0.17.2### Improvements - Adds OpenAI-compatible APIs. - Introduces a German user interface. - Accelerates knowledge graph extraction. - Enables Tavily-based web search in the **Retrieval** agent component. - Adds Tongyi-Qianwen QwQ models (OpenAI-compatible). - Supports CSV files in the **General** chunk method. ### Fixed issues - Unable to add models via Ollama/Xinference, an issue introduced in v0.17.1. ### Related APIs #### HTTP APIs [Create chat completion](./references/httpLow3/13/2025
v0.17.1## Improvements - Improves English tokenization quality. - Improves the table extraction logic in Markdown document parsing. - Updates SiliconFlow's model list. - Supports parsing XLS files (Excel97~2003) with improved corresponding error handling. - Supports Huggingface rerank models. - Enables relative time expressions ("now", "yesterday", "last week", "next year", and more) in the **Rewrite** agent component. ## Fixed issues - A repetitive knowledge graph extraction issue. - IsLow3/11/2025
v0.17.0## Summary - AI chat: Implements Deep Research for agentic reasoning. To activate this, enable the **Reasoning** toggle under the **Prompt Engine** tab of your chat assistant dialogue. - AI chat: Leverages Tavily-based web search to enhance contexts in agentic reasoning. To activate this, enter the correct Tavily API key under the **Assistant Setting** tab of your chat assistant dialogue. - AI chat: Supports initiating a chat without specifying knowledge bases. - AI chat: HTML files can alsoLow3/2/2025
v0.16.0## Summary - Updated model list to support DeepSeek R1 and DeepSeek V3, available on both DeepSeek and SiliconFlow - GraphRAG enhancements and refactoring: Support KG maintenance over the whole knowledge base dynamically; Support Light and General working mode---entity deduplication and community detection become optional. - Support for Tagging to improve search quality. You can maintain the tags, and tag the chunks of other knowledge bases, to improve the search relevance. - Support IteratoLow2/6/2025
v0.15.1## What's Changed * Fix release.yml by @yuzhichang in https://github.com/infiniflow/ragflow/pull/4100 * Miscellaneous updates to session APIs by @writinwaters in https://github.com/infiniflow/ragflow/pull/4097 * add typo locale by @isthaison in https://github.com/infiniflow/ragflow/pull/4099 * Fixed infinity exception SCORE() / SCORE_FACTORS() requires Fusion or MATCH TEXT or MATCH TENSOR by @yuzhichang in https://github.com/infiniflow/ragflow/pull/4110 * Separated list_agents() from sessioLow12/25/2024
v0.15.0Released on December 18, 2024. Cheers! ### New features - Introduces additional Agent-specific APIs. - Supports using page rank score to improve retrieval performance when searching across multiple knowledge bases. - Offers an iframe in Chat and Agent to facilitate the integration of RAGFlow into your webpage. - Adds a Helm chart for deploying RAGFlow on Kubernetes. - Supports importing or exporting an agent in JSON format. - Supports step run for Agent components/tools. - Adds a newLow12/18/2024
v0.14.1## Summary Mainly addressing several serious bugs in 0.14.0 and issues with Infinity configuration, the updates are listed below: * Infinity configuration can be changed in docker/infinity_conf.toml. After modifying the file, restart docker compose to take effect. * Solve the problem that the Chunk interface cannot be displayed or modified. * Solve the problem of garbled Chinese text parsing. * Solve the Elasticsearch 'Not found error' problem. * Solve the problem of Polars library compatiLow11/28/2024
v0.14.0## Summary - Supports Infinity as the alternative doc engine to elasticsearch - Add Agent support for variables and optimize interaction such as auto-saving - Added Andrew Ng's 3-step translation Agent template and SEO optimized blog generation Agent template. - Added API for Agent - English synonyms are supported during retrieval - Optimized the calculation of term weight, such that the recall time is reduced by half - More resilient task executor for document parsing and chunking - StrLow11/25/2024
v0.13.0## Summary - Added team support - Upgrade UI for Agent - Added HTTP API for knowledge base, document parsing, dialogs, etc. Python SDK is also provided, which can be installed via `pip install ragflow-sdk==0.13.0`. - Added smart chunking for Markdown - Added context retrieval through entity extraction from LLM - Added Invoke operator for Agent - Added support for DIFY external knowledge base API - Added integration for ChatGPT-on-WeChat - Added support for GLM4-9B, Yi-Lightning ## Low10/31/2024
v0.12.0## Summary * Adds new Docker image of slim version will doesn't including embedding and reranker models. * Improves the effect of long context conversion. * Supports OpenTTS and SparkTSS * Adds new Excel parsing method, user can choose the output data format between HTML and row text. * User can remove the added model provider. ## What's Changed * Add Multi-Language Descriptions for 'Switch' Component and Update Message Assistant Placeholder by @Defozo in https://github.com/infiniflowLow9/30/2024
v0.11.0## Summary - Adds AI search, aiming to provide PerplexityAI for each enterprise. - TTS output based on FishAudio, Tongyi Qwen TTS. - Supports using Postgres instead of MySQL to store RAGFlow metadata. - Supports using S3/Azure blob as file storage. - Supports LLMs above Anthropic, Voyage AI, Google Cloud. - Supports Tencent Cloud ASR, to convert speech files into text. - Adds several financial-related agent operators. - Adds a medical assistant agent. - Adds RAGFlow evaluation benchmarkLow9/14/2024
v0.10.0## Release Summary - Supports text to SQL by RAG - Provides Agent API - Provide the health check on task_executor process. - Add more Agent operator: Github, DeepL, BaiduFanyi, QWeather, ExecSQL, GoogleScholar - Supports file type: eml - Supports embedding model: Cohere - Supports more LLMs or model services : GPT-4o-mini, PerfXCloud, TogetherAI, Upstage, Novita.AI, 01.AI, SiliconFlow, XunFei Spark, Baidu Yiyan, Tencent Hunyuan ## What's Changed * refine loginfo about graprag progressLow8/23/2024
v0.9.0## Release Summary 1. Support GraphRAG 2. Add more Agent operators: keyword extraction, Baidu/duckduckgo/pubmed/wikipedia/bing/google search operators. 3. Add support for audio files (speech-to-text recognition). 4. Add support for Gemini, Groq and other LLMs. 5. Support for reranker provided by xinference. 6. Add support for LM studio/OpenRouter/LocalAI/Nvidia api and other inference frameworks/engines/services. ## What's Changed * feat: translate the to field #918 by @cike8899 in httLow8/6/2024
v0.8.0## What's Changed * Update README by @JinHai-CN in https://github.com/infiniflow/ragflow/pull/1001 * fix bug 994 ,991 by @KevinHuSh in https://github.com/infiniflow/ragflow/pull/1004 * fix #994 by @KevinHuSh in https://github.com/infiniflow/ragflow/pull/1006 * Update info by @JinHai-CN in https://github.com/infiniflow/ragflow/pull/1005 * Add file rag/svr/discord_svr.py by @guoyuhao2330 in https://github.com/infiniflow/ragflow/pull/1008 * fix chunk modification bug by @KevinHuSh in https://Low7/8/2024
v0.7.0Major updates: - Integrates [BCE](https://github.com/netease-youdao/BCEmbedding) and [BGE](https://github.com/FlagOpen/FlagEmbedding) reranker models. - Integrates [Jina](https://jina.ai/embeddings/) embedding and reranker models. - Supports LLM Baichuan and VolcanoArk. - Supports [RAPTOR](https://arxiv.org/html/2401.18059v1) for better text retrieval. - Adds a new API for removing documents. - Supports a new file format: HTML. - Supports the ARM platform. ## What's Changed * Fix miLow5/30/2024
v0.6.0Major updates: - Streaming output. - Text chunks retrieval API. - Supports skipping layout recognition for general layout. - Provide system components status monitoring. ## What's Changed * make sure the error will be recorded. by @KevinHuSh in https://github.com/infiniflow/ragflow/pull/672 * make sure to raise exception if redis is not there by @KevinHuSh in https://github.com/infiniflow/ragflow/pull/674 * fix coordinate error by @KevinHuSh in https://github.com/infiniflow/ragflow/puLow5/21/2024
v0.5.0RAGFlow v0.5.0 is released. The major updates in this release include improving scheduling mechanisms, a lot of bugs have been fixed, and new models such as DeepSeek-V2, being integrated. ## What's Changed * fix bug about fetching file from minio by @KevinHuSh in https://github.com/infiniflow/ragflow/pull/574 * fix: display the current language directly at the top and do not dispโ€ฆ by @cike8899 in https://github.com/infiniflow/ragflow/pull/579 * make cites in conversation API configurable bLow5/8/2024
v0.4.0## What's Changed * add file management feature by @KevinHuSh in https://github.com/infiniflow/ragflow/pull/560 * feat: translate FileManager #345 by @cike8899 in https://github.com/infiniflow/ragflow/pull/558 * fix rename bug by @KevinHuSh in https://github.com/infiniflow/ragflow/pull/562 * feat: add Tooltip to action icon of FileManager by @cike8899 in https://github.com/infiniflow/ragflow/pull/561 * fix bug of file management by @KevinHuSh in https://github.com/infiniflow/ragflow/pull/56Low4/26/2024
v0.3.2## What's Changed * Updated badge link by @writinwaters in https://github.com/infiniflow/ragflow/pull/545 * chore: disable Kibana volume storage in Docker Compose by @huangbaichao in https://github.com/infiniflow/ragflow/pull/548 * fix youdao bug by @KevinHuSh in https://github.com/infiniflow/ragflow/pull/551 * Update version to 0.3.2 by @JinHai-CN in https://github.com/infiniflow/ragflow/pull/550 ## New Contributors * @huangbaichao made their first contribution in https://github.com/infLow4/26/2024
v0.3.1## What's Changed * Update format by @JinHai-CN in https://github.com/infiniflow/ragflow/pull/467 * fix: ๐Ÿ› Fix duplicate ports in docker-compose by @moyueheng in https://github.com/infiniflow/ragflow/pull/472 * add redis to accelerate access of minio by @KevinHuSh in https://github.com/infiniflow/ragflow/pull/482 * feat: support markdown files by @wrfly in https://github.com/infiniflow/ragflow/pull/483 * fix ollama issuet push by @KevinHuSh in https://github.com/infiniflow/ragflow/pull/486Low4/25/2024
v0.3.0## What's Changed * fix gb2312 encoding issue by @KevinHuSh in https://github.com/infiniflow/ragflow/pull/394 * Added FAQs by @writinwaters in https://github.com/infiniflow/ragflow/pull/395 * feat: modify the description of qa by @cike8899 in https://github.com/infiniflow/ragflow/pull/406 * Add automation scripts to support displaying environment information such as RAGFlow repository version, operating system, Python version, etc. in a Linux environment for users to report issues. by @ysyx2Low4/19/2024
v0.2.0## What's Changed * change version number by @KevinHuSh in https://github.com/infiniflow/ragflow/pull/368 * make `<xxxx>` visiable by @ooooo-create in https://github.com/infiniflow/ragflow/pull/369 * enlarge max file number per user limit by @KevinHuSh in https://github.com/infiniflow/ragflow/pull/373 * build ragflow docker image from scratch by @yangjie407 in https://github.com/infiniflow/ragflow/pull/376 * Add bce-embedding and fastembed by @KevinHuSh in https://github.com/infiniflow/ragfLow4/16/2024
v0.1.0## What's Changed * use onnx models, new deepdoc by @KevinHuSh in https://github.com/infiniflow/ragflow/pull/68 * feat: fetch conversation and delete chat dialog by @cike8899 in https://github.com/infiniflow/ragflow/pull/69 * rename vision, add layour and tsr recognizer by @KevinHuSh in https://github.com/infiniflow/ragflow/pull/70 * init README of deepdoc, add picture processer. by @KevinHuSh in https://github.com/infiniflow/ragflow/pull/71 * refine README by @KevinHuSh in https://github.cLow4/15/2024

Dependencies & License Audit

Loading dependencies...

Similar Packages

deer-flowAn open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of tamain@2026-06-06
nexentNexent is a zero-code platform for auto-generating production-grade AI agents using Harness Engineering principles โ€” unified tools, skills, memory, and orchestration with built-in constraints, feedbacv2.2.0
mcp-audit๐ŸŒŸ Track token consumption in real-time with MCP Audit. Diagnose context bloat and unexpected spikes across MCP servers and tools efficiently.main@2026-06-06
DeepAnalyze๐Ÿ” Empower data scientists with DeepAnalyze, a tool that leverages large language models for automated data analysis and insights generation.main@2026-06-05
ag2AG2 (formerly AutoGen): The Open-Source AgentOS.Join us at: https://discord.gg/sNGSwQME3xv0.13.3

More from infiniflow

infinityThe AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text.

More in MCP Servers

AstrBotAgentic IM Chatbot infrastructure that integrates lots of IM platforms, LLMs, plugins and AI feature, and can be your openclaw alternative. โœจ
agentscopeBuild and run agents you can see, understand and trust.
claude-plugins-officialOfficial, Anthropic-managed directory of high quality Claude Code Plugins.
langchain4jLangChain4j is an open-source Java library that simplifies the integration of LLMs into Java applications through a unified API, providing access to popular LLMs and vector databases. It makes impleme