quarkus-docling
Docling simplifies document processing, parsing diverse formats — including advanced PDF understanding — and providing seamless integrations with the gen AI ecosystem
Description
Docling simplifies document processing, parsing diverse formats — including advanced PDF understanding — and providing seamless integrations with the gen AI ecosystem
README
This is a Quarkus extension for the Docling project. Docling simplifies document processing, parsing diverse formats — including advanced PDF understanding — and providing seamless integrations with the gen AI ecosystem.
- 🗂️ Parsing of multiple document formats incl. PDF, DOCX, XLSX, HTML, images, and more
- 📑 Advanced PDF understanding incl. page layout, reading order, table structure, code, formulas, image classification, and more
- 🧬 Unified, expressive DoclingDocument representation format
- ↪️ Various export formats and options, including Markdown, HTML, and lossless JSON
- 🔒 Local execution capabilities for sensitive data and air-gapped environments
- 🤖 Plug-and-play integrations incl. LangChain, LlamaIndex, Crew AI & Haystack for agentic AI
- 🔍 Extensive OCR support for scanned PDFs and images
- 🥚 Support of several Visual Language Models (SmolDocling)
- 💻 Simple and convenient CLI
Currently, this extension is a set of wrappers around the Docling Java project, which communicates with a Docling Serve instance via a REST API. This extension also provides a Dev Service and Dev UI integrations.
The eventual goal is to unify the DoclingDocument format with LangChain4j's Document abstraction so that Docling can be used in a LangChain4j RAG pipeline for ingesting data.
Take a look at the documentation for more information.
Or you can see an example with a video at: https://github.com/lordofthejars-ai/mission-impossible-rag
Thanks goes to these wonderful people (emoji key):
Eric Deandrea 💻 🚧 |
Alex Soto 💻 🚧 🖋 📖 🤔 |
Alina Yurenko 🐛 |
This project follows the all-contributors specification. Contributions of any kind welcome!
Release History
| Version | Changes | Urgency | Date |
|---|---|---|---|
| 1.3.1 | ## What's Changed * Reuse Ecosystem workflow by @gastaldi in https://github.com/quarkiverse/quarkus-docling/pull/112 * Bump dependabot/fetch-metadata from 2 to 3 by @dependabot[bot] in https://github.com/quarkiverse/quarkus-docling/pull/113 * Bump docling-java.version from 0.5.0 to 0.5.1 by @dependabot[bot] in https://github.com/quarkiverse/quarkus-docling/pull/114 * Bump io.quarkiverse.wiremock:quarkus-wiremock-test from 1.5.3 to 1.6.1 by @dependabot[bot] in https://github.com/quarkiverse/quark | High | 5/5/2026 |
| 1.3.0 | ## What's Changed * Update to docling-java 0.5.0 by @edeandrea in https://github.com/quarkiverse/quarkus-docling/pull/109 * Release 1.3.0 by @edeandrea in https://github.com/quarkiverse/quarkus-docling/pull/110 **Full Changelog**: https://github.com/quarkiverse/quarkus-docling/compare/1.2.3...1.3.0 | Medium | 3/20/2026 |
| 1.2.3 | ## What's Changed * Fix empty keyword entry in quarkus-extension.yaml by @krickert in https://github.com/quarkiverse/quarkus-docling/pull/96 * Bump org.apache.maven.plugins:maven-compiler-plugin from 3.14.1 to 3.15.0 by @dependabot[bot] in https://github.com/quarkiverse/quarkus-docling/pull/97 * docs: add alina-yur as a contributor for bug by @allcontributors[bot] in https://github.com/quarkiverse/quarkus-docling/pull/99 * Update to docling-java 0.4.4 by @edeandrea in https://github.com/quarkive | Low | 2/25/2026 |
| 1.2.2 | ## What's Changed * Release 1.2.2 by @edeandrea in https://github.com/quarkiverse/quarkus-docling/pull/94 **Full Changelog**: https://github.com/quarkiverse/quarkus-docling/compare/1.2.1...1.2.2 | Low | 1/26/2026 |
| 1.2.1 | ## What's Changed * Add Scalar documentation endpoint support in Dev UI by @edeandrea in https://github.com/quarkiverse/quarkus-docling/pull/89 * Bump io.quarkiverse.wiremock:quarkus-wiremock-test from 1.5.2 to 1.5.3 by @dependabot[bot] in https://github.com/quarkiverse/quarkus-docling/pull/91 * Upgrade `docling-java` to v0.4.2 by @edeandrea in https://github.com/quarkiverse/quarkus-docling/pull/92 * Release 1.2.1 by @edeandrea in https://github.com/quarkiverse/quarkus-docling/pull/93 **Full C | Low | 1/1/2026 |
| 1.2.0 | ## What's Changed * Adds InputStream methods to DoclingService by @lordofthejars in https://github.com/quarkiverse/quarkus-docling/pull/79 * Upgrade `docling-java` to v0.4.0 by @edeandrea in https://github.com/quarkiverse/quarkus-docling/pull/83 * Upgrade `docling-java` to v0.4.1 by @edeandrea in https://github.com/quarkiverse/quarkus-docling/pull/84 * release 1.20 by @edeandrea in https://github.com/quarkiverse/quarkus-docling/pull/85 * Release 1.2.0 by @edeandrea in https://github.com/quarkive | Low | 12/17/2025 |
| 1.1.0 | ## What's Changed * Bump docling-java.version from 0.1.3 to 0.1.4 by @dependabot[bot] in https://github.com/quarkiverse/quarkus-docling/pull/71 * Bump docling-java.version from 0.1.4 to 0.1.5 by @dependabot[bot] in https://github.com/quarkiverse/quarkus-docling/pull/72 * Update to docling 0.2.0 by @edeandrea in https://github.com/quarkiverse/quarkus-docling/pull/74 * Add check to detect running Docling container before starting dev services by @edeandrea in https://github.com/quarkiverse/quarkus | Low | 12/8/2025 |
| 1.0.1 | ## What's Changed * Bump io.swagger.parser.v3:swagger-parser from 2.1.27 to 2.1.31 by @dependabot[bot] in https://github.com/quarkiverse/quarkus-docling/pull/49 * Bump io.quarkiverse.antora:quarkus-antora from 2.2.0 to 3.25.0 by @dependabot[bot] in https://github.com/quarkiverse/quarkus-docling/pull/50 * Bump org.assertj:assertj-core from 3.27.3 to 3.27.4 by @dependabot[bot] in https://github.com/quarkiverse/quarkus-docling/pull/51 * Bump actions/checkout from 4 to 5 by @dependabot[bot] in https | Low | 11/22/2025 |
| 0.0.4 | ## What's Changed * Update docling devservice and add test to enforce client version is aligned by @edeandrea in https://github.com/quarkiverse/quarkus-docling/pull/47 * Release 0.0.4 by @edeandrea in https://github.com/quarkiverse/quarkus-docling/pull/48 **Full Changelog**: https://github.com/quarkiverse/quarkus-docling/compare/0.0.3...0.0.4 | Low | 7/23/2025 |
| 0.0.3 | ## What's Changed * Bump io.quarkiverse.antora:quarkus-antora from 2.1.0 to 2.2.0 by @dependabot[bot] in https://github.com/quarkiverse/quarkus-docling/pull/36 * Bump io.quarkiverse.wiremock:quarkus-wiremock-test from 1.5.0 to 1.5.1 by @dependabot[bot] in https://github.com/quarkiverse/quarkus-docling/pull/41 * Add icon in dev ui by @edeandrea in https://github.com/quarkiverse/quarkus-docling/pull/44 * Update version by @edeandrea in https://github.com/quarkiverse/quarkus-docling/pull/45 **Ful | Low | 7/23/2025 |
| 0.0.2 | ## What's Changed * Bump io.quarkiverse.antora:quarkus-antora from 2.0.0 to 2.1.0 by @dependabot in https://github.com/quarkiverse/quarkus-docling/pull/32 * Creation of basic High Level Api by @lordofthejars in https://github.com/quarkiverse/quarkus-docling/pull/34 * Update project.yml to v0.0.2 by @lordofthejars in https://github.com/quarkiverse/quarkus-docling/pull/35 ## New Contributors * @lordofthejars made their first contribution in https://github.com/quarkiverse/quarkus-docling/pull/34 | Low | 6/25/2025 |
| 0.0.1 | ## What's Changed * docs: add edeandrea as a contributor for code, and maintenance by @allcontributors in https://github.com/quarkiverse/quarkus-docling/pull/2 * .gitignore by @edeandrea in https://github.com/quarkiverse/quarkus-docling/pull/4 * Update .all-contributorsrc by @edeandrea in https://github.com/quarkiverse/quarkus-docling/pull/5 * docs: add lordofthejars as a contributor for code, and maintenance by @allcontributors in https://github.com/quarkiverse/quarkus-docling/pull/6 * Initial | Low | 6/13/2025 |
