freshcrate
Home > #document-processing

Tag: #document-processing

4 packages • ⭐ 1,230 total stars

pdf-reader-mcpv2.3.1🌿 Growing630

📄 Production-ready MCP server for PDF processing - 5-10x faster with parallel processing and 94%+ test coverage

pdf_oxidev0.3.37🌳 Mature547

The fastest PDF library for Python and Rust. Text extraction, image extraction, markdown conversion, PDF creation & editing. 0.8ms mean, 5× faster than industry leaders, 100% pass rate on 3,830 PDFs.

qdrant-loaderqdrant-loader-v1.0.0🌱 Seedling36

Enterprise-ready vector database toolkit for building searchable knowledge bases from multiple data sources. Supports multi-project management, automatic ingestion from Confluence/JIRA/Git, intelligen

quarkus-docling1.3.0🌱 Seedling17

Docling simplifies document processing, parsing diverse formats — including advanced PDF understanding — and providing seamless integrations with the gen AI ecosystem