freshcrate
Skin:/
Home > #document-processing

Tag: #document-processing

5 packages • ⭐ 1,349 total stars

pdf-reader-mcpv2.4.3🌳 Mature657

📄 Production-ready MCP server for PDF processing - 5-10x faster with parallel processing and 94%+ test coverage

pdf_oxidev0.3.57🌳 Mature630

The fastest PDF library for Python and Rust. Text extraction, image extraction, markdown conversion, PDF creation & editing. 0.8ms mean, 5× faster than industry leaders, 100% pass rate on 3,830 PDFs.

qdrant-loaderqdrant-loader-v1.0.2🌱 Seedling37

Enterprise-ready vector database toolkit for building searchable knowledge bases from multiple data sources. Supports multi-project management, automatic ingestion from Confluence/JIRA/Git, intelligen

quarkus-docling1.3.1🌱 Seedling18

Docling simplifies document processing, parsing diverse formats — including advanced PDF understanding — and providing seamless integrations with the gen AI ecosystem

My_AIv7.4.0🌱 Seedling7

Local-first AI assistant — 9 specialized agents (code, web, debug, security…), 10M token vector memory, mobile relay via secure tunnel, real-time web search and document processing. Runs 100% on your