freshcrate
Skin:/
Home > #pdf

Tag: #pdf

9 packages • ⭐ 124,320 total stars

MinerUmineru-3.2.2-released🏛️ Flagship60,769

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

doclingv2.97.0🏛️ Flagship58,310

SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for powering downstream workflows such as gen AI applications.

pdf2image1.17.0🌳 Mature1,959

A wrapper around the pdftoppm and pdftocairo command line tools to convert PDF to a PIL Image list.

pdfrw0.4🌳 Mature1,911

PDF file reader/writer library

pyhankomaster@2026-06-03🌳 Mature711

Tools for stamping and signing PDF files

pdf_oxidev0.3.57🌳 Mature630

The fastest PDF library for Python and Rust. Text extraction, image extraction, markdown conversion, PDF creation & editing. 0.8ms mean, 5× faster than industry leaders, 100% pass rate on 3,830 PDFs.

astrbot_plugin_office_assistantv1.9.1🌱 Seedling24

这是一个为 AstrBot 设计的 Office 助手插件。它赋予大语言模型(LLM)直接操作文件的能力,支持读取并分析多种格式文件,以及生成 Office 文档和office互转pdf的功能

cf-browserv2.0.0🌱 Seedling5

Open-source Cloudflare Browser Rendering proxy — 10 MCP tools for Claude Code (content, screenshot, PDF, markdown, scrape, JSON AI extraction, links, a11y, crawl)

ragtable-extractmain@2026-06-04🌱 Seedling1

Extract tables precisely from PDFs and convert them to clean HTML for RAG pipelines, running fast on CPU without external dependencies.