freshcrate

Search results for "data-extraction"

Clear filters
2 results found (Rust)
pdf_oxide📁v0.3.37🌳 Mature547

The fastest PDF library for Python and Rust. Text extraction, image extraction, markdown conversion, PDF creation & editing. 0.8ms mean, 5× faster than industry leaders, 100% pass rate on 3,830 PDFs.

kreuzberg📁v4.9.2🌳 Mature7,479

A polyglot document intelligence framework with a Rust core. Extract text, metadata, images, and structured information from PDFs, Office documents, images, and 91+ formats. Available for Rust, Python