Binary package ocrmypdf/14.0.1+dfsg1-1 in crimson @ pureos

ocrmypdf - 14.0.1+dfsg1-1 main

OCRmyPDF generates a searchable PDF/A file from a regular PDF
containing only images, allowing it to be searched.
.
It uses the Tesseract OCR engine and so supports all the languages
that Tesseract does.
.
Some other main features:
.
* Places OCR text accurately below the image to ease copy / paste
* Keeps the exact resolution of the original embedded images
* When possible, inserts OCR information as a lossless operation
without rendering vector information
* Keeps file size about the same
* If requested deskews and/or cleans the image before performing OCR
* Validates input and output files
* Provides debug mode to enable easy verification of the OCR results
* Processes pages in parallel when more than one CPU core is
available
* Battle-tested on thousands of PDFs, a test suite and continuous
integration.

Priority: optional
Section: graphics
Suites: amber byzantium crimson dawn landing
Maintainer: Debian Python Team <team+python [꩜] tracker.debian.org>

Homepage Source Package

Dependencies

ghostscript (>= 9.18~dfsg~)
icc-profiles-free
python3-pdfminer (>= 20181108+dfsg-3)
python3-pil
python3-pkg-resources
python3-reportlab
python3-pikepdf (>= 5.0.1)
python3-pluggy
python3-coloredlogs
tesseract-ocr (>= 4.0.0)
zlib1g
python3-deprecation
python3-img2pdf (>= 0.3.0)
python3-importlib-resources | python3 (>> 3.9)
python3-packaging
python3-tqdm
python3-typing-extensions | python3 (>> 3.10)
python3:any

Installed Size: 568.3 kB
Architectures: all

Versions

14.0.1+dfsg1-1 all