deb_control_files:
- control
- md5sums
- postinst
- prerm
deb_fields:
Architecture: all
Depends: ghostscript (>= 9.18~dfsg~), icc-profiles-free, python3-pdfminer (>= 20181108+dfsg-3),
python3-pil, python3-pkg-resources, python3-reportlab, python3-pikepdf (>= 5.0.1),
python3-pluggy, python3-coloredlogs, tesseract-ocr (>= 4.0.0), zlib1g, python3-deprecation,
python3-img2pdf (>= 0.3.0), python3-importlib-resources | python3 (>> 3.9), python3-packaging,
python3-tqdm, python3-typing-extensions | python3 (>> 3.10), python3:any
Description: |-
add an OCR text layer to PDF files
OCRmyPDF generates a searchable PDF/A file from a regular PDF
containing only images, allowing it to be searched.
.
It uses the Tesseract OCR engine and so supports all the languages
that Tesseract does.
.
Some other main features:
.
* Places OCR text accurately below the image to ease copy / paste
* Keeps the exact resolution of the original embedded images
* When possible, inserts OCR information as a lossless operation
without rendering vector information
* Keeps file size about the same
* If requested deskews and/or cleans the image before performing OCR
* Validates input and output files
* Provides debug mode to enable easy verification of the OCR results
* Processes pages in parallel when more than one CPU core is
available
* Battle-tested on thousands of PDFs, a test suite and continuous
integration.
Homepage: https://github.com/jbarlow83/OCRmyPDF
Installed-Size: '555'
Maintainer: Debian Python Team <team+python@tracker.debian.org>
Package: ocrmypdf
Priority: optional
Recommends: unpaper, pngquant
Section: graphics
Suggests: ocrmypdf-doc, python-watchdog, img2pdf
Version: 14.0.1+dfsg1-1
srcpkg_name: ocrmypdf
srcpkg_version: 14.0.1+dfsg1-1