HOCR2PDF(1) | ExactImage Manual | HOCR2PDF(1) |
NAME¶
hocr2pdf - hOCR to PDF converter of the ExactImage toolkitSYNOPSIS¶
hocr2pdf
[ option...]
{-i | --input} input-file {
-o | --output} output-file
hocr2pdf
{ -h | --help}
DESCRIPTION¶
ExactImage is a fast C++ image processing library. Unlike many other library frameworks it allows operation in several color spaces and bit depths natively, resulting in low memory and computational requirements. hocr2pdf creates well layouted, searchable PDF files from hOCR (annotated HTML) input obtained from an OCR system.OPTIONS¶
-i file, --input fileRead image from the specified file. Note that
input hOCR is read from the standard input.
-o file, --output file
Save output PDF to the specified file.
-n, --no-image
Don't place the image over the text. By
default the text layer is hidden behind the image.
-s, --sloppy-text
Sloppily place text, group words, do not draw
single glyphs.
-r n, --resolution n
Override resolution of the input image to
n dpi. The default resolution (if not specified in the input file) is
300 dpi.
-h, --help
Display help text and exit.
EXAMPLE¶
$ hocr2pdf -i scan.tiff -o test.pdf < cuneiform-out.hocr
SEE ALSO¶
AUTHORS¶
Jakub Wilk <jwilk@debian.org>Wrote this manual page for the Debian
system.
http://www.exactcode.de/site/open_source/exactimage/
This manual page incorporates texts found on
the ExactImage homepage.
COPYRIGHT¶
This manual page was written for the Debian system (and may be used by others). Permission is granted to copy, distribute and/or modify this document under the terms of the GNU General Public License, Version 2 or (at your option) any later version published by the Free Software Foundation. On Debian systems, the complete text of the GNU General Public License can be found in /usr/share/common-licenses/GPL-2.05/26/2015 | hocr2pdf |