Scroll to navigation

GETPDFTEXT(1p) User Contributed Perl Documentation GETPDFTEXT(1p)

NAME

getpdftext - Extracts and print the text from one or more PDF pages

SYNOPSIS

 getpdftext [options] infile.pdf [<pagenums>]
 Options:
   -c --check          just validates the page instead of printing it
   -g --geometry       just computes geometry, prints nothing
   -v --verbose        print diagnostic messages
   -h --help           verbose help message
   -V --version        print CAM::PDF version
 <pagenums> is a comma-separated list of page numbers.
      Ranges like '2-6' allowed in the list
      Example: 4-6,2,12,8-9

DESCRIPTION

Extracts all of the text from the specified PDF page(s) and prints them to STDOUT. If no pages are specified, all pages are processed.

The "--check" and "--geometry" modes are distinctly different. They are used primarily for debugging.

SEE ALSO

CAM::PDF

renderpdf

AUTHOR

See CAM::PDF

2022-12-08 perl v5.36.0