Date: Thu, 28 Mar 2024 08:17:55 -0400 (EDT) Message-ID: <1315417987.27686.1711628275993@lyrasis1-roc-mp1> Subject: Exported From Confluence MIME-Version: 1.0 Content-Type: multipart/related; boundary="----=_Part_27685_1141252131.1711628275993" ------=_Part_27685_1141252131.1711628275993 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Content-Location: file:///C:/exported.html
pdftotext is a utility that comes as part of the Foolabs Xpdf package. I= t is used by the P= DF Solution Pack to extract text from text-based PDFs so that it can be= appended to the object as a FULL_TEXT datastream.
pdftotext is installed as part of Xpdf, which can be found at Foolabs' o= fficial site, http://www.foolabs.com/xpdf/download= .html. For Windows and Mac installations, a binary installer exists the= re; for Linux installations, however, you may compile it from source, use t= he binaries from the site, or much more simply use your distribution's pack= age manager to install it automatically; on Debian- and Ubuntu-based system= s, this can be accomplished by running:
apt-get= install xpdf-utils
More information on how to integrate pdftotext with Islandora can be fou= nd on the PDF Solu= tion Pack page.