Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Installation

Note

For Linux and Mac installations: While it is likely that your distribution's package manager (or homebrew on a Mac, as Tesseract's site suggests) may contain Tesseract in one of its repositories, it is EXTREMELY unlikely that it will be the correct version. For the Islandora OCR module to create OCR derivatives, Tesseract 3.02.02 or higher is required. At the time of writing, this is the latest stable version. THIS MEANS THAT IT IS LIKELY THAT YOU WILL HAVE TO COMPILE IT FROM SOURCE.

Tesseract is managed by a team at Google; the latest stable release can be found on the downloads page of their website, https://code.google.com/p/tesseract-ocr/downloads/list. For those . A binary installer exists for Windows, and specific instructions for installing on a Mac through homebrew can be found in the Tesseract readme here: https://code.google.com/p/tesseract-ocr/wiki/ReadMe. For Linux users, or any others compiling it from source, you will need to make sure that you also have the Leptonica library installed, and that you have appropriate source building tools.

...