tesseract-ocr 5.3.0
Optical character recognition engine
Tesseract is an optical character recognition (OCR) engine with very high accuracy. It supports many languages, output text formatting, hOCR positional information and page layout analysis. Several image formats are supported through the Leptonica library. It can also detect whether text is monospaced or proportional. Support for the English language is included by default. To add support for more languages, the tesseract-ocr-tessdata-fast
package should be installed.
- Website: https://github.com/tesseract-ocr/tesseract
- Licenses: ASL 2.0
- Package source: gnu/packages/ocr.scm
- Builds: See build status
- Issues: See known issues
Installation
Install tesseract-ocr 5.3.0
as follows:
guix install tesseract-ocr@5.3.0
Or install the latest version:
guix install tesseract-ocr
You can also install packages in augmented, pure or containerized environments for development or simply to try them out without polluting your user profile. See the guix shell
documentation for more information.