This package browser is in early development. Mind the rough edges.

tesseract-ocr 5.3.0

Optical character recognition engine

Tesseract is an optical character recognition (OCR) engine with very high accuracy. It supports many languages, output text formatting, hOCR positional information and page layout analysis. Several image formats are supported through the Leptonica library. It can also detect whether text is monospaced or proportional. Support for the English language is included by default. To add support for more languages, the tesseract-ocr-tessdata-fast package should be installed.

Installation

Install tesseract-ocr 5.3.0 as follows:

guix install tesseract-ocr@5.3.0

Or install the latest version:

guix install tesseract-ocr

You can also install packages in augmented, pure or containerized environments for development or simply to try them out without polluting your user profile. See the guix shell documentation for more information.