Unicode text segmentation for OCaml
Uuseg is an OCaml library for segmenting Unicode text. It implements the locale independent Unicode text segmentation algorithms to detect grapheme cluster, word and sentence boundaries and the Unicode line breaking algorithm to detect line break opportunities.
The library is independent from any IO mechanism or Unicode text data structure and it can process text without a complete in-memory representation.
- Website: https://erratique.ch/software/uuseg
- Licenses: ISC
- Package source: gnu/packages/ocaml.scm
- Builds: See build status
- Issues: See known issues
ocaml-uuseg 15.0.0 as follows:
guix install email@example.com
Or install the latest version:
guix install ocaml-uuseg
You can also install packages in augmented, pure or containerized environments for development or simply to try them out without polluting your user profile. See the
guix shell documentation for more information.