docx2txt 1.4
Recover text from `.docx' files, with good formatting
docx2txt
is a Perl based command line utility to convert Microsoft Office .docx
documents to equivalent text documents. Latest version supports following features during text extraction.
Character conversions; currency characters are converted to respective names like Euro.
Capitalisation of text blocks.
Center and right justification of text fitting in a line of (configurable) 80 columns.
Horizontal ruler, line breaks, paragraphs separation, tabs.
Indicating hyperlinked text along with the hyperlink (configurable).
Handling (bullet, decimal, letter, roman) lists along with (attempt at) indentation.
- Website: https://docx2txt.sourceforge.net
- Licenses: GPL 3+
- Package source: gnu/packages/textutils.scm
- Builds: See build status
- Issues: See known issues
Installation
Install docx2txt 1.4
as follows:
guix install docx2txt@1.4
Or install the latest version:
guix install docx2txt
You can also install packages in augmented, pure or containerized environments for development or simply to try them out without polluting your user profile. See the guix shell
documentation for more information.