This package browser is in early development. Mind the rough edges.

htmlcxx

Simple non-validating CSS1 and HTML parser for C++

htmlcxx is a simple non-validating CSS1 and HTML parser for C++. Although there are several other HTML parsers available, htmlcxx has some characteristics that make it unique:

  • STL like navigation of DOM tree, using excelent's tree.hh library from Kasper Peeters

  • It is possible to reproduce exactly, character by character, the original document from the parse tree

  • Bundled CSS parser

  • Optional parsing of attributes

  • C++ code that looks like C++ (not so true anymore)

  • Offsets of tags/elements in the original document are stored in the nodes of the DOM tree

Installation

Install the latest version of htmlcxx as follows:

guix install htmlcxx

Or install a particular version:

guix install htmlcxx@0.87

You can also install packages in augmented, pure or containerized environments for development or simply to try them out without polluting your user profile. See the guix shell documentation for more information.

Badge code

You can use the following badge to inform users of htmlcxx about the latest version available in Guix.

Example HTML:

<a href='https://packages.guix.gnu.org/packages/htmlcxx'><img src='https://packages.guix.gnu.org/packages/htmlcxx/badges/latest-version.svg' alt='Version in GNU Guix'></a>

Example Markdown:

[![Version in GNU Guix](https://packages.guix.gnu.org/packages/htmlcxx/badges/latest-version.svg)](https://packages.guix.gnu.org/packages/htmlcxx)

Example Org:

[[https://packages.guix.gnu.org/packages/htmlcxx][https://packages.guix.gnu.org/packages/htmlcxx/badges/latest-version.svg]]