Tag Soup also includes a command-line processor that reads HTML files and can generate either clean HTML or well-formed XML that is a close approximation to XHTML.html-parser parser xml-parser sax Tag Soup is a library for parsing HTML/XML.RELAX NG Tutorial (errata), OASIS Committee Specification, 3 December 2001. More recent, but not officially approved Working Draft, 26 March 2003 also available RELAX NG Compact Syntax Tutorial, OASIS Working Draft, 26 March 2003. Tutorial for RELAX NG using the compact syntax, not assuming prior knowledge of XML syntax. Also supports other schema languages including W3C XML Schema.
XML Schema 1.1 Part 2: Datatypes and XML Schema 1.1 Part 1: Structures are available.
December 13, 2001 Eric van der Vlist and Lisa Rein Updated December 2001.
This guide includes details of tools for working with the W3C XML Schema Definition Language, as well as some tools for processing alternative schema languages such as RELAX NG and Schematron.
This page tries to give a comparison of existing free C or C XML parser libraries. SAX2, DOM, XPath and partial XSLT implementation to be used with expat, libxml, Xerces or MSXML. The objective of Arabica is to provide a nice standard C implementation of DOM and SAX2 interfaces on top of these parsers. Can be cumbersome to get up and running on Windows, since it heavily depends on other GNU infrastructure (glib etc.). you have to have libxml2 installed in order to use libxml . On Windows it is difficult to install and set up due to multiple dependencies on other GNU libs. It has support for a large set of common character encodings but only limited support for DTD.
It includes both full blown as well as lightweight parsers. I'll start with a comparison table giving a quick overview of all available libraires. Hence, Arabica has to be set up and built for one of the underlying parsers before use. Included documentation is sparse but there are a lot of external sites providing docs, examples, tutorials. For example uses Glib::ustring instead of std:strings. : For Win32, libxml 1.0 is better suited than libxml 2.x, due to a nasty glibmm dependency. Various third party SAX/DOM wrappers and language bindings. As usual for a Microsoft product, there are lots of high quality documentation and articles.