March 22, 2007

TagSoup is a Java-based SAX2 parser that processes XML and HTML documents.

Version 1.0.5

License GPL

Platform Linux

Supported Languages English

Homepage mercury.ccil.org

Developed by John Cowan

TagSoup is a Java-based SAX2 parser that is specifically designed to parse HTML documents, including those that are not well-formed or valid. This means that even the most complex, messy, or wild HTML can be parsed without any problems, thanks to the intuitive SAX interface that is provided by TagSoup.

It is important to note that TagSoup is not intended to clean up bad HTML permanently, like some other applications do. Instead, it is designed to parse HTML on the fly, making it a particularly useful tool for developers and other IT professionals who work with a lot of HTML-based content.

There are several different options available when using TagSoup, including the ability to output content as individual files or clean HTML, the suppression of the XML declaration, and the ability to suppress bogon elements and default attribute values. Developers can also specify the input encoding and output format, and there are a number of other useful features that make this parser a powerful and flexible tool for working with HTML content.

One of the main improvements in the latest release of TagSoup is the fix for HTML comments, which were previously broken due to a bug that caused any > character to terminate a comment prematurely. Other updates in this release include support for the new version of Saxon as an XSLT processor, improved documentation on SAX features and properties specific to TagSoup, and the ability to reuse a single instance of the parser throughout.

Overall, TagSoup is a reliable and robust HTML parser that is well-suited for a variety of different use cases. Whether you are working with messy or unstructured HTML, or you simply need a tool that can handle large volumes of content quickly and efficiently, TagSoup is an excellent choice that is definitely worth considering.

What's New

Version 1.0.5: N/A

Free Download 51K

Softpile

Free Downloads

Tag Soup

Most Popular

Related Downloads