• Home
  • Most Popular
  • Submit
  • About Us
  • Contact Us

Softpile

Free Downloads

Categories
  • Home
  • Most Popular
  • Communications
  • Desktop
  • Games & Entertainment
  • Graphic Apps
  • Network & Internet
  • Security & Privacy
  • System Utilities
Alternative to itextsharp 2022.11.10347
IronPDF offers an itextsharp alternative for HTML to PDF conversion with C# code examples, documentation, and ...
VShell Server for Linux and Mac 4.8
VShell is a versatile and secure file transfer server that supports multiple protocols and is compatible ...
PDF Studio PDF Editor for Linux 2022
PDF Studio is a cost-effective PDF editor that delivers full compatibility with the PDF Standard. It's ...
VQ Probe for Linux 1.5
VQ Probe is a comprehensive software tool that enables objective and subjective video quality analysis. The ...
Resilient Server 2.3
This Debian GNU/Linux (Buster) based software has a customized partitioning scheme that enhances robustness against filesystem ...
Valentina Studio for Linux 9.6
Valentina Studio is a cross-platform GUI manager for Mac, Windows, and Linux. It allows users to ...
VPN Lifeguard for Linux 1.0.58
The software monitors VPN connection and automatically terminates apps during connection loss, re-establishes the connection and ...
G_Viewer 0.84
G_Viewer is a Linux software that serves as both a file system and photo/image viewer. It ...
Checksome File Hash Tool for Linux 1.1
This software allows for the generation and verification of file hashes. It is a quick and ...
KeyWrangler Password Manager for Linux 1.2
A password management software that is secure, offline and extensible. It offers military-grade encryption to protect ...
Home Linux htmlcxx Download

htmlcxx

October 13, 2008
htmlcxx is a C++ software that efficiently parses HTML and CSS1 without validation. It provides a simple and straightforward solution for C++ developers to extract meaningful data from web pages.
Version 0.83
License LGPL
Platform Linux
Supported Languages English
Homepage htmlcxx.sourceforge.net
Developed by Davi de Castro Reis and Robson Braga Arajo
htmlcxx project is a C++ parser for HTML and CSS1 that does not validate. While there are several other parsers available, htmlcxx is unique in some aspects. It allows for STL-like navigation of the DOM tree, utilizing the tree.hh library from Kasper Peeters. It can reproduce the original document character by character from the parse tree. It also comes with a bundled CSS parser and offers optional parsing of attributes. Its C++ code resembles C++, and offsets of tags/elements are stored in the nodes of the DOM tree.

The parsing approach of htmlcxx is created to mimic Mozilla Firefox's behavior. As a result, users should expect similar parse trees created by Firefox. However, unlike Firefox, htmlcxx does not insert non-existent items in HTML, promising the exact bytes originally contained in the HTML document when serializing the DOM tree.

Using htmlcxx is relatively easy; the following example demonstrates how it works:

#include < htmlcxx/html/ParserDom.h > ... // Parse some HTML code string html = "< html >< body >hey< /body >< /html >"; HTML::ParserDom parser; tree< HTML::Node > dom = parser.parseTree(html); // Print the entire DOM tree cout ::iterator end = dom.end(); for (; it != end; ++it) { if (it->tagName() == "A") { it->parseAttributes(); cout attributes("href"); } } // Dump all text of the document it = dom.begin(); end = dom.end(); for (; it != end; ++it) { if ((!it->isTag()) && (!it->isComment())) { cout text(); } }

Overall, htmlcxx project is a useful and unique option for C++ developers in need of an HTML and CSS parser.
What's New

Version 0.83: N/A

Free Download 410K
337
  • Share on:

Most Popular

  1. Quicksilver Forums 1.4.2
    152
  2. Dvgrab 3.4
    95
  3. DynVPN 1.0
    86
  4. CherryTV 0.1
    82
  5. SlideMap 1.2.2
    73
  6. porm r2
    68
  7. fuseftp 0.8
    67
  8. K!rash 1.0
    67
  9. Swiftfox 3.0b5pre-2
    66
  10. Free Orion 0.3.13
    66

Related Downloads

AutoProfile
AutoProfile is a Gaim extension for personalized away messages and profiles. Users ...
NOCC
NOCC is a PHP-based webmail client that lets users access their POP3 ...
Cluttermm Tutorial
Cluttermm tutorial package provides an easy-to-follow guide for efficiently using Cluttermm, a ...
Open Pinball Simulator
Open Pinball Simulator is a project that aims to simulate the experience ...
bnrt
Bnrt offers a client-server software solution for efficient rendering of Blender animations, ...
Nightwatch
Nightwatch is a versatile build scheduler that supports multiple build styles such ...
bigFORTH/MINOS
bigFORTH/MINOS is a Forth that includes a user interface library and rapid ...
raidmon
Raidmon monitors Linux MD software RAID devices by running as a daemon ...
The DaisyPlayer Project
This software enables users to play Digital Talking Books (DTB) on their ...
qcapan
qcapan enables the download, creation, testing, and installation of Perl modules from ...
Copyright © 1999-2025 Softpile Free Downloads
  • Most Popular
  • Submit
  • About Us
  • Contact Us
  • Privacy Policy
  • Disclaimer
  • Terms of Use

Can we use your data to tailor ads for you?

Our partners will collect data and use cookies for ad personalization and measurement.

By choosing "I agree", closing this pop-up or clicking on any element on the page, you agree to the use of cookies to help us provide you with a better user experience.

Learn how Softpile and our partners collect and use data.

You can change your choice at any time in our privacy center.

Cookie Settings

Our website stores four types of cookies. At any time you can choose which cookies you accept and which you refuse. You can read more about what cookies are and what types of cookies we store in our Cookie Policy.

are necessary for technical reasons. Without them, this website may not function properly.

are necessary for specific functionality on the website. Without them, some features may be disabled.

allow us to analyse website use and to improve the visitor's experience.

allow us to personalise your experience and to send you relevant content and offers, on this website and other websites.