Best HTML Parser

Best HTML Parser

am 08.01.2003 08:13:14 von Blesson Paul

Hi all
Which is the Best HTML Parser in C/C++

regards
Blesson Paul

Blesson Paul,
Software Engineer,
Integrated Computing Environments Ltd,
Cochin Special Economic Zone,
Cochin - 682 030, India.
Tel: +91-484-2413 083(O), +91-484-2413 096(O)
+91-481-2380 893(R)

Re: [apache-modules] Best HTML Parser

am 08.01.2003 18:53:57 von Charles Reitzel

I am partial to HTML Tidy for a few reasons:

1) cross-platform, reentrant C library
2) very forgiving of sloppy, malformed markup
3) produces clean markup - XHTML if requested
4) C++, Perl, Pascal, COM and .NET bindings available,
others easily done with SWIG

But I must admit, as one of the primary developers, I am probably
biased. But if you need to get your markup cleaned up so that you can
apply XML tools to it, it is probably the best game in town.

For more info: http://tidy.sourceforge.net/

take it easy,
Charlie


At 12:43 PM 1/8/2003 +0530, Blesson Paul wrote:
>Hi all
> Which is the Best HTML Parser in C/C++
>
>regards
>Blesson Paul