Best HTML Parser
am 08.01.2003 08:13:14 von Blesson Paul
Hi all
Which is the Best HTML Parser in C/C++
regards
Blesson Paul
Blesson Paul,
Software Engineer,
Integrated Computing Environments Ltd,
Cochin Special Economic Zone,
Cochin - 682 030, India.
Tel: +91-484-2413 083(O), +91-484-2413 096(O)
+91-481-2380 893(R)
Re: [apache-modules] Best HTML Parser
am 08.01.2003 18:53:57 von Charles Reitzel
I am partial to HTML Tidy for a few reasons:
1) cross-platform, reentrant C library
2) very forgiving of sloppy, malformed markup
3) produces clean markup - XHTML if requested
4) C++, Perl, Pascal, COM and .NET bindings available,
others easily done with SWIG
But I must admit, as one of the primary developers, I am probably
biased. But if you need to get your markup cleaned up so that you can
apply XML tools to it, it is probably the best game in town.
For more info: http://tidy.sourceforge.net/
take it easy,
Charlie
At 12:43 PM 1/8/2003 +0530, Blesson Paul wrote:
>Hi all
> Which is the Best HTML Parser in C/C++
>
>regards
>Blesson Paul