HTML::TreeBuilder eating my entities using perl 5.8.x

HTML::TreeBuilder eating my entities using perl 5.8.x

am 18.05.2005 14:05:05 von michalowski

hi there!

Sorry for cross posting to comp.lang.perl.modules, but this seems to be
the right place.

I'm using HTML-TreeBuilder to parse HTML code that actually contains
entities in it's text nodes. Working with perl 5.6.1 and the newest
HTML::TreeBuilder/HTML::Tree/HTML::Parser versions was satisfying, but
using perl 5.8.4 there is a problem with a different entity treatment.
Perl 5.8.4 translates the entities in the text nodes (such as ⚔)
into unicode characters (2 bytes) and actually doesn't ask me before
;-)
Is there a possibility to make HTML::TreeBuilder with perl 5.6.x and
perl 5.8.x react the same way (storing entities as they are)?

best regards,
Michael