HTML::Parser - string "<>" at end of a document
am 07.07.2006 17:37:30 von paul.bijnensIt seems that the string "<>" at the end of some html document
is interpreted as "comment", and not as text.
Is this a bug, or this this some obscure syntax in HTML that I do
not know about.
(In my program I frequently need to parse some short string
as html. Like "Press the buttons <>". Bingo.)
Here is a small program that demonstrates the behaviour.
Even adding a space or newline at the end of the string makes
the resulting token "T" again.
$ perl -MHTML::TokeParser -le '
$s = "<>xx<>";
$p = HTML::TokeParser->new(\$s);
print join "\t", @$t while ($t=$p->get_token()),
'
T <>xx
C <>
--
Paul Bijnens, xplanation Technology Services Tel +32 16 397.511
Technologielaan 21 bus 2, B-3001 Leuven, BELGIUM Fax +32 16 397.512
http://www.xplanation.com/ email: Paul.Bijnens@xplanation.com
************************************************************ ***********
* I think I've got the hang of it now: exit, ^D, ^C, ^\, ^Z, ^Q, ^^, *
* F6, quit, ZZ, :q, :q!, M-Z, ^X^C, logoff, logout, close, bye, /bye, *
* stop, end, F3, ~., ^]c, +++ ATH, disconnect, halt, abort, hangup, *
* PF4, F20, ^X^X, :D::D, KJOB, F14-f-e, F8-e, kill -1 $$, shutdown, *
* init 0, kill -9 1, Alt-F4, Ctrl-Alt-Del, AltGr-NumLock, Stop-A, ... *
* ... "Are you sure?" ... YES ... Phew ... I'm out *
************************************************************ ***********