Re: charsets generally and with lwp

Re: charsets generally and with lwp

am 21.10.2009 21:26:41 von Bill Luebkert

Hildegard Schedthelm wrote:
> Hello folks
>
> I've some troubles with a perlscript that you can see below.
> The problem is that some german special characters (umlaut) are not
> displayed as
> they should be. This seems to be an encoding-issue. Either the internal
> perl variables have the wrong encoding or the lwp-module when grapping
> the html?
> Additional the output to write the data into the MS-Access DB can also
> have the
> incorrect encoding. How can we fix the uncertainty? What can I do to
> ensure the
> right encoding in all levels?

Your retrieval part looks ok. I would just dump the data in hex prior
to insertion in the DB and verify it's what you want. That just leaves
the DB if all is well going in. If the DB is converting it, you could
consider your own conversion on entry and exit to the DB if you're also
using Perl to query the DB - if not, and there is some conversion going
on going into the DB I would check ODBC for options that may alleviate
the problem. (I haven't played with ODBC much.)

_______________________________________________
ActivePerl mailing list
ActivePerl@listserv.ActiveState.com
To unsubscribe: http://listserv.ActiveState.com/mailman/mysubs