Way to Parse HTML fields to Excel Spreadsheet

Way to Parse HTML fields to Excel Spreadsheet

am 14.01.2008 22:59:15 von ghadley_00

Hi,

I've got a text file that contains the html of approximately 514 web
pages, concatenated into 1 file. Each html page has approximately the
same format, with data in tables. Can anyone recommend a way I can
parse the document to get the fields as variables and write them out
as an excel spreadsheet? [e.g. using win32 ole]

Thanks,

George

ghadley_00@yahoo.com

Re: Way to Parse HTML fields to Excel Spreadsheet

am 14.01.2008 23:17:24 von glex_no-spam

ghadley_00@yahoo.com wrote:
> Hi,
>
> I've got a text file that contains the html of approximately 514 web
> pages, concatenated into 1 file. Each html page has approximately the
> same format, with data in tables. Can anyone recommend a way I can
> parse the document to get the fields as variables and write them out
> as an excel spreadsheet? [e.g. using win32 ole]

Parse the HTML - There are many modules and possibly HTML::TableExtract
might be useful, or possibly a regular expression might be enough.
Write out the data.

To really answer your question you should show us an example of the data
(NOT the entire file), and what you've tried. See the Posting Guidelines.