Modules to produce a local cache of a web page?

Modules to produce a local cache of a web page?

am 24.01.2009 00:28:39 von david.effa

This is a multipart message in MIME format.
--===============0070378011==
Content-Type: multipart/alternative;
boundary="=_alternative 00810DB086257547_="

This is a multipart message in MIME format.
--=_alternative 00810DB086257547_=
Content-Type: text/plain; charset="US-ASCII"

Hi,

Does anyone know of some of modules that will fetch an URL and save it to
a local system (easy) but also download all the page elements as well
(basically render the page and store the contents locally)? I would also
like the fetched URL home page URLs for all the elements downloaded to be
rewritten to point to the location (of my choosing "Base" Relative or
Absolute) where all the render elements ended up. Basically I want to do
a local cache of a remote web page.

Best regards,

David

--=_alternative 00810DB086257547_=
Content-Type: text/html; charset="US-ASCII"



Hi,



Does anyone know of some of modules
that will fetch an URL and save it to a local system (easy) but also download
all the page elements as well (basically render the page and store the
contents locally)?  I would also like the fetched URL home page URLs
for all the elements downloaded to be rewritten to point to the location
(of my choosing "Base" Relative or Absolute) where all the render
elements ended up.  Basically I want to do a local cache of a remote
web page.




Best regards,



David


--=_alternative 00810DB086257547_=--

--===============0070378011==
Content-Type: text/plain; charset="us-ascii"
MIME-Version: 1.0
Content-Transfer-Encoding: 7bit
Content-Disposition: inline

_______________________________________________
ActivePerl mailing list
ActivePerl@listserv.ActiveState.com
To unsubscribe: http://listserv.ActiveState.com/mailman/mysubs
--===============0070378011==--

Re: Modules to produce a local cache of a web page?

am 24.01.2009 00:54:41 von lyle

David J Effa wrote:
>
> Hi,
>
> Does anyone know of some of modules that will fetch an URL and save it
> to a local system (easy) but also download all the page elements as
> well (basically render the page and store the contents locally)? I
> would also like the fetched URL home page URLs for all the elements
> downloaded to be rewritten to point to the location (of my choosing
> "Base" Relative or Absolute) where all the render elements ended up.
> Basically I want to do a local cache of a remote web page.

I would have thought LWP::UserAgent would have been able to do this...

But I haven't checked :/


Lyle

_______________________________________________
ActivePerl mailing list
ActivePerl@listserv.ActiveState.com
To unsubscribe: http://listserv.ActiveState.com/mailman/mysubs

Re: Modules to produce a local cache of a web page?

am 24.01.2009 01:50:17 von Bill Luebkert

David J Effa wrote:
>
> Hi,
>
> Does anyone know of some of modules that will fetch an URL and save it
> to a local system (easy) but also download all the page elements as well
> (basically render the page and store the contents locally)? I would
> also like the fetched URL home page URLs for all the elements downloaded
> to be rewritten to point to the location (of my choosing "Base" Relative
> or Absolute) where all the render elements ended up. Basically I want
> to do a local cache of a remote web page.

You can use WWW::Mech or LWP::UserAgent with HTML::LinkExtor or lots
of other ways including letting your browser do it for you using the
save option (if it supports it). Randall Schwartz has a script somewhere
that does something similar, but I can't find it at the moment - I know
I modified it to do something similar - it's on one of his websites as
a coding exampple possibly for an article he wrote.

I'll send a couple of examples of extracting links that you can play with.
_______________________________________________
ActivePerl mailing list
ActivePerl@listserv.ActiveState.com
To unsubscribe: http://listserv.ActiveState.com/mailman/mysubs