Avoiding Alarm Clocks While Spidering

Avoiding Alarm Clocks While Spidering

am 18.01.2005 02:50:26 von justin.tang

Hi All:
I am currently running a spider program derieved from an open source
Search Engine program call SWISH-E, the spider.pl file that I am using uses
the LWP::RobotUA class. Now, the way I have it set up is that I have a
program that would prep the spider program with a list of sites to spider,
the the program would call the spider using the backtick(``). From there
on, the spider becomes a zombie program, outputing results to a local
textfile for me to review later. The problem I'm running into is that it
seems like the LWP class has a timeout function implemented that would sleep
the process after a period of time with a message saying "Alarm clock."
What is happening is that, since my process is a zombie, when it is put to
sleep the system kills the process. Is there anyway around this situation?
Is there a command or flag in LWP::RobotUA that I can set so it would not be
put to sleep. Thanks in advance!

-Justin

Re: Avoiding Alarm Clocks While Spidering

am 18.01.2005 10:14:30 von gisle

"Justin Tang" writes:

> I am currently running a spider program derieved from an open source
> Search Engine program call SWISH-E, the spider.pl file that I am using uses
> the LWP::RobotUA class. Now, the way I have it set up is that I have a
> program that would prep the spider program with a list of sites to spider,
> the the program would call the spider using the backtick(``). From there
> on, the spider becomes a zombie program, outputing results to a local
> textfile for me to review later. The problem I'm running into is that it
> seems like the LWP class has a timeout function implemented that would sleep
> the process after a period of time with a message saying "Alarm clock."
> What is happening is that, since my process is a zombie, when it is put to
> sleep the system kills the process. Is there anyway around this situation?
> Is there a command or flag in LWP::RobotUA that I can set so it would not be
> put to sleep.

There is the 'use_sleep' attribute that you might set to a FALSE value.

Regards,
Gisle