problem w/ robotrules record parse
problem w/ robotrules record parse
am 21.09.2005 18:04:33 von Matthew.van.Eerde
I've run into the same problem J and T ran into here:
http://www.mail-archive.com/libwww@perl.org/msg05452.html
Namely, malformed robots.txt files out there are being parsed somewhat =
draconianly by WWW::RobotRules.pm.
I've cobbled together a "fixed" version of RobotRules.pm -- how can I =
get it reviewed and ultimately blessed by the LWP community?
http://www.geocities.com/mvaneerde/RobotRules.pm.txt
--=20
Matthew.van.Eerde (at) hbinc.com 805.964.4554 x902
Hispanic Business Inc./HireDiversity.com Software Engineer
Re: problem w/ robotrules record parse
am 21.09.2005 18:07:49 von Andy
On Wed, Sep 21, 2005 at 09:04:33AM -0700, Matthew.van.Eerde@hbinc.com (Matthew.van.Eerde@hbinc.com) wrote:
> I've cobbled together a "fixed" version of RobotRules.pm -- how can
> I get it reviewed and ultimately blessed by the LWP community?
It's not a community thing. Send a patch directly to Gisle, the owner
of LWP.
xoxo,
Andy
--
Andy Lester => andy@petdance.com => www.petdance.com => AIM:petdance
Re: problem w/ robotrules record parse
am 21.09.2005 18:55:34 von gisle
Andy Lester writes:
> On Wed, Sep 21, 2005 at 09:04:33AM -0700, Matthew.van.Eerde@hbinc.com (Matthew.van.Eerde@hbinc.com) wrote:
> > I've cobbled together a "fixed" version of RobotRules.pm -- how can
> > I get it reviewed and ultimately blessed by the LWP community?
>
> It's not a community thing. Send a patch directly to Gisle, the owner
> of LWP.
I still prefer patches to be posted to this list instead of me
directly. That way others might comment or pick up the patch for
their local use even if I'm not able to process it timely.
--Gisle
RE: problem w/ robotrules record parse
am 21.09.2005 19:01:02 von Matthew.van.Eerde
Gisle Aas wrote:
> Andy Lester writes:
>=20
>> Matthew.van.Eerde@hbinc.com (Matthew.van.Eerde@hbinc.com) wrote:=20
>>> I've cobbled together a "fixed" version of RobotRules.pm
>>=20
>> Send a patch directly to Gisle
>=20
> I still prefer patches to be posted to this list
Here's the patch, for the list.
http://www.geocities.com/mvaneerde/RobotRules.patch.txt
--=20
Matthew.van.Eerde (at) hbinc.com 805.964.4554 x902
Hispanic Business Inc./HireDiversity.com Software Engineer
RE: problem w/ robotrules record parse
am 21.09.2005 19:58:51 von Matthew.van.Eerde
Matthew.van.Eerde wrote:
> Gisle Aas wrote:
>> Andy Lester writes:
>>=20
>>> Matthew.van.Eerde@hbinc.com (Matthew.van.Eerde@hbinc.com) wrote:
>>>> I've cobbled together a "fixed" version of RobotRules.pm
>>>=20
>>> Send a patch directly to Gisle
>>=20
>> I still prefer patches to be posted to this list
>=20
> Here's the patch, for the list.
>=20
> http://www.geocities.com/mvaneerde/RobotRules.patch.txt
And here's a smaller patch - only eleven new lines of code - which =
should have the same net effect.
http://www.geocities.com/mvaneerde/RobotRules.patch-3.txt
--=20
Matthew.van.Eerde (at) hbinc.com 805.964.4554 x902
Hispanic Business Inc./HireDiversity.com Software Engineer
Re: problem w/ robotrules record parse
am 21.09.2005 20:11:15 von gisle
writes:
> And here's a smaller patch - only eleven new lines of code - which
> should have the same net effect.
This patch looks good. I'll apply it. Can you provide an update to
t/robot/rules.t as well?
Regards,
Gisle
Re: problem w/ robotrules record parse
am 21.09.2005 21:38:29 von gisle
writes:
> I've added a "warn" line in the case where a record separation is assumed... see
> http://www.geocities.com/mvaneerde/RobotRules.patch-4.txt
>
> rules.t patch:
> http://www.geocities.com/mvaneerde/rules-patch.txt
These patches have now been applied. Thanks!
--Gisle