[Date Prev][Date Next][Thread Prev][Thread Next][Interchange by date
][Interchange by thread
]
[ic] RobotUA
Grant said:
> I've had my RobotUA all set up for a few days, but examining my rotated
> access_log files, the robots aren't getting any further than this:
>
> 66.196.65.16 - - [25/Nov/2002:18:30:41 -0800] "GET /robots.txt HTTP/1.0"
> 200 0 "-" "Mozilla/3.0 (Slurp/si; slurp@inktomi.com;
> http://www.inktomi.com/slurp.html)"
> 66.196.65.16 - - [25/Nov/2002:18:30:42 -0800] "GET / HTTP/1.0" 301 330
> "-" "Mozilla/3.0 (Slurp/si; slurp@inktomi.com;
> http://www.inktomi.com/slurp.html)"
>
> Here's my RobotUA entry:
>
> RobotUA WebCrawler, BaiDuSpider, ZyBorg, almaden.ibm, Googlebot, Slurp,
> Girafabo
> t, ia_archiver, LinkWalker, MSIECrawler
>
> Has anyone verified that this directive really works to clean up the
> URLs for spidering?
>
> - Grant
Get a web browser that allows you to change the User Agent like Konqueror
or even w3m, and then turn off cookies. Go to your web site and look at
it. This will tell you if your configuration is working as you expedted
to.
---
Philip S. Hempel
- Follow-Ups:
- [ic] RobotUA
- From: interchange-users@icdevgroup.org (Jonathan Clark)