My Robots.txt
• Working On My Website
Mostly I'm protecting myself from bandwidth drain. Not as big an issue as it used to be. But in some cases the bots worked the scripts so heavily that the CPU drain on the server was pretty bad.
First the usual stuff:
User-agent: *
Disallow: /cgi-bin
Disallow: /mt
Disallow: /common
PicSearch's psbot was consuming much bandwidth grabbing images. Mostly from affliate pages. Near as I can tell Become.com is a fancy scraper site.
User-agent: psbot
User-agent: BecomeBot
Disallow: /
Some times Google just goes berserk grabbing images. Again, mostly affilate program stuff. Never got any good out of it.
User-agent: Googlebot-Image
Disallow: /*.gif$
User-agent: Googlebot-Image
Disallow: /*.jpg$
User-agent: Googlebot-Image
Disallow: /
Jeeves will just gobble and gobble but send you very few hits. And sometimes slurp is just too hungry.
User-agent: Teoma
Crawl-delay: 60
User-agent: Slurp
Crawl-delay: 30
Comments
Thank you for posting that. Google has been grabbing stuff from some of my sites like a maniac but not sending traffic so i t does me no good at all … !
Posted by: Worried Webmaster | June 8, 2006 07:25 PM