Any problem with Document Archive in Bitsavers.org ?

Contemporary messages sorted: [ by date ] [ by thread ] [ by subject ] [ by author ] [ by messages with attachments ]

From: Patrick Finnegan <pat_at_computer-refuge.org>
Date: Tue Jun 29 12:58:39 2004

On Tuesday 29 June 2004 12:16, Paul Williams wrote:
> Al Kossow wrote:
> > As soon as bitsavers came on line again, google crawlers started
> > downloading EVERYTHING from multiple IP adrs.
>
> Put this in your robots.txt:
>
> User-agent: Googlebot
> Disallow: /*.pdf$

Grr. Don't do this. I really hate it when people disallow google to
index content. It always makes it harder to find stuff. The only time
I'd consider doing it is if the "webserver" is on a dialup connection
or something that won't stay at the same IP address.

Pat

-- 
Purdue University ITAP/RCS        ---  http://www.itap.purdue.edu/rcs/
The Computer Refuge               ---  http://computer-refuge.org

Received on Tue Jun 29 2004 - 12:58:39 BST

This archive was generated by hypermail 2.3.0 : Fri Oct 10 2014 - 23:37:01 BST