List of Googlebot IPs

By Deane Barker on September 11, 2008

google.txt: I don’t know how well this is kept up-to-date, but it purports to be a list of all the IPs the Googlebot comes from.

This is of interest after the last post, because Google says you can detect the Googlebot User Agent and bypass login pages to index content behind subscription walls. But it also means that anyone can bypass your login page as well by changing their User Agent.

But, if this is an accurate list of Googlebot IPs, then you could detect both the User Agent and the IP. The trick, of course, is to make sure you have a an up-to-date list of IPs. I imagine it changes a lot, but of the 281 lines in this file, several are Class C subnets, which would encompass ~250 other IPs, so the actual number of potential IPs in this list is into the thousands.

Gadgetopia
What This Links To

Comments

  1. The folks over at IP delivery (John Heard), they are probably the most aggressive in keeping track of all major search engine spider IP addresses. They use the IP addresses always save for their IP delivery software which is pretty potent and powerful stuff.

    They have a subscription-based service where you can receive immediate alerts when new IP addresses from search engines (such as Google) are detected.

Comments are closed. If you have something you really want to say, email editors@gadgetopia.com and we‘ll get it added for you.