Ian McAnerin and Mike Churchill
2005
Bogons Ate My Web Site
An unforeseen risk of changing ISPs
The Scenario
We were recently contacted by one of our customers wanting to know why their site was dropping in their search engine rankings. The site’s presence in Yahoo! was slowing eroding – every day a few more indexed pages would disappear. The site had not undergone any content changes, but it had recently changed ISPs. What could be causing the problem? It was only through the teamwork of networking staff at both Yahoo! and RackSpace that the issue was identified, isolated, and corrected. Curiously, neither was to blame, but both were instrumental in solving the problem.
There were a number of unusual aspects to the case, including the fact that only rankings on Yahoo seemed to be affected. More interestingly, Yahoo’s spiders were reaching the site according to the logs, but when we checked with Yahoo they were apparently reporting an error.
So we began a detailed analysis of what was going on. The analysis was made more difficult by the fact that running a traceroute from Yahoo to the website showed that the ping made it into RackSpace’s network before disappearing. RackSpace, doing the same trace in reverse, noted that the ping disappeared once Yahoo’s network was reached, thus prompting a scenario where each initially blamed the other based on the evidence presented.
We had no reason to believe that either party was misleading us, so we began a careful troubleshooting checklist. The site itself was clean and fast, used no risky tactics and was very spiderable. The robots.txt and other robot behavior control codes freely allowed spiders into the site, and there were no errors in the DNS or other related server technologies. Additionally, the site had been spidered by various search engines in the past, and was ranking well in several categories. We knew it wasn’t the site, and began to check the network.
First, we checked the IP address of the website. There was no indication of it being blacklisted for spam purposes.
Next, we checked for the website being blocked by Yahoo or the Yahoo spiders being blocked by RackSpace. Both parties assured us that this was not the case. Yahoo was able to confirm that the spiders were unable to access the site, however.
The only odd factor in the new hosting location was that the IP block was only recently allocated – previously it was part of a reserved block not in use anywhere. It was this oddity that allowed us to identify the culprit: a monster lurking in the internet known as a bogon.
After a few phone calls and some additional tests, we verified that a bogon had indeed eaten our website and Yahoo’s spiders, and could very well be lurking out there on the internet waiting for others, as well. Once we confirmed this to Yahoo they were able to deal with the matter and the client is now enjoying his previous rankings and traffic.
The Bogon
So, you are probably asking, what is a bogon?
Let’s back up a step and outline an issue that networks deal with all the time – spammers and hackers. Naturally these people do not wish to be traced or identified, so they often send fake return IP addresses to make it difficult to track them. It’s like sending a nasty letter to someone and using a fake return address.
There are billions of potential IP addresses in the world, including a lot of IP addresses that are not being used. They are either being held in reserve for future needs, or are allocated for testing or other purposes. It’s a fair assumption that anything coming to you with an address you know is fake is probably up to no good.
Network administrators have discovered that if they block all of these known unused addresses from moving through their networks, they can block upwards of 60% of spam and hackers. This makes blocking known unused addresses a very easy way to make your network run better and make your users happy.
These unused IP addresses are collectively known as a bogon, a contraction of “bogus logon”, or a logon from a place you know no one can actually logon from.
Smart network administrators block bogons at the router or firewall level, never letting them get into the system in the first place. But what happens when a previously unallocated IP address is released into public use?
In a perfect world, all network administrators everywhere would either manually or automatically update their bogon filters to allow the new IP’s to flow through their network. But it’s not a perfect world, and RackSpace reports that a large number of administrators either do not update their bogon filters automatically or are running systems that need to be manually updated and have not done so yet.
This means that if you are given an IP address that was a bogon until recently, you may find your website blocked from various ISP’s and networks.
The issue with our client was not that Yahoo was blocking the IP, but Yahoo’s ISP was. Since the main router for RackSpace was a known safe IP address, what would happen was that the trace from Yahoo would leave Yahoo, go through their ISP, enter RackSpace’s network and then get directed to the new IP, all the while reporting back at each step that it had successfully arrived.
Of course, the report back from the new IP was being blocked by Yahoo’s ISP, which made it look like the trace ping entered RackSpace but never arrived at the new IP, since the return packet was being blocked. This explains why Yahoo’s spiders were showing up in the logs, but were not reporting their visit back to Yahoo – they were visiting and reporting, but the reports were being intercepted by the bogon filter and discarded.
The lucky thing is that in this case there was only one intervening network between Yahoo and RackSpace, allowing us to narrow in quickly. Imagine the potential confusion if there had been several ISP’s. Worse, what if the spiders took a different path to the website sometimes? You would get an intermittent block to the spiders and visitors, which would be very hard to track down unless you knew exactly what you were looking for.
Yahoo’s ISP has since fixed this problem, but there are a great many ISP’s and networks out there that have not. There may be websites that are not accessible to some search engines and visitors, with the website owner not having any clue as to the reason why.
The Fix
So, how do you check to see if you are affected by a bogon filter? First, check your IP address. If it’s a recently allocated (within the last year) IP then there is a possibility that it may be affected by bogon filters put in place by various networks. Another symptom is that a traceroute will show a response at each step of the way until the new IP is reached, and it is consistently blocked by the same network node.
At this time, we recommend that you avoid newly released IP addresses unless you are certain that they are not being affected by bogon filters.
If you are a network administrator who uses bogon filters, we strongly recommend you use one of the many freely available automatic bogon updating scripts and services.
Bogon filters can be a powerful ally in fighting hackers and spammers, but it’s important to realize that a website or mail server can be inadvertently blocked by out-of-date bogon lists as a false positive. Without the help and support of both Yahoo and RackSpace, this would have been a very difficult issue to troubleshoot.
Useful Links:
http://www.cymru.com/Bogons/
http://www.completewhois.com/bogons/
http://www.antionline.com/jargon/bogon.php
© Ian McAnerin and Mike Churchill 2005
Ian McAnerin, founder of McAnerin Networks Inc, is a moderator for the High Rankings and the Search Engine Watch forums, and has been vetted by the standards watchdog SEOConsultants.com and SEOPros directories. He is the president of the Search Marketing Association of North America, with published articles in many international newsletters and web site news reports. He has a special interest in SEO legal issues due to his legal background.
Mike Churchill is the Tech Guru at KeyRelevance.com. He has been an Internet Junkie since 1992, cutting his Internet teeth on FTP, Archie, and Mosaic 1.0. He is the former Chief Technical Officer and co-founder of NetMechanic.com and is an avid developer of Internet applications.
First Published by Mike Grehan. Search engine marketing consultant, speaker and author. http://www.search-engine-book.co.uk
Associate Editor: Christine Churchill. KeyRelevance.com
e-marketing-news is published selectively on a when it’s ready basis. ©2005 Net Writer Publishing.
< http://www.e-marketing-news.co.uk >
Unless otherwise noted, all articles written by Ian McAnerin, BASc, LLB. Copyright © 2002-2004 All Rights Reserved. Permission must be specifically granted in writing for use or reprinting anywhere but on this site, but we do allow it and don’t charge for it, other than a backlink. Contact Us for more information.