Alexa 1 Million Top-Sites CSV (worth 2500$) download for free
Alexa seriously tries do defend Google’s recent data publishing attack (releasing Trends for Websites and Insights for Search) and stumbled across this ad on alexa.com:
The linked download is quite impressive. Not only the Top 500 (or country-based Top 100), but the top 1 million trafficked sites by Alexa ranking is now available for download (top-1m.csv.zip 9,5 MB Zipfile, ausgepackt 21,4 MB) and it appears to be recent data too (November 24th says the file-timestamp). This list would set you back 2500 US$ via Amazon’s Web Services and unlucky you are only if you recently bought it or you work with MS Excel 2003 where you’re limited to the first 65536 Entries…
Some observations about the content: Subdomains from wordpress.com, blogspot.com, blog.br, blogs.com and others are now counted seperately. While this might be old news to some I just observed it today as well as the fact that they do also seperately track it when your folder-structure contains a ~username (old UNIX standard used at some universities) or my.domain.tld/user/name. This also causes some errors, assome Domains using Feedburner MyBrand appear in the stats with their feeds.domain.tld/~r/ redirect URLs.
Beside these minor flaws just a sincere Thank You to the Alexa Team for this early christmas present ![]()
Also see my original german post about this christmas present from Alexa.
December 7th, 2008 at 15:30
[...] read my english translation on randolf.jorberg.com [...]
December 9th, 2008 at 07:23
Thanks for making everyone aware of this! I learned about your post today via DomainNameNews.com.
I just finished writing an browser based tool to help me search the data (since my version of Excel was only able to open the first 65,000 records). Instead of manually reviewing the list, I can now enter in a keyword phrase to show all sites in the
Alexa top million which contain this phrase.
The data is very interesting. In addition to checking on competitors,
it’s possible to get a list of top trafficked domains by domain extension
or country code.
For example, I found…
565,805 .com sites
70,640 .net sites
48,810 .org sites
54,309 .de sites
3,599 .za sites
If you would like to try the tool out, it’s located at
http://www.popularity.info and can search the entire list in about 5-10 seconds.
December 9th, 2008 at 10:37
hey Barry - THIS IS COOL! Thanks for that tool…
December 14th, 2008 at 19:00
[...] 10k…, while Alexa charged a whopping $2500k for pretty much the same data. Today I saw over a Randolph’s blog, that Alexa just opened up their database and offered the Alexa top million websites for free. [...]
March 3rd, 2009 at 08:24
Recently Alexa added new features in their Top http://www.alexa.com/data/details/main?url=www.fortunehotels.in Sites Lists. These features have come to be extremeley well recieved by the web professionals.
May 20th, 2009 at 05:10
thank you..