Find out who’s scraping you!
Sometimes you just need to know who is linking to a blog, or even better, your competitor’s blog, or who’s scraping it, and you’d actually like that data in a format you can use. I thought I’d make a list of the sources you can use to gather link-data from, outside of the “obvious” Yahoo SiteExplorer and Google Webmaster Tools.
- First of all, and most hardened SEO’s know this: Google Blog search. The link data in there is just fenomenal, as they’re not, like the normal Google link: command results, filtered. When you want a complete overview, make sure you disable the dupe content filter, that will show you all those pesky scrapers. If you need this data to use it in a tool, it offers RSS feeds too.
- Second: Technorati. If you click on a blog’s authority, you get to see it’s reactions, and you can also drag these out via the API.
- Blogpulse, you don’t even need to add link: here, you can just throw in a URL, just like the others, Blogpulse offers RSS feeds.
- IceRocket. Offers RSS feeds for the link: command queries as well.
- WASAlive, not too much data, but sometimes it offers some unique stuff. RSS feeds for the results as well.
Now you should also consider, when you’re querying those engines, to query for two things: the blog URL, and it’s feed URL. In my blog’s case: yoast.com and feeds.joostdevalk.nl. Especially scrapers will usually just link to your feed URL.
Now if you’ve read my post on the Technorati Authority booster, you know why I needed this myself…