meta robots tags you can have on your pages has been around for quite a while. Search engines have slowly been adding features to it. This page lists an overview of all the different values you can have in your tag. First, the official syntax of the meta robots tag:
Which search engine supports which robots meta tag values?
This table shows which search engines support which values:
|Robots value||Yahoo!||MSN / Live||Ask|
|noydir||No use||Yes||No use||No use|
The different robots meta tag values
An explanation of all the different values you can use in the robots meta tags:
Allow search engines robots to index the page, you don’t have to add this to your pages, as it’s the default.
Disallow search engines from showing this page in their results.
Disallow search engines from spidering images on that page. Of course, if images are linked to directly from elsewhere, Google can still index them, so using an X-Robots-Tag HTTP header is a better idea.
This is a shortcut for noindex,nofollow, or basically saying to search engines: don’t do anything with this page at all.
Tells the search engines robots to follow the links on the page, whether it can index it or not.
Tells the search engines robots to not follow any links on the page at all.
Prevents the search engines from showing a cached copy of this page.
Same as noarchive, but only used by MSN/Live.
Prevents the search engines from showing a snippet of this page in the search results and prevents them from caching the page.
Used to block search engines from using the description for this page in DMOZ (aka ODP) as the snippet for your page in the search results. However, DMOZ doesn’t exist anymore.
Blocks Yahoo! from using the description for this page in the Yahoo! directory as the snippet for your page in the search results. No other search engines use the Yahoo! directory for this purpose, so they don’t support the tag. Since Yahoo! closed its directory this tag is deprecated, but you might come across it once in awhile.
The resources from the search engines
The search engines themselves have pages about this subject as well:
- Google did a blogpost on the Google Webmaster Central Blog
- Bing has a webmaster help page for it
- Yandex has a help page for HTML meta tags.
You can block all robots at once by the markup above, or just block one robot by specifying it specifically:
- MSN / Live