Crawl directives

There are multiple ways to tell search engines how to behave on your site. These are called “crawl directives”. They allow you to:

tell a search engine to not crawl a page at all;
not to use a page in its index after it has crawled it;
whether to follow or not to follow links on that page;
a lot of “minor” directives.

We write a lot about these crawl directives as they are a very important weapon in an SEO’s arsenal. We try to keep these articles up to date as standards and best practices evolve.

Beginners level

SEO basics: What is crawlability? »

What is crawlability, and why is it important for SEO? And in what ways could you block Google from crawling (parts of your) site? Read on!

Expert level

The ultimate guide to robots.txt »

The robots.txt file is a file you can use to tell search engines where they can and cannot go on your site. Learn how to use it to your advantage!

Must read articles about Crawl directives

Preventing your site from being indexed, the right way »

Trying to prevent indexing of your site by using robots.txt is a no-go, use X-Robots-Tag or a meta robots tag instead! Here's why.
rel=canonical: the ultimate guide »

Canonical URLs tell search engines that similar pages are actually the same. Learn when and how to set a URL as rel=canonical in this guide.
How to keep your page out of the search results »

Want to keep a page out of the search results? Ask yourself if it should be on your site anyways. If it should, use a robots meta tag to prevent it from being indexed.
What are sponsored, nofollow and ugc links, and why use them? »

Search engines need a bit of help to qualify links; use the nofollow, sponsored and UGC attribute to help them out. With Yoast SEO it's easy!