The Yoast SEO readability analysis: methodological choices explained

In Yoast SEO, you’ll find readability checks which will help you making your text easy to read. Here we’ll explain the choices we made while developing all these checks. What was our methodology, why did we include these checks in the readability analysis? And how did we set them up?
Want to know if the readability analysis works in your language? It’s available in these languages now.
Little research on what is ‘right’ in writing!
There are a lot of things we know about readability. Short sentences are easier to read than long sentences. Passive voice causes distant writing. However, linguistics is not at all an exact science. Still, we had to decide on some things in order to make readability checks. Which checks should we include? What should be the boundaries?
Our methodology
Why readability?
Readability is of great importance! It determines whether people understand the message of your text. Although correct spelling and grammar are important, these are most certainly not the only factors influencing the readability. A readable text is a text someone wants reads all the way through.
Deciding upon measurements
In the first stage of developing the assessments, we analyzed the competition. We decided upon the measurements of our assessments through thorough investigation of other tools and checks. Which measurements are commonly used in grammar and spelling checks? How did these other tools measure readability?
In the second phase, we analyzed the readability of text we considered to be readable and on text we considered to be a bad read. In this phase we also calculated readability scores of text we considered well-written, as well as readability scores of text we considered badly written. We used all the information of the two phases in developing our content checks to come to the first version of our assessments.
Finetuning measurements with some research
In the final phase of developing the assessment, we used our own readability analysis. In this phase, we put our content checks to the test. We analyzed text of lots of news sites and blogs using our own readability analysis tool. We checked out text of the Guardian, of Moz, but also articles of low-key mum and travel blogs. We selected a total of 75 articles from very different blogs and news sites.
The purpose of the analysis of these 75 different articles was to make sure that the assessments in the readability analysis are sufficiently distinctive. If the bullet of the readability analysis would be green all the time, that wouldn’t help our users. But, if it would be almost impossible to get a green bullet, that would lead to much frustration as well. Our research gave us a clear overview of the readability of many different blog post.
Our research initially showed that more than 40 % of the text scored an overall red bullet on the readability analysis. It turned out that very few articles had enough transition words. We decided to lower the demands on transition words a bit, as most articles weren’t able to meet our demands. In the end, about 35% of the articles scored a red bullet, 30% scored an orange bullet and 35% scored a green bullet.
After fine tuning the assessments in this last phase, we made the final measurements of all the readability checks. We feel our instrument is useful and distinctive enough. However, if you feel otherwise, please let us know!
Read more: The readability checks in Yoast SEO: a closer look with our linguists »
The readability analysis
Below we’ll discuss the different assessments. We’ll first explain the importance of a check and then describe the exact measurement of each assessment. Finally, we’ll discuss the measurement of the overall content score.
Subheadings
Most readers are lazy and quickly bored. You want to convince them to read your text in a matter of seconds. Before deciding to read your text, readers tend to scan your text. Research has shown that people generally scan a text in an F-shaped pattern. As a writer, you can guide your readers by providing them with clear subheadings. Good subheadings will not only give them a quick overview of the topics discussed, they also make the structure of your text clearly visible. Moreover, if readers decide to read your text, they’ll already know what your paragraphs will be about. This will make understanding the content much easier.
Subheadings should be equally distributed throughout your text. You should try to cover a topic in the text after each subheading. Not enough subheadings throughout your text could mean that you did not cover all your topics with a subheading. That’ll make the structure of your text less visible to your reader. Too many subheadings will make the text messy and cluttered on the other hand.
Read how the subheading check works here.
Paragraphs
Readers like bite-sized pieces of information. Long paragraphs are scary and discourage people from reading. You should therefore make sure that paragraphs remain rather short.
Read how the paragraphs length check works here
Sentences
Your sentences should not be too long either. The longer your sentences are, the harder they are to process, because readers have to keep all the words and relationships in their working memory. Therefore, in English, try to write no sentences longer than 20 words.
Read how the sentence length check works here
Transition words
Using transition words is like putting cement between your sentences. The relation between two sentences becomes apparent by the use of transition words. Readers will understand your content much better if you make proper use of these kinds of words.
With transition words, you indicate relationships both between paragraphs as well as within paragraphs. They indicate whether a conclusion is coming up, or maybe a comparison or an enumeration. When readers know what to expect next, they’ll be able to process your text more easily.
Read how the transition words check works here
Passive voice
Passive voice occurs if the noun or noun phrase that would be the object of an active sentence (such as Yoast SEO calculates your SEO score) appears as the subject of a sentence with passive voice (The SEO score is calculated by Yoast SEO).
Passive voice results in distant writing. Active voice is much more engaging. We discourage you from using passive voice altogether. However, some sentences just get really awkward when written in the active voice. That’s why we’ve set the recommended maximum percentage of passive sentences to 10%.
Read how the passive voice check works here
Flesch Reading Ease
Flesch Reading Ease measures textual difficulty of a reading passage in English (note: in languages other than English, Flesch is seriously unreliable). The lower the score, the more difficult the text is. The Flesch readability score uses the sentence length (number of words per sentence) and the number of syllables per word in an equation to calculate the reading ease.
Read how the Flesh Reading Ease check works here
Measurement of overall readability score
The present readability analysis of Yoast SEO contains 6 different content checks. These 6 checks all are equally important in the calculation of the overall content score. A red bullet equals 3 penalty points, while an orange bullet equals 2 penalty-points. If your article scores 7 or more penalty points, the overall content bullet will be red. If your article scores 5 or 6 penalty points, your article will receive an orange bullet. Articles with 0, 2, 3 or 4 penalty points will be rewarded with the much-wanted green bullet.
In order to score an overall green readability score, you are allowed to have one red bullet or two orange bullets.
Coming up next!
-
WordCamp Europe
June 08 - 10, 2023 Team Yoast is Attending, Organizing, Speaking, Sponsoring WordCamp Europe. Click through to see who we'll be there, what we'll do there and more! See where you can find us next » -
Yoast SEO news webinar - May 30, 2023
30 May 2023 Our head of SEO, Jono Alderson, will keep you up-to-date about everything that happens in the world of SEO and WordPress. All Yoast SEO webinars »
Very nice…But transition words does not work with Vietnamese. So please consider to update for Vietnamese transtion words.
Good to know, that you’ve also checked travel blogs. I’m curious what it will bring… I wonder how it will check foreign languages, like german…
Maybe it is a good idea to point out which languages are supported within Content analysis. Transition word count obviously depends on the language used.
Thanks for you suggestion. I actually did mention that the content analysis is only available in English (for passive, transitionwords and flesch reading ease) but it was somewhere in the middle of the article. I put it in the first paragraph now.
Thanks Marieke, good article. Small typo in the first paragraph: Englisch