Cohort Analysis in Google Analytics

Cohort Analysis in Google AnalyticsSince a few weeks Google Analytics has been rolling out a new feature in beta: Cohort Analysis. I was actually made aware of this new feature by my colleague Peter, who is working on our Google Analytics by Yoast Premium plugin. We started looking at the new analysis together. As is often the case with Google Analytics, it wasn’t really clear right away. So that’s why I thought I’d delve into it a bit more and try to explain what this actually does for all of you!

What’s a cohort?

Before I can decently explain what Google Analytics’ cohort analysis is about, it’s probably a smart idea to explain what a cohort is. When asking Google what the meaning of ‘cohort’ is, it gives me this explanation:

cohort meaning Google Search - cohort analysis google analytics

In this case, the second definition is obviously the one we’re looking for. A cohort is any group of people sharing a characteristic. In Google Analytics’ case, the only characteristic to be shared you can select, at the moment, is Acquisition Date.

Cohort analysis

So a cohort analysis is basically the analysis of a group of people, in this case people who interacted with your website at the same date or date range. When clicking Cohort Analysis in Google Analytics, it’ll look something like this:

Cohort Analysis Google Analytics

I don’t know about you, but this isn’t really immediately clear to me, so let me walk you through how to look at it. The chart at the top is a visualization of the average user rentention (percentage of returning visitors) for the date range, which is 7 days by default.

The most interesting, however, is the table below the chart. This actually gives us insight in what percentage of people returned to your site within 7 days of visiting it for the first time. Day 0 corresponds with the date in the first column. Day 1 is the first day after someone visited your website for the first time. So the 4.32% at Day 1 in the March 10th row means that 4.32% of the people who visited yoast.com for the first time on March 10th, visited yoast.com again on the next day (March 11th). Day 2 is the second day (March 12th) and so on.

Note: this is a breakdown of New Users, so although it says “All Sessions”, this only includes people having visited your site for the first time.

What can I do with this?

This is a question that I immediately asked myself. It wasn’t completely clear to me right away, so I might be a bit slow, or it’s just not that obvious. I’ll let you be the judge of that ;)

Let me give you an example (not yoast.com, by the way):

Cohort_Analysis_-_Google_Analytics

So what happened on March 14th or 15th that made people who visited this website for the first time on March 14th visit again the next day? The retention rate is about 2% higher there, and even on day 2 the retention rate is higher. Maybe they wrote a nice post? This can be a great way of figuring out whether what you’re trying (new content, new campaigns, etc.) is actually working.

Breaking down the cohort

If you need a more specific look on what’s happening, either because you don’t know why the retention rate was lower/higher, or because you’re just a data geek, you’re in luck. You can actually ‘break down’ your cohort analysis by using segments. For instance, if I were to use the Mobile and Tablet Traffic segment on the data above:

Cohort_Analysis_-_Google_Analytics

Google Analytics will give me this cohort report:

Cohort_Analysis_-_Google_Analytics

This shows the data for people who not only visited your website for the first time in the set timeframe, but were also on a smartphone or tablet when viewing the site. You can have up to 4 of such segments active at the same time. This way you can see whether the (expected) effect happened for all sorts of people, such as people on mobile phones, people from search engines or direct visitors, etc.

Other metrics

You can actually select quite a few metrics that will make the cohort analysis useful for a lot more than returning visitors:

Cohort_Analysis_-_Google_Analytics

Although the Cohort Type has a dropdown, it actually just has the one option. The Cohort Size can be set to ‘by day’, ‘by week’ or ‘by month’ and the Date Range will change accordingly. The most interesting though, is the Metric dropdown. You can select a lot of per user metrics (revenue, pageviews, transactions, etc.) or total metrics (again revenue, pageviews, etc.) apart from the Retention metric I used in the examples above.

This means you can actually see a lot of effects, such as whether your overall revenue or revenue per user has increased after a post or campaign. Of course, you can normally see your sales or revenue increase if you have a successful campaign, but this data is different.

You can now see how much revenue you got from people that visited your website for the first time on a specific date and see if these new visitors bought something on that date or in the days to follow. And since you can see this for an entire date range, you’ll also be able to see if that’s a higher or lower revenue than was to be expected.

Let me give you an example. Say you changed your landing page recently, which is tailored to just convincing new visitors of your site to buy a product. You could just be looking at the revenue from new visitors and see if it increases. However, if a visitor were to visit your website for the first time, only to return the next day to buy your product, Google Analytics wouldn’t show it as a new visitor anymore. And that’s why these cohorts actually work: the visitor was new at the set date, so even if they buy the product a day (or 2, or more) later, they’ll still show up in the cohort analysis. So you’re not just measuring direct effect anymore, you’re measuring delayed effects as well!

By the way, to be sure you have just the visitors that visited that specific landing page, you should create a segment for visitors who visited that page.

The downsides

While looking at the cohort analysis for yoast.com, I noticed that the Retention metric is quite difficult for our domain. Our traffic, even from the new visitors, is just too stable. The pattern was just the same all the time, no matter what date range I selected. This is probably because we have such a steady flow of new visitors, mainly from Google, that any lift here would only be a small change in percentage.

So, the changes in the percentages are too small; if everything between 3.5% and 4.5% is the same color, it’s pretty hard to distinguish any real differences. Of course, I could just look at the percentages, but that’s just not as convenient.

More importantly, though, we can only create cohorts based on Acquisition Date at the moment, which is a nice start, but I do really hope they’ll start adding more Cohort Types. Just the Acquisition Date is really not enough, for me at least. I’d love to see cohorts of people buying a specific product (category), for instance.

Summing up

The cohort analysis can definitely give you some insights that weren’t readily available before. However, it does still require more than just basic knowledge of Google Analytics and might be a little confusing in the beginning. So I’m not completely sold on this feature yet, but to be fair; it is still in beta, so who knows how much better it will get right?

Did you think of any other cool ways of using the new cohort analysis feature? Or do you think I said something utterly stupid in this post? Let me know!

Tags:


Check out our WordPress SEO Premium plugin

15 Responses

  1. DavidBy David on 18 March, 2015

    Hi Thijs,
    First, thanks for this overview – my comment is that it’s refreshing to see someone post about GA features but also be plain-spoken about the fact that much of GA can be confusing even for the GA veteran, and some posts go into great detail but forget that some people (yes, even veterans) need it broken down to the simple from the beginning (I’m looking at you, Jusin Cutroni!) – GA features (and Google products in general) tend to arrive half-baked and not really fully ready (it’s the Silicon Valley way – ship it now, iterate it later) and the fact that cohort reporting shows “all sessions” when it’s really just “new visitors” is misleading to beginners and veterans. I’ll end my comment/rant here ;) but just add that indeed, it’s hard to keep mental track of all of GA’s quirks (quick – what’s the lookback window in MCF reports? what’s the limit before GA starts sampling? you get the idea, there’s a huge number of quirks to keep track of, cohort report mislabeling is but one more).

    Lastly, maybe it would be cool if you could continue this series as GA continues to iterate the cohort analysis!?

    • Thijs de ValkBy Thijs de Valk on 19 March, 2015

      Thanks for the kind words! And I’ll definitely keep an eye on the cohort analysis and either update this post or write a new post if anything changes!

  2. JoshBy Josh on 18 March, 2015

    Great Post! Thanks for breaking that down in layman terms : )

  3. RobinBy Robin on 19 March, 2015

    Thanks for breaking it down so clearly.
    This was something I was bit unclear about using in GA.
    But now I have some idea about it. Hopefully you will be covering more.

  4. Jerry SBy Jerry S on 19 March, 2015

    I’ve been meaning to get around to figuring out what the cohort analysis does but now I have at least a basic understanding. Thanks!

  5. Ankur PratapBy Ankur Pratap on 19 March, 2015

    The Post is superb, at the beginning i was unable to understand Cohort but after your article its quite clear now

  6. DiwakarBy Diwakar on 19 March, 2015

    Just wanna say, WOW!
    The way you explained is really appreciable thanks and now this is clear to me.
    Thanks Thijs!

  7. PS Website DesignBy PS Website Design on 19 March, 2015

    Hi Thijs,
    Firstly I’d just like to say great article on the Cohort Analysis reports in Google Analytics – I’ve read many articles which say what it is but very few actually put it into perspective about what you can actually do with it.

    I think the tool in essence will be a valuable thing in the future. But I think one thing which lets it down is that you cannot apply a campaign to this. So if your company is running a 3 month campaign on a particular product or service it would be really valuable to track this in the cohort analysis. This would allow users to to see where the campaign is really working and things which need to be quickly adapted or changed.

    What are your thoughts on this Thijs?

    • Thijs de ValkBy Thijs de Valk on 19 March, 2015

      I completely agree. The potential for this feature is pretty big, but we definitely need more cohort types than the one we’re getting now. But since it’s in beta, I have good hopes that this will happen in the future :)

  8. ShawnBy Shawn on 19 March, 2015

    Thanks for the post, I’ve been looking at the Cohort Analysis for a while wondering what it means.

    One question I still have and you touched on it a little, the further you go out from Day 0 does that represent the % of visitors that come back each day for the given seven days or does it represent the % of people came back X days later after the first visit?

    Thanks again for the help in clarifying this new GA feature.

    • Thijs de ValkBy Thijs de Valk on 19 March, 2015

      Hi Shawn!

      If you’re looking at the Retention metric, then this percentage means the following: the percentage of people visiting your site again after having visited it for the first time on the date you’re looking at.

      Take the screenshot under the “What can I do with this?” heading; if you look at the “Mar 11, 2015″ row, Day 1 shows 4.14%. This means that 4.14% of the users that visited the website for the first time on March 11th, returned the next day (Day 1, equal to March 12th in this case).

      For Day 2 it shows 1.91%. This means that of the people who first visited the site on March 11th, 1.91% returned two days after (March 13th). These can be the same people, or completely different people, as the ones who visited on March 12th, as long as their first visit was on March 11th.

      And it goes on like that. Does that make it a bit more clear?

  9. Nigel AberyBy Nigel Abery on 21 March, 2015

    Thanks for the great explanation of Cohort Analysis. Google analytic tools are not that well explained in GA. Without a guide like this it takes a bit of figuring out what it all means.

  10. Chris BaldockBy Chris Baldock on 25 March, 2015

    I think this is huge. Well, it will be once you can look into different metrics. This information can help startups prove their retention and activity on the site. If your testing a certain new feature and no-one comes back to use it, it didn’t work; works the other way as well. It can help new companies learn and “pivot” quicker.

  11. JoshBy Josh on 29 March, 2015

    I have seen this feature in my GA account for a few weeks now here in the UK and have wondered what it was all about.

    Thanks for the fantastic insight as always guys!

  12. MikeBy Mike on 29 March, 2015

    Thanks for sharing this, very helpful.