Paginated Headache for SEOs

Pagination is usually overlooked because… not sure why. My best guess is the default system works as is so why bother?

What are you trying to achieve?

The two obvious goals are:

  • Help the users to browse efficiently through a very large website (e.g. forum)
  • Decrease your website depth (i.e. distance in clicks from a starting point, usually the homepage, to other pages – it usually makes sense to have your most important pages within the first levels)

Pagination is not boring

At least it doesn’t have to be, some solutions can also be very creative. One of the best examples would be the ghostblocks pagination, probably the best solution to decrease the depth of a site. (I strongly suggest to read the full article, it was an eye-opener for me. It might be dated but still relevant)

source: https://audisto.com/guides/pagination/#ghostblock

In my in-house SEO career, I’ve been dealing with a lot of large websites (>100k pages to several million) and pagination offers interesting challenges if you want to improve UX and optimize your crawl. I often used Ghostblocks as a one size fits all solution with great success until recently… I had a very SEO kind of revelation:

IT DEPENDS - Rainbow Spongbob | Make a Meme

For years now, I’ve been working as SEO for big publishers, and by definition a publisher publishes… A LOT.

see also: How to fix index bloating issues (slideshare)

Most of the content stays under control and benefits of a predictable calendar for publication and updates (e.g. product reviews, event coverage, buying guide, etc.)

modern times 1936 | Tumblr

The rest of the website (e.g. the news) feels more like:

Modern Times GIFs - Get the best GIF on GIPHY

There’s no one size fits all solution. (aka it depends)

Probably not even solution by website but rather by content. It’s very rare to see different pagination logic on a single website because most platforms handle one pagination system and apply it to the whole site. We should change that approach because not all pages are equals and your pagination should depend on one very much overlooked criteria: your page lifespan.

Once it’s indexed, how often do you need a page to be re-crawled? is the question you should try to resolve. I see 3 major cases + suggested solutions:

1 – A page that gets frequent updates will naturally be pushed again within the first levels every time it’s updated. (e.g. Best Shows on Netflix Now)

Nothing to do here, yay \o/

2 – A page that doesn’t require an update and has a short lifespan has no benefit from being crawled often once indexed. (e.g. News)

  • Infinite scroll: it is still possible for the users to browse but less easy for a crawler
  • Archive section with a different URL structure (e.g. /archive/ directory ) so you can give yourself the option to “bury” stale content deeper on your site and get better control of its crawling frequency Important: Make this archive section customizable (threshold by page and/or date; rules by section, article type; noindex? disallow? etc.)

3 – A page that doesn’t require an update (or not often) but is evergreen (e.g. How to clean your oven with baking soda)

  • Ghostblocks: this is a typical case where those are very efficient to keep content not too many clicks away

As a rule of thumb…

I would keep this in mind: The goal is to keep the important pages within the first levels, not ALL pages.


Some caveats and tactics not mentioned here for the purpose of the article (i.e. pagination):

  • Search Engines are probably not following blindly deep pages even if linked with Ghostblocks because smart enough to understand page-899522 will be less interesting to recrawl than page-2 – still a link vs no link makes a difference
  • Sitemaps could reflect or not that strategy: making a page harder or impossible to find on your website will make little difference if it appears in your sitemap (from a crawl frequency standpoint)
  • Leverage Internal Linking: if that page is important, it needs to be ‘promoted’ more than the others and prevent it to be buried too deep: related content, contextual navigation, etc.
  • Internal Search: at some point, the best user experience to find older content might be your internal search. It is rare to see a website with a performant one and advanced search filters options, it might be something to look at in some cases (e.g. forum)

IRL examples:

NYTimes doesn’t paginate the sections (it uses infinite scroll, so not crawlable but still user friendly)
Techradar has an archive for all section. It arbitrary starts after 10th page + indexed and crawlable which seems suboptimal but the idea is here.
Stackoverflow links automatically to the last page, surfacing 2008 content within the first level (in that example)

Read Also:

  • https://audisto.com/guides/pagination/
  • https://www.jcchouinard.com/pagination-seo/
  • https://builtvisible.com/solving-site-architecture-issues/

Leave a Reply 0

Your email address will not be published. Required fields are marked *