Like any industry, the world of search engine optimization has a language and jargon all its own.
For those new to the business of SEO, particularly do it yourself webmasters, many of these terms and concepts can be somewhat confusing at first.
It takes time, and experience, to become well versed in the subtleties of SEO and on-page optimization.
There are basic technical SEO concepts and much more advanced concepts, such as understanding server configurations.
One of the more common areas of confusion for newly minted SEOs and inexperienced webmasters is the use of the rel=canonical tag, a medium level technical SEO concept that primarily deals with duplicate content issues.
Canonical tags are an important aspect of on-page optimization, and they are one of the fundamental tools that webmasters can use to communicate directly with search engines and their bots.
But just what are rel=canonical tags, and when and where should you use them?
First, Google’s Support on canonical tags is really good. So, take a look at that as well.
With this article, I’m hoping to help you better understand the canonical tag and how they should be deployed on an active website from a more simple explanation.
Canonical Tags – The Basics
In brief, the rel=canonical tag is information included in the header of your HTML that communicates directly with search engine bots, and explains how the individual page of content relates to the other pages within your website.
Sounds confusing, but it’s really not. Let’s keep going.
The rel=canonical tag is inserted into the <head> section of your pages to tell search engines where the original, or primary, page is located and looks like this:
<link rel="canonical" href=“http://www.domain.com/topic-page/” />
Why would you need this for a standard page?
Let’s say your website can render a non-www version, then the canonical version tells search engines which page to index.
Look at these pages:
http://www.domain.com/topic-page/ http://domain.com/topic-page/ http://www.domain.com/topic-page/index.html https://www.domain.com/topic-page/
All these pages are different right? Wrong. They’re all the same to Google. We need to tell them which to use.
In other words, this simple tag tells search engines to look at the href link as the primary page for that pages content.
In this way, link juice does not get diluted, and the search engine bots will only index, and ultimately give link credit, to this specific page. Well, in theory at least.
Hopefully, you’re using WordPress or another CMS platform that will properly redirect www or non-www versions and address secure pages properly.
Why Worry About Duplicate Content?
Search engines do not like to see duplicated content in their SERPs (search engine results pages).
When someone goes to Google to look something up, the search engine naturally works to return a diverse list of related, but individual, results.
When multiple pages with the same content are encountered, the search engine bots will evaluate them and will choose the best version, based on their algorithms.
By using the rel=canonical tag you can instruct the search engine bots to concentrate the link value on a single primary page, thus giving it a better chance to gain ground in a given search result.
Canonical tags can be useful in a number of different scenarios where duplicate content is integral to your website but possibly problematical from a search engine perspective.
Product Listings on E-commerce Sites
Let’s assume that your website sells men’s shoes. You may have several URLs that direct to a basic product, say a traditional wingtip, but with subtle variations – ie color, size, etc.
In this case you will want the primary product page to be indexed by the search engines, and the other pages to be ignored.
Let’s look at a scenario:
In this scenario, using “rewrite” rules should be implemented, but that is more of an advanced topic.
The solution? Yep, use the rel=canonical tag to clear things up for Google.
If you are running a blog, you will have paginated content. What is this?
Pagination is the term used to refer to the “next” and “previous” pages of your blog content or 1, 2, 3…and so on.
In this case, using rel=canonical is typically not the solution.
Here, you want to use the rel=prev and rel=next tags, unless you have a “view all” page.
This is a bit more advanced, and Ryan Huser wrote an excellent article on it here.
Websites with Mobile Access
If your website has a mobile interface that is kept on a sub-domain you will need to use rel=canonical tags to tell the search engines that it contains similar content to the primary site.
You should also include an rel=alt tag on the desktop version of your website that points back to the mobile interface.
For novice webmasters and SEOs the use of rel=canonical tags can be confusing at first, and it will take some time to become comfortable with them.
However, as Google and other search engines continue to strive to return unique and valuable results for their online users it will become increasingly important to learn how to communicate with the search engine bots and direct them to the content you want indexed and ranked.
This short article is simply an introduction to the use of rel=canonical tags, but should set inexperienced webmasters on the road to a better understanding of their use and value.
Not sure how to start using rel=canonical or have a specific question? Feel free to start a discussion below.