Contact Us

Imagine having to dedicate content resources to read/edit/proofread the content manually line by line. And, imagine the plight of a content writer or developer when the volume of content increases in numbers on a daily basis. In the 21st century, marketers are continually moving towards the age of automation where Artificial Intelligence(AI) and Machine Learning(ML) are becoming the best friends of the wealth of data collected by the xDB (Experience Database).

There are explicit benefits to having xDB. Historical data is important to create a full-fledged personalization experience in Sitecore that targets every stage of the customer journey.

Sitecore 9.1 introduced Sitecore CortexTM that enables implementation of ML/AI–based technology by any product/customer on experience data in the xDB. In version 9.1 it provides Content Tagging Out of Box that tags content items based on the content of the item. The Sitecore Cortex Content Tagging feature integrates the Sitecore CMS with Machine Learning (ML) based Natural Language Processing (NLP) engines. By default it provides integration with OpenCalais.

Did you know: Open Calais is a sophisticated web service that attaches intelligent metadata-tags to your unstructured content. The Open Calais natural language processing engine automatically analyzes and tags your input files.

Auto-Tagging Setup Using Cortex

2 (1)

Let’s understand what was missing with the default Sitecore Cortex Content Tagging feature-

Open Calais NLP Limitations: Sitecore Cortex takes the information from all the fields present in the item, which is sent to the third party where the Open Calais NLP Algorithm extracts the tags from the data and returns back to Sitecore.

This facility wasn’t really working in favor of the developers, as Sitecore Cortex was giving us the data or tags from all the fields (as against the specific fields) that are not even close to the context of the content. There used to be a lot of manual work at any developer’s end to implement tagging in all older versions.

3 (4)

Irrelevant Tags shown in Sitecore Cortex

Why Azure ML

Having faced the above-mentioned limitations of Sitecore Cortex Content Tagging, we have strategically overwritten two content pipelines - the retrieve content pipeline and get tags pipeline.

  • We collected the tokens from the data and removed stoppers. Stoppers are the most frequently used words such as ‘and’, ‘the’, etc.
  • Extracted Keywords – words that help us differentiate one item from another
  • Output the tags
Ebook
Upgrade to Sitecore 9.1 Using this Handy Guide
Need Help?

In Sitecore 9.1, we have the facility of tagging the content (as seen in the snapshot below). Upon clicking this button, we make the call to Azure ML.

Note: You should have Sitecore 9.1 installed on your local machine or in Azure PaaS.

4 (1) 

Using Azure ML NLP Algorithm, it turns the tag in the below-illustrated form:

5 (2) 

Screenshot of Azure ML NLP Algorithm

Automation of Content Tagging Completed in Sitecore Cortex

We now pass the input to the Azure ML to pre-process data, which helps in removing the stoppers or irrelevant data like nouns, adjectives, numbers, etc. Once the tagging process is finished, if you review the tagging field again, it will be populated with the tags.

The integration of Sitecore CMS with Machine Learning (ML) based Natural Language Processing (NLP) engines, i.e, Azure ML in this case, helps you in capturing omnichannel customer behaviour to uncover deeper insights and to automate personalization and segmentation.

Voilà! We’ve automated Content Tagging in Sitecore Cortex with the help of Azure ML, while making the tagging experience seamless so much quicker.