Technology & Engineering
Optimizing CloudFront from a Sitecore perspective
Generally people use CloudFront for serving images, other static assets, or even to serve a whole site through the platform. It allows for edge caching (which means geo-located servers), resulting in a quick and high-speed website.
Recently we’ve been doing an extensive amount of work with both AWS and Sitecore, with site performance and reliability being the main aim. This led us to unravel how we can use CloudFront as a CDN, by offloading as much as we can over to it. As such, I thought I’d write up a few of the tips I picked up along the way, relating not just to Sitecore, but CloudFront in general.
Have a test environment that matches your production environment
This is probably the golden rule when implementing systems like these, since you don’t want to be trying out new settings on a production instance. I’ve seen many implementations where people try to spend a bit less by not having the infrastructure of the test environments match that of production. While they don’t need to be of the same scale, the setup should be similar. This carries over to CloudFront, where a test distribution is probably the only way to reliably test new CloudFront behavior settings, as well as new functionality in your webapp coexisting with CloudFront.
GET FAMILIAR WITH THE CACHE STATISTICS PAGE
Get familiar with the cache statistics page
The cache statistics page should be your first port of call when testing or debugging your CloudFront distribution, as it provides very useful, detailed statistics on how well your distribution is working. With time, you want to be aiming for a hit rate above 99% for your assets to maximize the efficiency of the cache. Tweaking the behavior settings for your distribution to achieve this can be somewhat of a trial and error process, since every webapp has different requirements. As a starting point, I’d recommend leaning towards anything that improves caching, then adapting based on testing your implementation.
Here are some example settings from one of our projects, however you will need to tweak things based on your specific needs.
Another neat little trick you can do, is to set the header in the Chrome dev tools to show on network requests, so you can get a nice overview of your assets and what is and isn’t hitting CloudFront.
This is done by simply right clicking on the column names in the network tab, selecting “Manage Header Columns…” and adding a custom column called “x-cache” which is the header that CloudFront returns.
Once you’ve done this you should be able to see which assets are and are not hitting CloudFront. Bear in mind that this is just from your machine, and should be taken with a pinch of salt; people in other parts of the world, or with different setups or request headers could have a totally different experience.
Offload everything you can
This one might be obvious, but you want to be offloading everything you can to the CDN, not just images. If you can offload scripts, styles and fonts, then you should. You can even take it further and offload API endpoints if the data is unlikely to change. One setting we use, is to cache based on query string; this lets us bust the cache for new deployments of the site, as we have version query strings for our static assets. Caching based on query string also means that your images can pass through height and width parameters, or any custom image parameters you might have, caching the end result flawlessly.
A common gotcha for fonts is CORS, as since you most likely will be serving your static assets from another domain, CORS will block them without the correct configuration. We got around this by allowing for GET, HEAD and OPTIONS methods, as well as passing through the “Access-Control-Allow-Origin” header. Bear in mind this will also require a bit of work on your webapp, to pass through the relevant header. More information on this can be found in this stack overflow post here.
Sitecore has its own HTML caching functionality and you should lean on this heavily for pages instead of using CloudFront. Otherwise functionality such as A/B testing, personalization and even publishing won’t work correctly. But aside from this, practically everything is fair game.
The Sitecore changes
The Sitecore configuration changes are actually relatively simple, although if you’re using any computed fields or link generation outside of the standard out-of-the-box functionality, it can get a little complicated. Assuming you’re just looking at the default, the main point of integration is that of the MediaProvider. You’ll need a configuration like the following:
Then that provider should extend the default MediaProvider and override GetMediaUrl with your own implementation. This is so you can make sure that modifications and new versions of the image pull through. There’s a nice implementation from the incredibly knowledgeable JammyKam here, as well as a lot of really useful tips for CloudFront’s Azure counterpart.
An alternative approach, would be to hash the image and use that as a query string, but when looking into this bear in mind this is different from the current hash that Sitecore puts in the request query string (which is described here) and worth thinking about if you have any issues with resizing.
With this in place, you should see your images in preview using the new CDN URL you’ve specified, though note that in the example above, the role definition requires the role of content delivery, which I’d recommend to make sure there isn’t any crazy experience editor issues.