Practical Prometheus: Lessons learned from a year in production
At Fastly we’ve been running Prometheus as our primary application and infrastructure monitoring system for over 12 months. We’ve gained deep insight into the performance and reliability of our globe-spanning network and have built a system that our engineers love to work with.
Our move to Prometheus allowed us to build a monitoring system that scales more sympathetically with our infrastructure growth and has enabled us to refocus our observability culture across the engineering organisation.
Getting started with Prometheus is wonderfully simple but, as your deployment grows, it can pose a wide range of cultural and technical challenges.
In this presentation, we’ll share some of the lessons we’ve learnt operating Prometheus at scale. Whether you’re just getting started, or you have an established Prometheus infrastructure in place we hope you’ll find some practical advice to take back and apply in your environment.