Knowing things work instead of thinking things work

April 18, 2017

Production can be a scary environment. Sometimes things are working great and sometimes they’re a complete dumpster fire. How do you know your current status? How do you know where your problem is or which server is causing the degradation? In this talk we’ll discuss a journey from no application performance monitoring(apm) to “good enough to troubleshoot today” apm and where we continued after our tire fire was turned back into a normal day. We’ll focus on the abstract types of things to watch for and show how easy retrofitting these abilities can be, but for those curious the examples are built on influxdb, and grafana.

This will be presented by Dani Ames and/or Seth Larson.