From Reactive to Proactive: How SREs Can Optimize Their Application Services Before Users Are Affected
About this Session
Maintaining service reliability gets harder as environments scale and systems generate an ever-growing volume of telemetry data. Unfortunately, most platform and SRE teams treat an incident as the starting line for improvement work.
In this DASH Product Breakout Session, you’ll learn how OnePay developed a proactive approach to service optimization with Datadog. You’ll learn how they leveraged Datadog dashboards to track slow-burn SLO trends across their services, and see how they action off of recommendations surfaced through Datadog’s end-to-end APM products.
You’ll leave knowing how to better leverage Datadog’s suite of proactive features to understand where services are wasting time or capacity, how to triage automated recommendations with Bits AI, and how to action fixes that ensure your applications run reliably and efficiently at scale.