Bringing Observability to Azure: Instrument, Explore, Troubleshoot
About this Session
Your ecommerce application is running on Azure App Services backed by Azure SQL Database. Customers are reporting failed transactions, and your team is scrambling to find the root cause across logs, traces, and infrastructure metrics. Sound familiar? When applications span multiple Azure services, pinpointing performance issues requires more than just monitoring individual components—it requires unified observability that connects the dots across your entire stack.
In this hands-on workshop, you will instrument Azure App Services with Datadog using sidecar containers, explore out-of-the-box Azure integration dashboards for App Service and Azure SQL Database metrics, and verify data collection across the Service Catalog. You will then put these tools to work by investigating a real production incident: customers unable to add items to their shopping cart due to cascading SQL database timeouts.
Using Logs Explorer, Watchdog Insights, distributed tracing, and Error Tracking, you will trace the issue from customer-facing errors down to the offending SQL query. Finally, you will see how Bits AI SRE can autonomously investigate and surface root causes, turning what would be an hours-long manual investigation into minutes.
Related Sessions
From Reactive to Proactive: How SREs Can Optimize Their Application Services Before Users Are Affected
Speakers
Build with LLM Observability: From Setup to Signal
Datadog Core Skills for Developers - Pre-Day
Datadog Core Skills for Site Reliability Engineers (SREs) - Pre-Day
Serverless Observability on AWS
From Ingestion to AI: Ensuring Data Reliability Across the Full Lifecycle
From Reactive to Proactive: How SREs Can Optimize Their Application Services Before Users Are Affected
Speakers
Build with LLM Observability: From Setup to Signal
Datadog Core Skills for Developers - Pre-Day
Datadog Core Skills for Site Reliability Engineers (SREs) - Pre-Day
Serverless Observability on AWS
From Reactive to Proactive: How SREs Can Optimize Their Application Services Before Users Are Affected
Speakers
How AI Is Redefining the Datadog Experience—and How to Make the Most of It
The AI Engineering Playbook: How to Evaluate & Iterate at Every Phase of Development
Build with LLM Observability: From Setup to Signal
Datadog Core Skills for Developers - Pre-Day
Datadog Core Skills for Site Reliability Engineers (SREs) - Pre-Day
From Ingestion to AI: Ensuring Data Reliability Across the Full Lifecycle
From Reactive to Proactive: How SREs Can Optimize Their Application Services Before Users Are Affected
Speakers