LLM Observability in Action: Monitor, Evaluate, and Optimize Agentic AI

About this Session

Agentic AI introduces a new class of observability challenges. When a LangGraph workflow routes a single customer request across multiple agents, tools, RAG pipelines, and LLMs, traditional APM cannot explain why an agent hallucinated, why latency doubled after a model change, or why your LLM bill spiked 10x overnight.

In this hands-on workshop, you'll use Datadog LLM Observability to instrument SwagBot, a multi-agent ecommerce chatbot powered by LangGraph. With just three environment variables, you'll unlock end-to-end visibility into every agent decision, LLM call, token cost, and retrieval step. You’ll connect all the dots, from frontend user experience to backend LLM execution.

From there, you will seamlessly extend that visibility into quality and security. You will configure managed evaluations for hallucination detection, failure to answer, and prompt injection, then build custom LLM-as-a-Judge evaluations tailored to your domain. When a new version of SwagBot is deployed and issues appear, you will use monitors, traces, and evaluation results to diagnose root causes and fix them with confidence. Finally, you will run LLM Experiments to compare multiple models across latency, cost, and quality dimensions. Instead of guessing, you will make data-driven model decisions that balance performance, reliability, and business impact.

By the end of this lab, you will know how to observe, evaluate, and continuously optimize any agentic AI application running in production.

Related Sessions

End-to-End Observability

Breakout Session

all levels

The Hidden Data Pipelines Behind Datadog: Lessons from Building Observability for Our Own Data Teams

June 9 09:00 AM – 09:40 AM

End-to-End Observability

Breakout Session

all levels

Turning User Behavior into a Better Customer Experience

June 9 09:00 AM – 09:40 AM

Speakers

Rob Taylor, Director, Product Management, SAS

Perry Thomas, Sr. Director of Developer Experience, SAS

End-to-End Observability

Harnessing AI

Workshop

Build with LLM Observability: From Setup to Signal - Pre-Day

June 8 01:30 PM – 04:00 PM

End-to-End Observability

Harnessing AI

Developer Autonomy

Datadog Core Skills for Developers - Pre-Day

June 8 01:30 PM – 04:00 PM

End-to-End Observability

Harnessing AI

Scaling Systems

Datadog Core Skills for Site Reliability Engineers (SREs) - Pre-Day

June 8 01:30 PM – 04:00 PM

End-to-End Observability

Workshop

intermediate

Serverless Observability on AWS

June 8 01:30 PM – 04:00 PM

End-to-End Observability

Breakout Session

all levels

The Hidden Data Pipelines Behind Datadog: Lessons from Building Observability for Our Own Data Teams

June 9 09:00 AM – 09:40 AM

End-to-End Observability

Breakout Session

all levels

Turning User Behavior into a Better Customer Experience

June 9 09:00 AM – 09:40 AM

Speakers

Rob Taylor, Director, Product Management, SAS

Perry Thomas, Sr. Director of Developer Experience, SAS

End-to-End Observability

Harnessing AI

Workshop

Build with LLM Observability: From Setup to Signal - Pre-Day

June 8 01:30 PM – 04:00 PM

End-to-End Observability

Harnessing AI

Developer Autonomy

Datadog Core Skills for Developers - Pre-Day

June 8 01:30 PM – 04:00 PM

End-to-End Observability

Harnessing AI

Scaling Systems

Datadog Core Skills for Site Reliability Engineers (SREs) - Pre-Day

June 8 01:30 PM – 04:00 PM

End-to-End Observability

Workshop

intermediate

Serverless Observability on AWS

June 8 01:30 PM – 04:00 PM

Security & Compliance

Harnessing AI

Breakout Session

BewAIre: Detecting Malicious Pull Requests at Scale with LLMs

June 9 01:00 PM – 01:40 PM

Speakers

D Niu, Senior Software Engineer, Datadog

Kassen Qian, Senior Product Manager, Datadog

Harnessing AI

Breakout Session

all levels

LLM Observability at Scale: Governing, Monitoring, and Securing AI Agents in Production

June 9 09:00 AM – 09:40 AM

Speakers

Rodrigo Moreno, Head of Cloud & SRE, Banco BV

Flávia Sacramoni, Head of Command Center and IT Services Management, Banco BV

Harnessing AI

Fireside Chat

all levels

The New Shape of Engineering

June 9 01:00 PM – 01:40 PM

Speakers

Alexis Lê-Quôc, CTO & Co-Founder, Datadog

Thibault Sottiaux, Head of Codex, OpenAI

End-to-End Observability

Harnessing AI

Workshop

Build with LLM Observability: From Setup to Signal - Pre-Day

June 8 01:30 PM – 04:00 PM

End-to-End Observability

Harnessing AI

Developer Autonomy

Datadog Core Skills for Developers - Pre-Day

June 8 01:30 PM – 04:00 PM

End-to-End Observability

Harnessing AI

Scaling Systems

Datadog Core Skills for Site Reliability Engineers (SREs) - Pre-Day

June 8 01:30 PM – 04:00 PM

End-to-End Observability

Breakout Session

all levels

The Hidden Data Pipelines Behind Datadog: Lessons from Building Observability for Our Own Data Teams

June 9 09:00 AM – 09:40 AM

End-to-End Observability

Breakout Session

all levels

Turning User Behavior into a Better Customer Experience

June 9 09:00 AM – 09:40 AM

Speakers

Rob Taylor, Director, Product Management, SAS

Perry Thomas, Sr. Director of Developer Experience, SAS

End-to-End Observability

Harnessing AI

Workshop

Build with LLM Observability: From Setup to Signal - Pre-Day

June 8 01:30 PM – 04:00 PM

End-to-End Observability

Harnessing AI

Developer Autonomy

Datadog Core Skills for Developers - Pre-Day

June 8 01:30 PM – 04:00 PM

End-to-End Observability

Harnessing AI

Scaling Systems

Datadog Core Skills for Site Reliability Engineers (SREs) - Pre-Day

June 8 01:30 PM – 04:00 PM

End-to-End Observability

Workshop

intermediate

Serverless Observability on AWS

June 8 01:30 PM – 04:00 PM