The AI Engineering Playbook: How to Evaluate & Iterate at Every Phase of Development

About this Session

AI coding tools are accelerating development velocity, creating a release challenge most teams aren’t equipped for. Without controlled rollout, higher change velocity makes it harder to know which specific release drove the results you’re seeing in production.

And when teams use AI, to build AI – LLM apps and AI agents– complexity multiplies. Traditional observability can’t ensure AI agent quality, performance, and cost-efficiency at production scale.

In this session, you'll learn how Datadog Feature Flags and LLM Observability work together to drive equilibrium across the software delivery lifecycle. You’ll learn how leading AI teams move fast while maximizing reliability at each stage:

Evaluating and iterating agents with structured experiments in pre-production
Controlling rollout with automated guardrails that pause or roll back releases
Instantly link feature releases to traces, user sessions, and downstream behavior
Monitoring agent behavior in production, tracing every step, retrieval and tool call

You and your team will leave with a practical AI engineering playbook showing you how to ship faster while iteratively improving quality, security, and ROI.

Speakers

Kenta Ishida

Site Reliability Engineer, AI Platform LayerX

Eric Metaj

Product Marketing Manager Datadog

Shri Subramanian

Group Product Manager Datadog

Kaito Yamagishi

Engineer Manager - Backend Tapple

Related Sessions

Developer Autonomy

Breakout Session

all levels

From Alerts to Autonomy: Scaling Incident Management at PUBG with Automation and AI

June 9 01:00 PM – 01:40 PM

Speakers

Junghun Kim, DevOps Engineer, Krafton Inc.

Harnessing AI

Breakout Session

all levels

LLM Observability at Scale: Governing, Monitoring, and Securing AI Agents in Production

June 9 09:00 AM – 09:40 AM

Speakers

Rodrigo Moreno, Head of Cloud & SRE, Banco BV

Flávia Sacramoni, Head of Command Center and IT Services Management, Banco BV

End-to-End Observability

Harnessing AI

Workshop

Build with LLM Observability: From Setup to Signal - Pre-Day

June 8 01:30 PM – 04:00 PM

End-to-End Observability

Harnessing AI

Developer Autonomy

Datadog Core Skills for Developers - Pre-Day

June 8 01:30 PM – 04:00 PM

End-to-End Observability

Harnessing AI

Scaling Systems

Datadog Core Skills for Site Reliability Engineers (SREs) - Pre-Day

June 8 01:30 PM – 04:00 PM

Developer Autonomy

Workshop

intermediate

Read Between the Stack Traces: Investigations with Continuous Profiler

June 8 01:30 PM – 04:00 PM

Security & Compliance

Harnessing AI

Breakout Session

BewAIre: Detecting Malicious Pull Requests at Scale with LLMs

June 9 01:00 PM – 01:40 PM

Speakers

D Niu, Senior Software Engineer, Datadog

Kassen Qian, Senior Product Manager, Datadog

Harnessing AI

Breakout Session

all levels

LLM Observability at Scale: Governing, Monitoring, and Securing AI Agents in Production

June 9 09:00 AM – 09:40 AM

Speakers

Rodrigo Moreno, Head of Cloud & SRE, Banco BV

Flávia Sacramoni, Head of Command Center and IT Services Management, Banco BV

Harnessing AI

Fireside Chat

all levels

The New Shape of Engineering

June 9 01:00 PM – 01:40 PM

Speakers

Alexis Lê-Quôc, CTO & Co-Founder, Datadog

Thibault Sottiaux, Head of Codex, OpenAI

End-to-End Observability

Harnessing AI

Workshop

Build with LLM Observability: From Setup to Signal - Pre-Day

June 8 01:30 PM – 04:00 PM

End-to-End Observability

Harnessing AI

Developer Autonomy

Datadog Core Skills for Developers - Pre-Day

June 8 01:30 PM – 04:00 PM

End-to-End Observability

Harnessing AI

Scaling Systems

Datadog Core Skills for Site Reliability Engineers (SREs) - Pre-Day

June 8 01:30 PM – 04:00 PM

Developer Autonomy

Breakout Session

all levels

From Alerts to Autonomy: Scaling Incident Management at PUBG with Automation and AI

June 9 01:00 PM – 01:40 PM

Speakers

Junghun Kim, DevOps Engineer, Krafton Inc.

End-to-End Observability

Harnessing AI

Developer Autonomy

Datadog Core Skills for Developers - Day 1

June 9 02:00 PM – 04:30 PM

End-to-End Observability

Harnessing AI

Developer Autonomy

Datadog Core Skills for Developers - Pre-Day

June 8 01:30 PM – 04:00 PM

End-to-End Observability

Developer Autonomy

Workshop

Delivering High Quality Software with APM and Distributed Tracing

June 10 09:00 AM – 11:30 AM

End-to-End Observability

Security & Compliance

Harnessing AI

From Commit to Runtime: Secure Software Delivery and Automated Response with Datadog

June 9 02:00 PM – 04:30 PM

Developer Autonomy

Workshop

intermediate

Read Between the Stack Traces: Investigations with Continuous Profiler

June 8 01:30 PM – 04:00 PM

Developer Autonomy

Breakout Session

all levels

From Alerts to Autonomy: Scaling Incident Management at PUBG with Automation and AI

June 9 01:00 PM – 01:40 PM

Speakers

Junghun Kim, DevOps Engineer, Krafton Inc.

Harnessing AI

Breakout Session

all levels

LLM Observability at Scale: Governing, Monitoring, and Securing AI Agents in Production

June 9 09:00 AM – 09:40 AM

Speakers

Rodrigo Moreno, Head of Cloud & SRE, Banco BV

Flávia Sacramoni, Head of Command Center and IT Services Management, Banco BV

End-to-End Observability

Harnessing AI

Workshop

Build with LLM Observability: From Setup to Signal - Pre-Day

June 8 01:30 PM – 04:00 PM

End-to-End Observability

Harnessing AI

Developer Autonomy

Datadog Core Skills for Developers - Pre-Day

June 8 01:30 PM – 04:00 PM

End-to-End Observability

Harnessing AI

Scaling Systems

Datadog Core Skills for Site Reliability Engineers (SREs) - Pre-Day

June 8 01:30 PM – 04:00 PM

Developer Autonomy

Workshop

intermediate

Read Between the Stack Traces: Investigations with Continuous Profiler

June 8 01:30 PM – 04:00 PM