Ship Reliable AI Faster: How to Operate AI Agents with Control and Confidence

About this Session

Replace "AI shipped on hope" with an operating model that holds up once real users depend on it. AI quality is multi-dimensional, covering accuracy, tone, safety, and faithfulness to user data, and can't be debugged from outputs alone. Without visibility into what their AI actually did in production, teams miss regressions, reverse-engineer chains by hand, and watch a single bad answer erode trust built over hundreds of right ones.

We'll walk through how to operate AI with the same discipline you apply to any production system, anchored in LLM Observability. Start by tracing every prompt, retrieval, and tool call end-to-end so you can see what your agents did and why. Production traffic then becomes your evaluation dataset, replacing synthetic tests that age the moment users do something unexpected. Structured experiments let you compare prompt and model variants with confidence before changes reach users. You'll also see how to catch regressions in quality, latency, and cost before users feel them, connect AI behavior to the rest of your stack, and equip every team shipping agents to own their reliability.

Attendees will leave able to define quality for their AI, investigate faster when outputs look wrong, and ship updates engineering and reliability teams can trust.

Speakers

Rashel Hoover

Senior Product Manager Datadog

Viraj Patel

Senior Software Engineer WHOOP

Related Sessions

on-demand

Harnessing AI

all levels

Agent Observability at Scale: Governing, Monitoring, and Securing AI Agents in Production

Speakers

Willians Aguiar, Head of the Digital Channels Engineering, Banco BV

Rodrigo Moreno, Head of Cloud & SRE, Banco BV

on-demand

Developer Autonomy

all levels

From Alerts to Autonomy: Scaling Incident Management at PUBG with Automation and AI

Speakers

Junghun Kim, DevOps Engineer, Krafton Inc.

End-to-End Observability

Harnessing AI

beginner

Build with Agent Observability: From Setup to Signal - Pre-Day

End-to-End Observability

Harnessing AI

Developer Autonomy

Datadog Core Skills for Developers - Pre-Day

End-to-End Observability

Harnessing AI

Scaling Systems

Datadog Core Skills for Site Reliability Engineers (SREs) - Pre-Day

Developer Autonomy

intermediate

Workshop

Read Between the Stack Traces: Investigations with Continuous Profiler

on-demand

Harnessing AI

all levels

Agent Observability at Scale: Governing, Monitoring, and Securing AI Agents in Production

Speakers

Willians Aguiar, Head of the Digital Channels Engineering, Banco BV

Rodrigo Moreno, Head of Cloud & SRE, Banco BV

on-demand

Security & Compliance

Harnessing AI

BewAIre: Detecting Malicious Pull Requests at Scale with LLMs

Speakers

D Niu, Senior Software Engineer, Datadog

Kassen Qian, Senior Product Manager, Datadog

on-demand

Harnessing AI

all levels

The New Shape of Engineering

Speakers

Alexis Lê-Quôc, CTO & Co-Founder, Datadog

Thibault Sottiaux, Head of Product and Platform, OpenAI

End-to-End Observability

Harnessing AI

beginner

Build with Agent Observability: From Setup to Signal - Pre-Day

End-to-End Observability

Harnessing AI

Developer Autonomy

Datadog Core Skills for Developers - Pre-Day

End-to-End Observability

Harnessing AI

Scaling Systems

Datadog Core Skills for Site Reliability Engineers (SREs) - Pre-Day

on-demand

Developer Autonomy

all levels

From Alerts to Autonomy: Scaling Incident Management at PUBG with Automation and AI

Speakers

Junghun Kim, DevOps Engineer, Krafton Inc.

End-to-End Observability

Harnessing AI

Developer Autonomy

Datadog Core Skills for Developers - Day 1

End-to-End Observability

Harnessing AI

Developer Autonomy

Datadog Core Skills for Developers - Pre-Day

End-to-End Observability

Developer Autonomy

beginner

Delivering High Quality Software with APM and Distributed Tracing

End-to-End Observability

Security & Compliance

Harnessing AI

From Commit to Runtime: Secure Software Delivery and Automated Response with Datadog

Developer Autonomy

intermediate

Workshop

Read Between the Stack Traces: Investigations with Continuous Profiler

on-demand

Harnessing AI

all levels

Agent Observability at Scale: Governing, Monitoring, and Securing AI Agents in Production

Speakers

Willians Aguiar, Head of the Digital Channels Engineering, Banco BV

Rodrigo Moreno, Head of Cloud & SRE, Banco BV

on-demand

Developer Autonomy

all levels

From Alerts to Autonomy: Scaling Incident Management at PUBG with Automation and AI

Speakers

Junghun Kim, DevOps Engineer, Krafton Inc.

End-to-End Observability

Harnessing AI

beginner

Build with Agent Observability: From Setup to Signal - Pre-Day

End-to-End Observability

Harnessing AI

Developer Autonomy

Datadog Core Skills for Developers - Pre-Day

End-to-End Observability

Harnessing AI

Scaling Systems

Datadog Core Skills for Site Reliability Engineers (SREs) - Pre-Day

Developer Autonomy

intermediate

Workshop

Ship Reliable AI Faster: How to Operate AI Agents with Control and Confidence

About this Session

Speakers

Related Sessions

Agent Observability at Scale: Governing, Monitoring, and Securing AI Agents in Production

From Alerts to Autonomy: Scaling Incident Management at PUBG with Automation and AI

Build with Agent Observability: From Setup to Signal - Pre-Day

Datadog Core Skills for Developers - Pre-Day

Datadog Core Skills for Site Reliability Engineers (SREs) - Pre-Day

Read Between the Stack Traces: Investigations with Continuous Profiler

Agent Observability at Scale: Governing, Monitoring, and Securing AI Agents in Production

BewAIre: Detecting Malicious Pull Requests at Scale with LLMs

The New Shape of Engineering

Build with Agent Observability: From Setup to Signal - Pre-Day

Datadog Core Skills for Developers - Pre-Day

Datadog Core Skills for Site Reliability Engineers (SREs) - Pre-Day

From Alerts to Autonomy: Scaling Incident Management at PUBG with Automation and AI

Datadog Core Skills for Developers - Day 1

Datadog Core Skills for Developers - Pre-Day

Delivering High Quality Software with APM and Distributed Tracing

From Commit to Runtime: Secure Software Delivery and Automated Response with Datadog

Read Between the Stack Traces: Investigations with Continuous Profiler

Agent Observability at Scale: Governing, Monitoring, and Securing AI Agents in Production

From Alerts to Autonomy: Scaling Incident Management at PUBG with Automation and AI

Build with Agent Observability: From Setup to Signal - Pre-Day

Datadog Core Skills for Developers - Pre-Day

Datadog Core Skills for Site Reliability Engineers (SREs) - Pre-Day

Read Between the Stack Traces: Investigations with Continuous Profiler

DASH 2027 is coming—Be in the know

Thank you for your signing up