Build with LLM Observability: From Setup to Signal

About this Session

Large Language Models (LLMs) power modern AI applications, but their unpredictable behavior and complex workflows make it difficult to diagnose issues, optimize performance, and understand how they process data. Without visibility into each step of an LLM chain, troubleshooting and improving efficiency can be challenging.

Datadog’s LLM Observability provides visibility into operational performance, helping you ensure the quality, safety, and security of your LLM applications. End-to-end tracing captures input and output, latency metrics, token usage, and errors. By tracing each step in the LLM chain—including embedding, retrieval, and generation—teams can identify the root causes of unexpected outputs, latency, and errors, helping them troubleshoot performance issues and control costs.

In this hands-on workshop, you’ll build a chatbot application with a Retrieval-Augmented Generation (RAG) workflow using the OpenAI Python SDK to make calls to local models. You’ll instrument the application for Datadog’s LLM Observability, using auto-instrumentation and manual in-code setup to collect traces. Then, you’ll analyze these traces to connect application behavior with steps in the LLM chain, identify areas for improvement, apply changes, and observe results.

By the end of this workshop, you’ll have practical experience using Datadog’s observability tools to understand LLM application behavior and improve performance.

Related Sessions

Automated Workflows

Application Performance

Incidents & Remediation

Datadog Core Skills for Developers - Pre-Day

June 8

Application Performance

Incidents & Remediation

End-to-End Observability

Datadog Core Skills for Site Reliability Engineers (SREs) - Pre-Day

June 8

AI & LLMs

End-to-End Observability

Harnessing AI

Evaluate and Optimize AI Agent Performance

June 9

AI & LLMs

Cloud Cost Optimization

End-to-End Observability

LLM Observability in Action: Monitor, Evaluate, and Optimize Agentic AI

June 10

Application Performance

Developer Autonomy

Workshop

Read Between the Stack Traces: Investigations with Continuous Profiler

June 8

Application Performance

Modern Cloud Infrastructure

End-to-End Observability

Serverless Observability on AWS

June 8