Profiling Coding Agents via Tool Calls

About this Session

Across 656 runs on Claude Sonnet, Opus, and Haiku, the simple statistical shape of an agent's tool calls (how many, in what order, how varied) is enough to spot misbehavior with 100% precision on well-defined tasks, with no access to the model's reasoning. On vague tasks, the same signal falls apart, and a known 30% slice of misaligned runs looks completely normal until you compare outputs.

In this talk, we share how a runtime tripwire built from a baseline of clean runs flags drifting traces in structured agent workflows, where this approach breaks down and why output diffing is the only thing that catches the 30% blind spot, and which class of adversarial prompt (reward framing) was the only one to reliably break alignment in our runs, pointing to where defensive effort actually pays off.

Speakers

D Niu

Senior Software Engineer Datadog

Related Sessions

on-demand

Security & Compliance

Harnessing AI

BewAIre: Detecting Malicious Pull Requests at Scale with LLMs

Speakers

D Niu, Senior Software Engineer, Datadog

Kassen Qian, Senior Product Manager, Datadog

Security & Compliance

Expo Theater Talk

From Alert Triage to Autonomous Response: The Future of SecOps

Speakers

Rex Guo, Staff Product Manager, Datadog

on-demand

Security & Compliance

Panel

How to Innovate in Regulated Industries

Speakers

Sinthanai Natarajan, Senior Executive Director, JPMorgan Chase

Paul Richards, SVP, Technology Operations, Citizens Bank

Arlei Roberto Francioli Junior, Executive Manager of Technology, Elo

John Trapani, Field CTO, Financial Services, Datadog

Security & Compliance

Expo Theater Talk

Modern Threat Detection and Incident Response for the AI Era

Speakers

Vera Chan, Senior Product Marketing Manager, Datadog

Ron Feldman, Staff Product Manager, Datadog

Michael Li, Staff Detection Engineer, FanDuel

Nathan Pitchaikani, Senior Security Engineer, Riot Games

Security & Compliance

Expo Theater Talk

You Have Thousands of CVEs. What If You Only Had to Care About a Handful?

Speakers

Connor Plante, Product Manager, Datadog

End-to-End Observability

Security & Compliance

Harnessing AI

Profiling Coding Agents via Tool Calls

About this Session

Speakers

Related Sessions

BewAIre: Detecting Malicious Pull Requests at Scale with LLMs

From Alert Triage to Autonomous Response: The Future of SecOps

How to Innovate in Regulated Industries

Modern Threat Detection and Incident Response for the AI Era

You Have Thousands of CVEs. What If You Only Had to Care About a Handful?

From Commit to Runtime: Secure Software Delivery and Automated Response with Datadog

DASH 2027 is coming—Be in the know

Thank you for your signing up