Senior AI Engineer - APM Experiences
Datadog · New York City, NY · Dev Eng
About this role
Datadog is hiring a senior-level AI Engineer in the machine learning function based in New York City, NY. The posting calls out experience with Python, Java, LLMs, RAG. Compensation is listed at $187,000–$240,000 per year.
- Role
- AI Engineer
- Function
- machine learning
- Level
- senior
- Track
- Individual contributor
- Employment
- Full-time
- Location
- New York City, NY
- Department
- Dev Eng
More roles at Datadog
Job description
from Datadog careersThe opportunity
Datadog’s APM Experiences team owns the core product experience for Application Performance Monitoring — including distributed tracing, service representation, and more. We’re building a new wave of AI-powered capabilities that help customers detect, resolve, and prevent performance issues faster. In this role, you will lead end‑to‑end development of LLM- and Agent‑based features that can:
- Debug and investigate application performance issues down to the root cause, as both a developer assistant and a fully autonomous agent
- Proactively recommend performance and reliability-based optimizations to prevent the next incident
- Automatically create intelligent monitors and SLOs for the most important business flows and critical paths
This is a highly product‑minded engineering role: you’ll work from problem discovery and UX all the way to reliable, scalable production systems.
What you’ll do
- Shape AI experiences for APM. Design and ship LLM/agentic workflows that analyze traces, metrics, logs, and other telemetry to generate diagnoses, explanations, and guided fixes.
- Own the full loop. Prototype quickly, define success metrics and evals, run experiments, iterate, and ultimately productionize for scale and reliability.
- Build robust agent systems. Develop tools, retrieval and planning strategies, and guardrails; manage prompts/evals; design fallbacks and human‑in‑the‑loop paths.
- Integrate with Datadog’s platform. Leverage surfaces like Trace Explorer, Service Catalog, monitors, and workflows to deliver end‑to‑end value in the APM UI.