Datadog Watchdog™

Docs > Datadog Watchdog™

Overview

Watchdog is Datadog’s AI engine, providing you with automated alerts, insights, and root cause analyses that draw from observability data across the entire Datadog platform. Watchdog continuously monitors your infrastructure and calls attention to the signals that matter most, helping you to detect, troubleshoot, and resolve issues.

All Watchdog features come built-in, and do not require setup.

Proactive alerts

Watchdog proactively computes a baseline of expected behavior for your systems, applications, and deployments. This baseline is then used to detect anomalous behavior.

Additional helpful documentation, links, and articles:

Watchdog Alerts: How to view and interpret Watchdog Alerts: what information is provided by each alert, what alerts cover, and where to find Watchdog alerts throughout Datadog.

Faulty Deployment Detection: How Watchdog finds faulty code deployments.

To customize Watchdog algorithms:

Investigation assistance

To help investigation, Watchdog shows context-based insights in all explorers, searches for root causes, and determines user impact.

Additional helpful documentation, links, and articles:

Watchdog Insights: Watchdog Insights is a recommendation engine that helps you identify and resolve issues.

Root Cause Analysis: How Watchdog Root Cause Analysis (RCA) finds the root cause of an anomaly, and how to use the information provided.

Impact Analysis: How Watchdog identifies when an anomaly adversely impacts users.

Troubleshooting

Need help? Contact Datadog support.

Further Reading

Additional helpful documentation, links, and articles:

Check out the latest Datadog Watchdog releases! (App login required).RELEASE NOTES

Introducing Bits AI, your new DevOps copilotBLOG

Collect your logsDOCUMENTATION

Collect your tracesDOCUMENTATION

Automated root cause analysis with Watchdog RCABLOG

Understand user impact scope with Watchdog Impact AnalysisBLOG

Troubleshoot anomalies in workload performance with Watchdog Insights for Live ProcessesBLOG

Anomaly detection, predictive correlations - Using AI-assisted metrics monitoringBLOG