This product is not supported for your selected
Datadog site. (
).
このページは日本語には対応しておりません。随時翻訳に取り組んでいます。
翻訳に関してご質問やご意見ございましたら、
お気軽にご連絡ください。
Start a Bits AI SRE investigation
You can launch a Bits AI SRE investigation from several entry points:
Manually start an investigation
Monitor alerts
You can invoke Bits on an individual monitor alert or warn event from several entry points:
Option 1: Bits AI SRE Monitors list
- Go to Bits AI SRE > Monitors > Supported.
- Click Investigate Recent Alerts and select an alert.
Option 2: Monitor status page
Navigate to the monitor status page of a Bits AI SRE-supported monitor and click Investigate with Bits AI SRE in the top-right corner.
Option 3: Monitor event side panel
In the monitor event side panel of a Bits AI SRE-supported monitor, click Investigate with Bits AI SRE.
Option 4: Slack
To use the Slack integration, connect your Slack workspace to Bits AI SRE.
In Slack, reply to a monitor notification with @Datadog Investigate this alert.
APM latency
Join the Preview!
Bits AI SRE investigations started from APM latency graphs and APM Watchdog stories are in Preview. Click Request Access to join the Preview program.
Request AccessAPM latency graphs on service pages
- In Datadog, navigate to APM and open the service or resource page you want to investigate. Next to the latency graph, click Investigate.
- Click and drag your cursor over the point plot visualization to make a rectangular selection over a region that shows unusual latency to seed the analysis. Initial diagnostics on the latency issue appear, including the observed user impact, anomalous tags contributing to the issue, and recent changes. For more information, see APM Investigator.
- Click Investigate with Bits AI SRE to run a deeper investigation.
APM latency Watchdog stories
On a Watchdog APM latency story, click Investigate with Bits AI SRE.
General prompt
Join the Preview!
Starting Bits AI SRE investigations from chat prompts is in Preview. Click Request Access to join the Preview program.
Request AccessClick on New Investigation and describe the issue you want to troubleshoot. Include as much relevant context as possible:
- Observed symptoms (e.g., errors, latency) including any links to Datadog telemetry that indicate this
- Relevant tags such as service, application, or environment that isolate the issue
- A time window (default is past 4 hours)
The more specific your prompt, the more accurate and useful the investigation will be.
Good examples:
Bad example:
- App is slow. What’s wrong?
You can also trigger an investigation from Slack. Mention Datadog in a message: @Datadog Investigate high CPU in ai-gateway in prod over the last 30 minutes. If invoked within a Slack thread, Bits AI SRE automatically uses the entire thread as investigation context.
Enable automatic investigations
In addition to manual investigations, you can configure Bits to run automatically when a monitor transitions to the alert state:
From the Bits AI SRE Monitors list
- Go to Bits AI SRE > Monitors > Supported.
- Toggle Auto-Investigate on for a single monitor, or bulk-edit multiple monitors by selecting multiple monitors, then clicking Auto-Investigate All.
For a single monitor
- Open the monitor’s status page and click Edit.
- Scroll to Configure notifications & automations and toggle Investigate with Bits AI SRE.
- Enabling automatic investigations using the Datadog API or Terraform is not supported.
- An investigation initiates when a monitor transitions to the alert state.
- Transitions to the warn or no data state, renotifications, and test notifications do not trigger automatic investigations.
Supported monitors
Bits is able to run investigations on the following monitor types:
- Metric
- Anomaly
- Forecast
- Integration
- Outlier
- Logs
- APM (
APM Metrics type only; Trace Analytics is not supported) - Synthetics API and Browser tests (Preview)
Join the Preview!
Starting Bits AI SRE investigations from Synthetic tests is now in Preview. Click Request Access to join the Preview program.
Request AccessHow Bits AI SRE investigates
When Bits AI SRE investigates an issue, it operates in a continuous loop of observation, reasoning, and action. It begins by forming hypotheses about the potential root cause, then uses its tools to query telemetry data to validate or invalidate those hypotheses. Each step builds on prior findings. As new evidence emerges, Bits AI SRE updates its understanding, refines its reasoning, and chains together additional investigative steps—adapting and course-correcting until it converges on the most likely root cause.
At the end of an investigation, Bits AI SRE either presents a clear, evidence-backed conclusion or marks the investigation as inconclusive when the available data is insufficient to support a defensible conclusion.
Supported data sources
Bits uses the following data sources during investigations:
Datadog products
- Metrics
- APM traces
- Logs
- Dashboards
- Events
- Change Tracking
- Source code (GitHub only)
- Watchdog
- Real User Monitoring
- Network Path
- Database Monitoring
- Continuous Profiler
Add service scoping: For monitors associated with a service, add a service tag to the monitor, or filter or group the monitor query by service. This helps Bits AI SRE correlate data more accurately.
Third-party integrations
- Grafana
- Dynatrace
- Sentry
- Splunk
- ServiceNow
- Confluence
Join the Preview!
A subset of third-party integrations are in Preview. Click Request Access to join the Preview program.
Request AccessFor best practices on maximizing the effectiveness of investigations, see Knowledge sources.
Investigation display modes
There are two display modes: Agent Trace and Investigation.
While an investigation is in progress, Bits captures every step it takes—including how it evaluates evidence and makes decisions—in the Agent Trace view. This provides a real-time, detailed record of the agent’s reasoning process.
Once the investigation is complete, you can switch to the Investigation view to explore a structured, tree-based visualization of the investigative path, making it easier to understand findings and conclusions at a glance.
Reports
The Reports tab enables you to track the number of investigations run over time by monitor, user, service, and team. You can also track the mean time to conclusion to assess the impact of Bits AI SRE on your on-call efficiency.