- 필수 기능
- 시작하기
- Glossary
- 표준 속성
- Guides
- Agent
- 통합
- 개방형텔레메트리
- 개발자
- Administrator's Guide
- API
- Datadog Mobile App
- CoScreen
- Cloudcraft
- 앱 내
- 서비스 관리
- 인프라스트럭처
- 애플리케이션 성능
- APM
- Continuous Profiler
- 스팬 시각화
- 데이터 스트림 모니터링
- 데이터 작업 모니터링
- 디지털 경험
- 소프트웨어 제공
- 보안
- AI Observability
- 로그 관리
- 관리
The Monitor Status page contains two graphs, the History Graph and Evaluation Graph, that provide insight into monitor evaluations. This guide covers:
When you submit your raw data points to Datadog for monitoring, this information is visualized in the history graph. For example, you have the following data points over the past 5 minutes: [10, 15, 12, 8, 11]. The history graph shows each value.
When you configure your evaluation of the query, this adds another aggregation to the metric values for your monitor to alert on. For example, you configure your monitor to evaluate the average over the past 5 minutes. The evaluation graph shows the value of 11.2 as a single data point.
(10+15+12+8+11)/5 = 11.2
Typically, the two graphs are not visualizing the same data point values. In addition, multiple other factors can contribute to the differences in the visualization graphs.
Queries with as_count
metrics in the formula, use a different evaluation path. The evaluation applies any aggregation before the formula. For example if you are using A / B
and both are using the as_count
evaluation path, it would be evaluated as:
(1+2+3+4+5) / (10+10+10+10+10)
For more information, see the as_count() in Monitor Evaluations guide.
When using formulas, monitors apply the aggregation function for the monitor evaluation on the formula, not the individual queries. This means, if you are using the AVG (avg by
) aggregation function on your queries, but are using SUM (sum by
) over the past X minutes in your evaluation configuration, then the edit page/history graph values do not match the values of the evaluation graph. For an example, see the troubleshooting section.
When using an evaluation delay, the evaluation graph does not match the timing of the history graph one for one. For example, if you add a 5 minute evaluation delay, you need to look at the data point in the history graph from 5 minutes before to correlate it to the evaluation graph.
You can see different results depending on the aggregation method you are using in your query and your evaluation aggregation. The History and Edit Page use the aggregation methods from your queries while the Evaluation Graph uses the aggregation method determined by the Evaluate the option.
Depending on the aggregation method you are choosing in your monitor setup, this can show a different value compared to what you are seeing on the edit page. For example, if your monitor queries are using AVG but you are looking to alert on the MINIMUM value over the last X minutes/hours, then the evaluation graph shows the MIN value while your history/edit page graphs shows the AVG values. This is because the monitor is alerting on the aggregation method set in the monitor evaluation, not the aggregation method set in the metric query.
You can visualize what the monitor is evaluating at a specific time point by using a Notebook Query Value Widget. Take the query in your monitor (with any formulas or functions) and then set your time frame for the graph to your evaluation window. This shows the data as it is aggregated to one single point.
In the following example, take a time frame from the Evaluation graph you want to investigate. Hover over the evaluation graph data point to see the value and the time. For example, you want to investigate why your evaluation graph shows a data point of 0.38
at 10:50:35, when your history graph shows 0.26
around the same time.
To troubleshoot this value you can open the monitor edit page and transfer the monitor configuration to a Notebook Query Value widget.
Monitor edit page configuration fields:
proc.test_process.cpu.total_pct
p95 by (everything)percentile (p95)
of the querylast 5 minutes
Transfer the same configuration to the Notebook Query Value widget.
proc.test_process.cpu.total_pct
. Add the metric aggregation p95 by
.percentile (p95)
.Configuration | Monitor | Query Value widget |
---|---|---|
Metric Query | ||
Monitor Aggregation | ||
Evaluation Window |
In this example, troubleshoot a value in the monitor evaluation graph with multiple queries and a formula, in a Notebook Query Value widget. On the evaluation graph, hover over the data point you want to investigate, in this example, you want to troubleshoot the evaluation graph value of 9.17
at 13:55:29.
Monitor edit page configuration fields:
proc.test_process.cpu.total_pct
avg by (everything)system.cpu.user
avg by (everything)min
of the querylast 5 minutes
Transfer the same configuration to the Notebook Query Value widget one metric at a time.
Metric a
proc.test_process.cpu.total_pct
. Add the metric aggregation avg
.Metric b
system.cpu.user
. Add the metric aggregation avg
.The monitor evaluation Min
takes the minimum value of the queries over the past 5 minutes.