- 필수 기능
- 시작하기
- Glossary
- 표준 속성
- Guides
- Agent
- 통합
- 개방형텔레메트리
- 개발자
- Administrator's Guide
- API
- Datadog Mobile App
- CoScreen
- Cloudcraft
- 앱 내
- 서비스 관리
- 인프라스트럭처
- 애플리케이션 성능
- APM
- Continuous Profiler
- 스팬 시각화
- 데이터 스트림 모니터링
- 데이터 작업 모니터링
- 디지털 경험
- 소프트웨어 제공
- 보안
- AI Observability
- 로그 관리
- 관리
Datadog Network Performance Monitoring (NPM) provides visibility into your network traffic between services, containers, availability zones, and any other tag in Datadog. After you enable NPM, you can create an NPM monitor and get alerted if a TCP network metric crosses a threshold that you have set. For example, you can monitor network throughput between a specific client/server and get alerted if that throughput crosses a threshold.
To create an NPM monitor in Datadog, use the main navigation: Monitors –> New Monitor –> Network Performance.
You can create NPM monitors using formulas and functions. This can be used, for example, to create monitors on throughput between a client and server.
The following example shows using a formula to calculate percent retransmits from a client to server.
For more information, see the Functions documentation.
The following tables list the different NPM metrics you can create monitors on.
Metric name | Definition |
---|---|
Bytes Received | Bytes received from client. |
Bytes Sent | Bytes sent from client. |
Packets Sent | Packets sent from client. |
Metric name | Definition |
---|---|
Retransmits | Retransmits between client/server. |
Latency | Average time it takes to make the connection. |
RTT (Round-Trip Time) | Average time it takes to receive a response. |
Jitter | Average variance in RTT. |
TCP Timeouts | The number of TCP connections that timed out from the perspective of the operating system. This can indicate general connectivity and latency issues. |
TCP Refusals | The number of TCP connections that were refused by the server. Typically this indicates an attempt to connect to an IP/port that isn’t receiving connections, or a firewall/security misconfiguration. |
TCP Resets | The number of TCP connections that were reset by the server. |
Established Connections | Establishes connections between client/server. |
Closed Connections | Closed connections between client/server. |
Metric name | Definition |
---|---|
DNS Requests | Total number of DNS requests. |
DNS Failures | Total number of DNS failures. |
DNS Timeouts | Total number of DNS timeouts. |
DNS Failed Responses | Total number of DNS failed responses. |
DNS Successful Responses | Total number of DNS successful responses. |
DNS Failure Latency | Average DNS failure latency. |
DNS Success Latency | Average DNS success latency. |
NXDOMAIN Errors | Total number of NXDOMAIN errors. |
SERVFAIL Errors | Total number of SERVFAIL errors. |
Other Errors | Total number of other errors. |
Configure monitors to trigger if the query value crosses a threshold and customize advanced alert options for recovery thresholds and evaluations delays. For more information, see Configure Monitors.
For detailed instructions on the Configure notifications and automations section, see the Notifications page.
You can start creating monitors on NPM with the following common monitors. These provide a good starting point to track your network and get alerted if your network is experiencing unusual traffic and potentially experiencing unexpected network behavior.
The throughput monitor alerts you if throughput between two endpoints specified in the query surpasses a threshold. Monitoring throughput can help you determine if your network is nearing capacity given your network bandwidth. Knowing this can give you enough time to make adjustments to your network to avoid bottlenecks and other effects downstream.
Retransmission occurs when packets are either damaged or lost and indicate an unreliable network. The percent retransmits monitor alerts you if the percentage of total packets sent that are resulting in retransmits passes a threshold.
DNS failure monitor tracks DNS server performance to help you identify server-side and client-side DNS issues. Use this monitor to alert you if the sum of DNS failures passes a threshold.