- 필수 기능
- 시작하기
- Glossary
- 표준 속성
- Guides
- Agent
- 통합
- 개방형텔레메트리
- 개발자
- Administrator's Guide
- API
- Datadog Mobile App
- CoScreen
- Cloudcraft
- 앱 내
- 서비스 관리
- 인프라스트럭처
- 애플리케이션 성능
- APM
- Continuous Profiler
- 스팬 시각화
- 데이터 스트림 모니터링
- 데이터 작업 모니터링
- 디지털 경험
- 소프트웨어 제공
- 보안
- AI Observability
- 로그 관리
- 관리
Amazon Managed Streaming for Apache Kafka(MSK)는 전체 관리형 서비스로, Apache Kafka를 이용해 스트리밍 데이터를 처리하는 애플리케이션을 빌드하고 실행하기 쉽게 도와줍니다.
이 통합에서는 CloudWatch에서 수집한 메트릭을 수집하는 크롤러를 사용합니다. Datadog 에이전트로 MSK를 모니터링하는 방법에 관한 자세한 내용을 보려면 Amazon MSK(에이전트) 페이지를 참고하세요.
Datadog에서 CloudWatch에서 수집한 메트릭을 보려면 Amazon MSK 크롤러를 활성화하세요.
아직 설정하지 않은 경우 먼저 Amazon Web Services 통합을 설정하세요.
AWS 통합 페이지의 Metric Collection
탭에서 Kafka
가 활성화되어 있는지 확인합니다.
Amazon MSK 통합을 설치하세요.
S3 버킷이나 CloudWatch로 로그를 전송하도록 Amazon MSK를 구성하세요.
참고:
amazon_msk
가 _Target prefix_로 지정되어야 합니다.msk
를 포함해야 합니다.아직 설정하지 않은 경우 Datadog Forwarder Lambda 함수를 설정하세요.
Lambda 함수를 설치한 후 AWS 콘솔에서 Amazon MSK 로그를 포함하는 S3 버킷이나 CloudWatch 로그 그룹에 수동으로 트리거를 추가하세요.
aws.kafka.active_controller_count (gauge) | Only one controller per cluster should be active at any given time. |
aws.kafka.active_controller_count.maximum (gauge) | Only one controller per cluster should be active at any given time. |
aws.kafka.bw_in_allowance_exceeded (count) | The number of packets shaped because the inbound aggregate bandwidth exceeded the maximum for the broker. |
aws.kafka.bw_out_allowance_exceeded (count) | The number of packets shaped because the outbound aggregate bandwidth exceeded the maximum for the broker. |
aws.kafka.bytes_in_per_sec (rate) | The number of bytes per second received from clients. Shown as byte |
aws.kafka.bytes_out_per_sec (rate) | The number of bytes per second sent to clients. Shown as byte |
aws.kafka.conn_track_allowance_exceeded (count) | The number of packets shaped because the connection tracking exceeded the maximum for the broker. Connection tracking is related to security groups that track each connection established to ensure that return packets are delivered as expected. |
aws.kafka.connection_close_rate (rate) | The number of connections closed per second per listener. This number is aggregated per listener and filtered for the client listeners. |
aws.kafka.connection_creation_rate (rate) | The number of new connections established per second per listener. This number is aggregated per listener and filtered for the client listeners. |
aws.kafka.consumer_response_send_time_ms_mean (gauge) | The mean time in milliseconds for the consumer to send a response. Shown as millisecond |
aws.kafka.cpu_credit_balance (gauge) | This metric can help you monitor CPU credit balance on the brokers. |
aws.kafka.cpu_credit_usage (gauge) | This metric can help you monitor CPU credit usage on the instances. If your CPU usage is sustained above the baseline level of 20% you can run out of the CPU credit balance which can have a negative impact on cluster performance. You can monitor and alarm on this metric to take corrective actions. |
aws.kafka.cpu_idle (gauge) | The percentage of CPU idle time. Shown as percent |
aws.kafka.cpu_system (gauge) | The percentage of CPU in kernel space. Shown as percent |
aws.kafka.cpu_user (gauge) | The percentage of CPU in user space. Shown as percent |
aws.kafka.estimated_max_time_lag (gauge) | Time estimate (in seconds) to drain MaxOffsetLag. Shown as second |
aws.kafka.estimated_time_lag (gauge) | Time estimate (in seconds) to drain the partition offset lag. Shown as second |
aws.kafka.fetch_consumer_local_time_ms_mean (gauge) | The mean time in milliseconds that the consumer request is processed at the leader. Shown as millisecond |
aws.kafka.fetch_consumer_request_queue_time_ms_mean (gauge) | The mean time in milliseconds that the consumer request waits in the request queue. Shown as millisecond |
aws.kafka.fetch_consumer_response_queue_time_ms_mean (gauge) | The mean time in milliseconds that the consumer request waits in the response queue. Shown as millisecond |
aws.kafka.fetch_consumer_total_time_ms_mean (gauge) | The mean total time in milliseconds that consumers spend on fetching data from the broker. Shown as millisecond |
aws.kafka.fetch_follower_local_time_ms_mean (gauge) | The mean time in milliseconds that the follower request is processed at the leader. Shown as millisecond |
aws.kafka.fetch_follower_request_queue_time_ms_mean (gauge) | The mean time in milliseconds that the follower request waits in the request queue. Shown as millisecond |
aws.kafka.fetch_follower_response_queue_time_ms_mean (gauge) | The mean time in milliseconds that the follower request waits in the response queue. Shown as millisecond |
aws.kafka.fetch_follower_response_send_time_ms_mean (gauge) | The mean time in milliseconds for the follower to send a response. Shown as millisecond |
aws.kafka.fetch_follower_total_time_ms_mean (gauge) | The mean total time in milliseconds that followers spend on fetching data from the broker. Shown as millisecond |
aws.kafka.fetch_message_conversions_per_sec (rate) | The number of fetch message conversions per second for the broker. |
aws.kafka.fetch_throttle_byte_rate (gauge) | The number of throttled bytes per second. |
aws.kafka.fetch_throttle_queue_size (gauge) | The number of messages in the throttle queue. |
aws.kafka.fetch_throttle_time (gauge) | The average fetch throttle time in milliseconds. Shown as millisecond |
aws.kafka.global_partition_count (gauge) | Total number of partitions across all brokers in the cluster. |
aws.kafka.global_partition_count.maximum (gauge) | Maximum total number of partitions across all brokers in the cluster. |
aws.kafka.global_topic_count (gauge) | Total number of topics averaged by the number of brokers in the cluster. |
aws.kafka.global_topic_count.maximum (gauge) | Maximum total number of topics averaged by the number of brokers in the cluster. |
aws.kafka.kafka_app_logs_disk_used (gauge) | The percentage of disk space used for application logs. Shown as percent |
aws.kafka.kafka_data_logs_disk_used (gauge) | The percentage of disk space used for data logs. Shown as percent |
aws.kafka.leader_count (gauge) | The number of leader replicas. |
aws.kafka.max_offset_lag (gauge) | The maximum offset lag across all partitions in a topic. |
aws.kafka.memory_buffered (gauge) | The size in bytes of buffered memory for the broker. Shown as byte |
aws.kafka.memory_cached (gauge) | The size in bytes of cached memory for the broker. Shown as byte |
aws.kafka.memory_free (gauge) | The size in bytes of memory that is free and available for the broker. Shown as byte |
aws.kafka.memory_heap_after_gc (gauge) | The percentage of total heap memory available after garbage collection. Shown as percent |
aws.kafka.memory_used (gauge) | The size in bytes of memory that is in use for the broker. Shown as byte |
aws.kafka.messages_in_per_sec (rate) | The number of messages received from clients per second. |
aws.kafka.network_processor_avg_idle_percent (gauge) | The average percentage of the time the network processors are idle. Shown as percent |
aws.kafka.network_rx_dropped (count) | The number of dropped receive packages. |
aws.kafka.network_rx_errors (count) | The number of network receive errors for the broker. |
aws.kafka.network_rx_packets (count) | The number of packets recieved by the broker. |
aws.kafka.network_tx_dropped (count) | The number of dropped transmit packages. |
aws.kafka.network_tx_errors (count) | The number of network transmit errors for the broker. |
aws.kafka.network_tx_packets (count) | The number of packets transmitted by the broker. |
aws.kafka.offline_partitions_count (gauge) | Total number of partitions that are offline in the cluster. |
aws.kafka.offset_lag (gauge) | Partition-level consumer lag in numberofoffsets. |
aws.kafka.partition_count (gauge) | The number of partitions for the broker. |
aws.kafka.pps_allowance_exceeded (count) | The number of packets shaped because the bidirectional PPS exceeded the maximum for the broker. |
aws.kafka.produce_local_time_ms_mean (gauge) | The mean time in milliseconds for the follower to send a response. Shown as millisecond |
aws.kafka.produce_message_conversions_per_sec (rate) | The number of produce message conversions per second for the broker. |
aws.kafka.produce_message_conversions_time_ms_mean (gauge) | The mean time in milliseconds spent on message format conversions. Shown as millisecond |
aws.kafka.produce_request_queue_time_ms_mean (gauge) | The mean time in milliseconds that request messages spend in the queue. Shown as millisecond |
aws.kafka.produce_response_queue_time_ms_mean (gauge) | The mean time in milliseconds that response messages spend in the queue. Shown as millisecond |
aws.kafka.produce_response_send_time_ms_mean (gauge) | The mean time in milliseconds spent on sending response messages. Shown as millisecond |
aws.kafka.produce_throttle_byte_rate (gauge) | The number of throttled bytes per second. |
aws.kafka.produce_throttle_queue_size (gauge) | The number of messages in the throttle queue. |
aws.kafka.produce_throttle_time (gauge) | The average produce throttle time in milliseconds. Shown as millisecond |
aws.kafka.produce_total_time_ms_mean (gauge) | The mean produce time in milliseconds. Shown as millisecond |
aws.kafka.replication_bytes_in_per_sec (rate) | The number of bytes per second received from other brokers. Shown as byte |
aws.kafka.replication_bytes_out_per_sec (rate) | The number of bytes per second sent to other brokers. Shown as byte |
aws.kafka.request_bytes_mean (gauge) | The mean number of request bytes for the broker. |
aws.kafka.request_exempt_from_throttle_time (gauge) | The average time spent in broker network and I/O threads to process requests that are exempt from throttling. |
aws.kafka.request_handler_avg_idle_percent (gauge) | The average percentage of the time the request handler threads are idle. |
aws.kafka.request_throttle_queue_size (gauge) | The number of messages in the throttle queue. |
aws.kafka.request_throttle_time (gauge) | The average request throttle time in milliseconds. Shown as millisecond |
aws.kafka.request_time (gauge) | The average time spent in broker network and I/O threads to process requests that are exempt throttled. |
aws.kafka.root_disk_used (gauge) | The percentage of the root disk used by the broker. Shown as percent |
aws.kafka.sum_offset_lag (gauge) | The aggregated offset lag for all the partitions in a topic. |
aws.kafka.swap_free (gauge) | The size in bytes of swap memory that is available for the broker. Shown as byte |
aws.kafka.swap_used (gauge) | The size in bytes of swap memory that is in use for the broker. Shown as byte |
aws.kafka.tcp_connections (gauge) | Shows number of incoming and outgoing TCP segments with the SYN flag set. |
aws.kafka.traffic_bytes (gauge) | Shows network traffic in overall bytes between clients (producers and consumers) and brokers. Traffic between brokers isn't reported. Shown as byte |
aws.kafka.under_minlsr_partition_count (gauge) | The number of under minlsr partitions for the broker |
aws.kafka.under_replicated_partitions (gauge) | The number of under-replicated partitions for the broker. |
aws.kafka.volume_queue_length (gauge) | The number of read and write operation requests waiting to be completed in a specified time period. |
aws.kafka.volume_read_bytes (gauge) | The number of bytes read in a specified time period. Shown as byte |
aws.kafka.volume_read_ops (gauge) | The number of read operations in a specified time period. |
aws.kafka.volume_total_read_time (gauge) | The total number of seconds spent by all read operations that completed in a specified time period. Shown as second |
aws.kafka.volume_total_write_time (gauge) | The total number of seconds spent by all write operations that completed in a specified time period. Shown as second |
aws.kafka.volume_write_bytes (gauge) | The number of bytes written in a specified time period. Shown as byte |
aws.kafka.volume_write_ops (gauge) | The number of write operations in a specified time period. |
aws.kafka.zookeeper_request_latency_ms_mean (gauge) | Mean latency in milliseconds for ZooKeeper requests from broker. |
Amazon MSK 크롤러에는 이벤트가 포함되어 있지 않습니다.
Amazon MSK 통합에는 서비스 점검이 포함되어 있지 않습니다.
도움이 필요하신가요? Datadog 지원팀에 문의하세요.
추가 유용한 문서, 링크 및 기사: