aws.kafka.active_controller_count (gauge) | Only one controller per cluster should be active at any given time. |
aws.kafka.active_controller_count.maximum (gauge) | Only one controller per cluster should be active at any given time. |
aws.kafka.auth_error (count) | The number of connections with failed authentication per second. Check if the service execution role policy for the replicator is valid. Shown as error |
aws.kafka.burst_balance (gauge) | The remaining balance of input-output burst credits for EBS volumes in the cluster. Use it to investigate latency or decreased throughput. Shown as percent |
aws.kafka.bw_in_allowance_exceeded (count) | The number of packets shaped because the inbound aggregate bandwidth exceeded the maximum for the broker. |
aws.kafka.bw_out_allowance_exceeded (count) | The number of packets shaped because the outbound aggregate bandwidth exceeded the maximum for the broker. |
aws.kafka.bytes_in_per_sec (rate) | The number of bytes per second received from clients. Shown as byte |
aws.kafka.bytes_out_per_sec (rate) | The number of bytes per second sent to clients. Shown as byte |
aws.kafka.client_connection_count (count) | The number of active authenticated client connections. Shown as connection |
aws.kafka.client_connection_count.average (count) | The average number of active authenticated client connections. Shown as connection |
aws.kafka.conn_track_allowance_exceeded (count) | The number of packets shaped because the connection tracking exceeded the maximum for the broker. Connection tracking is related to security groups that track each connection established to ensure that return packets are delivered as expected. |
aws.kafka.connection_close_rate (rate) | The number of connections closed per second per listener. This number is aggregated per listener and filtered for the client listeners. |
aws.kafka.connection_count (count) | The total number of active authenticated, unauthenticated, and inter-broker connections. Shown as connection |
aws.kafka.connection_count.average (gauge) | The average number of active authenticated, unauthenticated, and inter-broker connections. Shown as connection |
aws.kafka.connection_count.maximum (gauge) | The maximum number of active authenticated, unauthenticated, and inter-broker connections. Shown as connection |
aws.kafka.connection_creation_rate (rate) | The number of new connections established per second per listener. This number is aggregated per listener and filtered for the client listeners. |
aws.kafka.cpu_credit_balance (gauge) | This metric can help you monitor CPU credit balance on the brokers. |
aws.kafka.cpu_credit_usage (gauge) | This metric can help you monitor CPU credit usage on the instances. If your CPU usage is sustained above the baseline level of 20% you can run out of the CPU credit balance which can have a negative impact on cluster performance. You can monitor and alarm on this metric to take corrective actions. |
aws.kafka.cpu_idle (gauge) | The percentage of CPU idle time. Shown as percent |
aws.kafka.cpu_io_wait (gauge) | The percentage of CPU idle time during a pending disk operation. Shown as percent |
aws.kafka.cpu_system (gauge) | The percentage of CPU in kernel space. Shown as percent |
aws.kafka.cpu_user (gauge) | The percentage of CPU in user space. Shown as percent |
aws.kafka.estimated_max_time_lag (gauge) | Time estimate (in seconds) to drain MaxOffsetLag. Shown as second |
aws.kafka.estimated_time_lag (gauge) | Time estimate (in seconds) to drain the partition offset lag. Shown as second |
aws.kafka.fetch_consumer_local_time_ms_mean (gauge) | The mean time in milliseconds that the consumer request is processed at the leader. Shown as millisecond |
aws.kafka.fetch_consumer_request_queue_time_ms_mean (gauge) | The mean time in milliseconds that the consumer request waits in the request queue. Shown as millisecond |
aws.kafka.fetch_consumer_response_queue_time_ms_mean (gauge) | The mean time in milliseconds that the consumer request waits in the response queue. Shown as millisecond |
aws.kafka.fetch_consumer_response_send_time_ms_mean (gauge) | The mean time in milliseconds for the consumer to send a response. Shown as millisecond |
aws.kafka.fetch_consumer_total_time_ms_mean (gauge) | The mean total time in milliseconds that consumers spend on fetching data from the broker. Shown as millisecond |
aws.kafka.fetch_follower_local_time_ms_mean (gauge) | The mean time in milliseconds that the follower request is processed at the leader. Shown as millisecond |
aws.kafka.fetch_follower_request_queue_time_ms_mean (gauge) | The mean time in milliseconds that the follower request waits in the request queue. Shown as millisecond |
aws.kafka.fetch_follower_response_queue_time_ms_mean (gauge) | The mean time in milliseconds that the follower request waits in the response queue. Shown as millisecond |
aws.kafka.fetch_follower_response_send_time_ms_mean (gauge) | The mean time in milliseconds for the follower to send a response. Shown as millisecond |
aws.kafka.fetch_follower_total_time_ms_mean (gauge) | The mean total time in milliseconds that followers spend on fetching data from the broker. Shown as millisecond |
aws.kafka.fetch_message_conversions_per_sec (rate) | The number of fetch message conversions per second for the broker. |
aws.kafka.fetch_message_conversions_time_ms_mean (gauge) | The mean time in milliseconds spent on fetch message conversions. Shown as millisecond |
aws.kafka.fetch_throttle_byte_rate (gauge) | The number of throttled bytes per second. |
aws.kafka.fetch_throttle_queue_size (gauge) | The number of messages in the throttle queue. |
aws.kafka.fetch_throttle_time (gauge) | The average fetch throttle time in milliseconds. Shown as millisecond |
aws.kafka.global_partition_count (gauge) | Total number of partitions across all brokers in the cluster. |
aws.kafka.global_partition_count.maximum (gauge) | Maximum total number of partitions across all brokers in the cluster. |
aws.kafka.global_topic_count (gauge) | Total number of topics averaged by the number of brokers in the cluster. |
aws.kafka.global_topic_count.maximum (gauge) | Maximum total number of topics averaged by the number of brokers in the cluster. |
aws.kafka.heap_memory_after_gc (gauge) | The percentage of total heap memory available after garbage collection. Shown as percent |
aws.kafka.iamnumber_of_connection_requests (count) | The total number of new connection requests that were configured with IAM authentication. Shown as connection |
aws.kafka.iamtoo_many_connections (count) | The number of connection requests rejected because they exceeded the maximum allowed connections per-principal. Shown as connection |
aws.kafka.kafka_app_logs_disk_used (gauge) | The percentage of disk space used for application logs. Shown as percent |
aws.kafka.kafka_cluster_ping_success_count (gauge) | Indicates the health of the replicator connection to the kafka cluster. If this value is 1, the connection is healthy. If the value is 0 or no datapoint, the connection is unhealthy. |
aws.kafka.kafka_data_logs_disk_used (gauge) | The percentage of disk space used for data logs. Shown as percent |
aws.kafka.leader_count (gauge) | The number of leader replicas. |
aws.kafka.linklocal_allowance_exceeded (count) | The number of packets shaped because the link local PPS exceeded the maximum for the broker. Shown as packet |
aws.kafka.max_offset_lag (gauge) | The maximum offset lag across all partitions in a topic. |
aws.kafka.memory_buffered (gauge) | The size in bytes of buffered memory for the broker. Shown as byte |
aws.kafka.memory_cached (gauge) | The size in bytes of cached memory for the broker. Shown as byte |
aws.kafka.memory_free (gauge) | The size in bytes of memory that is free and available for the broker. Shown as byte |
aws.kafka.memory_used (gauge) | The size in bytes of memory that is in use for the broker. Shown as byte |
aws.kafka.message_lag (count) | The lag between messages produced to the source cluster and messages consumed by the replicator. After an outage, MessageLag shows an increase indicating the number of messages the replicator is behind. Shown as message |
aws.kafka.messages_in_per_sec (rate) | The number of messages received from clients per second. |
aws.kafka.network_processor_avg_idle_percent (gauge) | The average percentage of the time the network processors are idle. Shown as percent |
aws.kafka.network_rx_dropped (count) | The number of dropped receive packages. |
aws.kafka.network_rx_errors (count) | The number of network receive errors for the broker. |
aws.kafka.network_rx_packets (count) | The number of packets recieved by the broker. |
aws.kafka.network_tx_dropped (count) | The number of dropped transmit packages. |
aws.kafka.network_tx_errors (count) | The number of network transmit errors for the broker. |
aws.kafka.network_tx_packets (count) | The number of packets transmitted by the broker. |
aws.kafka.offline_partitions_count (gauge) | Total number of partitions that are offline in the cluster. |
aws.kafka.offset_lag (gauge) | Partition-level consumer lag in numberofoffsets. |
aws.kafka.partition_count (gauge) | The number of partitions for the broker. |
aws.kafka.pps_allowance_exceeded (count) | The number of packets shaped because the bidirectional PPS exceeded the maximum for the broker. |
aws.kafka.produce_local_time_ms_mean (gauge) | The mean time in milliseconds for the follower to send a response. Shown as millisecond |
aws.kafka.produce_message_conversions_per_sec (rate) | The number of produce message conversions per second for the broker. |
aws.kafka.produce_message_conversions_time_ms_mean (gauge) | The mean time in milliseconds spent on message format conversions. Shown as millisecond |
aws.kafka.produce_request_queue_time_ms_mean (gauge) | The mean time in milliseconds that request messages spend in the queue. Shown as millisecond |
aws.kafka.produce_response_queue_time_ms_mean (gauge) | The mean time in milliseconds that response messages spend in the queue. Shown as millisecond |
aws.kafka.produce_response_send_time_ms_mean (gauge) | The mean time in milliseconds spent on sending response messages. Shown as millisecond |
aws.kafka.produce_throttle_byte_rate (gauge) | The number of throttled bytes per second. |
aws.kafka.produce_throttle_queue_size (gauge) | The number of messages in the throttle queue. |
aws.kafka.produce_throttle_time (gauge) | The average produce throttle time in milliseconds. Shown as millisecond |
aws.kafka.produce_total_time_ms_mean (gauge) | The mean produce time in milliseconds. Shown as millisecond |
aws.kafka.remote_copy_bytes_per_sec (rate) | The number of bytes transferred to tiered storage for the specified broker. Shown as byte |
aws.kafka.remote_copy_errors_per_sec (rate) | The rate of errors in response to write requests that the specified broker sends to tiered storage to transfer data upstream. Shown as error |
aws.kafka.remote_copy_lag_bytes (gauge) | The number of bytes a broker is behind the high-water mark for copying to tiered storage. Shown as byte |
aws.kafka.remote_fetch_bytes_per_sec (rate) | The number of bytes transferred from tiered storage in response to consumer fetches. Shown as byte |
aws.kafka.remote_fetch_errors_per_sec (rate) | The rate of errors in response to read requests that the specified broker sends to tiered storage to retrieve data in response to consumer fetches. Shown as error |
aws.kafka.remote_fetch_requests_per_sec (rate) | The number of read requests that the broker sends to tiered storage to retrieve data in response to consumer fetches. Shown as request |
aws.kafka.remote_log_manager_tasks_avg_idle_percent (gauge) | The average percentage of time the remote log manager tasks are idle. Shown as percent |
aws.kafka.remote_log_reader_avg_idle_percent (gauge) | The average percentage of time the remote log reader tasks are idle. Shown as percent |
aws.kafka.remote_log_reader_task_queue_size (gauge) | The number of tasks in the remote log reader task queue. Shown as task |
aws.kafka.remote_log_size_bytes (gauge) | The number of bytes stored on the remote tier for tiered storage. Shown as byte |
aws.kafka.replication_bytes_in_per_sec (rate) | The number of bytes per second received from other brokers. Shown as byte |
aws.kafka.replication_bytes_out_per_sec (rate) | The number of bytes per second sent to other brokers. Shown as byte |
aws.kafka.replication_latency (gauge) | Time it takes records to replicate from the source to target cluster. Duration between record produce time at source and replicated to target. Shown as millisecond |
aws.kafka.replicator_bytes_in_per_sec (rate) | Average number of bytes processed by the replicator per second. Includes all data received which includes data replicated to target and data filtered by MSK Replicator. Shown as byte |
aws.kafka.replicator_failure (count) | Number of failures that the replicator is experiencing. Shown as error |
aws.kafka.replicator_throughput (rate) | Average number of bytes replicated per second. If ReplicatorThroughput drops for a topic, check cluster connectivity and authentication. Shown as byte |
aws.kafka.request_bytes_mean (gauge) | The mean number of request bytes for the broker. Shown as byte |
aws.kafka.request_exempt_from_throttle_time (gauge) | The average time spent in broker network and I/O threads to process requests that are exempt from throttling. Shown as millisecond |
aws.kafka.request_handler_avg_idle_percent (gauge) | The average percentage of the time the request handler threads are idle. Shown as percent |
aws.kafka.request_throttle_queue_size (gauge) | The number of messages in the throttle queue. Shown as message |
aws.kafka.request_throttle_time (gauge) | The average request throttle time in milliseconds. Shown as millisecond |
aws.kafka.request_time (gauge) | The average time spent in broker network and I/O threads to process requests that are exempt throttled. Shown as millisecond |
aws.kafka.rolling_estimated_time_lag (gauge) | Time estimate (in seconds) to drain the partition offset lag at the current consumption rate. Shown as second |
aws.kafka.rolling_estimated_time_lag_max (gauge) | Maximum time estimate (in seconds) to drain the partition offset lag across all partitions. Shown as second |
aws.kafka.root_disk_used (gauge) | The percentage of the root disk used by the broker. Shown as percent |
aws.kafka.storage_used (gauge) | The total amount of storage used by the cluster in bytes. Shown as byte |
aws.kafka.sum_offset_lag (gauge) | The aggregated offset lag for all the partitions in a topic. |
aws.kafka.swap_free (gauge) | The size in bytes of swap memory that is available for the broker. Shown as byte |
aws.kafka.swap_used (gauge) | The size in bytes of swap memory that is in use for the broker. Shown as byte |
aws.kafka.tcp_connections (gauge) | Shows number of incoming and outgoing TCP segments with the SYN flag set. |
aws.kafka.throttle_time (gauge) | The average time in milliseconds a request was throttled by brokers on the cluster. Shown as millisecond |
aws.kafka.topic_count (gauge) | The total number of topics in the replicator. Shown as item |
aws.kafka.traffic_bytes (gauge) | Shows network traffic in overall bytes between clients (producers and consumers) and brokers. Traffic between brokers isn’t reported. Shown as byte |
aws.kafka.traffic_shaping (count) | The number of packets that are dropped by the broker due to traffic shaping. Shown as packet |
aws.kafka.under_min_isr_partition_count (gauge) | The number of under minlsr partitions for the broker |
aws.kafka.under_replicated_partitions (gauge) | The number of under-replicated partitions for the broker. |
aws.kafka.user_partition_exists (gauge) | Indicates whether a user-created partition exists on the broker. |
aws.kafka.volume_queue_length (gauge) | The number of read and write operation requests waiting to be completed in a specified time period. |
aws.kafka.volume_read_bytes (gauge) | The number of bytes read in a specified time period. Shown as byte |
aws.kafka.volume_read_ops (gauge) | The number of read operations in a specified time period. |
aws.kafka.volume_total_read_time (gauge) | The total number of seconds spent by all read operations that completed in a specified time period. Shown as second |
aws.kafka.volume_total_write_time (gauge) | The total number of seconds spent by all write operations that completed in a specified time period. Shown as second |
aws.kafka.volume_write_bytes (gauge) | The number of bytes written in a specified time period. Shown as byte |
aws.kafka.volume_write_ops (gauge) | The number of write operations in a specified time period. |
aws.kafka.zoo_keeper_request_latency_ms_mean (gauge) | Mean latency in milliseconds for ZooKeeper requests from broker. |
aws.kafka.zoo_keeper_session_state (gauge) | Connection status of the broker’s ZooKeeper session which can be one of the following - NOT_CONNECTED - 0.0, ASSOCIATING - 0.1, CONNECTING - 0.5, CONNECTEDREADONLY - 0.8, CONNECTED - 1.0, CLOSED - 5.0, AUTH_FAILED - 10.0. |