Strimzi

Supported OS Linux Windows Mac OS

통합 버전3.2.0

개요

이 점검은 Datadog Agent를 통해 Strimzi를 모니터링합니다.

설정

설치

Strimzi 점검은 Datadog Agent 패키지에 포함되어 있습니다. 서버에 추가 설치가 필요하지 않습니다.

이 점검은OpenMetrics를 사용하며 Python 3이 필요합니다.

구성

Strimzi 점검은 다음 연산자에 관한 Prometheus 형식의 메트릭을 수집합니다.

  • Cluster
  • Topic
  • 사용자

참고: Kafka 및 Zookeeper를 모니터링하려면 각각 Kafka, Kafka ConsumerZookeeper 점검을 사용하세요.

다음 지침을 따라 Agent에 이 점검을 활성화하고 구성하세요.

호스트

  1. Agent의 구성 디렉터리 루트에 있는 conf.d/ 폴더의 strimzi.d/conf.yaml 파일을 편집하여 Strimzi 성능 데이터 수집을 시작하세요. 사용 가능한 모든 구성 옵션은 샘플 strimzi.d/conf.yaml을 참고하세요.

  2. Agent를 다시 시작합니다.

컨테이너화

컨테이너화된 환경에서 이 지침을 적용하려면 Autodiscovery 통합 템플릿을 참고하세요. 다음 예제는 포드 주석을 사용하여 다양한 Operator 매니페스트에서 이를 구성하는 방법을 보여줍니다.

Cluster Operator:
apiVersion: apps/v1
kind: Deployment
metadata:
  name: strimzi-cluster-operator
  labels:
    app: strimzi
  namespace: kafka
spec:
  replicas: 1
  selector:
    matchLabels:
      name: strimzi-cluster-operator
      strimzi.io/kind: cluster-operator
  template:
    metadata:
      labels:
        name: strimzi-cluster-operator
        strimzi.io/kind: cluster-operator
      annotations:
        ad.datadoghq.com/strimzi-cluster-operator.checks: |
          {
            "strimzi": {
              "instances":[
                {
                  "cluster_operator_endpoint": "http://%%host%%:8080/metrics"
                }
              ]
            }
          }
      spec:
        containers:
        - name: strimzi-cluster-operator
...

참고: 이 예제에 사용된 템플릿은 여기에서 찾을 수 있습니다.

Topic 및 User Operators:
apiVersion: kafka.strimzi.io/v1beta2
kind: Kafka
metadata:
  name: my-cluster
spec:
...
  entityOperator:
    topicOperator: {}
    userOperator: {}
    template:
      pod:
        metadata:
          annotations:
            ad.datadoghq.com/topic-operator.checks: |
              {
                "strimzi": {
                  "instances":[
                    {
                      "topic_operator_endpoint": "http://%%host%%:8080/metrics"
                    }
                  ]
                }
              }
            ad.datadoghq.com/user-operator.checks: |
              {
                "strimzi": {
                  "instances":[
                    {
                      "user_operator_endpoint": "http://%%host%%:8081/metrics"
                    }
                  ]
                }
              } 
...

참고: 이 예제에 사용된 템플릿은 여기에서 찾을 수 있습니다.

사용 가능한 모든 구성 옵션은 샘플 strimzi.d/conf.yaml을 참고하세요.

Kafka 및 Zookeeper

Strimzi의 Kafka 및 Zookeeper 구성 요소는 Kafka, Kafka Consumer, Zookeeper 점검을 사용하여 모니터링할 수 있습니다. Kafka 메트릭은 JMX를 통해 수집됩니다. JMX 활성화에 대한 자세한 내용은 Strimzi의 JMX 옵션 설명서를 참고하세요. 다음은 포드 주석을 사용하여 Kafka, Kafka Consumer, Zookeeper 점검을 구성하는 방법을 보여줍니다.

apiVersion: kafka.strimzi.io/v1beta2
kind: Kafka
metadata:
  name: my-cluster
spec:
  kafka:
    jmxOptions: {}
    version: 3.4.0
    replicas: 1
    listeners:
      - name: plain
        port: 9092
        type: internal
        tls: false
    template:
      pod:
        metadata:  
          annotations:
            ad.datadoghq.com/kafka.checks: |
              {
                "kafka": {
                  "init_config": {
                    "is_jmx": true, 
                    "collect_default_metrics": true, 
                    "new_gc_metrics": true
                  },
                  "instances":[
                    {
                      "host": "%%host%%",
                      "port": "9999"
                    }
                  ]
                },
                "kafka_consumer": {
                  "init_config": {},
                  "instances": [
                    {
                      "kafka_connect_str": "%%host%%:9092",
                      "monitor_unlisted_consumer_groups": "true"
                    }
                  ]
                }
              }        
    config:
      offsets.topic.replication.factor: 1
      transaction.state.log.replication.factor: 1
      transaction.state.log.min.isr: 1
      inter.broker.protocol.version: "3.4"
    storage:
      type: ephemeral
  zookeeper:
    replicas: 1
    storage:
      type: ephemeral
    template:
      pod:
        metadata:
          annotations:
            ad.datadoghq.com/zookeeper.checks: |
              {
                "zk": {
                  "instances":[
                    {
                      "host":"%%host%%","port":"2181"
                    }
                  ]
                }
              } 

참고: 이 예제에 사용된 템플릿은 여기에서 찾을 수 있습니다.

로그 수집

Agent 버전 6.0 이상에서 사용 가능

Strimzi 로그는 Kubernetes를 통해 여러 Strimzi 포드에서 수집할 수 있습니다. Datadog Agent에는 로그 수집 기능이 기본적으로 비활성화되어 있습니다. 활성화하려면 Kubernetes 로그 수집을 참고하세요.

아래 파라미터를 적용하는 방법은 Autodiscovery 통합 템플릿을 참고하세요.

파라미터
<LOG_CONFIG>{"source": "strimzi", "service": "<SERVICE_NAME>"}

검증

Agent 상태 하위 명령을 실행하고 Checks 섹션에서 strimzi를 찾으세요.

수집한 데이터

메트릭

strimzi.cluster_operator.jvm.buffer.count_buffers
(gauge)
An estimate of the number of buffers in the pool.
strimzi.cluster_operator.jvm.buffer.memory_used_bytes
(gauge)
An estimate of the memory that the Java virtual machine is using for this buffer pool.
Shown as byte
strimzi.cluster_operator.jvm.buffer.total_capacity_bytes
(gauge)
An estimate of the total capacity of the buffers in this pool.
Shown as byte
strimzi.cluster_operator.jvm.classes.loaded_classes
(gauge)
The number of classes that are currently loaded in the Java virtual machine.
strimzi.cluster_operator.jvm.classes.unloaded_classes.count
(count)
The total number of classes unloaded since the Java virtual machine has started execution.
strimzi.cluster_operator.jvm.gc.live_data_size_bytes
(gauge)
Size of long-lived heap memory pool after reclamation.
Shown as byte
strimzi.cluster_operator.jvm.gc.max_data_size_bytes
(gauge)
Max size of long-lived heap memory pool.
Shown as byte
strimzi.cluster_operator.jvm.gc.memory_allocated_bytes.count
(count)
Incremented for an increase in the size of the (young) heap memory pool after one GC.
strimzi.cluster_operator.jvm.gc.memory_promoted_bytes.count
(count)
Count of positive increases in the size of the old generation memory pool before GC to after GC.
strimzi.cluster_operator.jvm.gc.pause_seconds.count
(count)
Time spent in GC pause.
strimzi.cluster_operator.jvm.gc.pause_seconds.max
(gauge)
Max time spent in GC pause.
strimzi.cluster_operator.jvm.gc.pause_seconds.quantile
(gauge)
Quantile of time spent in GC pause.
strimzi.cluster_operator.jvm.gc.pause_seconds.sum
(count)
Sum of time spent in GC pause.
strimzi.cluster_operator.jvm.memory.committed_bytes
(gauge)
The amount of memory in bytes that is committed for the Java virtual machine to use.
strimzi.cluster_operator.jvm.memory.max_bytes
(gauge)
The maximum amount of memory in bytes that can be used for memory management.
Shown as byte
strimzi.cluster_operator.jvm.memory.used_bytes
(gauge)
An estimate of the memory that the Java virtual machine is using for this buffer pool.
Shown as byte
strimzi.cluster_operator.jvm.threads.daemon_threads
(gauge)
The current number of live daemon threads.
strimzi.cluster_operator.jvm.threads.live_threads
(gauge)
The current number of live threads including both daemon and non-daemon threads.
strimzi.cluster_operator.jvm.threads.peak_threads
(gauge)
The peak live thread count since the Java virtual machine started or peak was reset.
strimzi.cluster_operator.jvm.threads.states_threads
(gauge)
The current number of threads.
strimzi.cluster_operator.process.cpu_usage
(gauge)
The recent cpu usage for the Java Virtual Machine process.
strimzi.cluster_operator.reconciliations.already_enqueued.count
(count)
Number of reconciliations skipped because another reconciliation for the same resource was still running.
strimzi.cluster_operator.reconciliations.count
(count)
Number of reconciliations done by the operator for individual resources
strimzi.cluster_operator.reconciliations.duration_seconds.bucket
(count)
The time the reconciliation takes to complete.
Shown as second
strimzi.cluster_operator.reconciliations.duration_seconds.count
(count)
The time the reconciliation takes to complete.
strimzi.cluster_operator.reconciliations.duration_seconds.max
(gauge)
The maximum time the reconciliation takes to complete.
Shown as second
strimzi.cluster_operator.reconciliations.duration_seconds.sum
(count)
The time the reconciliation takes to complete.
Shown as second
strimzi.cluster_operator.reconciliations.failed.count
(count)
Number of reconciliations done by the operator for individual resources which failed.
strimzi.cluster_operator.reconciliations.locked.count
(count)
Number of reconciliations skipped because another reconciliation for the same resource was still running.
strimzi.cluster_operator.reconciliations.periodical.count
(count)
Number of periodical reconciliations done by the operator.
strimzi.cluster_operator.reconciliations.successful.count
(count)
Number of reconciliations done by the operator for individual resources which were successful.
strimzi.cluster_operator.resource.state
(gauge)
Current state of the resource: 1 ready, 0 fail.
strimzi.cluster_operator.resources
(gauge)
Number of custom resources the operator sees.
strimzi.cluster_operator.resources.paused
(gauge)
Number of custom resources the operator sees but does not reconcile due to paused reconciliations.
strimzi.cluster_operator.system.cpu_count
(gauge)
The number of processors available to the Java virtual machine
strimzi.cluster_operator.system.cpu_usage
(gauge)
The recent cpu usage of the system the application is running in.
strimzi.cluster_operator.system.load_average_1m
(gauge)
The sum of the number of runnable entities queued to available processors and the number of runnable entities running on the available processors averaged over a period of time.
strimzi.cluster_operator.vertx.http_client.active_connections
(gauge)
Number of connections to the remote host currently opened.
strimzi.cluster_operator.vertx.http_client.active_requests
(gauge)
Number of requests being processed, waiting for a response.
strimzi.cluster_operator.vertx.http_client.active_ws_connections
(gauge)
Number of websockets currently opened.
strimzi.cluster_operator.vertx.http_client.bytes_read.count
(count)
Number of bytes received from the remote host.
Shown as byte
strimzi.cluster_operator.vertx.http_client.bytes_written.count
(count)
Number of bytes sent to the remote host.
Shown as byte
strimzi.cluster_operator.vertx.http_client.errors.count
(count)
Number of errors.
strimzi.cluster_operator.vertx.http_client.queue_pending
(gauge)
Number of pending elements in queue.
strimzi.cluster_operator.vertx.http_client.queue_time_seconds.count
(count)
Time spent in queue before being processed, in seconds.
strimzi.cluster_operator.vertx.http_client.queue_time_seconds.max
(gauge)
Time spent in queue before being processed, in seconds.
Shown as second
strimzi.cluster_operator.vertx.http_client.queue_time_seconds.sum
(count)
Time spent in queue before being processed, in seconds.
Shown as second
strimzi.cluster_operator.vertx.http_client.request_bytes.count
(count)
Size in bytes of the requests.
strimzi.cluster_operator.vertx.http_client.request_bytes.max
(gauge)
Size in bytes of the requests.
Shown as byte
strimzi.cluster_operator.vertx.http_client.request_bytes.sum
(count)
Size in bytes of the requests.
Shown as byte
strimzi.cluster_operator.vertx.http_client.requests.count
(count)
Number of requests sent.
strimzi.cluster_operator.vertx.http_client.response_bytes.count
(count)
Size in bytes of the responses.
strimzi.cluster_operator.vertx.http_client.response_bytes.max
(gauge)
Size in bytes of the responses.
Shown as byte
strimzi.cluster_operator.vertx.http_client.response_bytes.sum
(count)
Size in bytes of the responses.
Shown as byte
strimzi.cluster_operator.vertx.http_client.response_time_seconds.count
(count)
Response time in seconds.
strimzi.cluster_operator.vertx.http_client.response_time_seconds.max
(gauge)
Response time in seconds.
Shown as second
strimzi.cluster_operator.vertx.http_client.response_time_seconds.sum
(count)
Response time in seconds.
Shown as second
strimzi.cluster_operator.vertx.http_client.responses.count
(count)
Number of received responses.
strimzi.cluster_operator.vertx.http_server.active_connections
(gauge)
Number of opened connections to the HTTP Server.
strimzi.cluster_operator.vertx.http_server.active_requests
(gauge)
Number of requests being processed.
strimzi.cluster_operator.vertx.http_server.active_ws_connections
(gauge)
Number of websockets currently opened.
strimzi.cluster_operator.vertx.http_server.bytes_read.count
(count)
Number of bytes received by the HTTP Server.
Shown as byte
strimzi.cluster_operator.vertx.http_server.bytes_written.count
(count)
Number of bytes sent by the HTTP Server.
Shown as byte
strimzi.cluster_operator.vertx.http_server.errors.count
(count)
Number of errors.
strimzi.cluster_operator.vertx.http_server.request_bytes.count
(count)
Size in bytes of the requests.
strimzi.cluster_operator.vertx.http_server.request_bytes.max
(gauge)
Size in bytes of the requests.
Shown as byte
strimzi.cluster_operator.vertx.http_server.request_bytes.sum
(count)
Size in bytes of the requests.
Shown as byte
strimzi.cluster_operator.vertx.http_server.request_resets.count
(count)
Number of request resets.
strimzi.cluster_operator.vertx.http_server.requests.count
(count)
Number of processed requests.
strimzi.cluster_operator.vertx.http_server.response_bytes.count
(count)
Size in bytes of the responses.
strimzi.cluster_operator.vertx.http_server.response_bytes.max
(gauge)
Size in bytes of the responses.
Shown as byte
strimzi.cluster_operator.vertx.http_server.response_bytes.sum
(count)
Size in bytes of the responses.
Shown as byte
strimzi.cluster_operator.vertx.http_server.response_time_seconds.count
(count)
Request processing time in seconds.
strimzi.cluster_operator.vertx.http_server.response_time_seconds.max
(gauge)
Request processing time in seconds.
Shown as second
strimzi.cluster_operator.vertx.http_server.response_time_seconds.sum
(count)
Request processing time in seconds.
Shown as second
strimzi.cluster_operator.vertx.pool.completed.count
(count)
Number of elements done with the resource (i.e. total number of tasks executed for worker pools).
strimzi.cluster_operator.vertx.pool.in_use
(gauge)
Number of resources used.
strimzi.cluster_operator.vertx.pool.queue_pending
(gauge)
Number of pending elements in queue.
strimzi.cluster_operator.vertx.pool.queue_time_seconds.count
(count)
Time spent in queue before being processed, in seconds.
strimzi.cluster_operator.vertx.pool.queue_time_seconds.max
(gauge)
Time spent in queue before being processed, in seconds.
Shown as second
strimzi.cluster_operator.vertx.pool.queue_time_seconds.sum
(count)
Time spent in queue before being processed, in seconds.
Shown as second
strimzi.cluster_operator.vertx.pool.ratio
(gauge)
Pool usage ratio, only present if maximum pool size could be determined.
strimzi.cluster_operator.vertx.pool.usage_seconds.count
(count)
Time using a resource (i.e. processing time for worker pools).
strimzi.cluster_operator.vertx.pool.usage_seconds.max
(gauge)
Time using a resource (i.e. processing time for worker pools).
Shown as second
strimzi.cluster_operator.vertx.pool.usage_seconds.sum
(count)
Time using a resource (i.e. processing time for worker pools).
Shown as second
strimzi.topic_operator.jvm.buffer.count_buffers
(gauge)
An estimate of the number of buffers in the pool.
strimzi.topic_operator.jvm.buffer.memory_used_bytes
(gauge)
An estimate of the memory that the Java virtual machine is using for this buffer pool.
Shown as byte
strimzi.topic_operator.jvm.buffer.total_capacity_bytes
(gauge)
An estimate of the total capacity of the buffers in this pool.
Shown as byte
strimzi.topic_operator.jvm.classes.loaded_classes
(gauge)
The number of classes that are currently loaded in the Java virtual machine.
strimzi.topic_operator.jvm.classes.unloaded_classes.count
(count)
The total number of classes unloaded since the Java virtual machine has started execution.
strimzi.topic_operator.jvm.gc.live_data_size_bytes
(gauge)
Size of long-lived heap memory pool after reclamation.
Shown as byte
strimzi.topic_operator.jvm.gc.max_data_size_bytes
(gauge)
Max size of long-lived heap memory pool.
Shown as byte
strimzi.topic_operator.jvm.gc.memory_allocated_bytes.count
(count)
Incremented for an increase in the size of the (young) heap memory pool after one GC.
strimzi.topic_operator.jvm.gc.memory_promoted_bytes.count
(count)
Count of positive increases in the size of the old generation memory pool before GC to after GC.
strimzi.topic_operator.jvm.gc.pause_seconds.count
(count)
Time spent in GC pause.
strimzi.topic_operator.jvm.gc.pause_seconds.max
(gauge)
Max time spent in GC pause.
strimzi.topic_operator.jvm.gc.pause_seconds.quantile
(gauge)
Quantile of time spent in GC pause.
strimzi.topic_operator.jvm.gc.pause_seconds.sum
(count)
Sum of time spent in GC pause.
strimzi.topic_operator.jvm.memory.committed_bytes
(gauge)
The amount of memory in bytes that is committed for the Java virtual machine to use.
strimzi.topic_operator.jvm.memory.max_bytes
(gauge)
The maximum amount of memory in bytes that can be used for memory management.
Shown as byte
strimzi.topic_operator.jvm.memory.used_bytes
(gauge)
An estimate of the memory that the Java virtual machine is using for this buffer pool.
Shown as byte
strimzi.topic_operator.jvm.threads.daemon_threads
(gauge)
The current number of live daemon threads.
strimzi.topic_operator.jvm.threads.live_threads
(gauge)
The current number of live threads including both daemon and non-daemon threads.
strimzi.topic_operator.jvm.threads.peak_threads
(gauge)
The peak live thread count since the Java virtual machine started or peak was reset.
strimzi.topic_operator.jvm.threads.states_threads
(gauge)
The current number of threads
strimzi.topic_operator.process.cpu_usage
(gauge)
The recent cpu usage for the Java Virtual Machine process.
strimzi.topic_operator.reconciliations.count
(count)
Number of reconciliations done by the operator for individual resources
strimzi.topic_operator.reconciliations.duration_seconds.bucket
(count)
The time the reconciliation takes to complete.
Shown as second
strimzi.topic_operator.reconciliations.duration_seconds.count
(count)
The time the reconciliation takes to complete.
strimzi.topic_operator.reconciliations.duration_seconds.max
(gauge)
The maximum time the reconciliation takes to complete.
Shown as second
strimzi.topic_operator.reconciliations.duration_seconds.sum
(count)
The time the reconciliation takes to complete.
Shown as second
strimzi.topic_operator.reconciliations.failed.count
(count)
Number of reconciliations done by the operator for individual resources which failed.
strimzi.topic_operator.reconciliations.locked.count
(count)
Number of reconciliations skipped because another reconciliation for the same resource was still running.
strimzi.topic_operator.reconciliations.periodical.count
(count)
Number of periodical reconciliations done by the operator.
strimzi.topic_operator.reconciliations.successful.count
(count)
Number of reconciliations done by the operator for individual resources which were successful.
strimzi.topic_operator.resource.state
(gauge)
Current state of the resource: 1 ready, 0 fail.
strimzi.topic_operator.resources
(gauge)
Number of custom resources the operator sees.
strimzi.topic_operator.resources.paused
(gauge)
Number of custom resources the operator sees but does not reconcile due to paused reconciliations.
strimzi.topic_operator.system.cpu_count
(gauge)
The number of processors available to the Java virtual machine
strimzi.topic_operator.system.cpu_usage
(gauge)
The recent cpu usage of the system the application is running in
strimzi.topic_operator.system.load_average_1m
(gauge)
The sum of the number of runnable entities queued to available processors and the number of runnable entities running on the available processors averaged over a period of time.
strimzi.topic_operator.vertx.http_client.active_connections
(gauge)
Number of connections to the remote host currently opened.
strimzi.topic_operator.vertx.http_client.active_requests
(gauge)
Number of requests being processed, waiting for a response.
strimzi.topic_operator.vertx.http_client.active_ws_connections
(gauge)
Number of websockets currently opened.
strimzi.topic_operator.vertx.http_client.bytes_read.count
(count)
Number of bytes received from the remote host.
Shown as byte
strimzi.topic_operator.vertx.http_client.bytes_written.count
(count)
Number of bytes sent to the remote host.
Shown as byte
strimzi.topic_operator.vertx.http_client.errors.count
(count)
Number of errors.
strimzi.topic_operator.vertx.http_client.queue_pending
(gauge)
Number of pending elements in queue.
strimzi.topic_operator.vertx.http_client.queue_time_seconds.count
(count)
Time spent in queue before being processed, in seconds.
strimzi.topic_operator.vertx.http_client.queue_time_seconds.max
(gauge)
Time spent in queue before being processed, in seconds.
Shown as second
strimzi.topic_operator.vertx.http_client.queue_time_seconds.sum
(count)
Time spent in queue before being processed, in seconds.
Shown as second
strimzi.topic_operator.vertx.http_client.request_bytes.count
(count)
Size in bytes of the requests.
strimzi.topic_operator.vertx.http_client.request_bytes.max
(gauge)
Size in bytes of the requests.
Shown as byte
strimzi.topic_operator.vertx.http_client.request_bytes.sum
(count)
Size in bytes of the requests.
Shown as byte
strimzi.topic_operator.vertx.http_client.requests.count
(count)
Number of requests sent.
strimzi.topic_operator.vertx.http_client.response_bytes.count
(count)
Size in bytes of the responses.
strimzi.topic_operator.vertx.http_client.response_bytes.max
(gauge)
Size in bytes of the responses.
Shown as byte
strimzi.topic_operator.vertx.http_client.response_bytes.sum
(count)
Size in bytes of the responses.
Shown as byte
strimzi.topic_operator.vertx.http_client.response_time_seconds.count
(count)
Response time in seconds.
strimzi.topic_operator.vertx.http_client.response_time_seconds.max
(gauge)
Response time in seconds.
Shown as second
strimzi.topic_operator.vertx.http_client.response_time_seconds.sum
(count)
Response time in seconds.
Shown as second
strimzi.topic_operator.vertx.http_client.responses.count
(count)
Number of received responses.
strimzi.topic_operator.vertx.http_server.active_connections
(gauge)
Number of opened connections to the HTTP Server.
strimzi.topic_operator.vertx.http_server.active_requests
(gauge)
Number of requests being processed.
strimzi.topic_operator.vertx.http_server.active_ws_connections
(gauge)
Number of websockets currently opened.
strimzi.topic_operator.vertx.http_server.bytes_read.count
(count)
Number of bytes received by the HTTP Server.
Shown as byte
strimzi.topic_operator.vertx.http_server.bytes_written.count
(count)
Number of bytes sent by the HTTP Server.
Shown as byte
strimzi.topic_operator.vertx.http_server.errors.count
(count)
Number of errors.
strimzi.topic_operator.vertx.http_server.request_bytes.count
(count)
Size in bytes of the requests.
strimzi.topic_operator.vertx.http_server.request_bytes.max
(gauge)
Size in bytes of the requests.
Shown as byte
strimzi.topic_operator.vertx.http_server.request_bytes.sum
(count)
Size in bytes of the requests.
Shown as byte
strimzi.topic_operator.vertx.http_server.request_resets.count
(count)
Number of request resets.
strimzi.topic_operator.vertx.http_server.requests.count
(count)
Number of processed requests.
strimzi.topic_operator.vertx.http_server.response_bytes.count
(count)
Size in bytes of the responses.
strimzi.topic_operator.vertx.http_server.response_bytes.max
(gauge)
Size in bytes of the responses.
Shown as byte
strimzi.topic_operator.vertx.http_server.response_bytes.sum
(count)
Size in bytes of the responses.
Shown as byte
strimzi.topic_operator.vertx.http_server.response_time_seconds.count
(count)
Request processing time in seconds.
strimzi.topic_operator.vertx.http_server.response_time_seconds.max
(gauge)
Request processing time in seconds.
Shown as second
strimzi.topic_operator.vertx.http_server.response_time_seconds.sum
(count)
Request processing time in seconds.
Shown as second
strimzi.topic_operator.vertx.pool.completed.count
(count)
Number of elements done with the resource (i.e. total number of tasks executed for worker pools).
strimzi.topic_operator.vertx.pool.in_use
(gauge)
Number of resources used.
strimzi.topic_operator.vertx.pool.queue_pending
(gauge)
Number of pending elements in queue.
strimzi.topic_operator.vertx.pool.queue_time_seconds.count
(count)
Time spent in queue before being processed, in seconds.
strimzi.topic_operator.vertx.pool.queue_time_seconds.max
(gauge)
Time spent in queue before being processed, in seconds.
Shown as second
strimzi.topic_operator.vertx.pool.queue_time_seconds.sum
(count)
Time spent in queue before being processed, in seconds.
Shown as second
strimzi.topic_operator.vertx.pool.ratio
(gauge)
Pool usage ratio, only present if maximum pool size could be determined.
strimzi.topic_operator.vertx.pool.usage_seconds.count
(count)
Time using a resource (i.e. processing time for worker pools).
strimzi.topic_operator.vertx.pool.usage_seconds.max
(gauge)
Time using a resource (i.e. processing time for worker pools).
Shown as second
strimzi.topic_operator.vertx.pool.usage_seconds.sum
(count)
Time using a resource (i.e. processing time for worker pools).
Shown as second
strimzi.user_operator.jvm.buffer.count_buffers
(gauge)
An estimate of the number of buffers in the pool.
strimzi.user_operator.jvm.buffer.memory_used_bytes
(gauge)
An estimate of the memory that the Java virtual machine is using for this buffer pool.
Shown as byte
strimzi.user_operator.jvm.buffer.total_capacity_bytes
(gauge)
An estimate of the total capacity of the buffers in this pool.
Shown as byte
strimzi.user_operator.jvm.classes.loaded_classes
(gauge)
The number of classes that are currently loaded in the Java virtual machine.
strimzi.user_operator.jvm.classes.unloaded_classes.count
(count)
The total number of classes unloaded since the Java virtual machine has started execution.
strimzi.user_operator.jvm.gc.live_data_size_bytes
(gauge)
Size of long-lived heap memory pool after reclamation.
Shown as byte
strimzi.user_operator.jvm.gc.max_data_size_bytes
(gauge)
Max size of long-lived heap memory pool.
Shown as byte
strimzi.user_operator.jvm.gc.memory_allocated_bytes.count
(count)
Incremented for an increase in the size of the (young) heap memory pool after one GC.
strimzi.user_operator.jvm.gc.memory_promoted_bytes.count
(count)
Count of positive increases in the size of the old generation memory pool before GC to after GC.
strimzi.user_operator.jvm.gc.pause_seconds.count
(count)
Time spent in GC pause.
strimzi.user_operator.jvm.gc.pause_seconds.max
(gauge)
Max time spent in GC pause.
strimzi.user_operator.jvm.gc.pause_seconds.quantile
(gauge)
Quantile of time spent in GC pause.
strimzi.user_operator.jvm.gc.pause_seconds.sum
(count)
Sum of time spent in GC pause.
strimzi.user_operator.jvm.memory.committed_bytes
(gauge)
The amount of memory in bytes that is committed for the Java virtual machine to use.
strimzi.user_operator.jvm.memory.max_bytes
(gauge)
The maximum amount of memory in bytes that can be used for memory management.
Shown as byte
strimzi.user_operator.jvm.memory.used_bytes
(gauge)
An estimate of the memory that the Java virtual machine is using for this buffer pool.
Shown as byte
strimzi.user_operator.jvm.threads.daemon_threads
(gauge)
The current number of live daemon threads.
strimzi.user_operator.jvm.threads.live_threads
(gauge)
The current number of live threads including both daemon and non-daemon threads.
strimzi.user_operator.jvm.threads.peak_threads
(gauge)
The peak live thread count since the Java virtual machine started or peak was reset.
strimzi.user_operator.jvm.threads.states_threads
(gauge)
The current number of threads
strimzi.user_operator.process.cpu_usage
(gauge)
The recent cpu usage for the Java Virtual Machine process.
strimzi.user_operator.reconciliations.count
(count)
Number of reconciliations done by the operator for individual resources
strimzi.user_operator.reconciliations.duration_seconds.bucket
(count)
The time the reconciliation takes to complete.
Shown as second
strimzi.user_operator.reconciliations.duration_seconds.count
(count)
The time the reconciliation takes to complete.
strimzi.user_operator.reconciliations.duration_seconds.max
(gauge)
The maximum time the reconciliation takes to complete.
Shown as second
strimzi.user_operator.reconciliations.duration_seconds.sum
(count)
The time the reconciliation takes to complete.
Shown as second
strimzi.user_operator.reconciliations.failed.count
(count)
Number of reconciliations done by the operator for individual resources which failed.
strimzi.user_operator.reconciliations.locked.count
(count)
Number of reconciliations skipped because another reconciliation for the same resource was still running.
strimzi.user_operator.reconciliations.periodical.count
(count)
Number of periodical reconciliations done by the operator.
strimzi.user_operator.reconciliations.successful.count
(count)
Number of reconciliations done by the operator for individual resources which were successful.
strimzi.user_operator.resource.state
(gauge)
Current state of the resource: 1 ready, 0 fail.
strimzi.user_operator.resources
(gauge)
Number of custom resources the operator sees.
strimzi.user_operator.resources.paused
(gauge)
Number of custom resources the operator sees but does not reconcile due to paused reconciliations.
strimzi.user_operator.system.cpu_count
(gauge)
The number of processors available to the Java virtual machine
strimzi.user_operator.system.cpu_usage
(gauge)
The recent cpu usage of the system the application is running in
strimzi.user_operator.system.load_average_1m
(gauge)
The sum of the number of runnable entities queued to available processors and the number of runnable entities running on the available processors averaged over a period of time.

이벤트

Strimzi 통합은 이벤트를 포함하지 않습니다.

서비스 점검

strimzi.cluster_operator.openmetrics.health

Returns CRITICAL if the Agent is unable to connect to the Cluster Operator OpenMetrics endpoint, otherwise returns OK.

Statuses: ok, critical

strimzi.topic_operator.openmetrics.health

Returns CRITICAL if the Agent is unable to connect to the Topic Operator OpenMetrics endpoint, otherwise returns OK.

Statuses: ok, critical

strimzi.user_operator.openmetrics.health

Returns CRITICAL if the Agent is unable to connect to the User Operator OpenMetrics endpoint, otherwise returns OK.

Statuses: ok, critical

트러블슈팅

도움이 필요하신가요? Datadog 지원팀에 문의해 주세요.

참고 자료

기타 유용한 문서, 링크 및 기사: