Strimzi

Supported OS Linux Windows Mac OS

통합 버전3.2.0

To find out if this integration is available in your organization, see your Datadog Integrations page or ask your organization administrator.

To initiate an exception request to enable this integration for your organization, email support@ddog-gov.com.

개요

이 점검은 Datadog Agent를 통해 Strimzi를 모니터링합니다.

설정

설치

Strimzi 점검은 Datadog Agent 패키지에 포함되어 있습니다. 서버에 추가 설치가 필요하지 않습니다.

이 점검은OpenMetrics를 사용하며 Python 3이 필요합니다.

구성

Strimzi 점검은 다음 연산자에 관한 Prometheus 형식의 메트릭을 수집합니다.

  • Cluster
  • Topic
  • 사용자

참고: Kafka 및 Zookeeper를 모니터링하려면 각각 Kafka, Kafka ConsumerZookeeper 점검을 사용하세요.

다음 지침을 따라 Agent에 이 점검을 활성화하고 구성하세요.

호스트

  1. Agent의 구성 디렉터리 루트에 있는 conf.d/ 폴더의 strimzi.d/conf.yaml 파일을 편집하여 Strimzi 성능 데이터 수집을 시작하세요. 사용 가능한 모든 구성 옵션은 샘플 strimzi.d/conf.yaml을 참고하세요.

  2. Agent를 다시 시작합니다.

컨테이너화

컨테이너화된 환경에서 이 지침을 적용하려면 Autodiscovery 통합 템플릿을 참고하세요. 다음 예제는 포드 주석을 사용하여 다양한 Operator 매니페스트에서 이를 구성하는 방법을 보여줍니다.

Cluster Operator:
apiVersion: apps/v1
kind: Deployment
metadata:
  name: strimzi-cluster-operator
  labels:
    app: strimzi
  namespace: kafka
spec:
  replicas: 1
  selector:
    matchLabels:
      name: strimzi-cluster-operator
      strimzi.io/kind: cluster-operator
  template:
    metadata:
      labels:
        name: strimzi-cluster-operator
        strimzi.io/kind: cluster-operator
      annotations:
        ad.datadoghq.com/strimzi-cluster-operator.checks: |
          {
            "strimzi": {
              "instances":[
                {
                  "cluster_operator_endpoint": "http://%%host%%:8080/metrics"
                }
              ]
            }
          }
      spec:
        containers:
        - name: strimzi-cluster-operator
...

참고: 이 예제에 사용된 템플릿은 여기에서 찾을 수 있습니다.

Topic 및 User Operators:
apiVersion: kafka.strimzi.io/v1beta2
kind: Kafka
metadata:
  name: my-cluster
spec:
...
  entityOperator:
    topicOperator: {}
    userOperator: {}
    template:
      pod:
        metadata:
          annotations:
            ad.datadoghq.com/topic-operator.checks: |
              {
                "strimzi": {
                  "instances":[
                    {
                      "topic_operator_endpoint": "http://%%host%%:8080/metrics"
                    }
                  ]
                }
              }
            ad.datadoghq.com/user-operator.checks: |
              {
                "strimzi": {
                  "instances":[
                    {
                      "user_operator_endpoint": "http://%%host%%:8081/metrics"
                    }
                  ]
                }
              } 
...

참고: 이 예제에 사용된 템플릿은 여기에서 찾을 수 있습니다.

사용 가능한 모든 구성 옵션은 샘플 strimzi.d/conf.yaml을 참고하세요.

Kafka 및 Zookeeper

Strimzi의 Kafka 및 Zookeeper 구성 요소는 Kafka, Kafka Consumer, Zookeeper 점검을 사용하여 모니터링할 수 있습니다. Kafka 메트릭은 JMX를 통해 수집됩니다. JMX 활성화에 대한 자세한 내용은 Strimzi의 JMX 옵션 설명서를 참고하세요. 다음은 포드 주석을 사용하여 Kafka, Kafka Consumer, Zookeeper 점검을 구성하는 방법을 보여줍니다.

apiVersion: kafka.strimzi.io/v1beta2
kind: Kafka
metadata:
  name: my-cluster
spec:
  kafka:
    jmxOptions: {}
    version: 3.4.0
    replicas: 1
    listeners:
      - name: plain
        port: 9092
        type: internal
        tls: false
    template:
      pod:
        metadata:  
          annotations:
            ad.datadoghq.com/kafka.checks: |
              {
                "kafka": {
                  "init_config": {
                    "is_jmx": true, 
                    "collect_default_metrics": true, 
                    "new_gc_metrics": true
                  },
                  "instances":[
                    {
                      "host": "%%host%%",
                      "port": "9999"
                    }
                  ]
                },
                "kafka_consumer": {
                  "init_config": {},
                  "instances": [
                    {
                      "kafka_connect_str": "%%host%%:9092",
                      "monitor_unlisted_consumer_groups": "true"
                    }
                  ]
                }
              }        
    config:
      offsets.topic.replication.factor: 1
      transaction.state.log.replication.factor: 1
      transaction.state.log.min.isr: 1
      inter.broker.protocol.version: "3.4"
    storage:
      type: ephemeral
  zookeeper:
    replicas: 1
    storage:
      type: ephemeral
    template:
      pod:
        metadata:
          annotations:
            ad.datadoghq.com/zookeeper.checks: |
              {
                "zk": {
                  "instances":[
                    {
                      "host":"%%host%%","port":"2181"
                    }
                  ]
                }
              } 

참고: 이 예제에 사용된 템플릿은 여기에서 찾을 수 있습니다.

로그 수집

Agent 버전 6.0 이상에서 사용 가능

Strimzi 로그는 Kubernetes를 통해 여러 Strimzi 포드에서 수집할 수 있습니다. Datadog Agent에는 로그 수집 기능이 기본적으로 비활성화되어 있습니다. 활성화하려면 Kubernetes 로그 수집을 참고하세요.

아래 파라미터를 적용하는 방법은 Autodiscovery 통합 템플릿을 참고하세요.

파라미터
<LOG_CONFIG>{"source": "strimzi", "service": "<SERVICE_NAME>"}

검증

Agent 상태 하위 명령을 실행하고 Checks 섹션에서 strimzi를 찾으세요.

수집한 데이터

메트릭

strimzi.cluster_operator.jvm.buffer.count_buffers
(gauge)
An estimate of the number of buffers in the pool.
strimzi.cluster_operator.jvm.buffer.memory_used_bytes
(gauge)
An estimate of the memory that the Java virtual machine is using for this buffer pool.
Shown as byte
strimzi.cluster_operator.jvm.buffer.total_capacity_bytes
(gauge)
An estimate of the total capacity of the buffers in this pool.
Shown as byte
strimzi.cluster_operator.jvm.classes.loaded_classes
(gauge)
The number of classes that are currently loaded in the Java virtual machine.
strimzi.cluster_operator.jvm.classes.unloaded_classes.count
(count)
The total number of classes unloaded since the Java virtual machine has started execution.
strimzi.cluster_operator.jvm.gc.live_data_size_bytes
(gauge)
Size of long-lived heap memory pool after reclamation.
Shown as byte
strimzi.cluster_operator.jvm.gc.max_data_size_bytes
(gauge)
Max size of long-lived heap memory pool.
Shown as byte
strimzi.cluster_operator.jvm.gc.memory_allocated_bytes.count
(count)
Incremented for an increase in the size of the (young) heap memory pool after one GC.
strimzi.cluster_operator.jvm.gc.memory_promoted_bytes.count
(count)
Count of positive increases in the size of the old generation memory pool before GC to after GC.
strimzi.cluster_operator.jvm.gc.pause_seconds.count
(count)
Time spent in GC pause.
strimzi.cluster_operator.jvm.gc.pause_seconds.max
(gauge)
Max time spent in GC pause.
strimzi.cluster_operator.jvm.gc.pause_seconds.quantile
(gauge)
Quantile of time spent in GC pause.
strimzi.cluster_operator.jvm.gc.pause_seconds.sum
(count)
Sum of time spent in GC pause.
strimzi.cluster_operator.jvm.memory.committed_bytes
(gauge)
The amount of memory in bytes that is committed for the Java virtual machine to use.
strimzi.cluster_operator.jvm.memory.max_bytes
(gauge)
The maximum amount of memory in bytes that can be used for memory management.
Shown as byte
strimzi.cluster_operator.jvm.memory.used_bytes
(gauge)
An estimate of the memory that the Java virtual machine is using for this buffer pool.
Shown as byte
strimzi.cluster_operator.jvm.threads.daemon_threads
(gauge)
The current number of live daemon threads.
strimzi.cluster_operator.jvm.threads.live_threads
(gauge)
The current number of live threads including both daemon and non-daemon threads.
strimzi.cluster_operator.jvm.threads.peak_threads
(gauge)
The peak live thread count since the Java virtual machine started or peak was reset.
strimzi.cluster_operator.jvm.threads.states_threads
(gauge)
The current number of threads.
strimzi.cluster_operator.process.cpu_usage
(gauge)
The recent cpu usage for the Java Virtual Machine process.
strimzi.cluster_operator.reconciliations.already_enqueued.count
(count)
Number of reconciliations skipped because another reconciliation for the same resource was still running.
strimzi.cluster_operator.reconciliations.count
(count)
Number of reconciliations done by the operator for individual resources
strimzi.cluster_operator.reconciliations.duration_seconds.bucket
(count)
The time the reconciliation takes to complete.
Shown as second
strimzi.cluster_operator.reconciliations.duration_seconds.count
(count)
The time the reconciliation takes to complete.
strimzi.cluster_operator.reconciliations.duration_seconds.max
(gauge)
The maximum time the reconciliation takes to complete.
Shown as second
strimzi.cluster_operator.reconciliations.duration_seconds.sum
(count)
The time the reconciliation takes to complete.
Shown as second
strimzi.cluster_operator.reconciliations.failed.count
(count)
Number of reconciliations done by the operator for individual resources which failed.
strimzi.cluster_operator.reconciliations.locked.count
(count)
Number of reconciliations skipped because another reconciliation for the same resource was still running.
strimzi.cluster_operator.reconciliations.periodical.count
(count)
Number of periodical reconciliations done by the operator.
strimzi.cluster_operator.reconciliations.successful.count
(count)
Number of reconciliations done by the operator for individual resources which were successful.
strimzi.cluster_operator.resource.state
(gauge)
Current state of the resource: 1 ready, 0 fail.
strimzi.cluster_operator.resources
(gauge)
Number of custom resources the operator sees.
strimzi.cluster_operator.resources.paused
(gauge)
Number of custom resources the operator sees but does not reconcile due to paused reconciliations.
strimzi.cluster_operator.system.cpu_count
(gauge)
The number of processors available to the Java virtual machine
strimzi.cluster_operator.system.cpu_usage
(gauge)
The recent cpu usage of the system the application is running in.
strimzi.cluster_operator.system.load_average_1m
(gauge)
The sum of the number of runnable entities queued to available processors and the number of runnable entities running on the available processors averaged over a period of time.
strimzi.cluster_operator.vertx.http_client.active_connections
(gauge)
Number of connections to the remote host currently opened.
strimzi.cluster_operator.vertx.http_client.active_requests
(gauge)
Number of requests being processed, waiting for a response.
strimzi.cluster_operator.vertx.http_client.active_ws_connections
(gauge)
Number of websockets currently opened.
strimzi.cluster_operator.vertx.http_client.bytes_read.count
(count)
Number of bytes received from the remote host.
Shown as byte
strimzi.cluster_operator.vertx.http_client.bytes_written.count
(count)
Number of bytes sent to the remote host.
Shown as byte
strimzi.cluster_operator.vertx.http_client.errors.count
(count)
Number of errors.
strimzi.cluster_operator.vertx.http_client.queue_pending
(gauge)
Number of pending elements in queue.
strimzi.cluster_operator.vertx.http_client.queue_time_seconds.count
(count)
Time spent in queue before being processed, in seconds.
strimzi.cluster_operator.vertx.http_client.queue_time_seconds.max
(gauge)
Time spent in queue before being processed, in seconds.
Shown as second
strimzi.cluster_operator.vertx.http_client.queue_time_seconds.sum
(count)
Time spent in queue before being processed, in seconds.
Shown as second
strimzi.cluster_operator.vertx.http_client.request_bytes.count
(count)
Size in bytes of the requests.
strimzi.cluster_operator.vertx.http_client.request_bytes.max
(gauge)
Size in bytes of the requests.
Shown as byte
strimzi.cluster_operator.vertx.http_client.request_bytes.sum
(count)
Size in bytes of the requests.
Shown as byte
strimzi.cluster_operator.vertx.http_client.requests.count
(count)
Number of requests sent.
strimzi.cluster_operator.vertx.http_client.response_bytes.count
(count)
Size in bytes of the responses.
strimzi.cluster_operator.vertx.http_client.response_bytes.max
(gauge)
Size in bytes of the responses.
Shown as byte
strimzi.cluster_operator.vertx.http_client.response_bytes.sum
(count)
Size in bytes of the responses.
Shown as byte
strimzi.cluster_operator.vertx.http_client.response_time_seconds.count
(count)
Response time in seconds.
strimzi.cluster_operator.vertx.http_client.response_time_seconds.max
(gauge)
Response time in seconds.
Shown as second
strimzi.cluster_operator.vertx.http_client.response_time_seconds.sum
(count)
Response time in seconds.
Shown as second
strimzi.cluster_operator.vertx.http_client.responses.count
(count)
Number of received responses.
strimzi.cluster_operator.vertx.http_server.active_connections
(gauge)
Number of opened connections to the HTTP Server.
strimzi.cluster_operator.vertx.http_server.active_requests
(gauge)
Number of requests being processed.
strimzi.cluster_operator.vertx.http_server.active_ws_connections
(gauge)
Number of websockets currently opened.
strimzi.cluster_operator.vertx.http_server.bytes_read.count
(count)
Number of bytes received by the HTTP Server.
Shown as byte
strimzi.cluster_operator.vertx.http_server.bytes_written.count
(count)
Number of bytes sent by the HTTP Server.
Shown as byte
strimzi.cluster_operator.vertx.http_server.errors.count
(count)
Number of errors.
strimzi.cluster_operator.vertx.http_server.request_bytes.count
(count)
Size in bytes of the requests.
strimzi.cluster_operator.vertx.http_server.request_bytes.max
(gauge)
Size in bytes of the requests.
Shown as byte
strimzi.cluster_operator.vertx.http_server.request_bytes.sum
(count)
Size in bytes of the requests.
Shown as byte
strimzi.cluster_operator.vertx.http_server.request_resets.count
(count)
Number of request resets.
strimzi.cluster_operator.vertx.http_server.requests.count
(count)
Number of processed requests.
strimzi.cluster_operator.vertx.http_server.response_bytes.count
(count)
Size in bytes of the responses.
strimzi.cluster_operator.vertx.http_server.response_bytes.max
(gauge)
Size in bytes of the responses.
Shown as byte
strimzi.cluster_operator.vertx.http_server.response_bytes.sum
(count)
Size in bytes of the responses.
Shown as byte
strimzi.cluster_operator.vertx.http_server.response_time_seconds.count
(count)
Request processing time in seconds.
strimzi.cluster_operator.vertx.http_server.response_time_seconds.max
(gauge)
Request processing time in seconds.
Shown as second
strimzi.cluster_operator.vertx.http_server.response_time_seconds.sum
(count)
Request processing time in seconds.
Shown as second
strimzi.cluster_operator.vertx.pool.completed.count
(count)
Number of elements done with the resource (i.e. total number of tasks executed for worker pools).
strimzi.cluster_operator.vertx.pool.in_use
(gauge)
Number of resources used.
strimzi.cluster_operator.vertx.pool.queue_pending
(gauge)
Number of pending elements in queue.
strimzi.cluster_operator.vertx.pool.queue_time_seconds.count
(count)
Time spent in queue before being processed, in seconds.
strimzi.cluster_operator.vertx.pool.queue_time_seconds.max
(gauge)
Time spent in queue before being processed, in seconds.
Shown as second
strimzi.cluster_operator.vertx.pool.queue_time_seconds.sum
(count)
Time spent in queue before being processed, in seconds.
Shown as second
strimzi.cluster_operator.vertx.pool.ratio
(gauge)
Pool usage ratio, only present if maximum pool size could be determined.
strimzi.cluster_operator.vertx.pool.usage_seconds.count
(count)
Time using a resource (i.e. processing time for worker pools).
strimzi.cluster_operator.vertx.pool.usage_seconds.max
(gauge)
Time using a resource (i.e. processing time for worker pools).
Shown as second
strimzi.cluster_operator.vertx.pool.usage_seconds.sum
(count)
Time using a resource (i.e. processing time for worker pools).
Shown as second
strimzi.topic_operator.jvm.buffer.count_buffers
(gauge)
An estimate of the number of buffers in the pool.
strimzi.topic_operator.jvm.buffer.memory_used_bytes
(gauge)
An estimate of the memory that the Java virtual machine is using for this buffer pool.
Shown as byte
strimzi.topic_operator.jvm.buffer.total_capacity_bytes
(gauge)
An estimate of the total capacity of the buffers in this pool.
Shown as byte
strimzi.topic_operator.jvm.classes.loaded_classes
(gauge)
The number of classes that are currently loaded in the Java virtual machine.
strimzi.topic_operator.jvm.classes.unloaded_classes.count
(count)
The total number of classes unloaded since the Java virtual machine has started execution.
strimzi.topic_operator.jvm.gc.live_data_size_bytes
(gauge)
Size of long-lived heap memory pool after reclamation.
Shown as byte
strimzi.topic_operator.jvm.gc.max_data_size_bytes
(gauge)
Max size of long-lived heap memory pool.
Shown as byte
strimzi.topic_operator.jvm.gc.memory_allocated_bytes.count
(count)
Incremented for an increase in the size of the (young) heap memory pool after one GC.
strimzi.topic_operator.jvm.gc.memory_promoted_bytes.count
(count)
Count of positive increases in the size of the old generation memory pool before GC to after GC.
strimzi.topic_operator.jvm.gc.pause_seconds.count
(count)
Time spent in GC pause.
strimzi.topic_operator.jvm.gc.pause_seconds.max
(gauge)
Max time spent in GC pause.
strimzi.topic_operator.jvm.gc.pause_seconds.quantile
(gauge)
Quantile of time spent in GC pause.
strimzi.topic_operator.jvm.gc.pause_seconds.sum
(count)
Sum of time spent in GC pause.
strimzi.topic_operator.jvm.memory.committed_bytes
(gauge)
The amount of memory in bytes that is committed for the Java virtual machine to use.
strimzi.topic_operator.jvm.memory.max_bytes
(gauge)
The maximum amount of memory in bytes that can be used for memory management.
Shown as byte
strimzi.topic_operator.jvm.memory.used_bytes
(gauge)
An estimate of the memory that the Java virtual machine is using for this buffer pool.
Shown as byte
strimzi.topic_operator.jvm.threads.daemon_threads
(gauge)
The current number of live daemon threads.
strimzi.topic_operator.jvm.threads.live_threads
(gauge)
The current number of live threads including both daemon and non-daemon threads.
strimzi.topic_operator.jvm.threads.peak_threads
(gauge)
The peak live thread count since the Java virtual machine started or peak was reset.
strimzi.topic_operator.jvm.threads.states_threads
(gauge)
The current number of threads
strimzi.topic_operator.process.cpu_usage
(gauge)
The recent cpu usage for the Java Virtual Machine process.
strimzi.topic_operator.reconciliations.count
(count)
Number of reconciliations done by the operator for individual resources
strimzi.topic_operator.reconciliations.duration_seconds.bucket
(count)
The time the reconciliation takes to complete.
Shown as second
strimzi.topic_operator.reconciliations.duration_seconds.count
(count)
The time the reconciliation takes to complete.
strimzi.topic_operator.reconciliations.duration_seconds.max
(gauge)
The maximum time the reconciliation takes to complete.
Shown as second
strimzi.topic_operator.reconciliations.duration_seconds.sum
(count)
The time the reconciliation takes to complete.
Shown as second
strimzi.topic_operator.reconciliations.failed.count
(count)
Number of reconciliations done by the operator for individual resources which failed.
strimzi.topic_operator.reconciliations.locked.count
(count)
Number of reconciliations skipped because another reconciliation for the same resource was still running.
strimzi.topic_operator.reconciliations.periodical.count
(count)
Number of periodical reconciliations done by the operator.
strimzi.topic_operator.reconciliations.successful.count
(count)
Number of reconciliations done by the operator for individual resources which were successful.
strimzi.topic_operator.resource.state
(gauge)
Current state of the resource: 1 ready, 0 fail.
strimzi.topic_operator.resources
(gauge)
Number of custom resources the operator sees.
strimzi.topic_operator.resources.paused
(gauge)
Number of custom resources the operator sees but does not reconcile due to paused reconciliations.
strimzi.topic_operator.system.cpu_count
(gauge)
The number of processors available to the Java virtual machine
strimzi.topic_operator.system.cpu_usage
(gauge)
The recent cpu usage of the system the application is running in
strimzi.topic_operator.system.load_average_1m
(gauge)
The sum of the number of runnable entities queued to available processors and the number of runnable entities running on the available processors averaged over a period of time.
strimzi.topic_operator.vertx.http_client.active_connections
(gauge)
Number of connections to the remote host currently opened.
strimzi.topic_operator.vertx.http_client.active_requests
(gauge)
Number of requests being processed, waiting for a response.
strimzi.topic_operator.vertx.http_client.active_ws_connections
(gauge)
Number of websockets currently opened.
strimzi.topic_operator.vertx.http_client.bytes_read.count
(count)
Number of bytes received from the remote host.
Shown as byte
strimzi.topic_operator.vertx.http_client.bytes_written.count
(count)
Number of bytes sent to the remote host.
Shown as byte
strimzi.topic_operator.vertx.http_client.errors.count
(count)
Number of errors.
strimzi.topic_operator.vertx.http_client.queue_pending
(gauge)
Number of pending elements in queue.
strimzi.topic_operator.vertx.http_client.queue_time_seconds.count
(count)
Time spent in queue before being processed, in seconds.
strimzi.topic_operator.vertx.http_client.queue_time_seconds.max
(gauge)
Time spent in queue before being processed, in seconds.
Shown as second
strimzi.topic_operator.vertx.http_client.queue_time_seconds.sum
(count)
Time spent in queue before being processed, in seconds.
Shown as second
strimzi.topic_operator.vertx.http_client.request_bytes.count
(count)
Size in bytes of the requests.
strimzi.topic_operator.vertx.http_client.request_bytes.max
(gauge)
Size in bytes of the requests.
Shown as byte
strimzi.topic_operator.vertx.http_client.request_bytes.sum
(count)
Size in bytes of the requests.
Shown as byte
strimzi.topic_operator.vertx.http_client.requests.count
(count)
Number of requests sent.
strimzi.topic_operator.vertx.http_client.response_bytes.count
(count)
Size in bytes of the responses.
strimzi.topic_operator.vertx.http_client.response_bytes.max
(gauge)
Size in bytes of the responses.
Shown as byte
strimzi.topic_operator.vertx.http_client.response_bytes.sum
(count)
Size in bytes of the responses.
Shown as byte
strimzi.topic_operator.vertx.http_client.response_time_seconds.count
(count)
Response time in seconds.
strimzi.topic_operator.vertx.http_client.response_time_seconds.max
(gauge)
Response time in seconds.
Shown as second
strimzi.topic_operator.vertx.http_client.response_time_seconds.sum
(count)
Response time in seconds.
Shown as second
strimzi.topic_operator.vertx.http_client.responses.count
(count)
Number of received responses.
strimzi.topic_operator.vertx.http_server.active_connections
(gauge)
Number of opened connections to the HTTP Server.
strimzi.topic_operator.vertx.http_server.active_requests
(gauge)
Number of requests being processed.
strimzi.topic_operator.vertx.http_server.active_ws_connections
(gauge)
Number of websockets currently opened.
strimzi.topic_operator.vertx.http_server.bytes_read.count
(count)
Number of bytes received by the HTTP Server.
Shown as byte
strimzi.topic_operator.vertx.http_server.bytes_written.count
(count)
Number of bytes sent by the HTTP Server.
Shown as byte
strimzi.topic_operator.vertx.http_server.errors.count
(count)
Number of errors.
strimzi.topic_operator.vertx.http_server.request_bytes.count
(count)
Size in bytes of the requests.
strimzi.topic_operator.vertx.http_server.request_bytes.max
(gauge)
Size in bytes of the requests.
Shown as byte
strimzi.topic_operator.vertx.http_server.request_bytes.sum
(count)
Size in bytes of the requests.
Shown as byte
strimzi.topic_operator.vertx.http_server.request_resets.count
(count)
Number of request resets.
strimzi.topic_operator.vertx.http_server.requests.count
(count)
Number of processed requests.
strimzi.topic_operator.vertx.http_server.response_bytes.count
(count)
Size in bytes of the responses.
strimzi.topic_operator.vertx.http_server.response_bytes.max
(gauge)
Size in bytes of the responses.
Shown as byte
strimzi.topic_operator.vertx.http_server.response_bytes.sum
(count)
Size in bytes of the responses.
Shown as byte
strimzi.topic_operator.vertx.http_server.response_time_seconds.count
(count)
Request processing time in seconds.
strimzi.topic_operator.vertx.http_server.response_time_seconds.max
(gauge)
Request processing time in seconds.
Shown as second
strimzi.topic_operator.vertx.http_server.response_time_seconds.sum
(count)
Request processing time in seconds.
Shown as second
strimzi.topic_operator.vertx.pool.completed.count
(count)
Number of elements done with the resource (i.e. total number of tasks executed for worker pools).
strimzi.topic_operator.vertx.pool.in_use
(gauge)
Number of resources used.
strimzi.topic_operator.vertx.pool.queue_pending
(gauge)
Number of pending elements in queue.
strimzi.topic_operator.vertx.pool.queue_time_seconds.count
(count)
Time spent in queue before being processed, in seconds.
strimzi.topic_operator.vertx.pool.queue_time_seconds.max
(gauge)
Time spent in queue before being processed, in seconds.
Shown as second
strimzi.topic_operator.vertx.pool.queue_time_seconds.sum
(count)
Time spent in queue before being processed, in seconds.
Shown as second
strimzi.topic_operator.vertx.pool.ratio
(gauge)
Pool usage ratio, only present if maximum pool size could be determined.
strimzi.topic_operator.vertx.pool.usage_seconds.count
(count)
Time using a resource (i.e. processing time for worker pools).
strimzi.topic_operator.vertx.pool.usage_seconds.max
(gauge)
Time using a resource (i.e. processing time for worker pools).
Shown as second
strimzi.topic_operator.vertx.pool.usage_seconds.sum
(count)
Time using a resource (i.e. processing time for worker pools).
Shown as second
strimzi.user_operator.jvm.buffer.count_buffers
(gauge)
An estimate of the number of buffers in the pool.
strimzi.user_operator.jvm.buffer.memory_used_bytes
(gauge)
An estimate of the memory that the Java virtual machine is using for this buffer pool.
Shown as byte
strimzi.user_operator.jvm.buffer.total_capacity_bytes
(gauge)
An estimate of the total capacity of the buffers in this pool.
Shown as byte
strimzi.user_operator.jvm.classes.loaded_classes
(gauge)
The number of classes that are currently loaded in the Java virtual machine.
strimzi.user_operator.jvm.classes.unloaded_classes.count
(count)
The total number of classes unloaded since the Java virtual machine has started execution.
strimzi.user_operator.jvm.gc.live_data_size_bytes
(gauge)
Size of long-lived heap memory pool after reclamation.
Shown as byte
strimzi.user_operator.jvm.gc.max_data_size_bytes
(gauge)
Max size of long-lived heap memory pool.
Shown as byte
strimzi.user_operator.jvm.gc.memory_allocated_bytes.count
(count)
Incremented for an increase in the size of the (young) heap memory pool after one GC.
strimzi.user_operator.jvm.gc.memory_promoted_bytes.count
(count)
Count of positive increases in the size of the old generation memory pool before GC to after GC.
strimzi.user_operator.jvm.gc.pause_seconds.count
(count)
Time spent in GC pause.
strimzi.user_operator.jvm.gc.pause_seconds.max
(gauge)
Max time spent in GC pause.
strimzi.user_operator.jvm.gc.pause_seconds.quantile
(gauge)
Quantile of time spent in GC pause.
strimzi.user_operator.jvm.gc.pause_seconds.sum
(count)
Sum of time spent in GC pause.
strimzi.user_operator.jvm.memory.committed_bytes
(gauge)
The amount of memory in bytes that is committed for the Java virtual machine to use.
strimzi.user_operator.jvm.memory.max_bytes
(gauge)
The maximum amount of memory in bytes that can be used for memory management.
Shown as byte
strimzi.user_operator.jvm.memory.used_bytes
(gauge)
An estimate of the memory that the Java virtual machine is using for this buffer pool.
Shown as byte
strimzi.user_operator.jvm.threads.daemon_threads
(gauge)
The current number of live daemon threads.
strimzi.user_operator.jvm.threads.live_threads
(gauge)
The current number of live threads including both daemon and non-daemon threads.
strimzi.user_operator.jvm.threads.peak_threads
(gauge)
The peak live thread count since the Java virtual machine started or peak was reset.
strimzi.user_operator.jvm.threads.states_threads
(gauge)
The current number of threads
strimzi.user_operator.process.cpu_usage
(gauge)
The recent cpu usage for the Java Virtual Machine process.
strimzi.user_operator.reconciliations.count
(count)
Number of reconciliations done by the operator for individual resources
strimzi.user_operator.reconciliations.duration_seconds.bucket
(count)
The time the reconciliation takes to complete.
Shown as second
strimzi.user_operator.reconciliations.duration_seconds.count
(count)
The time the reconciliation takes to complete.
strimzi.user_operator.reconciliations.duration_seconds.max
(gauge)
The maximum time the reconciliation takes to complete.
Shown as second
strimzi.user_operator.reconciliations.duration_seconds.sum
(count)
The time the reconciliation takes to complete.
Shown as second
strimzi.user_operator.reconciliations.failed.count
(count)
Number of reconciliations done by the operator for individual resources which failed.
strimzi.user_operator.reconciliations.locked.count
(count)
Number of reconciliations skipped because another reconciliation for the same resource was still running.
strimzi.user_operator.reconciliations.periodical.count
(count)
Number of periodical reconciliations done by the operator.
strimzi.user_operator.reconciliations.successful.count
(count)
Number of reconciliations done by the operator for individual resources which were successful.
strimzi.user_operator.resource.state
(gauge)
Current state of the resource: 1 ready, 0 fail.
strimzi.user_operator.resources
(gauge)
Number of custom resources the operator sees.
strimzi.user_operator.resources.paused
(gauge)
Number of custom resources the operator sees but does not reconcile due to paused reconciliations.
strimzi.user_operator.system.cpu_count
(gauge)
The number of processors available to the Java virtual machine
strimzi.user_operator.system.cpu_usage
(gauge)
The recent cpu usage of the system the application is running in
strimzi.user_operator.system.load_average_1m
(gauge)
The sum of the number of runnable entities queued to available processors and the number of runnable entities running on the available processors averaged over a period of time.

이벤트

Strimzi 통합은 이벤트를 포함하지 않습니다.

서비스 점검

strimzi.cluster_operator.openmetrics.health

Returns CRITICAL if the Agent is unable to connect to the Cluster Operator OpenMetrics endpoint, otherwise returns OK.

Statuses: ok, critical

strimzi.topic_operator.openmetrics.health

Returns CRITICAL if the Agent is unable to connect to the Topic Operator OpenMetrics endpoint, otherwise returns OK.

Statuses: ok, critical

strimzi.user_operator.openmetrics.health

Returns CRITICAL if the Agent is unable to connect to the User Operator OpenMetrics endpoint, otherwise returns OK.

Statuses: ok, critical

트러블슈팅

도움이 필요하신가요? Datadog 지원팀에 문의해 주세요.

참고 자료

기타 유용한 문서, 링크 및 기사: