Supported OS Linux Windows Mac OS

インテグレーションバージョン3.2.0

概要

このチェックは、Datadog Agent を通じて Strimzi を監視します。

セットアップ

インストール

Strimzi チェックは Datadog Agent パッケージに含まれています。サーバーに追加でインストールする必要はありません。

このチェックでは OpenMetrics を使用しており、Python 3 が必要です。

構成

Strimzi チェックは、以下のオペレーターの Prometheus 形式メトリクスを収集します。

  • クラスター
  • トピック
  • ユーザー

: Kafka と Zookeeper の監視には、それぞれ KafkaKafka ConsumerZookeeper チェックを使用してください。

以下の手順に従って、Agent でこのチェックを有効化し、構成します。

ホスト

  1. Strimzi のパフォーマンス データの収集を開始するには、Agent の構成ディレクトリのルートにある conf.d/ フォルダー内の strimzi.d/conf.yaml ファイルを編集します。使用可能なすべての構成オプションの詳細については、サンプル strimzi.d/conf.yaml を参照してください。

  2. Agent を再起動します

コンテナ化

コンテナ化された環境の場合は、これらの手順の適用方法について オートディスカバリーのインテグレーション テンプレート を参照してください。以下は、ポッド アノテーションを使用して各種の Operator マニフェストで構成する例です。

クラスター オペレーター:
apiVersion: apps/v1
kind: Deployment
metadata:
  name: strimzi-cluster-operator
  labels:
    app: strimzi
  namespace: kafka
spec:
  replicas: 1
  selector:
    matchLabels:
      name: strimzi-cluster-operator
      strimzi.io/kind: cluster-operator
  template:
    metadata:
      labels:
        name: strimzi-cluster-operator
        strimzi.io/kind: cluster-operator
      annotations:
        ad.datadoghq.com/strimzi-cluster-operator.checks: |
          {
            "strimzi": {
              "instances":[
                {
                  "cluster_operator_endpoint": "http://%%host%%:8080/metrics"
                }
              ]
            }
          }
      spec:
        containers:
        - name: strimzi-cluster-operator
...

: この例で使用したテンプレートは こちら にあります。

トピック オペレーターとユーザー オペレーター:
apiVersion: kafka.strimzi.io/v1beta2
kind: Kafka
metadata:
  name: my-cluster
spec:
...
  entityOperator:
    topicOperator: {}
    userOperator: {}
    template:
      pod:
        metadata:
          annotations:
            ad.datadoghq.com/topic-operator.checks: |
              {
                "strimzi": {
                  "instances":[
                    {
                      "topic_operator_endpoint": "http://%%host%%:8080/metrics"
                    }
                  ]
                }
              }
            ad.datadoghq.com/user-operator.checks: |
              {
                "strimzi": {
                  "instances":[
                    {
                      "user_operator_endpoint": "http://%%host%%:8081/metrics"
                    }
                  ]
                }
              } 
...

: この例で使用したテンプレートは こちら にあります。

使用可能なすべての構成オプションの詳細については、サンプル strimzi.d/conf.yaml を参照してください。

Kafka と Zookeeper

Strimzi の Kafka コンポーネントと Zookeeper コンポーネントは、KafkaKafka ConsumerZookeeper チェックで監視できます。Kafka のメトリクスは JMX 経由で収集されます。JMX を有効化する方法は、JMX オプションに関する Strimzi のドキュメント を参照してください。以下は、ポッド アノテーションを使用して Kafka、Kafka Consumer、Zookeeper のチェックを構成する例です。

apiVersion: kafka.strimzi.io/v1beta2
kind: Kafka
metadata:
  name: my-cluster
spec:
  kafka:
    jmxOptions: {}
    version: 3.4.0
    replicas: 1
    listeners:
      - name: plain
        port: 9092
        type: internal
        tls: false
    template:
      pod:
        metadata:  
          annotations:
            ad.datadoghq.com/kafka.checks: |
              {
                "kafka": {
                  "init_config": {
                    "is_jmx": true, 
                    "collect_default_metrics": true, 
                    "new_gc_metrics": true
                  },
                  "instances":[
                    {
                      "host": "%%host%%",
                      "port": "9999"
                    }
                  ]
                },
                "kafka_consumer": {
                  "init_config": {},
                  "instances": [
                    {
                      "kafka_connect_str": "%%host%%:9092",
                      "monitor_unlisted_consumer_groups": "true"
                    }
                  ]
                }
              }        
    config:
      offsets.topic.replication.factor: 1
      transaction.state.log.replication.factor: 1
      transaction.state.log.min.isr: 1
      inter.broker.protocol.version: "3.4"
    storage:
      type: ephemeral
  zookeeper:
    replicas: 1
    storage:
      type: ephemeral
    template:
      pod:
        metadata:
          annotations:
            ad.datadoghq.com/zookeeper.checks: |
              {
                "zk": {
                  "instances":[
                    {
                      "host":"%%host%%","port":"2181"
                    }
                  ]
                }
              } 

: この例で使用したテンプレートは こちら にあります。

ログ収集

Agent バージョン 6.0 以降で利用可能

Strimzi のログは、Kubernetes 経由で各種の Strimzi ポッドから収集できます。Datadog Agent では、ログ収集はデフォルトで無効です。有効化するには、Kubernetes ログ収集 を参照してください。

オートディスカバリーのインテグレーションテンプレートのガイドを参照して、次のパラメーターを適用してください。

パラメーター
<LOG_CONFIG>{"source": "strimzi", "service": "<SERVICE_NAME>"}

検証

Agent の status サブコマンドを実行 し、Checks セクションで strimzi を探します。

収集データ

メトリクス

strimzi.cluster_operator.jvm.buffer.count_buffers
(gauge)
An estimate of the number of buffers in the pool.
strimzi.cluster_operator.jvm.buffer.memory_used_bytes
(gauge)
An estimate of the memory that the Java virtual machine is using for this buffer pool.
Shown as byte
strimzi.cluster_operator.jvm.buffer.total_capacity_bytes
(gauge)
An estimate of the total capacity of the buffers in this pool.
Shown as byte
strimzi.cluster_operator.jvm.classes.loaded_classes
(gauge)
The number of classes that are currently loaded in the Java virtual machine.
strimzi.cluster_operator.jvm.classes.unloaded_classes.count
(count)
The total number of classes unloaded since the Java virtual machine has started execution.
strimzi.cluster_operator.jvm.gc.live_data_size_bytes
(gauge)
Size of long-lived heap memory pool after reclamation.
Shown as byte
strimzi.cluster_operator.jvm.gc.max_data_size_bytes
(gauge)
Max size of long-lived heap memory pool.
Shown as byte
strimzi.cluster_operator.jvm.gc.memory_allocated_bytes.count
(count)
Incremented for an increase in the size of the (young) heap memory pool after one GC.
strimzi.cluster_operator.jvm.gc.memory_promoted_bytes.count
(count)
Count of positive increases in the size of the old generation memory pool before GC to after GC.
strimzi.cluster_operator.jvm.gc.pause_seconds.count
(count)
Time spent in GC pause.
strimzi.cluster_operator.jvm.gc.pause_seconds.max
(gauge)
Max time spent in GC pause.
strimzi.cluster_operator.jvm.gc.pause_seconds.quantile
(gauge)
Quantile of time spent in GC pause.
strimzi.cluster_operator.jvm.gc.pause_seconds.sum
(count)
Sum of time spent in GC pause.
strimzi.cluster_operator.jvm.memory.committed_bytes
(gauge)
The amount of memory in bytes that is committed for the Java virtual machine to use.
strimzi.cluster_operator.jvm.memory.max_bytes
(gauge)
The maximum amount of memory in bytes that can be used for memory management.
Shown as byte
strimzi.cluster_operator.jvm.memory.used_bytes
(gauge)
An estimate of the memory that the Java virtual machine is using for this buffer pool.
Shown as byte
strimzi.cluster_operator.jvm.threads.daemon_threads
(gauge)
The current number of live daemon threads.
strimzi.cluster_operator.jvm.threads.live_threads
(gauge)
The current number of live threads including both daemon and non-daemon threads.
strimzi.cluster_operator.jvm.threads.peak_threads
(gauge)
The peak live thread count since the Java virtual machine started or peak was reset.
strimzi.cluster_operator.jvm.threads.states_threads
(gauge)
The current number of threads.
strimzi.cluster_operator.process.cpu_usage
(gauge)
The recent cpu usage for the Java Virtual Machine process.
strimzi.cluster_operator.reconciliations.already_enqueued.count
(count)
Number of reconciliations skipped because another reconciliation for the same resource was still running.
strimzi.cluster_operator.reconciliations.count
(count)
Number of reconciliations done by the operator for individual resources
strimzi.cluster_operator.reconciliations.duration_seconds.bucket
(count)
The time the reconciliation takes to complete.
Shown as second
strimzi.cluster_operator.reconciliations.duration_seconds.count
(count)
The time the reconciliation takes to complete.
strimzi.cluster_operator.reconciliations.duration_seconds.max
(gauge)
The maximum time the reconciliation takes to complete.
Shown as second
strimzi.cluster_operator.reconciliations.duration_seconds.sum
(count)
The time the reconciliation takes to complete.
Shown as second
strimzi.cluster_operator.reconciliations.failed.count
(count)
Number of reconciliations done by the operator for individual resources which failed.
strimzi.cluster_operator.reconciliations.locked.count
(count)
Number of reconciliations skipped because another reconciliation for the same resource was still running.
strimzi.cluster_operator.reconciliations.periodical.count
(count)
Number of periodical reconciliations done by the operator.
strimzi.cluster_operator.reconciliations.successful.count
(count)
Number of reconciliations done by the operator for individual resources which were successful.
strimzi.cluster_operator.resource.state
(gauge)
Current state of the resource: 1 ready, 0 fail.
strimzi.cluster_operator.resources
(gauge)
Number of custom resources the operator sees.
strimzi.cluster_operator.resources.paused
(gauge)
Number of custom resources the operator sees but does not reconcile due to paused reconciliations.
strimzi.cluster_operator.system.cpu_count
(gauge)
The number of processors available to the Java virtual machine
strimzi.cluster_operator.system.cpu_usage
(gauge)
The recent cpu usage of the system the application is running in.
strimzi.cluster_operator.system.load_average_1m
(gauge)
The sum of the number of runnable entities queued to available processors and the number of runnable entities running on the available processors averaged over a period of time.
strimzi.cluster_operator.vertx.http_client.active_connections
(gauge)
Number of connections to the remote host currently opened.
strimzi.cluster_operator.vertx.http_client.active_requests
(gauge)
Number of requests being processed, waiting for a response.
strimzi.cluster_operator.vertx.http_client.active_ws_connections
(gauge)
Number of websockets currently opened.
strimzi.cluster_operator.vertx.http_client.bytes_read.count
(count)
Number of bytes received from the remote host.
Shown as byte
strimzi.cluster_operator.vertx.http_client.bytes_written.count
(count)
Number of bytes sent to the remote host.
Shown as byte
strimzi.cluster_operator.vertx.http_client.errors.count
(count)
Number of errors.
strimzi.cluster_operator.vertx.http_client.queue_pending
(gauge)
Number of pending elements in queue.
strimzi.cluster_operator.vertx.http_client.queue_time_seconds.count
(count)
Time spent in queue before being processed, in seconds.
strimzi.cluster_operator.vertx.http_client.queue_time_seconds.max
(gauge)
Time spent in queue before being processed, in seconds.
Shown as second
strimzi.cluster_operator.vertx.http_client.queue_time_seconds.sum
(count)
Time spent in queue before being processed, in seconds.
Shown as second
strimzi.cluster_operator.vertx.http_client.request_bytes.count
(count)
Size in bytes of the requests.
strimzi.cluster_operator.vertx.http_client.request_bytes.max
(gauge)
Size in bytes of the requests.
Shown as byte
strimzi.cluster_operator.vertx.http_client.request_bytes.sum
(count)
Size in bytes of the requests.
Shown as byte
strimzi.cluster_operator.vertx.http_client.requests.count
(count)
Number of requests sent.
strimzi.cluster_operator.vertx.http_client.response_bytes.count
(count)
Size in bytes of the responses.
strimzi.cluster_operator.vertx.http_client.response_bytes.max
(gauge)
Size in bytes of the responses.
Shown as byte
strimzi.cluster_operator.vertx.http_client.response_bytes.sum
(count)
Size in bytes of the responses.
Shown as byte
strimzi.cluster_operator.vertx.http_client.response_time_seconds.count
(count)
Response time in seconds.
strimzi.cluster_operator.vertx.http_client.response_time_seconds.max
(gauge)
Response time in seconds.
Shown as second
strimzi.cluster_operator.vertx.http_client.response_time_seconds.sum
(count)
Response time in seconds.
Shown as second
strimzi.cluster_operator.vertx.http_client.responses.count
(count)
Number of received responses.
strimzi.cluster_operator.vertx.http_server.active_connections
(gauge)
Number of opened connections to the HTTP Server.
strimzi.cluster_operator.vertx.http_server.active_requests
(gauge)
Number of requests being processed.
strimzi.cluster_operator.vertx.http_server.active_ws_connections
(gauge)
Number of websockets currently opened.
strimzi.cluster_operator.vertx.http_server.bytes_read.count
(count)
Number of bytes received by the HTTP Server.
Shown as byte
strimzi.cluster_operator.vertx.http_server.bytes_written.count
(count)
Number of bytes sent by the HTTP Server.
Shown as byte
strimzi.cluster_operator.vertx.http_server.errors.count
(count)
Number of errors.
strimzi.cluster_operator.vertx.http_server.request_bytes.count
(count)
Size in bytes of the requests.
strimzi.cluster_operator.vertx.http_server.request_bytes.max
(gauge)
Size in bytes of the requests.
Shown as byte
strimzi.cluster_operator.vertx.http_server.request_bytes.sum
(count)
Size in bytes of the requests.
Shown as byte
strimzi.cluster_operator.vertx.http_server.request_resets.count
(count)
Number of request resets.
strimzi.cluster_operator.vertx.http_server.requests.count
(count)
Number of processed requests.
strimzi.cluster_operator.vertx.http_server.response_bytes.count
(count)
Size in bytes of the responses.
strimzi.cluster_operator.vertx.http_server.response_bytes.max
(gauge)
Size in bytes of the responses.
Shown as byte
strimzi.cluster_operator.vertx.http_server.response_bytes.sum
(count)
Size in bytes of the responses.
Shown as byte
strimzi.cluster_operator.vertx.http_server.response_time_seconds.count
(count)
Request processing time in seconds.
strimzi.cluster_operator.vertx.http_server.response_time_seconds.max
(gauge)
Request processing time in seconds.
Shown as second
strimzi.cluster_operator.vertx.http_server.response_time_seconds.sum
(count)
Request processing time in seconds.
Shown as second
strimzi.cluster_operator.vertx.pool.completed.count
(count)
Number of elements done with the resource (i.e. total number of tasks executed for worker pools).
strimzi.cluster_operator.vertx.pool.in_use
(gauge)
Number of resources used.
strimzi.cluster_operator.vertx.pool.queue_pending
(gauge)
Number of pending elements in queue.
strimzi.cluster_operator.vertx.pool.queue_time_seconds.count
(count)
Time spent in queue before being processed, in seconds.
strimzi.cluster_operator.vertx.pool.queue_time_seconds.max
(gauge)
Time spent in queue before being processed, in seconds.
Shown as second
strimzi.cluster_operator.vertx.pool.queue_time_seconds.sum
(count)
Time spent in queue before being processed, in seconds.
Shown as second
strimzi.cluster_operator.vertx.pool.ratio
(gauge)
Pool usage ratio, only present if maximum pool size could be determined.
strimzi.cluster_operator.vertx.pool.usage_seconds.count
(count)
Time using a resource (i.e. processing time for worker pools).
strimzi.cluster_operator.vertx.pool.usage_seconds.max
(gauge)
Time using a resource (i.e. processing time for worker pools).
Shown as second
strimzi.cluster_operator.vertx.pool.usage_seconds.sum
(count)
Time using a resource (i.e. processing time for worker pools).
Shown as second
strimzi.topic_operator.jvm.buffer.count_buffers
(gauge)
An estimate of the number of buffers in the pool.
strimzi.topic_operator.jvm.buffer.memory_used_bytes
(gauge)
An estimate of the memory that the Java virtual machine is using for this buffer pool.
Shown as byte
strimzi.topic_operator.jvm.buffer.total_capacity_bytes
(gauge)
An estimate of the total capacity of the buffers in this pool.
Shown as byte
strimzi.topic_operator.jvm.classes.loaded_classes
(gauge)
The number of classes that are currently loaded in the Java virtual machine.
strimzi.topic_operator.jvm.classes.unloaded_classes.count
(count)
The total number of classes unloaded since the Java virtual machine has started execution.
strimzi.topic_operator.jvm.gc.live_data_size_bytes
(gauge)
Size of long-lived heap memory pool after reclamation.
Shown as byte
strimzi.topic_operator.jvm.gc.max_data_size_bytes
(gauge)
Max size of long-lived heap memory pool.
Shown as byte
strimzi.topic_operator.jvm.gc.memory_allocated_bytes.count
(count)
Incremented for an increase in the size of the (young) heap memory pool after one GC.
strimzi.topic_operator.jvm.gc.memory_promoted_bytes.count
(count)
Count of positive increases in the size of the old generation memory pool before GC to after GC.
strimzi.topic_operator.jvm.gc.pause_seconds.count
(count)
Time spent in GC pause.
strimzi.topic_operator.jvm.gc.pause_seconds.max
(gauge)
Max time spent in GC pause.
strimzi.topic_operator.jvm.gc.pause_seconds.quantile
(gauge)
Quantile of time spent in GC pause.
strimzi.topic_operator.jvm.gc.pause_seconds.sum
(count)
Sum of time spent in GC pause.
strimzi.topic_operator.jvm.memory.committed_bytes
(gauge)
The amount of memory in bytes that is committed for the Java virtual machine to use.
strimzi.topic_operator.jvm.memory.max_bytes
(gauge)
The maximum amount of memory in bytes that can be used for memory management.
Shown as byte
strimzi.topic_operator.jvm.memory.used_bytes
(gauge)
An estimate of the memory that the Java virtual machine is using for this buffer pool.
Shown as byte
strimzi.topic_operator.jvm.threads.daemon_threads
(gauge)
The current number of live daemon threads.
strimzi.topic_operator.jvm.threads.live_threads
(gauge)
The current number of live threads including both daemon and non-daemon threads.
strimzi.topic_operator.jvm.threads.peak_threads
(gauge)
The peak live thread count since the Java virtual machine started or peak was reset.
strimzi.topic_operator.jvm.threads.states_threads
(gauge)
The current number of threads
strimzi.topic_operator.process.cpu_usage
(gauge)
The recent cpu usage for the Java Virtual Machine process.
strimzi.topic_operator.reconciliations.count
(count)
Number of reconciliations done by the operator for individual resources
strimzi.topic_operator.reconciliations.duration_seconds.bucket
(count)
The time the reconciliation takes to complete.
Shown as second
strimzi.topic_operator.reconciliations.duration_seconds.count
(count)
The time the reconciliation takes to complete.
strimzi.topic_operator.reconciliations.duration_seconds.max
(gauge)
The maximum time the reconciliation takes to complete.
Shown as second
strimzi.topic_operator.reconciliations.duration_seconds.sum
(count)
The time the reconciliation takes to complete.
Shown as second
strimzi.topic_operator.reconciliations.failed.count
(count)
Number of reconciliations done by the operator for individual resources which failed.
strimzi.topic_operator.reconciliations.locked.count
(count)
Number of reconciliations skipped because another reconciliation for the same resource was still running.
strimzi.topic_operator.reconciliations.periodical.count
(count)
Number of periodical reconciliations done by the operator.
strimzi.topic_operator.reconciliations.successful.count
(count)
Number of reconciliations done by the operator for individual resources which were successful.
strimzi.topic_operator.resource.state
(gauge)
Current state of the resource: 1 ready, 0 fail.
strimzi.topic_operator.resources
(gauge)
Number of custom resources the operator sees.
strimzi.topic_operator.resources.paused
(gauge)
Number of custom resources the operator sees but does not reconcile due to paused reconciliations.
strimzi.topic_operator.system.cpu_count
(gauge)
The number of processors available to the Java virtual machine
strimzi.topic_operator.system.cpu_usage
(gauge)
The recent cpu usage of the system the application is running in
strimzi.topic_operator.system.load_average_1m
(gauge)
The sum of the number of runnable entities queued to available processors and the number of runnable entities running on the available processors averaged over a period of time.
strimzi.topic_operator.vertx.http_client.active_connections
(gauge)
Number of connections to the remote host currently opened.
strimzi.topic_operator.vertx.http_client.active_requests
(gauge)
Number of requests being processed, waiting for a response.
strimzi.topic_operator.vertx.http_client.active_ws_connections
(gauge)
Number of websockets currently opened.
strimzi.topic_operator.vertx.http_client.bytes_read.count
(count)
Number of bytes received from the remote host.
Shown as byte
strimzi.topic_operator.vertx.http_client.bytes_written.count
(count)
Number of bytes sent to the remote host.
Shown as byte
strimzi.topic_operator.vertx.http_client.errors.count
(count)
Number of errors.
strimzi.topic_operator.vertx.http_client.queue_pending
(gauge)
Number of pending elements in queue.
strimzi.topic_operator.vertx.http_client.queue_time_seconds.count
(count)
Time spent in queue before being processed, in seconds.
strimzi.topic_operator.vertx.http_client.queue_time_seconds.max
(gauge)
Time spent in queue before being processed, in seconds.
Shown as second
strimzi.topic_operator.vertx.http_client.queue_time_seconds.sum
(count)
Time spent in queue before being processed, in seconds.
Shown as second
strimzi.topic_operator.vertx.http_client.request_bytes.count
(count)
Size in bytes of the requests.
strimzi.topic_operator.vertx.http_client.request_bytes.max
(gauge)
Size in bytes of the requests.
Shown as byte
strimzi.topic_operator.vertx.http_client.request_bytes.sum
(count)
Size in bytes of the requests.
Shown as byte
strimzi.topic_operator.vertx.http_client.requests.count
(count)
Number of requests sent.
strimzi.topic_operator.vertx.http_client.response_bytes.count
(count)
Size in bytes of the responses.
strimzi.topic_operator.vertx.http_client.response_bytes.max
(gauge)
Size in bytes of the responses.
Shown as byte
strimzi.topic_operator.vertx.http_client.response_bytes.sum
(count)
Size in bytes of the responses.
Shown as byte
strimzi.topic_operator.vertx.http_client.response_time_seconds.count
(count)
Response time in seconds.
strimzi.topic_operator.vertx.http_client.response_time_seconds.max
(gauge)
Response time in seconds.
Shown as second
strimzi.topic_operator.vertx.http_client.response_time_seconds.sum
(count)
Response time in seconds.
Shown as second
strimzi.topic_operator.vertx.http_client.responses.count
(count)
Number of received responses.
strimzi.topic_operator.vertx.http_server.active_connections
(gauge)
Number of opened connections to the HTTP Server.
strimzi.topic_operator.vertx.http_server.active_requests
(gauge)
Number of requests being processed.
strimzi.topic_operator.vertx.http_server.active_ws_connections
(gauge)
Number of websockets currently opened.
strimzi.topic_operator.vertx.http_server.bytes_read.count
(count)
Number of bytes received by the HTTP Server.
Shown as byte
strimzi.topic_operator.vertx.http_server.bytes_written.count
(count)
Number of bytes sent by the HTTP Server.
Shown as byte
strimzi.topic_operator.vertx.http_server.errors.count
(count)
Number of errors.
strimzi.topic_operator.vertx.http_server.request_bytes.count
(count)
Size in bytes of the requests.
strimzi.topic_operator.vertx.http_server.request_bytes.max
(gauge)
Size in bytes of the requests.
Shown as byte
strimzi.topic_operator.vertx.http_server.request_bytes.sum
(count)
Size in bytes of the requests.
Shown as byte
strimzi.topic_operator.vertx.http_server.request_resets.count
(count)
Number of request resets.
strimzi.topic_operator.vertx.http_server.requests.count
(count)
Number of processed requests.
strimzi.topic_operator.vertx.http_server.response_bytes.count
(count)
Size in bytes of the responses.
strimzi.topic_operator.vertx.http_server.response_bytes.max
(gauge)
Size in bytes of the responses.
Shown as byte
strimzi.topic_operator.vertx.http_server.response_bytes.sum
(count)
Size in bytes of the responses.
Shown as byte
strimzi.topic_operator.vertx.http_server.response_time_seconds.count
(count)
Request processing time in seconds.
strimzi.topic_operator.vertx.http_server.response_time_seconds.max
(gauge)
Request processing time in seconds.
Shown as second
strimzi.topic_operator.vertx.http_server.response_time_seconds.sum
(count)
Request processing time in seconds.
Shown as second
strimzi.topic_operator.vertx.pool.completed.count
(count)
Number of elements done with the resource (i.e. total number of tasks executed for worker pools).
strimzi.topic_operator.vertx.pool.in_use
(gauge)
Number of resources used.
strimzi.topic_operator.vertx.pool.queue_pending
(gauge)
Number of pending elements in queue.
strimzi.topic_operator.vertx.pool.queue_time_seconds.count
(count)
Time spent in queue before being processed, in seconds.
strimzi.topic_operator.vertx.pool.queue_time_seconds.max
(gauge)
Time spent in queue before being processed, in seconds.
Shown as second
strimzi.topic_operator.vertx.pool.queue_time_seconds.sum
(count)
Time spent in queue before being processed, in seconds.
Shown as second
strimzi.topic_operator.vertx.pool.ratio
(gauge)
Pool usage ratio, only present if maximum pool size could be determined.
strimzi.topic_operator.vertx.pool.usage_seconds.count
(count)
Time using a resource (i.e. processing time for worker pools).
strimzi.topic_operator.vertx.pool.usage_seconds.max
(gauge)
Time using a resource (i.e. processing time for worker pools).
Shown as second
strimzi.topic_operator.vertx.pool.usage_seconds.sum
(count)
Time using a resource (i.e. processing time for worker pools).
Shown as second
strimzi.user_operator.jvm.buffer.count_buffers
(gauge)
An estimate of the number of buffers in the pool.
strimzi.user_operator.jvm.buffer.memory_used_bytes
(gauge)
An estimate of the memory that the Java virtual machine is using for this buffer pool.
Shown as byte
strimzi.user_operator.jvm.buffer.total_capacity_bytes
(gauge)
An estimate of the total capacity of the buffers in this pool.
Shown as byte
strimzi.user_operator.jvm.classes.loaded_classes
(gauge)
The number of classes that are currently loaded in the Java virtual machine.
strimzi.user_operator.jvm.classes.unloaded_classes.count
(count)
The total number of classes unloaded since the Java virtual machine has started execution.
strimzi.user_operator.jvm.gc.live_data_size_bytes
(gauge)
Size of long-lived heap memory pool after reclamation.
Shown as byte
strimzi.user_operator.jvm.gc.max_data_size_bytes
(gauge)
Max size of long-lived heap memory pool.
Shown as byte
strimzi.user_operator.jvm.gc.memory_allocated_bytes.count
(count)
Incremented for an increase in the size of the (young) heap memory pool after one GC.
strimzi.user_operator.jvm.gc.memory_promoted_bytes.count
(count)
Count of positive increases in the size of the old generation memory pool before GC to after GC.
strimzi.user_operator.jvm.gc.pause_seconds.count
(count)
Time spent in GC pause.
strimzi.user_operator.jvm.gc.pause_seconds.max
(gauge)
Max time spent in GC pause.
strimzi.user_operator.jvm.gc.pause_seconds.quantile
(gauge)
Quantile of time spent in GC pause.
strimzi.user_operator.jvm.gc.pause_seconds.sum
(count)
Sum of time spent in GC pause.
strimzi.user_operator.jvm.memory.committed_bytes
(gauge)
The amount of memory in bytes that is committed for the Java virtual machine to use.
strimzi.user_operator.jvm.memory.max_bytes
(gauge)
The maximum amount of memory in bytes that can be used for memory management.
Shown as byte
strimzi.user_operator.jvm.memory.used_bytes
(gauge)
An estimate of the memory that the Java virtual machine is using for this buffer pool.
Shown as byte
strimzi.user_operator.jvm.threads.daemon_threads
(gauge)
The current number of live daemon threads.
strimzi.user_operator.jvm.threads.live_threads
(gauge)
The current number of live threads including both daemon and non-daemon threads.
strimzi.user_operator.jvm.threads.peak_threads
(gauge)
The peak live thread count since the Java virtual machine started or peak was reset.
strimzi.user_operator.jvm.threads.states_threads
(gauge)
The current number of threads
strimzi.user_operator.process.cpu_usage
(gauge)
The recent cpu usage for the Java Virtual Machine process.
strimzi.user_operator.reconciliations.count
(count)
Number of reconciliations done by the operator for individual resources
strimzi.user_operator.reconciliations.duration_seconds.bucket
(count)
The time the reconciliation takes to complete.
Shown as second
strimzi.user_operator.reconciliations.duration_seconds.count
(count)
The time the reconciliation takes to complete.
strimzi.user_operator.reconciliations.duration_seconds.max
(gauge)
The maximum time the reconciliation takes to complete.
Shown as second
strimzi.user_operator.reconciliations.duration_seconds.sum
(count)
The time the reconciliation takes to complete.
Shown as second
strimzi.user_operator.reconciliations.failed.count
(count)
Number of reconciliations done by the operator for individual resources which failed.
strimzi.user_operator.reconciliations.locked.count
(count)
Number of reconciliations skipped because another reconciliation for the same resource was still running.
strimzi.user_operator.reconciliations.periodical.count
(count)
Number of periodical reconciliations done by the operator.
strimzi.user_operator.reconciliations.successful.count
(count)
Number of reconciliations done by the operator for individual resources which were successful.
strimzi.user_operator.resource.state
(gauge)
Current state of the resource: 1 ready, 0 fail.
strimzi.user_operator.resources
(gauge)
Number of custom resources the operator sees.
strimzi.user_operator.resources.paused
(gauge)
Number of custom resources the operator sees but does not reconcile due to paused reconciliations.
strimzi.user_operator.system.cpu_count
(gauge)
The number of processors available to the Java virtual machine
strimzi.user_operator.system.cpu_usage
(gauge)
The recent cpu usage of the system the application is running in
strimzi.user_operator.system.load_average_1m
(gauge)
The sum of the number of runnable entities queued to available processors and the number of runnable entities running on the available processors averaged over a period of time.

イベント

Strimzi インテグレーションには、イベントは含まれません。

サービスチェック

strimzi.cluster_operator.openmetrics.health

Returns CRITICAL if the Agent is unable to connect to the Cluster Operator OpenMetrics endpoint, otherwise returns OK.

Statuses: ok, critical

strimzi.topic_operator.openmetrics.health

Returns CRITICAL if the Agent is unable to connect to the Topic Operator OpenMetrics endpoint, otherwise returns OK.

Statuses: ok, critical

strimzi.user_operator.openmetrics.health

Returns CRITICAL if the Agent is unable to connect to the User Operator OpenMetrics endpoint, otherwise returns OK.

Statuses: ok, critical

トラブルシューティング

ご不明な点は、Datadog のサポートチームまでお問合せください。

その他の参考資料

お役に立つドキュメント、リンクや記事: