Strimzi

Supported OS Linux Windows Mac OS

Integration version2.2.2

Overview

This check monitors Strimzi through the Datadog Agent.

Setup

Installation

The Strimzi check is included in the Datadog Agent package. No additional installation is needed on your server.

This check uses OpenMetrics, which requires Python 3.

Configuration

The Strimzi check collects Prometheus-formatted metrics on the following operators:

  • Cluster
  • Topic
  • User

Note: For monitoring Kafka and Zookeeper, please use the Kafka, Kafka Consumer and Zookeeper checks respectively.

Follow the instructions below to enable and configure this check for an Agent.

Host

  1. Edit the strimzi.d/conf.yaml file, in the conf.d/ folder at the root of your Agent’s configuration directory to start collecting your Strimzi performance data. See the sample strimzi.d/conf.yaml for all available configuration options.

  2. Restart the Agent.

Containerized

For containerized environments, refer to the Autodiscovery Integration Templates for guidance on applying these instructions. Here’s an example of how to configure this on the different Operator manifests using pod annotations:

Cluster Operator:
apiVersion: apps/v1
kind: Deployment
metadata:
  name: strimzi-cluster-operator
  labels:
    app: strimzi
  namespace: kafka
spec:
  replicas: 1
  selector:
    matchLabels:
      name: strimzi-cluster-operator
      strimzi.io/kind: cluster-operator
  template:
    metadata:
      labels:
        name: strimzi-cluster-operator
        strimzi.io/kind: cluster-operator
      annotations:
        ad.datadoghq.com/strimzi-cluster-operator.checks: |
          {
            "strimzi": {
              "instances":[
                {
                  "cluster_operator_endpoint": "http://%%host%%:8080/metrics"
                }
              ]
            }
          }          
      spec:
        containers:
        - name: strimzi-cluster-operator
...

Note: The template used for this example can be found here.

Topic and User Operators:
apiVersion: kafka.strimzi.io/v1beta2
kind: Kafka
metadata:
  name: my-cluster
spec:
...
  entityOperator:
    topicOperator: {}
    userOperator: {}
    template:
      pod:
        metadata:
          annotations:
            ad.datadoghq.com/topic-operator.checks: |
              {
                "strimzi": {
                  "instances":[
                    {
                      "topic_operator_endpoint": "http://%%host%%:8080/metrics"
                    }
                  ]
                }
              }              
            ad.datadoghq.com/user-operator.checks: |
              {
                "strimzi": {
                  "instances":[
                    {
                      "user_operator_endpoint": "http://%%host%%:8081/metrics"
                    }
                  ]
                }
              }               
...

Note: The template used as for this example can be found here.

See the sample strimzi.d/conf.yaml for all available configuration options.

Kafka and Zookeeper

The Kafka and Zookeeper components of Strimzi can be monitored using the Kafka, Kafka Consumer and Zookeeper checks. Kafka metrics are collected through JMX. For more information on enabling JMX, see the Strimzi documentation on JMX options. Here’s an example of how to configure the Kafka, Kafka Consumer and Zookeeper checks using Pod annotations:

apiVersion: kafka.strimzi.io/v1beta2
kind: Kafka
metadata:
  name: my-cluster
spec:
  kafka:
    jmxOptions: {}
    version: 3.4.0
    replicas: 1
    listeners:
      - name: plain
        port: 9092
        type: internal
        tls: false
    template:
      pod:
        metadata:  
          annotations:
            ad.datadoghq.com/kafka.checks: |
              {
                "kafka": {
                  "init_config": {
                    "is_jmx": true, 
                    "collect_default_metrics": true, 
                    "new_gc_metrics": true
                  },
                  "instances":[
                    {
                      "host": "%%host%%",
                      "port": "9999"
                    }
                  ]
                },
                "kafka_consumer": {
                  "init_config": {},
                  "instances": [
                    {
                      "kafka_connect_str": "%%host%%:9092",
                      "monitor_unlisted_consumer_groups": "true"
                    }
                  ]
                }
              }                      
    config:
      offsets.topic.replication.factor: 1
      transaction.state.log.replication.factor: 1
      transaction.state.log.min.isr: 1
      inter.broker.protocol.version: "3.4"
    storage:
      type: ephemeral
  zookeeper:
    replicas: 1
    storage:
      type: ephemeral
    template:
      pod:
        metadata:
          annotations:
            ad.datadoghq.com/zookeeper.checks: |
              {
                "zk": {
                  "instances":[
                    {
                      "host":"%%host%%","port":"2181"
                    }
                  ]
                }
              }               

Note: The template used for this example can be found here.

Log collection

Available for Agent versions >6.0

Strimzi logs can be collected from the different Strimzi pods through Kubernetes. Collecting logs is disabled by default in the Datadog Agent. To enable it, see Kubernetes Log Collection.

See the Autodiscovery Integration Templates for guidance on applying the parameters below.

ParameterValue
<LOG_CONFIG>{"source": "strimzi", "service": "<SERVICE_NAME>"}

Validation

Run the Agent’s status subcommand and look for strimzi under the Checks section.

Data Collected

Metrics

strimzi.cluster_operator.jvm.buffer.count_buffers
(gauge)
An estimate of the number of buffers in the pool.
strimzi.cluster_operator.jvm.buffer.memory_used_bytes
(gauge)
An estimate of the memory that the Java virtual machine is using for this buffer pool.
Shown as byte
strimzi.cluster_operator.jvm.buffer.total_capacity_bytes
(gauge)
An estimate of the total capacity of the buffers in this pool.
Shown as byte
strimzi.cluster_operator.jvm.classes.loaded_classes
(gauge)
The number of classes that are currently loaded in the Java virtual machine.
strimzi.cluster_operator.jvm.classes.unloaded_classes.count
(count)
The total number of classes unloaded since the Java virtual machine has started execution.
strimzi.cluster_operator.jvm.gc.live_data_size_bytes
(gauge)
Size of long-lived heap memory pool after reclamation.
Shown as byte
strimzi.cluster_operator.jvm.gc.max_data_size_bytes
(gauge)
Max size of long-lived heap memory pool.
Shown as byte
strimzi.cluster_operator.jvm.gc.memory_allocated_bytes.count
(count)
Incremented for an increase in the size of the (young) heap memory pool after one GC.
strimzi.cluster_operator.jvm.gc.memory_promoted_bytes.count
(count)
Count of positive increases in the size of the old generation memory pool before GC to after GC.
strimzi.cluster_operator.jvm.gc.pause_seconds.count
(count)
Time spent in GC pause.
strimzi.cluster_operator.jvm.gc.pause_seconds.max
(gauge)
Max time spent in GC pause.
strimzi.cluster_operator.jvm.gc.pause_seconds.quantile
(gauge)
Quantile of time spent in GC pause.
strimzi.cluster_operator.jvm.gc.pause_seconds.sum
(count)
Sum of time spent in GC pause.
strimzi.cluster_operator.jvm.memory.committed_bytes
(gauge)
The amount of memory in bytes that is committed for the Java virtual machine to use.
strimzi.cluster_operator.jvm.memory.max_bytes
(gauge)
The maximum amount of memory in bytes that can be used for memory management.
Shown as byte
strimzi.cluster_operator.jvm.memory.used_bytes
(gauge)
An estimate of the memory that the Java virtual machine is using for this buffer pool.
Shown as byte
strimzi.cluster_operator.jvm.threads.daemon_threads
(gauge)
The current number of live daemon threads.
strimzi.cluster_operator.jvm.threads.live_threads
(gauge)
The current number of live threads including both daemon and non-daemon threads.
strimzi.cluster_operator.jvm.threads.peak_threads
(gauge)
The peak live thread count since the Java virtual machine started or peak was reset.
strimzi.cluster_operator.jvm.threads.states_threads
(gauge)
The current number of threads.
strimzi.cluster_operator.process.cpu_usage
(gauge)
The recent cpu usage for the Java Virtual Machine process.
strimzi.cluster_operator.reconciliations.already_enqueued.count
(count)
Number of reconciliations skipped because another reconciliation for the same resource was still running.
strimzi.cluster_operator.reconciliations.count
(count)
Number of reconciliations done by the operator for individual resources
strimzi.cluster_operator.reconciliations.duration_seconds.bucket
(count)
The time the reconciliation takes to complete.
Shown as second
strimzi.cluster_operator.reconciliations.duration_seconds.count
(count)
The time the reconciliation takes to complete.
strimzi.cluster_operator.reconciliations.duration_seconds.max
(gauge)
The maximum time the reconciliation takes to complete.
Shown as second
strimzi.cluster_operator.reconciliations.duration_seconds.sum
(count)
The time the reconciliation takes to complete.
Shown as second
strimzi.cluster_operator.reconciliations.failed.count
(count)
Number of reconciliations done by the operator for individual resources which failed.
strimzi.cluster_operator.reconciliations.locked.count
(count)
Number of reconciliations skipped because another reconciliation for the same resource was still running.
strimzi.cluster_operator.reconciliations.periodical.count
(count)
Number of periodical reconciliations done by the operator.
strimzi.cluster_operator.reconciliations.successful.count
(count)
Number of reconciliations done by the operator for individual resources which were successful.
strimzi.cluster_operator.resource.state
(gauge)
Current state of the resource: 1 ready, 0 fail.
strimzi.cluster_operator.resources
(gauge)
Number of custom resources the operator sees.
strimzi.cluster_operator.resources.paused
(gauge)
Number of custom resources the operator sees but does not reconcile due to paused reconciliations.
strimzi.cluster_operator.system.cpu_count
(gauge)
The number of processors available to the Java virtual machine
strimzi.cluster_operator.system.cpu_usage
(gauge)
The recent cpu usage of the system the application is running in.
strimzi.cluster_operator.system.load_average_1m
(gauge)
The sum of the number of runnable entities queued to available processors and the number of runnable entities running on the available processors averaged over a period of time.
strimzi.cluster_operator.vertx.http_client.active_connections
(gauge)
Number of connections to the remote host currently opened.
strimzi.cluster_operator.vertx.http_client.active_requests
(gauge)
Number of requests being processed, waiting for a response.
strimzi.cluster_operator.vertx.http_client.active_ws_connections
(gauge)
Number of websockets currently opened.
strimzi.cluster_operator.vertx.http_client.bytes_read.count
(count)
Number of bytes received from the remote host.
Shown as byte
strimzi.cluster_operator.vertx.http_client.bytes_written.count
(count)
Number of bytes sent to the remote host.
Shown as byte
strimzi.cluster_operator.vertx.http_client.errors.count
(count)
Number of errors.
strimzi.cluster_operator.vertx.http_client.queue_pending
(gauge)
Number of pending elements in queue.
strimzi.cluster_operator.vertx.http_client.queue_time_seconds.count
(count)
Time spent in queue before being processed, in seconds.
strimzi.cluster_operator.vertx.http_client.queue_time_seconds.max
(gauge)
Time spent in queue before being processed, in seconds.
Shown as second
strimzi.cluster_operator.vertx.http_client.queue_time_seconds.sum
(count)
Time spent in queue before being processed, in seconds.
Shown as second
strimzi.cluster_operator.vertx.http_client.request_bytes.count
(count)
Size in bytes of the requests.
strimzi.cluster_operator.vertx.http_client.request_bytes.max
(gauge)
Size in bytes of the requests.
Shown as byte
strimzi.cluster_operator.vertx.http_client.request_bytes.sum
(count)
Size in bytes of the requests.
Shown as byte
strimzi.cluster_operator.vertx.http_client.requests.count
(count)
Number of requests sent.
strimzi.cluster_operator.vertx.http_client.response_bytes.count
(count)
Size in bytes of the responses.
strimzi.cluster_operator.vertx.http_client.response_bytes.max
(gauge)
Size in bytes of the responses.
Shown as byte
strimzi.cluster_operator.vertx.http_client.response_bytes.sum
(count)
Size in bytes of the responses.
Shown as byte
strimzi.cluster_operator.vertx.http_client.response_time_seconds.count
(count)
Response time in seconds.
strimzi.cluster_operator.vertx.http_client.response_time_seconds.max
(gauge)
Response time in seconds.
Shown as second
strimzi.cluster_operator.vertx.http_client.response_time_seconds.sum
(count)
Response time in seconds.
Shown as second
strimzi.cluster_operator.vertx.http_client.responses.count
(count)
Number of received responses.
strimzi.cluster_operator.vertx.http_server.active_connections
(gauge)
Number of opened connections to the HTTP Server.
strimzi.cluster_operator.vertx.http_server.active_requests
(gauge)
Number of requests being processed.
strimzi.cluster_operator.vertx.http_server.active_ws_connections
(gauge)
Number of websockets currently opened.
strimzi.cluster_operator.vertx.http_server.bytes_read.count
(count)
Number of bytes received by the HTTP Server.
Shown as byte
strimzi.cluster_operator.vertx.http_server.bytes_written.count
(count)
Number of bytes sent by the HTTP Server.
Shown as byte
strimzi.cluster_operator.vertx.http_server.errors.count
(count)
Number of errors.
strimzi.cluster_operator.vertx.http_server.request_bytes.count
(count)
Size in bytes of the requests.
strimzi.cluster_operator.vertx.http_server.request_bytes.max
(gauge)
Size in bytes of the requests.
Shown as byte
strimzi.cluster_operator.vertx.http_server.request_bytes.sum
(count)
Size in bytes of the requests.
Shown as byte
strimzi.cluster_operator.vertx.http_server.request_resets.count
(count)
Number of request resets.
strimzi.cluster_operator.vertx.http_server.requests.count
(count)
Number of processed requests.
strimzi.cluster_operator.vertx.http_server.response_bytes.count
(count)
Size in bytes of the responses.
strimzi.cluster_operator.vertx.http_server.response_bytes.max
(gauge)
Size in bytes of the responses.
Shown as byte
strimzi.cluster_operator.vertx.http_server.response_bytes.sum
(count)
Size in bytes of the responses.
Shown as byte
strimzi.cluster_operator.vertx.http_server.response_time_seconds.count
(count)
Request processing time in seconds.
strimzi.cluster_operator.vertx.http_server.response_time_seconds.max
(gauge)
Request processing time in seconds.
Shown as second
strimzi.cluster_operator.vertx.http_server.response_time_seconds.sum
(count)
Request processing time in seconds.
Shown as second
strimzi.cluster_operator.vertx.pool.completed.count
(count)
Number of elements done with the resource (i.e. total number of tasks executed for worker pools).
strimzi.cluster_operator.vertx.pool.in_use
(gauge)
Number of resources used.
strimzi.cluster_operator.vertx.pool.queue_pending
(gauge)
Number of pending elements in queue.
strimzi.cluster_operator.vertx.pool.queue_time_seconds.count
(count)
Time spent in queue before being processed, in seconds.
strimzi.cluster_operator.vertx.pool.queue_time_seconds.max
(gauge)
Time spent in queue before being processed, in seconds.
Shown as second
strimzi.cluster_operator.vertx.pool.queue_time_seconds.sum
(count)
Time spent in queue before being processed, in seconds.
Shown as second
strimzi.cluster_operator.vertx.pool.ratio
(gauge)
Pool usage ratio, only present if maximum pool size could be determined.
strimzi.cluster_operator.vertx.pool.usage_seconds.count
(count)
Time using a resource (i.e. processing time for worker pools).
strimzi.cluster_operator.vertx.pool.usage_seconds.max
(gauge)
Time using a resource (i.e. processing time for worker pools).
Shown as second
strimzi.cluster_operator.vertx.pool.usage_seconds.sum
(count)
Time using a resource (i.e. processing time for worker pools).
Shown as second
strimzi.topic_operator.jvm.buffer.count_buffers
(gauge)
An estimate of the number of buffers in the pool.
strimzi.topic_operator.jvm.buffer.memory_used_bytes
(gauge)
An estimate of the memory that the Java virtual machine is using for this buffer pool.
Shown as byte
strimzi.topic_operator.jvm.buffer.total_capacity_bytes
(gauge)
An estimate of the total capacity of the buffers in this pool.
Shown as byte
strimzi.topic_operator.jvm.classes.loaded_classes
(gauge)
The number of classes that are currently loaded in the Java virtual machine.
strimzi.topic_operator.jvm.classes.unloaded_classes.count
(count)
The total number of classes unloaded since the Java virtual machine has started execution.
strimzi.topic_operator.jvm.gc.live_data_size_bytes
(gauge)
Size of long-lived heap memory pool after reclamation.
Shown as byte
strimzi.topic_operator.jvm.gc.max_data_size_bytes
(gauge)
Max size of long-lived heap memory pool.
Shown as byte
strimzi.topic_operator.jvm.gc.memory_allocated_bytes.count
(count)
Incremented for an increase in the size of the (young) heap memory pool after one GC.
strimzi.topic_operator.jvm.gc.memory_promoted_bytes.count
(count)
Count of positive increases in the size of the old generation memory pool before GC to after GC.
strimzi.topic_operator.jvm.gc.pause_seconds.count
(count)
Time spent in GC pause.
strimzi.topic_operator.jvm.gc.pause_seconds.max
(gauge)
Max time spent in GC pause.
strimzi.topic_operator.jvm.gc.pause_seconds.quantile
(gauge)
Quantile of time spent in GC pause.
strimzi.topic_operator.jvm.gc.pause_seconds.sum
(count)
Sum of time spent in GC pause.
strimzi.topic_operator.jvm.memory.committed_bytes
(gauge)
The amount of memory in bytes that is committed for the Java virtual machine to use.
strimzi.topic_operator.jvm.memory.max_bytes
(gauge)
The maximum amount of memory in bytes that can be used for memory management.
Shown as byte
strimzi.topic_operator.jvm.memory.used_bytes
(gauge)
An estimate of the memory that the Java virtual machine is using for this buffer pool.
Shown as byte
strimzi.topic_operator.jvm.threads.daemon_threads
(gauge)
The current number of live daemon threads.
strimzi.topic_operator.jvm.threads.live_threads
(gauge)
The current number of live threads including both daemon and non-daemon threads.
strimzi.topic_operator.jvm.threads.peak_threads
(gauge)
The peak live thread count since the Java virtual machine started or peak was reset.
strimzi.topic_operator.jvm.threads.states_threads
(gauge)
The current number of threads
strimzi.topic_operator.process.cpu_usage
(gauge)
The recent cpu usage for the Java Virtual Machine process.
strimzi.topic_operator.reconciliations.count
(count)
Number of reconciliations done by the operator for individual resources
strimzi.topic_operator.reconciliations.duration_seconds.bucket
(count)
The time the reconciliation takes to complete.
Shown as second
strimzi.topic_operator.reconciliations.duration_seconds.count
(count)
The time the reconciliation takes to complete.
strimzi.topic_operator.reconciliations.duration_seconds.max
(gauge)
The maximum time the reconciliation takes to complete.
Shown as second
strimzi.topic_operator.reconciliations.duration_seconds.sum
(count)
The time the reconciliation takes to complete.
Shown as second
strimzi.topic_operator.reconciliations.failed.count
(count)
Number of reconciliations done by the operator for individual resources which failed.
strimzi.topic_operator.reconciliations.locked.count
(count)
Number of reconciliations skipped because another reconciliation for the same resource was still running.
strimzi.topic_operator.reconciliations.periodical.count
(count)
Number of periodical reconciliations done by the operator.
strimzi.topic_operator.reconciliations.successful.count
(count)
Number of reconciliations done by the operator for individual resources which were successful.
strimzi.topic_operator.resource.state
(gauge)
Current state of the resource: 1 ready, 0 fail.
strimzi.topic_operator.resources
(gauge)
Number of custom resources the operator sees.
strimzi.topic_operator.resources.paused
(gauge)
Number of custom resources the operator sees but does not reconcile due to paused reconciliations.
strimzi.topic_operator.system.cpu_count
(gauge)
The number of processors available to the Java virtual machine
strimzi.topic_operator.system.cpu_usage
(gauge)
The recent cpu usage of the system the application is running in
strimzi.topic_operator.system.load_average_1m
(gauge)
The sum of the number of runnable entities queued to available processors and the number of runnable entities running on the available processors averaged over a period of time.
strimzi.topic_operator.vertx.http_client.active_connections
(gauge)
Number of connections to the remote host currently opened.
strimzi.topic_operator.vertx.http_client.active_requests
(gauge)
Number of requests being processed, waiting for a response.
strimzi.topic_operator.vertx.http_client.active_ws_connections
(gauge)
Number of websockets currently opened.
strimzi.topic_operator.vertx.http_client.bytes_read.count
(count)
Number of bytes received from the remote host.
Shown as byte
strimzi.topic_operator.vertx.http_client.bytes_written.count
(count)
Number of bytes sent to the remote host.
Shown as byte
strimzi.topic_operator.vertx.http_client.errors.count
(count)
Number of errors.
strimzi.topic_operator.vertx.http_client.queue_pending
(gauge)
Number of pending elements in queue.
strimzi.topic_operator.vertx.http_client.queue_time_seconds.count
(count)
Time spent in queue before being processed, in seconds.
strimzi.topic_operator.vertx.http_client.queue_time_seconds.max
(gauge)
Time spent in queue before being processed, in seconds.
Shown as second
strimzi.topic_operator.vertx.http_client.queue_time_seconds.sum
(count)
Time spent in queue before being processed, in seconds.
Shown as second
strimzi.topic_operator.vertx.http_client.request_bytes.count
(count)
Size in bytes of the requests.
strimzi.topic_operator.vertx.http_client.request_bytes.max
(gauge)
Size in bytes of the requests.
Shown as byte
strimzi.topic_operator.vertx.http_client.request_bytes.sum
(count)
Size in bytes of the requests.
Shown as byte
strimzi.topic_operator.vertx.http_client.requests.count
(count)
Number of requests sent.
strimzi.topic_operator.vertx.http_client.response_bytes.count
(count)
Size in bytes of the responses.
strimzi.topic_operator.vertx.http_client.response_bytes.max
(gauge)
Size in bytes of the responses.
Shown as byte
strimzi.topic_operator.vertx.http_client.response_bytes.sum
(count)
Size in bytes of the responses.
Shown as byte
strimzi.topic_operator.vertx.http_client.response_time_seconds.count
(count)
Response time in seconds.
strimzi.topic_operator.vertx.http_client.response_time_seconds.max
(gauge)
Response time in seconds.
Shown as second
strimzi.topic_operator.vertx.http_client.response_time_seconds.sum
(count)
Response time in seconds.
Shown as second
strimzi.topic_operator.vertx.http_client.responses.count
(count)
Number of received responses.
strimzi.topic_operator.vertx.http_server.active_connections
(gauge)
Number of opened connections to the HTTP Server.
strimzi.topic_operator.vertx.http_server.active_requests
(gauge)
Number of requests being processed.
strimzi.topic_operator.vertx.http_server.active_ws_connections
(gauge)
Number of websockets currently opened.
strimzi.topic_operator.vertx.http_server.bytes_read.count
(count)
Number of bytes received by the HTTP Server.
Shown as byte
strimzi.topic_operator.vertx.http_server.bytes_written.count
(count)
Number of bytes sent by the HTTP Server.
Shown as byte
strimzi.topic_operator.vertx.http_server.errors.count
(count)
Number of errors.
strimzi.topic_operator.vertx.http_server.request_bytes.count
(count)
Size in bytes of the requests.
strimzi.topic_operator.vertx.http_server.request_bytes.max
(gauge)
Size in bytes of the requests.
Shown as byte
strimzi.topic_operator.vertx.http_server.request_bytes.sum
(count)
Size in bytes of the requests.
Shown as byte
strimzi.topic_operator.vertx.http_server.request_resets.count
(count)
Number of request resets.
strimzi.topic_operator.vertx.http_server.requests.count
(count)
Number of processed requests.
strimzi.topic_operator.vertx.http_server.response_bytes.count
(count)
Size in bytes of the responses.
strimzi.topic_operator.vertx.http_server.response_bytes.max
(gauge)
Size in bytes of the responses.
Shown as byte
strimzi.topic_operator.vertx.http_server.response_bytes.sum
(count)
Size in bytes of the responses.
Shown as byte
strimzi.topic_operator.vertx.http_server.response_time_seconds.count
(count)
Request processing time in seconds.
strimzi.topic_operator.vertx.http_server.response_time_seconds.max
(gauge)
Request processing time in seconds.
Shown as second
strimzi.topic_operator.vertx.http_server.response_time_seconds.sum
(count)
Request processing time in seconds.
Shown as second
strimzi.topic_operator.vertx.pool.completed.count
(count)
Number of elements done with the resource (i.e. total number of tasks executed for worker pools).
strimzi.topic_operator.vertx.pool.in_use
(gauge)
Number of resources used.
strimzi.topic_operator.vertx.pool.queue_pending
(gauge)
Number of pending elements in queue.
strimzi.topic_operator.vertx.pool.queue_time_seconds.count
(count)
Time spent in queue before being processed, in seconds.
strimzi.topic_operator.vertx.pool.queue_time_seconds.max
(gauge)
Time spent in queue before being processed, in seconds.
Shown as second
strimzi.topic_operator.vertx.pool.queue_time_seconds.sum
(count)
Time spent in queue before being processed, in seconds.
Shown as second
strimzi.topic_operator.vertx.pool.ratio
(gauge)
Pool usage ratio, only present if maximum pool size could be determined.
strimzi.topic_operator.vertx.pool.usage_seconds.count
(count)
Time using a resource (i.e. processing time for worker pools).
strimzi.topic_operator.vertx.pool.usage_seconds.max
(gauge)
Time using a resource (i.e. processing time for worker pools).
Shown as second
strimzi.topic_operator.vertx.pool.usage_seconds.sum
(count)
Time using a resource (i.e. processing time for worker pools).
Shown as second
strimzi.user_operator.jvm.buffer.count_buffers
(gauge)
An estimate of the number of buffers in the pool.
strimzi.user_operator.jvm.buffer.memory_used_bytes
(gauge)
An estimate of the memory that the Java virtual machine is using for this buffer pool.
Shown as byte
strimzi.user_operator.jvm.buffer.total_capacity_bytes
(gauge)
An estimate of the total capacity of the buffers in this pool.
Shown as byte
strimzi.user_operator.jvm.classes.loaded_classes
(gauge)
The number of classes that are currently loaded in the Java virtual machine.
strimzi.user_operator.jvm.classes.unloaded_classes.count
(count)
The total number of classes unloaded since the Java virtual machine has started execution.
strimzi.user_operator.jvm.gc.live_data_size_bytes
(gauge)
Size of long-lived heap memory pool after reclamation.
Shown as byte
strimzi.user_operator.jvm.gc.max_data_size_bytes
(gauge)
Max size of long-lived heap memory pool.
Shown as byte
strimzi.user_operator.jvm.gc.memory_allocated_bytes.count
(count)
Incremented for an increase in the size of the (young) heap memory pool after one GC.
strimzi.user_operator.jvm.gc.memory_promoted_bytes.count
(count)
Count of positive increases in the size of the old generation memory pool before GC to after GC.
strimzi.user_operator.jvm.gc.pause_seconds.count
(count)
Time spent in GC pause.
strimzi.user_operator.jvm.gc.pause_seconds.max
(gauge)
Max time spent in GC pause.
strimzi.user_operator.jvm.gc.pause_seconds.quantile
(gauge)
Quantile of time spent in GC pause.
strimzi.user_operator.jvm.gc.pause_seconds.sum
(count)
Sum of time spent in GC pause.
strimzi.user_operator.jvm.memory.committed_bytes
(gauge)
The amount of memory in bytes that is committed for the Java virtual machine to use.
strimzi.user_operator.jvm.memory.max_bytes
(gauge)
The maximum amount of memory in bytes that can be used for memory management.
Shown as byte
strimzi.user_operator.jvm.memory.used_bytes
(gauge)
An estimate of the memory that the Java virtual machine is using for this buffer pool.
Shown as byte
strimzi.user_operator.jvm.threads.daemon_threads
(gauge)
The current number of live daemon threads.
strimzi.user_operator.jvm.threads.live_threads
(gauge)
The current number of live threads including both daemon and non-daemon threads.
strimzi.user_operator.jvm.threads.peak_threads
(gauge)
The peak live thread count since the Java virtual machine started or peak was reset.
strimzi.user_operator.jvm.threads.states_threads
(gauge)
The current number of threads
strimzi.user_operator.process.cpu_usage
(gauge)
The recent cpu usage for the Java Virtual Machine process.
strimzi.user_operator.reconciliations.count
(count)
Number of reconciliations done by the operator for individual resources
strimzi.user_operator.reconciliations.duration_seconds.bucket
(count)
The time the reconciliation takes to complete.
Shown as second
strimzi.user_operator.reconciliations.duration_seconds.count
(count)
The time the reconciliation takes to complete.
strimzi.user_operator.reconciliations.duration_seconds.max
(gauge)
The maximum time the reconciliation takes to complete.
Shown as second
strimzi.user_operator.reconciliations.duration_seconds.sum
(count)
The time the reconciliation takes to complete.
Shown as second
strimzi.user_operator.reconciliations.failed.count
(count)
Number of reconciliations done by the operator for individual resources which failed.
strimzi.user_operator.reconciliations.locked.count
(count)
Number of reconciliations skipped because another reconciliation for the same resource was still running.
strimzi.user_operator.reconciliations.periodical.count
(count)
Number of periodical reconciliations done by the operator.
strimzi.user_operator.reconciliations.successful.count
(count)
Number of reconciliations done by the operator for individual resources which were successful.
strimzi.user_operator.resource.state
(gauge)
Current state of the resource: 1 ready, 0 fail.
strimzi.user_operator.resources
(gauge)
Number of custom resources the operator sees.
strimzi.user_operator.resources.paused
(gauge)
Number of custom resources the operator sees but does not reconcile due to paused reconciliations.
strimzi.user_operator.system.cpu_count
(gauge)
The number of processors available to the Java virtual machine
strimzi.user_operator.system.cpu_usage
(gauge)
The recent cpu usage of the system the application is running in
strimzi.user_operator.system.load_average_1m
(gauge)
The sum of the number of runnable entities queued to available processors and the number of runnable entities running on the available processors averaged over a period of time.

Events

The Strimzi integration does not include any events.

Service Checks

strimzi.cluster_operator.openmetrics.health
Returns CRITICAL if the Agent is unable to connect to the Cluster Operator OpenMetrics endpoint, otherwise returns OK.
Statuses: ok, critical

strimzi.topic_operator.openmetrics.health
Returns CRITICAL if the Agent is unable to connect to the Topic Operator OpenMetrics endpoint, otherwise returns OK.
Statuses: ok, critical

strimzi.user_operator.openmetrics.health
Returns CRITICAL if the Agent is unable to connect to the User Operator OpenMetrics endpoint, otherwise returns OK.
Statuses: ok, critical

Troubleshooting

Need help? Contact Datadog support.

Further Reading

Additional helpful documentation, links, and articles: