Supported OS Linux Windows Mac OS

Versión de la integración3.2.0

Información general

Este check monitoriza Strimzi a través del Datadog Agent.

Configuración

Instalación

El check de Strimzi está incluido en el paquete del Datadog Agent. No es necesaria ninguna instalación adicional en tu servidor.

Este check utiliza OpenMetrics, que requiere Python 3.

Configuración

El check de Strimzi recopila las métricas de formato de Prometheus de los siguientes operadores:

  • Clúster
  • Tema
  • Usuario

Nota: Para la monitorización de Kafka y Zookeeper, utiliza los checks de Kafka, Kafka Consumer y Zookeeper respectivamente.

Sigue las instrucciones siguientes para activar y configurar este check para un Agent.

Host

  1. Edita el archivo strimzi.d/conf.yaml, en la carpeta conf.d/ en la raíz del directorio de configuración de tu Agent para comenzar a recopilar tus datos de rendimiento de Strimzi. Consulta el strimzi.d/conf.yaml de ejemplo para todas las opciones disponibles de configuración.

  2. Reinicia el Agent.

En contenedores

Para entornos en contenedores, consulta las plantillas de integración de Autodiscovery para obtener orientación sobre la aplicación de estas instrucciones. He aquí un ejemplo de cómo configurar esto en los diferentes manifiestos de operador utilizando anotaciones pod:

Operador de clúster:
apiVersion: apps/v1
kind: Deployment
metadata:
  name: strimzi-cluster-operator
  labels:
    app: strimzi
  namespace: kafka
spec:
  replicas: 1
  selector:
    matchLabels:
      name: strimzi-cluster-operator
      strimzi.io/kind: cluster-operator
  template:
    metadata:
      labels:
        name: strimzi-cluster-operator
        strimzi.io/kind: cluster-operator
      annotations:
        ad.datadoghq.com/strimzi-cluster-operator.checks: |
          {
            "strimzi": {
              "instances":[
                {
                  "cluster_operator_endpoint": "http://%%host%%:8080/metrics"
                }
              ]
            }
          }          
      spec:
        containers:
        - name: strimzi-cluster-operator
...

Nota: La plantilla utilizada para este ejemplo puede encontrarse aquí.

Operadores de temas y usuarios:
apiVersion: kafka.strimzi.io/v1beta2
kind: Kafka
metadata:
  name: my-cluster
spec:
...
  entityOperator:
    topicOperator: {}
    userOperator: {}
    template:
      pod:
        metadata:
          annotations:
            ad.datadoghq.com/topic-operator.checks: |
              {
                "strimzi": {
                  "instances":[
                    {
                      "topic_operator_endpoint": "http://%%host%%:8080/metrics"
                    }
                  ]
                }
              }              
            ad.datadoghq.com/user-operator.checks: |
              {
                "strimzi": {
                  "instances":[
                    {
                      "user_operator_endpoint": "http://%%host%%:8081/metrics"
                    }
                  ]
                }
              }               
...

Nota: La plantilla utilizada para este ejemplo puede encontrarse aquí.

Consulta el strimzi.d/conf.yaml de ejemplo para conocer todas las opciones disponibles de configuración.

Kafka y Zookeeper

Los componentes de Kafka y Zookeeper de Strimzi pueden ser monitorizados usando checks de Kafka, Kafka Consumer y Zookeeper. Las métricas de Kafka se recopilan a través de JMX. Para más información sobre cómo habilitar JMX, consulta la documentación de Strimzi sobre opciones de JMX. He aquí un ejemplo de cómo configurar los checks de Kafka, Kafka Consumer y Zookeeper utilizando anotaciones pod:

apiVersion: kafka.strimzi.io/v1beta2
kind: Kafka
metadata:
  name: my-cluster
spec:
  kafka:
    jmxOptions: {}
    version: 3.4.0
    replicas: 1
    listeners:
      - name: plain
        port: 9092
        type: internal
        tls: false
    template:
      pod:
        metadata:  
          annotations:
            ad.datadoghq.com/kafka.checks: |
              {
                "kafka": {
                  "init_config": {
                    "is_jmx": true, 
                    "collect_default_metrics": true, 
                    "new_gc_metrics": true
                  },
                  "instances":[
                    {
                      "host": "%%host%%",
                      "port": "9999"
                    }
                  ]
                },
                "kafka_consumer": {
                  "init_config": {},
                  "instances": [
                    {
                      "kafka_connect_str": "%%host%%:9092",
                      "monitor_unlisted_consumer_groups": "true"
                    }
                  ]
                }
              }                      
    config:
      offsets.topic.replication.factor: 1
      transaction.state.log.replication.factor: 1
      transaction.state.log.min.isr: 1
      inter.broker.protocol.version: "3.4"
    storage:
      type: ephemeral
  zookeeper:
    replicas: 1
    storage:
      type: ephemeral
    template:
      pod:
        metadata:
          annotations:
            ad.datadoghq.com/zookeeper.checks: |
              {
                "zk": {
                  "instances":[
                    {
                      "host":"%%host%%","port":"2181"
                    }
                  ]
                }
              }               

Nota: La plantilla utilizada para este ejemplo puede encontrarse aquí.

Recopilación de logs

Disponible para el Agent versiones >6.0

Los logs de Strimzi pueden recopilarse de los distintos pods de Strimzi a través de Kubernetes. La recopilación de logs está deshabilitada por defecto en el Datadog Agent. Para habilitarla, consulta recopilación de logs de Kubernetes.

Consulta las plantillas de integración de Autodiscovery para obtener orientación sobre la aplicación de los parámetros que se indican a continuación.

ParámetroValor
<LOG_CONFIG>{"source": "strimzi", "service": "<SERVICE_NAME>"}

Validación

Ejecuta el subcomando de estado del Agent y busca strimzi en la sección Checks.

Datos recopilados

Métricas

strimzi.cluster_operator.jvm.buffer.count_buffers
(gauge)
An estimate of the number of buffers in the pool.
strimzi.cluster_operator.jvm.buffer.memory_used_bytes
(gauge)
An estimate of the memory that the Java virtual machine is using for this buffer pool.
Shown as byte
strimzi.cluster_operator.jvm.buffer.total_capacity_bytes
(gauge)
An estimate of the total capacity of the buffers in this pool.
Shown as byte
strimzi.cluster_operator.jvm.classes.loaded_classes
(gauge)
The number of classes that are currently loaded in the Java virtual machine.
strimzi.cluster_operator.jvm.classes.unloaded_classes.count
(count)
The total number of classes unloaded since the Java virtual machine has started execution.
strimzi.cluster_operator.jvm.gc.live_data_size_bytes
(gauge)
Size of long-lived heap memory pool after reclamation.
Shown as byte
strimzi.cluster_operator.jvm.gc.max_data_size_bytes
(gauge)
Max size of long-lived heap memory pool.
Shown as byte
strimzi.cluster_operator.jvm.gc.memory_allocated_bytes.count
(count)
Incremented for an increase in the size of the (young) heap memory pool after one GC.
strimzi.cluster_operator.jvm.gc.memory_promoted_bytes.count
(count)
Count of positive increases in the size of the old generation memory pool before GC to after GC.
strimzi.cluster_operator.jvm.gc.pause_seconds.count
(count)
Time spent in GC pause.
strimzi.cluster_operator.jvm.gc.pause_seconds.max
(gauge)
Max time spent in GC pause.
strimzi.cluster_operator.jvm.gc.pause_seconds.quantile
(gauge)
Quantile of time spent in GC pause.
strimzi.cluster_operator.jvm.gc.pause_seconds.sum
(count)
Sum of time spent in GC pause.
strimzi.cluster_operator.jvm.memory.committed_bytes
(gauge)
The amount of memory in bytes that is committed for the Java virtual machine to use.
strimzi.cluster_operator.jvm.memory.max_bytes
(gauge)
The maximum amount of memory in bytes that can be used for memory management.
Shown as byte
strimzi.cluster_operator.jvm.memory.used_bytes
(gauge)
An estimate of the memory that the Java virtual machine is using for this buffer pool.
Shown as byte
strimzi.cluster_operator.jvm.threads.daemon_threads
(gauge)
The current number of live daemon threads.
strimzi.cluster_operator.jvm.threads.live_threads
(gauge)
The current number of live threads including both daemon and non-daemon threads.
strimzi.cluster_operator.jvm.threads.peak_threads
(gauge)
The peak live thread count since the Java virtual machine started or peak was reset.
strimzi.cluster_operator.jvm.threads.states_threads
(gauge)
The current number of threads.
strimzi.cluster_operator.process.cpu_usage
(gauge)
The recent cpu usage for the Java Virtual Machine process.
strimzi.cluster_operator.reconciliations.already_enqueued.count
(count)
Number of reconciliations skipped because another reconciliation for the same resource was still running.
strimzi.cluster_operator.reconciliations.count
(count)
Number of reconciliations done by the operator for individual resources
strimzi.cluster_operator.reconciliations.duration_seconds.bucket
(count)
The time the reconciliation takes to complete.
Shown as second
strimzi.cluster_operator.reconciliations.duration_seconds.count
(count)
The time the reconciliation takes to complete.
strimzi.cluster_operator.reconciliations.duration_seconds.max
(gauge)
The maximum time the reconciliation takes to complete.
Shown as second
strimzi.cluster_operator.reconciliations.duration_seconds.sum
(count)
The time the reconciliation takes to complete.
Shown as second
strimzi.cluster_operator.reconciliations.failed.count
(count)
Number of reconciliations done by the operator for individual resources which failed.
strimzi.cluster_operator.reconciliations.locked.count
(count)
Number of reconciliations skipped because another reconciliation for the same resource was still running.
strimzi.cluster_operator.reconciliations.periodical.count
(count)
Number of periodical reconciliations done by the operator.
strimzi.cluster_operator.reconciliations.successful.count
(count)
Number of reconciliations done by the operator for individual resources which were successful.
strimzi.cluster_operator.resource.state
(gauge)
Current state of the resource: 1 ready, 0 fail.
strimzi.cluster_operator.resources
(gauge)
Number of custom resources the operator sees.
strimzi.cluster_operator.resources.paused
(gauge)
Number of custom resources the operator sees but does not reconcile due to paused reconciliations.
strimzi.cluster_operator.system.cpu_count
(gauge)
The number of processors available to the Java virtual machine
strimzi.cluster_operator.system.cpu_usage
(gauge)
The recent cpu usage of the system the application is running in.
strimzi.cluster_operator.system.load_average_1m
(gauge)
The sum of the number of runnable entities queued to available processors and the number of runnable entities running on the available processors averaged over a period of time.
strimzi.cluster_operator.vertx.http_client.active_connections
(gauge)
Number of connections to the remote host currently opened.
strimzi.cluster_operator.vertx.http_client.active_requests
(gauge)
Number of requests being processed, waiting for a response.
strimzi.cluster_operator.vertx.http_client.active_ws_connections
(gauge)
Number of websockets currently opened.
strimzi.cluster_operator.vertx.http_client.bytes_read.count
(count)
Number of bytes received from the remote host.
Shown as byte
strimzi.cluster_operator.vertx.http_client.bytes_written.count
(count)
Number of bytes sent to the remote host.
Shown as byte
strimzi.cluster_operator.vertx.http_client.errors.count
(count)
Number of errors.
strimzi.cluster_operator.vertx.http_client.queue_pending
(gauge)
Number of pending elements in queue.
strimzi.cluster_operator.vertx.http_client.queue_time_seconds.count
(count)
Time spent in queue before being processed, in seconds.
strimzi.cluster_operator.vertx.http_client.queue_time_seconds.max
(gauge)
Time spent in queue before being processed, in seconds.
Shown as second
strimzi.cluster_operator.vertx.http_client.queue_time_seconds.sum
(count)
Time spent in queue before being processed, in seconds.
Shown as second
strimzi.cluster_operator.vertx.http_client.request_bytes.count
(count)
Size in bytes of the requests.
strimzi.cluster_operator.vertx.http_client.request_bytes.max
(gauge)
Size in bytes of the requests.
Shown as byte
strimzi.cluster_operator.vertx.http_client.request_bytes.sum
(count)
Size in bytes of the requests.
Shown as byte
strimzi.cluster_operator.vertx.http_client.requests.count
(count)
Number of requests sent.
strimzi.cluster_operator.vertx.http_client.response_bytes.count
(count)
Size in bytes of the responses.
strimzi.cluster_operator.vertx.http_client.response_bytes.max
(gauge)
Size in bytes of the responses.
Shown as byte
strimzi.cluster_operator.vertx.http_client.response_bytes.sum
(count)
Size in bytes of the responses.
Shown as byte
strimzi.cluster_operator.vertx.http_client.response_time_seconds.count
(count)
Response time in seconds.
strimzi.cluster_operator.vertx.http_client.response_time_seconds.max
(gauge)
Response time in seconds.
Shown as second
strimzi.cluster_operator.vertx.http_client.response_time_seconds.sum
(count)
Response time in seconds.
Shown as second
strimzi.cluster_operator.vertx.http_client.responses.count
(count)
Number of received responses.
strimzi.cluster_operator.vertx.http_server.active_connections
(gauge)
Number of opened connections to the HTTP Server.
strimzi.cluster_operator.vertx.http_server.active_requests
(gauge)
Number of requests being processed.
strimzi.cluster_operator.vertx.http_server.active_ws_connections
(gauge)
Number of websockets currently opened.
strimzi.cluster_operator.vertx.http_server.bytes_read.count
(count)
Number of bytes received by the HTTP Server.
Shown as byte
strimzi.cluster_operator.vertx.http_server.bytes_written.count
(count)
Number of bytes sent by the HTTP Server.
Shown as byte
strimzi.cluster_operator.vertx.http_server.errors.count
(count)
Number of errors.
strimzi.cluster_operator.vertx.http_server.request_bytes.count
(count)
Size in bytes of the requests.
strimzi.cluster_operator.vertx.http_server.request_bytes.max
(gauge)
Size in bytes of the requests.
Shown as byte
strimzi.cluster_operator.vertx.http_server.request_bytes.sum
(count)
Size in bytes of the requests.
Shown as byte
strimzi.cluster_operator.vertx.http_server.request_resets.count
(count)
Number of request resets.
strimzi.cluster_operator.vertx.http_server.requests.count
(count)
Number of processed requests.
strimzi.cluster_operator.vertx.http_server.response_bytes.count
(count)
Size in bytes of the responses.
strimzi.cluster_operator.vertx.http_server.response_bytes.max
(gauge)
Size in bytes of the responses.
Shown as byte
strimzi.cluster_operator.vertx.http_server.response_bytes.sum
(count)
Size in bytes of the responses.
Shown as byte
strimzi.cluster_operator.vertx.http_server.response_time_seconds.count
(count)
Request processing time in seconds.
strimzi.cluster_operator.vertx.http_server.response_time_seconds.max
(gauge)
Request processing time in seconds.
Shown as second
strimzi.cluster_operator.vertx.http_server.response_time_seconds.sum
(count)
Request processing time in seconds.
Shown as second
strimzi.cluster_operator.vertx.pool.completed.count
(count)
Number of elements done with the resource (i.e. total number of tasks executed for worker pools).
strimzi.cluster_operator.vertx.pool.in_use
(gauge)
Number of resources used.
strimzi.cluster_operator.vertx.pool.queue_pending
(gauge)
Number of pending elements in queue.
strimzi.cluster_operator.vertx.pool.queue_time_seconds.count
(count)
Time spent in queue before being processed, in seconds.
strimzi.cluster_operator.vertx.pool.queue_time_seconds.max
(gauge)
Time spent in queue before being processed, in seconds.
Shown as second
strimzi.cluster_operator.vertx.pool.queue_time_seconds.sum
(count)
Time spent in queue before being processed, in seconds.
Shown as second
strimzi.cluster_operator.vertx.pool.ratio
(gauge)
Pool usage ratio, only present if maximum pool size could be determined.
strimzi.cluster_operator.vertx.pool.usage_seconds.count
(count)
Time using a resource (i.e. processing time for worker pools).
strimzi.cluster_operator.vertx.pool.usage_seconds.max
(gauge)
Time using a resource (i.e. processing time for worker pools).
Shown as second
strimzi.cluster_operator.vertx.pool.usage_seconds.sum
(count)
Time using a resource (i.e. processing time for worker pools).
Shown as second
strimzi.topic_operator.jvm.buffer.count_buffers
(gauge)
An estimate of the number of buffers in the pool.
strimzi.topic_operator.jvm.buffer.memory_used_bytes
(gauge)
An estimate of the memory that the Java virtual machine is using for this buffer pool.
Shown as byte
strimzi.topic_operator.jvm.buffer.total_capacity_bytes
(gauge)
An estimate of the total capacity of the buffers in this pool.
Shown as byte
strimzi.topic_operator.jvm.classes.loaded_classes
(gauge)
The number of classes that are currently loaded in the Java virtual machine.
strimzi.topic_operator.jvm.classes.unloaded_classes.count
(count)
The total number of classes unloaded since the Java virtual machine has started execution.
strimzi.topic_operator.jvm.gc.live_data_size_bytes
(gauge)
Size of long-lived heap memory pool after reclamation.
Shown as byte
strimzi.topic_operator.jvm.gc.max_data_size_bytes
(gauge)
Max size of long-lived heap memory pool.
Shown as byte
strimzi.topic_operator.jvm.gc.memory_allocated_bytes.count
(count)
Incremented for an increase in the size of the (young) heap memory pool after one GC.
strimzi.topic_operator.jvm.gc.memory_promoted_bytes.count
(count)
Count of positive increases in the size of the old generation memory pool before GC to after GC.
strimzi.topic_operator.jvm.gc.pause_seconds.count
(count)
Time spent in GC pause.
strimzi.topic_operator.jvm.gc.pause_seconds.max
(gauge)
Max time spent in GC pause.
strimzi.topic_operator.jvm.gc.pause_seconds.quantile
(gauge)
Quantile of time spent in GC pause.
strimzi.topic_operator.jvm.gc.pause_seconds.sum
(count)
Sum of time spent in GC pause.
strimzi.topic_operator.jvm.memory.committed_bytes
(gauge)
The amount of memory in bytes that is committed for the Java virtual machine to use.
strimzi.topic_operator.jvm.memory.max_bytes
(gauge)
The maximum amount of memory in bytes that can be used for memory management.
Shown as byte
strimzi.topic_operator.jvm.memory.used_bytes
(gauge)
An estimate of the memory that the Java virtual machine is using for this buffer pool.
Shown as byte
strimzi.topic_operator.jvm.threads.daemon_threads
(gauge)
The current number of live daemon threads.
strimzi.topic_operator.jvm.threads.live_threads
(gauge)
The current number of live threads including both daemon and non-daemon threads.
strimzi.topic_operator.jvm.threads.peak_threads
(gauge)
The peak live thread count since the Java virtual machine started or peak was reset.
strimzi.topic_operator.jvm.threads.states_threads
(gauge)
The current number of threads
strimzi.topic_operator.process.cpu_usage
(gauge)
The recent cpu usage for the Java Virtual Machine process.
strimzi.topic_operator.reconciliations.count
(count)
Number of reconciliations done by the operator for individual resources
strimzi.topic_operator.reconciliations.duration_seconds.bucket
(count)
The time the reconciliation takes to complete.
Shown as second
strimzi.topic_operator.reconciliations.duration_seconds.count
(count)
The time the reconciliation takes to complete.
strimzi.topic_operator.reconciliations.duration_seconds.max
(gauge)
The maximum time the reconciliation takes to complete.
Shown as second
strimzi.topic_operator.reconciliations.duration_seconds.sum
(count)
The time the reconciliation takes to complete.
Shown as second
strimzi.topic_operator.reconciliations.failed.count
(count)
Number of reconciliations done by the operator for individual resources which failed.
strimzi.topic_operator.reconciliations.locked.count
(count)
Number of reconciliations skipped because another reconciliation for the same resource was still running.
strimzi.topic_operator.reconciliations.periodical.count
(count)
Number of periodical reconciliations done by the operator.
strimzi.topic_operator.reconciliations.successful.count
(count)
Number of reconciliations done by the operator for individual resources which were successful.
strimzi.topic_operator.resource.state
(gauge)
Current state of the resource: 1 ready, 0 fail.
strimzi.topic_operator.resources
(gauge)
Number of custom resources the operator sees.
strimzi.topic_operator.resources.paused
(gauge)
Number of custom resources the operator sees but does not reconcile due to paused reconciliations.
strimzi.topic_operator.system.cpu_count
(gauge)
The number of processors available to the Java virtual machine
strimzi.topic_operator.system.cpu_usage
(gauge)
The recent cpu usage of the system the application is running in
strimzi.topic_operator.system.load_average_1m
(gauge)
The sum of the number of runnable entities queued to available processors and the number of runnable entities running on the available processors averaged over a period of time.
strimzi.topic_operator.vertx.http_client.active_connections
(gauge)
Number of connections to the remote host currently opened.
strimzi.topic_operator.vertx.http_client.active_requests
(gauge)
Number of requests being processed, waiting for a response.
strimzi.topic_operator.vertx.http_client.active_ws_connections
(gauge)
Number of websockets currently opened.
strimzi.topic_operator.vertx.http_client.bytes_read.count
(count)
Number of bytes received from the remote host.
Shown as byte
strimzi.topic_operator.vertx.http_client.bytes_written.count
(count)
Number of bytes sent to the remote host.
Shown as byte
strimzi.topic_operator.vertx.http_client.errors.count
(count)
Number of errors.
strimzi.topic_operator.vertx.http_client.queue_pending
(gauge)
Number of pending elements in queue.
strimzi.topic_operator.vertx.http_client.queue_time_seconds.count
(count)
Time spent in queue before being processed, in seconds.
strimzi.topic_operator.vertx.http_client.queue_time_seconds.max
(gauge)
Time spent in queue before being processed, in seconds.
Shown as second
strimzi.topic_operator.vertx.http_client.queue_time_seconds.sum
(count)
Time spent in queue before being processed, in seconds.
Shown as second
strimzi.topic_operator.vertx.http_client.request_bytes.count
(count)
Size in bytes of the requests.
strimzi.topic_operator.vertx.http_client.request_bytes.max
(gauge)
Size in bytes of the requests.
Shown as byte
strimzi.topic_operator.vertx.http_client.request_bytes.sum
(count)
Size in bytes of the requests.
Shown as byte
strimzi.topic_operator.vertx.http_client.requests.count
(count)
Number of requests sent.
strimzi.topic_operator.vertx.http_client.response_bytes.count
(count)
Size in bytes of the responses.
strimzi.topic_operator.vertx.http_client.response_bytes.max
(gauge)
Size in bytes of the responses.
Shown as byte
strimzi.topic_operator.vertx.http_client.response_bytes.sum
(count)
Size in bytes of the responses.
Shown as byte
strimzi.topic_operator.vertx.http_client.response_time_seconds.count
(count)
Response time in seconds.
strimzi.topic_operator.vertx.http_client.response_time_seconds.max
(gauge)
Response time in seconds.
Shown as second
strimzi.topic_operator.vertx.http_client.response_time_seconds.sum
(count)
Response time in seconds.
Shown as second
strimzi.topic_operator.vertx.http_client.responses.count
(count)
Number of received responses.
strimzi.topic_operator.vertx.http_server.active_connections
(gauge)
Number of opened connections to the HTTP Server.
strimzi.topic_operator.vertx.http_server.active_requests
(gauge)
Number of requests being processed.
strimzi.topic_operator.vertx.http_server.active_ws_connections
(gauge)
Number of websockets currently opened.
strimzi.topic_operator.vertx.http_server.bytes_read.count
(count)
Number of bytes received by the HTTP Server.
Shown as byte
strimzi.topic_operator.vertx.http_server.bytes_written.count
(count)
Number of bytes sent by the HTTP Server.
Shown as byte
strimzi.topic_operator.vertx.http_server.errors.count
(count)
Number of errors.
strimzi.topic_operator.vertx.http_server.request_bytes.count
(count)
Size in bytes of the requests.
strimzi.topic_operator.vertx.http_server.request_bytes.max
(gauge)
Size in bytes of the requests.
Shown as byte
strimzi.topic_operator.vertx.http_server.request_bytes.sum
(count)
Size in bytes of the requests.
Shown as byte
strimzi.topic_operator.vertx.http_server.request_resets.count
(count)
Number of request resets.
strimzi.topic_operator.vertx.http_server.requests.count
(count)
Number of processed requests.
strimzi.topic_operator.vertx.http_server.response_bytes.count
(count)
Size in bytes of the responses.
strimzi.topic_operator.vertx.http_server.response_bytes.max
(gauge)
Size in bytes of the responses.
Shown as byte
strimzi.topic_operator.vertx.http_server.response_bytes.sum
(count)
Size in bytes of the responses.
Shown as byte
strimzi.topic_operator.vertx.http_server.response_time_seconds.count
(count)
Request processing time in seconds.
strimzi.topic_operator.vertx.http_server.response_time_seconds.max
(gauge)
Request processing time in seconds.
Shown as second
strimzi.topic_operator.vertx.http_server.response_time_seconds.sum
(count)
Request processing time in seconds.
Shown as second
strimzi.topic_operator.vertx.pool.completed.count
(count)
Number of elements done with the resource (i.e. total number of tasks executed for worker pools).
strimzi.topic_operator.vertx.pool.in_use
(gauge)
Number of resources used.
strimzi.topic_operator.vertx.pool.queue_pending
(gauge)
Number of pending elements in queue.
strimzi.topic_operator.vertx.pool.queue_time_seconds.count
(count)
Time spent in queue before being processed, in seconds.
strimzi.topic_operator.vertx.pool.queue_time_seconds.max
(gauge)
Time spent in queue before being processed, in seconds.
Shown as second
strimzi.topic_operator.vertx.pool.queue_time_seconds.sum
(count)
Time spent in queue before being processed, in seconds.
Shown as second
strimzi.topic_operator.vertx.pool.ratio
(gauge)
Pool usage ratio, only present if maximum pool size could be determined.
strimzi.topic_operator.vertx.pool.usage_seconds.count
(count)
Time using a resource (i.e. processing time for worker pools).
strimzi.topic_operator.vertx.pool.usage_seconds.max
(gauge)
Time using a resource (i.e. processing time for worker pools).
Shown as second
strimzi.topic_operator.vertx.pool.usage_seconds.sum
(count)
Time using a resource (i.e. processing time for worker pools).
Shown as second
strimzi.user_operator.jvm.buffer.count_buffers
(gauge)
An estimate of the number of buffers in the pool.
strimzi.user_operator.jvm.buffer.memory_used_bytes
(gauge)
An estimate of the memory that the Java virtual machine is using for this buffer pool.
Shown as byte
strimzi.user_operator.jvm.buffer.total_capacity_bytes
(gauge)
An estimate of the total capacity of the buffers in this pool.
Shown as byte
strimzi.user_operator.jvm.classes.loaded_classes
(gauge)
The number of classes that are currently loaded in the Java virtual machine.
strimzi.user_operator.jvm.classes.unloaded_classes.count
(count)
The total number of classes unloaded since the Java virtual machine has started execution.
strimzi.user_operator.jvm.gc.live_data_size_bytes
(gauge)
Size of long-lived heap memory pool after reclamation.
Shown as byte
strimzi.user_operator.jvm.gc.max_data_size_bytes
(gauge)
Max size of long-lived heap memory pool.
Shown as byte
strimzi.user_operator.jvm.gc.memory_allocated_bytes.count
(count)
Incremented for an increase in the size of the (young) heap memory pool after one GC.
strimzi.user_operator.jvm.gc.memory_promoted_bytes.count
(count)
Count of positive increases in the size of the old generation memory pool before GC to after GC.
strimzi.user_operator.jvm.gc.pause_seconds.count
(count)
Time spent in GC pause.
strimzi.user_operator.jvm.gc.pause_seconds.max
(gauge)
Max time spent in GC pause.
strimzi.user_operator.jvm.gc.pause_seconds.quantile
(gauge)
Quantile of time spent in GC pause.
strimzi.user_operator.jvm.gc.pause_seconds.sum
(count)
Sum of time spent in GC pause.
strimzi.user_operator.jvm.memory.committed_bytes
(gauge)
The amount of memory in bytes that is committed for the Java virtual machine to use.
strimzi.user_operator.jvm.memory.max_bytes
(gauge)
The maximum amount of memory in bytes that can be used for memory management.
Shown as byte
strimzi.user_operator.jvm.memory.used_bytes
(gauge)
An estimate of the memory that the Java virtual machine is using for this buffer pool.
Shown as byte
strimzi.user_operator.jvm.threads.daemon_threads
(gauge)
The current number of live daemon threads.
strimzi.user_operator.jvm.threads.live_threads
(gauge)
The current number of live threads including both daemon and non-daemon threads.
strimzi.user_operator.jvm.threads.peak_threads
(gauge)
The peak live thread count since the Java virtual machine started or peak was reset.
strimzi.user_operator.jvm.threads.states_threads
(gauge)
The current number of threads
strimzi.user_operator.process.cpu_usage
(gauge)
The recent cpu usage for the Java Virtual Machine process.
strimzi.user_operator.reconciliations.count
(count)
Number of reconciliations done by the operator for individual resources
strimzi.user_operator.reconciliations.duration_seconds.bucket
(count)
The time the reconciliation takes to complete.
Shown as second
strimzi.user_operator.reconciliations.duration_seconds.count
(count)
The time the reconciliation takes to complete.
strimzi.user_operator.reconciliations.duration_seconds.max
(gauge)
The maximum time the reconciliation takes to complete.
Shown as second
strimzi.user_operator.reconciliations.duration_seconds.sum
(count)
The time the reconciliation takes to complete.
Shown as second
strimzi.user_operator.reconciliations.failed.count
(count)
Number of reconciliations done by the operator for individual resources which failed.
strimzi.user_operator.reconciliations.locked.count
(count)
Number of reconciliations skipped because another reconciliation for the same resource was still running.
strimzi.user_operator.reconciliations.periodical.count
(count)
Number of periodical reconciliations done by the operator.
strimzi.user_operator.reconciliations.successful.count
(count)
Number of reconciliations done by the operator for individual resources which were successful.
strimzi.user_operator.resource.state
(gauge)
Current state of the resource: 1 ready, 0 fail.
strimzi.user_operator.resources
(gauge)
Number of custom resources the operator sees.
strimzi.user_operator.resources.paused
(gauge)
Number of custom resources the operator sees but does not reconcile due to paused reconciliations.
strimzi.user_operator.system.cpu_count
(gauge)
The number of processors available to the Java virtual machine
strimzi.user_operator.system.cpu_usage
(gauge)
The recent cpu usage of the system the application is running in
strimzi.user_operator.system.load_average_1m
(gauge)
The sum of the number of runnable entities queued to available processors and the number of runnable entities running on the available processors averaged over a period of time.

Eventos

La integración de Strimzi no incluye ningún evento.

Checks de servicio

strimzi.cluster_operator.openmetrics.health

Returns CRITICAL if the Agent is unable to connect to the Cluster Operator OpenMetrics endpoint, otherwise returns OK.

Statuses: ok, critical

strimzi.topic_operator.openmetrics.health

Returns CRITICAL if the Agent is unable to connect to the Topic Operator OpenMetrics endpoint, otherwise returns OK.

Statuses: ok, critical

strimzi.user_operator.openmetrics.health

Returns CRITICAL if the Agent is unable to connect to the User Operator OpenMetrics endpoint, otherwise returns OK.

Statuses: ok, critical

Solucionar problemas

¿Necesitas ayuda? Ponte en contacto con el soporte de Datadog.

Referencias adicionales

Más enlaces, artículos y documentación útiles: