Datadog Operator

Supported OS Linux Windows Mac OS

This page is not yet available in Spanish. We are working on its translation.
If you have any questions or feedback about our current translation project, feel free to reach out to us!

Overview

This check monitors the Datadog Operator through the Datadog Agent.

Setup

Installation

Please see the documentation for Datadog Operator.

Data Collected

Metrics

datadog.operator.admission_controller.feature.enabled
(gauge)
1 if the Admission Controller feature is enabled
datadog.operator.agent.deployment.success
(gauge)
1 if the desired number of Agent replicas equals the number of available Agent pods. 0 otherwise.
datadog.operator.cluster_checks.feature.enabled
(gauge)
1 if the Cluster checks feature is enabled
datadog.operator.clusteragent.deployment.success
(gauge)
1 if the desired number of Cluster Agent replicas equals the number of available Cluster Agent pods. 0 otherwise.
datadog.operator.clusterchecksrunner.deployment.success
(gauge)
1 if the desired number of Cluster Check Runner replicas equals the number of available Cluster Check Runner pods. 0 otherwise.
datadog.operator.controller_runtime_active_workers
(gauge)
Number of currently used workers per controller
Shown as worker
datadog.operator.controller_runtime_max_concurrent_reconciles
(gauge)
Maximum number of concurrent reconciles per controller
Shown as operation
datadog.operator.controller_runtime_reconcile_errors_total
(count)
Total number of reconciliation errors per controller
Shown as error
datadog.operator.controller_runtime_reconcile_time_seconds.count
(count)
Count of time per reconciliation per controller
Shown as second
datadog.operator.controller_runtime_reconcile_time_seconds.sum
(count)
Sum of time per reconciliation per controller
Shown as second
datadog.operator.controller_runtime_reconcile_total
(count)
Total number of reconciliations per controller
Shown as operation
datadog.operator.default.feature.enabled
(gauge)
datadog.operator.dogstatsd.feature.enabled
(gauge)
1 if the DogStatsD feature is enabled
datadog.operator.eds_controller_leader
(gauge)
datadog.operator.event_collection.feature.enabled
(gauge)
1 if the Event Collection feature is enabled
datadog.operator.go_gc_duration_seconds.count
(count)
Count of the GC invocation durations
Shown as second
datadog.operator.go_gc_duration_seconds.quantile
(gauge)
Quantiles of the GC invocation durations
Shown as second
datadog.operator.go_gc_duration_seconds.sum
(count)
Sum of the GC invocation durations
Shown as second
datadog.operator.go_goroutines
(gauge)
Number of goroutines that currently exist
Shown as thread
datadog.operator.go_info
(gauge)
Go version
datadog.operator.go_memstats_alloc_bytes
(gauge)
Number of bytes allocated and still in use
Shown as byte
datadog.operator.go_memstats_alloc_bytes_total
(count)
Total number of bytes allocated even if freed
Shown as byte
datadog.operator.go_memstats_buck_hash_sys_bytes
(gauge)
Number of bytes used by the profiling bucket hash table
Shown as byte
datadog.operator.go_memstats_frees_total
(count)
Total number of frees
datadog.operator.go_memstats_gc_cpu_fraction
(gauge)
The fraction of this program's available CPU time used by the GC since the program started
Shown as fraction
datadog.operator.go_memstats_gc_sys_bytes
(gauge)
Number of bytes used for garbage collection system metadata
Shown as byte
datadog.operator.go_memstats_heap_alloc_bytes
(gauge)
Number of heap bytes allocated and still in use
Shown as byte
datadog.operator.go_memstats_heap_idle_bytes
(gauge)
Number of heap bytes waiting to be used
Shown as byte
datadog.operator.go_memstats_heap_inuse_bytes
(gauge)
Number of heap bytes that are in use
Shown as byte
datadog.operator.go_memstats_heap_objects
(gauge)
Number of allocated objects
Shown as object
datadog.operator.go_memstats_heap_released_bytes
(gauge)
Number of heap bytes released to OS
Shown as byte
datadog.operator.go_memstats_heap_sys_bytes
(gauge)
Number of heap bytes obtained from system
Shown as byte
datadog.operator.go_memstats_last_gc_time_seconds
(gauge)
Number of seconds since 1970 of last garbage collection
datadog.operator.go_memstats_lookups_total
(count)
Total number of pointer lookups
datadog.operator.go_memstats_mallocs_total
(count)
TTotal number of mallocs
datadog.operator.go_memstats_mcache_inuse_bytes
(gauge)
Number of bytes in use by mcache structures
Shown as byte
datadog.operator.go_memstats_mcache_sys_bytes
(gauge)
Number of bytes used for mcache structures obtained from system
Shown as byte
datadog.operator.go_memstats_mspan_inuse_bytes
(gauge)
Number of bytes in use by mspan structures
Shown as byte
datadog.operator.go_memstats_mspan_sys_bytes
(gauge)
Number of bytes used for mspan structures obtained from system
datadog.operator.go_memstats_next_gc_bytes
(gauge)
Number of heap bytes when next garbage collection will take place
Shown as byte
datadog.operator.go_memstats_other_sys_bytes
(gauge)
Number of bytes used for other system allocations
Shown as byte
datadog.operator.go_memstats_stack_inuse_bytes
(gauge)
Number of bytes in use by the stack allocator
Shown as byte
datadog.operator.go_memstats_stack_sys_bytes
(gauge)
Number of bytes obtained from system for stack allocator
Shown as byte
datadog.operator.go_memstats_sys_bytes
(gauge)
Number of bytes obtained from system
Shown as byte
datadog.operator.go_threads
(gauge)
Number of OS threads created
Shown as thread
datadog.operator.ksm.feature.enabled
(gauge)
1 if the Kubernetes State Metrics Core feature is enabled
datadog.operator.live_container.feature.enabled
(gauge)
1 if the Live Containers feature is enabled
datadog.operator.log_collection.feature.enabled
(gauge)
1 if the Log Collection feature is enabled
datadog.operator.orchestrator_explorer.feature.enabled
(gauge)
1 if the Kubernetes Orchestrator Explorer feature is enabled
datadog.operator.process_cpu_seconds_total
(count)
Total user and system CPU time spent in seconds
Shown as second
datadog.operator.process_max_fds
(gauge)
Maximum number of open file descriptors
Shown as file
datadog.operator.process_open_fds
(gauge)
Number of open file descriptors
Shown as file
datadog.operator.process_resident_memory_bytes
(gauge)
Resident memory size in bytes
Shown as byte
datadog.operator.process_start_time_seconds
(gauge)
Start time of the process since unix epoch in seconds
datadog.operator.process_virtual_memory_bytes
(gauge)
Virtual memory size in bytes
Shown as byte
datadog.operator.process_virtual_memory_max_bytes
(gauge)
Maximum amount of virtual memory available in bytes
Shown as byte
datadog.operator.reconcile.success
(gauge)
1 if the last recorded reconcile error is null, 0 otherwise. The reconcile_err tag describes the last recorded error.
datadog.operator.remote_config.feature.enabled
(gauge)
1 if the Remote Configuration feature is enabled
datadog.operator.rest_client_requests_total
(count)
Number of HTTP requests, partitioned by status code, method, and host
Shown as request
datadog.operator.workqueue_adds_total
(count)
Total number of adds handled by workqueue
datadog.operator.workqueue_depth
(gauge)
Current depth of workqueue
datadog.operator.workqueue_longest_running_processor_seconds
(gauge)
How many seconds has the longest running processor for workqueue been running.
Shown as second
datadog.operator.workqueue_queue_duration_seconds.count
(count)
Count of how long in seconds an item stays in workqueue before being requested
Shown as second
datadog.operator.workqueue_queue_duration_seconds.sum
(count)
Sum of how long in seconds an item stays in workqueue before being requested
Shown as second
datadog.operator.workqueue_retries_total
(count)
Total number of retries handled by workqueue
datadog.operator.workqueue_unfinished_work_seconds
(gauge)
How many seconds of work has been done that is in progress and hasn't been observed by work_duration. Large values indicate stuck threads. One can deduce the number of stuck threads by observing the rate at which this increases.
Shown as second
datadog.operator.workqueue_work_duration_seconds.count
(count)
Count of how long in seconds processing an item from workqueue takes
Shown as second
datadog.operator.workqueue_work_duration_seconds.sum
(count)
Sum of how long in seconds processing an item from workqueue takes
Shown as second

Events

The Datadog Operator integration does not include any events.

Service Checks

Troubleshooting

Need help? Contact Datadog support.