Cluster Checks Runner
Incident Management is now generally available! Incident Management is now generally available!

Cluster Checks Runner

While the regular Datadog Agent executes endpoint checks for your node and application pods, the Cluster Checks Runner is specifically dedicated to running cluster checks, which monitor internal Kubernetes services, as well as external services like managed databases and network devices. Using both endpoint checks and cluster checks enables you to monitor both your cluster’s services and any external services your application relies on.

Note: When using the Cluster Checks Runner, it is not necessary to enable cluster checks for the regular Datadog Agent.

Set up

First, deploy the Cluster Agent.

Then, deploy the Cluster Checks Runner using either Datadog Operator or Helm.

Using the Operator, you can launch and manage all of these resources with a single manifest like the one shown in this example.

Deploy these resources into your cluster:

kubectl apply -f datadog-agent-with-dca-clusterchecksrunner.yaml

You’ll see the following output, confirming that the configuration was applied successfully: created

See the Datadog Operator repo for more information about the Datadog Operator.

You can update the relevant sections of the chart, as shown below, to enable cluster checks, the Cluster Agent, and the Cluster Checks Runners at the same time:

    enabled: true
  enabled: true
  enabled: true
  replicas: 2

Use podAntiAffinity to avoid having multiple Cluster Checks Runners on the same node.

Note: The Datadog Operator and the Helm chart use podAntiAffinity to avoid having multiple Cluster Checks Runners on the same node. This is important because the Cluster Agent identifies the Cluster Checks Runners by their node names, using podAntiAffinity avoids having name collisions.

On this Page