While the regular Datadog Agent executes endpoint checks for your node and application pods, the Cluster Checks Runner is specifically dedicated to running cluster checks, which monitor internal Kubernetes services, as well as external services like managed databases and network devices. Using both endpoint checks and cluster checks enables you to monitor both your cluster’s services and any external services your application relies on.
Note: When using the Cluster Checks Runner, it is not necessary to enable cluster checks for the regular Datadog Agent.
First, deploy the Cluster Agent.
Using the Operator, you can launch and manage all of these resources with a single manifest like the one shown in this example.
Deploy these resources into your cluster:
kubectl apply -f datadog-agent-with-dca-clusterchecksrunner.yaml
You’ll see the following output, confirming that the configuration was applied successfully:
See the Datadog Operator repo for more information about the Datadog Operator.
You can update the relevant sections of the chart, as shown below, to enable cluster checks, the Cluster Agent, and the Cluster Checks Runners at the same time:
[...] clusterChecks: enabled: true [...] clusterAgent: enabled: true [...] clusterChecksRunner: enabled: true replicas: 2
podAntiAffinity to avoid having multiple Cluster Checks Runners on the same node.
Note: The Datadog Operator and the Helm chart use
podAntiAffinity to avoid having multiple Cluster Checks Runners on the same node. This is important because the Cluster Agent identifies the Cluster Checks Runners by their node names, using
podAntiAffinity avoids having name collisions.