AWS Batch

Integration version1.0.0

Overview

AWS Batch is a fully managed batch processing service that dynamically provisions compute resources based on workload requirements.

Enable this integration to see your Batch metrics in Datadog and receive events for job failures.

Setup

Installation

If you haven’t already, set up the Amazon Web Services integration.

Metric collection

  1. Add the following permissions to your Datadog IAM policy in order to collect AWS Batch metrics. For more information, see the Batch policies on the AWS website.

    AWS PermissionDescription
    batch:DescribeJobsUsed to get detailed descriptions of jobs
    batch:ListJobsUsed to list all jobs within a job queue
    batch:DescribeJobQueuesUsed to get detailed descriptions of job queues
  2. Install the Datadog - AWS Batch integration.

Data Collected

Metrics

aws.batch.job_started
(count)
Count of batch jobs started.
Shown as event
aws.batch.job_succeeded
(count)
Count of batch jobs that successfully executed.
Shown as event
aws.batch.job_failed
(count)
Count of batch jobs that failed execution.
Shown as event
aws.batch.retry_count
(count)
Count of retries for a batch job.
Shown as event

Events

The AWS Batch integration sends events to Datadog when jobs fail, including job name, job ID, exit code, and failure reason.

Service Checks

The AWS Batch integration does not include any service checks.

Troubleshooting

Need help? Contact Datadog support.