Overview
AWS Batch is a fully managed batch processing service that dynamically provisions compute resources based on workload requirements.
Enable this integration to see your Batch metrics in Datadog and receive events for job failures.
Setup
Installation
If you haven’t already, set up the Amazon Web Services integration.
Metric collection
Add the following permissions to your Datadog IAM policy in order to collect AWS Batch metrics. For more information, see the Batch policies on the AWS website.
| AWS Permission | Description | 
|---|
batch:DescribeJobs | Used to get detailed descriptions of jobs | 
batch:ListJobs | Used to list all jobs within a job queue | 
batch:DescribeJobQueues | Used to get detailed descriptions of job queues | 
Install the Datadog - AWS Batch integration.
Data Collected
Metrics
 |  | 
|---|
aws.batch.job_started (count) | Count of batch jobs started. Shown as event | 
aws.batch.job_succeeded (count) | Count of batch jobs that successfully executed. Shown as event | 
aws.batch.job_failed (count) | Count of batch jobs that failed execution. Shown as event | 
aws.batch.retry_count (count) | Count of retries for a batch job. Shown as event | 
Events
The AWS Batch integration sends events to Datadog when jobs fail, including job name, job ID, exit code, and failure reason.
Service Checks
The AWS Batch integration does not include any service checks.
Troubleshooting
Need help? Contact Datadog support.