---
title: Amazon EMR
description: Quickly and cost-effectively process vast amounts of data.
breadcrumbs: Docs > Integrations > Amazon EMR
---

> For the complete documentation index, see [llms.txt](https://docs.datadoghq.com/llms.txt).

# Amazon EMR
Integration version1.0.0
{% callout %}
# Important note for users on the following Datadog sites: us2.ddog-gov.com

{% alert level="info" %}
To find out if this integration is available in your organization, see your [Datadog Integrations](https://app.datadoghq.com/integrations) page or ask your organization administrator.

To initiate an exception request to enable this integration for your organization, email [support@ddog-gov.com](mailto:support@ddog-gov.com).
{% /alert %}

{% /callout %}

## Overview{% #overview %}

[Data Observability: Jobs Monitoring](https://docs.datadoghq.com/data_jobs.md) helps you observe, troubleshoot, and cost-optimize your Spark jobs on your EMR clusters.

Amazon EMR is a web service that makes it easy to quickly and cost-effectively process vast amounts of data.

Enable this integration to see EMR metrics in Datadog.

## Setup{% #setup %}

### Installation{% #installation %}

If you haven't already, set up the [Amazon Web Services integration](https://docs.datadoghq.com/integrations/amazon_web_services.md) first.

### Metric collection{% #metric-collection %}

1. In the [AWS integration page](https://app.datadoghq.com/integrations/amazon-web-services), ensure that `EMR` is enabled under the `Metric Collection` tab.

1. Add the following permissions to your [Datadog IAM policy](https://docs.datadoghq.com/integrations/amazon_web_services.md#installation) to collect Amazon EMR metrics. For more information, see the [EMR policies](https://docs.aws.amazon.com/elasticloadbalancing/latest/userguide/load-balancer-authentication-access-control.html) on the AWS website.

| AWS Permission                     | Description                         |
| ---------------------------------- | ----------------------------------- |
| `elasticmapreduce:ListClusters`    | List available clusters.            |
| `elasticmapreduce:DescribeCluster` | Add tags to CloudWatch EMR metrics. |

1. Install the [Datadog - Amazon EMR integration](https://app.datadoghq.com/integrations/amazon-emr).

### Log collection{% #log-collection %}

#### Enable logging{% #enable-logging %}

Configure Amazon EMR to send logs either to a S3 bucket or to CloudWatch.

**Note**: If you log to a S3 bucket, make sure that `amazon_emr` is set as *Target prefix*.

#### Send logs to Datadog{% #send-logs-to-datadog %}

1. If you haven't already, set up the [Datadog Forwarder Lambda function](https://docs.datadoghq.com/logs/guide/forwarder.md).

1. Once the Lambda function is installed, manually add a trigger on the S3 bucket or CloudWatch log group that contains your Amazon EMR logs in the AWS console:

   - [Add a manual trigger on the S3 bucket](https://docs.datadoghq.com/logs/guide/send-aws-services-logs-with-the-datadog-lambda-function.md#collecting-logs-from-s3-buckets)
   - [Add a manual trigger on the CloudWatch Log Group](https://docs.datadoghq.com/logs/guide/send-aws-services-logs-with-the-datadog-lambda-function.md#collecting-logs-from-cloudwatch-log-group)

## Data Collected{% #data-collected %}

### Metrics{% #metrics %}

|  |
|  |
| **aws.elasticmapreduce.apps\_completed**(gauge)                                            | The average number of applications submitted to YARN that have completed. (Hadoop v2 only)                                                                                                                                                                                                                                                                       |
| **aws.elasticmapreduce.apps\_completed.sum**(gauge)                                        | The sum of the number of applications submitted to YARN that have completed. (Hadoop v2 only)                                                                                                                                                                                                                                                                    |
| **aws.elasticmapreduce.apps\_failed**(gauge)                                               | The average number of applications submitted to YARN that have failed to complete. (Hadoop v2 only)                                                                                                                                                                                                                                                              |
| **aws.elasticmapreduce.apps\_failed.sum**(gauge)                                           | The sum of the number of applications submitted to YARN that have failed to complete. (Hadoop v2 only)                                                                                                                                                                                                                                                           |
| **aws.elasticmapreduce.apps\_killed**(gauge)                                               | The average number of applications submitted to YARN that have been killed. (Hadoop v2 only)                                                                                                                                                                                                                                                                     |
| **aws.elasticmapreduce.apps\_killed.sum**(gauge)                                           | The sum of the number of applications submitted to YARN that have been killed. (Hadoop v2 only)                                                                                                                                                                                                                                                                  |
| **aws.elasticmapreduce.apps\_pending**(gauge)                                              | The average number of applications submitted to YARN that are in a pending state. (Hadoop v2 only)                                                                                                                                                                                                                                                               |
| **aws.elasticmapreduce.apps\_pending.sum**(gauge)                                          | The sum of the number of applications submitted to YARN that are in a pending state. (Hadoop v2 only)                                                                                                                                                                                                                                                            |
| **aws.elasticmapreduce.apps\_running**(gauge)                                              | The average number of applications submitted to YARN that are running. (Hadoop v2 only)                                                                                                                                                                                                                                                                          |
| **aws.elasticmapreduce.apps\_running.sum**(gauge)                                          | The sum of the number of applications submitted to YARN that are running. (Hadoop v2 only)                                                                                                                                                                                                                                                                       |
| **aws.elasticmapreduce.apps\_submitted**(gauge)                                            | The average number of applications submitted to YARN. (Hadoop v2 only)                                                                                                                                                                                                                                                                                           |
| **aws.elasticmapreduce.apps\_submitted.sum**(gauge)                                        | The sum of the number of applications submitted to YARN. (Hadoop v2 only)                                                                                                                                                                                                                                                                                        |
| **aws.elasticmapreduce.capacity\_remaining\_gb**(gauge)                                    | The average amount of remaining HDFS disk capacity. (Hadoop v2 only)*Shown as byte*                                                                                                                                                                                                                                                                              |
| **aws.elasticmapreduce.capacity\_remaining\_gb.sum**(gauge)                                | The sum of the amount of remaining HDFS disk capacity. (Hadoop v2 only)*Shown as byte*                                                                                                                                                                                                                                                                           |
| **aws.elasticmapreduce.container\_allocated**(gauge)                                       | The average number of resource containers allocated by the ResourceManager. (Hadoop v2 only)                                                                                                                                                                                                                                                                     |
| **aws.elasticmapreduce.container\_allocated.sum**(gauge)                                   | The sum of the number of resource containers allocated by the ResourceManager. (Hadoop v2 only)                                                                                                                                                                                                                                                                  |
| **aws.elasticmapreduce.container\_pending**(gauge)                                         | The average number of containers in the queue that have not yet been allocated. (Hadoop v2 only)                                                                                                                                                                                                                                                                 |
| **aws.elasticmapreduce.container\_pending.sum**(gauge)                                     | The sum of the number of containers in the queue that have not yet been allocated. (Hadoop v2 only)                                                                                                                                                                                                                                                              |
| **aws.elasticmapreduce.container\_pending\_ratio**(gauge)                                  | The average percentage of containers in the queue that have not yet been allocated. (Hadoop v2 only)*Shown as percent*                                                                                                                                                                                                                                           |
| **aws.elasticmapreduce.container\_pending\_ratio.sum**(gauge)                              | The sum of the percentage of containers in the queue that have not yet been allocated. (Hadoop v2 only)*Shown as percent*                                                                                                                                                                                                                                        |
| **aws.elasticmapreduce.container\_reserved**(gauge)                                        | The average number of containers reserved. (Hadoop v2 only)                                                                                                                                                                                                                                                                                                      |
| **aws.elasticmapreduce.container\_reserved.sum**(gauge)                                    | The sum of the number of containers reserved. (Hadoop v2 only)                                                                                                                                                                                                                                                                                                   |
| **aws.elasticmapreduce.core\_nodes\_pending**(gauge)                                       | The average number of core nodes waiting to be assigned. All of the core nodes requested may not be immediately available; this metric reports the pending requests. Data points for this metric are reported only when a corresponding instance group exists.*Shown as node*                                                                                    |
| **aws.elasticmapreduce.core\_nodes\_pending.sum**(gauge)                                   | The sum of the number of core nodes waiting to be assigned. All of the core nodes requested may not be immediately available; this metric reports the pending requests. Data points for this metric are reported only when a corresponding instance group exists.*Shown as node*                                                                                 |
| **aws.elasticmapreduce.core\_nodes\_requested**(gauge)                                     | The average number of core nodes in a cluster as determined by managed scaling.*Shown as node*                                                                                                                                                                                                                                                                   |
| **aws.elasticmapreduce.core\_nodes\_requested.sum**(gauge)                                 | The sum of the number of core nodes in a cluster as determined by managed scaling.*Shown as node*                                                                                                                                                                                                                                                                |
| **aws.elasticmapreduce.core\_nodes\_running**(gauge)                                       | The average number of core nodes working. Data points for this metric are reported only when a corresponding instance group exists.*Shown as node*                                                                                                                                                                                                               |
| **aws.elasticmapreduce.core\_nodes\_running.sum**(gauge)                                   | The sum of the number of core nodes working. Data points for this metric are reported only when a corresponding instance group exists.*Shown as node*                                                                                                                                                                                                            |
| **aws.elasticmapreduce.core\_units\_requested**(gauge)                                     | The average number of core units in a cluster as determined by managed scaling.*Shown as node*                                                                                                                                                                                                                                                                   |
| **aws.elasticmapreduce.core\_units\_requested.sum**(gauge)                                 | The sum of the number of core units in a cluster as determined by managed scaling.*Shown as node*                                                                                                                                                                                                                                                                |
| **aws.elasticmapreduce.core\_units\_running**(gauge)                                       | The target number of core units working. Data points for this metric are reported only when a corresponding instance group exists.*Shown as node*                                                                                                                                                                                                                |
| **aws.elasticmapreduce.core\_units\_running.sum**(gauge)                                   | The sum of the number of core units working. Data points for this metric are reported only when a corresponding instance group exists.*Shown as node*                                                                                                                                                                                                            |
| **aws.elasticmapreduce.corrupt\_blocks**(gauge)                                            | The average number of blocks that HDFS reports as corrupted. (Hadoop v2 only)*Shown as block*                                                                                                                                                                                                                                                                    |
| **aws.elasticmapreduce.corrupt\_blocks.sum**(gauge)                                        | The sum of the number of blocks that HDFS reports as corrupted. (Hadoop v2 only)*Shown as block*                                                                                                                                                                                                                                                                 |
| **aws.elasticmapreduce.dfs\_fsnamesystem\_pending\_replication\_blocks**(gauge)            | The status of block replication: blocks being replicated, age of replication requests, and unsuccessful replication requests. (Hadoop v2 only)                                                                                                                                                                                                                   |
| **aws.elasticmapreduce.hbase\_backup\_failed**(gauge)                                      | Whether the last backup failed. This is set to 0 by default and updated to 1 if the previous backup attempt failed. This metric is only reported for HBase clusters. (Hadoop v2 only)                                                                                                                                                                            |
| **aws.elasticmapreduce.hbase\_most\_recent\_backup\_duration**(gauge)                      | The amount of time it took the previous backup to complete. This metric is set regardless of whether the last completed backup succeeded or failed. While the backup is ongoing, this metric returns the number of minutes after the backup started. This metric is only reported for HBase clusters.*Shown as minute*                                           |
| **aws.elasticmapreduce.hbase\_time\_since\_last\_successful\_backup**(gauge)               | The number of elapsed minutes after the last successful HBase backup started on your cluster. This metric is only reported for HBase clusters.*Shown as minute*                                                                                                                                                                                                  |
| **aws.elasticmapreduce.hdfsbytes\_read**(gauge)                                            | The average number of bytes read from HDFS.*Shown as byte*                                                                                                                                                                                                                                                                                                       |
| **aws.elasticmapreduce.hdfsbytes\_read.sum**(gauge)                                        | The sum of the number of bytes read from HDFS.*Shown as byte*                                                                                                                                                                                                                                                                                                    |
| **aws.elasticmapreduce.hdfsbytes\_written**(gauge)                                         | The average number of bytes written to HDFS.*Shown as byte*                                                                                                                                                                                                                                                                                                      |
| **aws.elasticmapreduce.hdfsbytes\_written.sum**(gauge)                                     | The sum of the number of bytes written to HDFS.*Shown as byte*                                                                                                                                                                                                                                                                                                   |
| **aws.elasticmapreduce.hdfsutilization**(gauge)                                            | The percentage of HDFS storage currently used.*Shown as percent*                                                                                                                                                                                                                                                                                                 |
| **aws.elasticmapreduce.is\_idle**(gauge)                                                   | Indicates that a cluster is no longer performing work, but is still alive and accruing charges. It is set to 1 if no tasks are running and no jobs are running, and set to 0 otherwise. This value is checked at five-minute intervals and a value of 1 indicates only that the cluster was idle when checked, not that it was idle for the entire five minutes. |
| **aws.elasticmapreduce.jobs\_failed**(gauge)                                               | The average number of jobs in the cluster that have failed. (Hadoop v1 only)                                                                                                                                                                                                                                                                                     |
| **aws.elasticmapreduce.jobs\_failed.sum**(gauge)                                           | The sum of the number of jobs in the cluster that have failed. (Hadoop v1 only)                                                                                                                                                                                                                                                                                  |
| **aws.elasticmapreduce.jobs\_running**(gauge)                                              | The average number of jobs in the cluster that are currently running. (Hadoop v1 only)                                                                                                                                                                                                                                                                           |
| **aws.elasticmapreduce.jobs\_running.sum**(gauge)                                          | The sum of the number of jobs in the cluster that are currently running. (Hadoop v1 only)                                                                                                                                                                                                                                                                        |
| **aws.elasticmapreduce.live\_data\_nodes**(gauge)                                          | The percentage of data nodes that are receiving work from Hadoop.*Shown as percent*                                                                                                                                                                                                                                                                              |
| **aws.elasticmapreduce.live\_task\_trackers**(gauge)                                       | The percentage of task trackers that are functional. (Hadoop v1 only)*Shown as percent*                                                                                                                                                                                                                                                                          |
| **aws.elasticmapreduce.map\_slots\_open**(gauge)                                           | The average unused map task capacity. This is calculated as the maximum number of map tasks for a given cluster, less the total number of map tasks currently running in that cluster. (Hadoop v1 only)                                                                                                                                                          |
| **aws.elasticmapreduce.map\_slots\_open.sum**(gauge)                                       | The sum of the unused map task capacity. This is calculated as the maximum number of map tasks for a given cluster, less the total number of map tasks currently running in that cluster. (Hadoop v1 only)                                                                                                                                                       |
| **aws.elasticmapreduce.memory\_allocated\_mb**(gauge)                                      | The average amount of memory allocated to the cluster. (Hadoop v2 only)*Shown as byte*                                                                                                                                                                                                                                                                           |
| **aws.elasticmapreduce.memory\_allocated\_mb.sum**(gauge)                                  | The sum of the amount of memory allocated to the cluster. (Hadoop v2 only)*Shown as byte*                                                                                                                                                                                                                                                                        |
| **aws.elasticmapreduce.memory\_available\_mb**(gauge)                                      | The average amount of memory available to be allocated. (Hadoop v2 only)*Shown as byte*                                                                                                                                                                                                                                                                          |
| **aws.elasticmapreduce.memory\_available\_mb.sum**(gauge)                                  | The sum of the amount of memory available to be allocated. (Hadoop v2 only)*Shown as byte*                                                                                                                                                                                                                                                                       |
| **aws.elasticmapreduce.memory\_reserved\_mb**(gauge)                                       | The average amount of memory reserved. (Hadoop v2 only)*Shown as byte*                                                                                                                                                                                                                                                                                           |
| **aws.elasticmapreduce.memory\_reserved\_mb.sum**(gauge)                                   | The sum of the amount of memory reserved. (Hadoop v2 only)*Shown as byte*                                                                                                                                                                                                                                                                                        |
| **aws.elasticmapreduce.memory\_total\_mb**(gauge)                                          | The average total amount of memory in the cluster. (Hadoop v2 only)*Shown as byte*                                                                                                                                                                                                                                                                               |
| **aws.elasticmapreduce.memory\_total\_mb.sum**(gauge)                                      | The sum of the total amount of memory in the cluster. (Hadoop v2 only)*Shown as byte*                                                                                                                                                                                                                                                                            |
| **aws.elasticmapreduce.missing\_blocks**(gauge)                                            | The average number of blocks in which HDFS has no replicas. These might be corrupt blocks.*Shown as block*                                                                                                                                                                                                                                                       |
| **aws.elasticmapreduce.missing\_blocks.sum**(gauge)                                        | The sum of the number of blocks in which HDFS has no replicas. These might be corrupt blocks.*Shown as block*                                                                                                                                                                                                                                                    |
| **aws.elasticmapreduce.mractive\_nodes**(gauge)                                            | The average number of nodes presently running MapReduce tasks or jobs. (Hadoop v2 only)*Shown as node*                                                                                                                                                                                                                                                           |
| **aws.elasticmapreduce.mractive\_nodes.sum**(gauge)                                        | The sum of the number of nodes presently running MapReduce tasks or jobs. (Hadoop v2 only)*Shown as node*                                                                                                                                                                                                                                                        |
| **aws.elasticmapreduce.mrdecommissioned\_nodes**(gauge)                                    | The average number of nodes allocated to MapReduce applications that have been marked in a DECOMMISSIONED state. (Hadoop v2 only)*Shown as node*                                                                                                                                                                                                                 |
| **aws.elasticmapreduce.mrdecommissioned\_nodes.sum**(gauge)                                | The sum of the number of nodes allocated to MapReduce applications that have been marked in a DECOMMISSIONED state. (Hadoop v2 only)*Shown as node*                                                                                                                                                                                                              |
| **aws.elasticmapreduce.mrlost\_nodes**(gauge)                                              | The average number of nodes allocated to MapReduce that have been marked in a LOST state. (Hadoop v2 only)*Shown as node*                                                                                                                                                                                                                                        |
| **aws.elasticmapreduce.mrlost\_nodes.sum**(gauge)                                          | The sum of the number of nodes allocated to MapReduce that have been marked in a LOST state. (Hadoop v2 only)*Shown as node*                                                                                                                                                                                                                                     |
| **aws.elasticmapreduce.mrrebooted\_nodes**(gauge)                                          | The average number of nodes available to MapReduce that have been rebooted and marked in a REBOOTED state. (Hadoop v2 only)*Shown as node*                                                                                                                                                                                                                       |
| **aws.elasticmapreduce.mrrebooted\_nodes.sum**(gauge)                                      | The sum of the number of nodes available to MapReduce that have been rebooted and marked in a REBOOTED state. (Hadoop v2 only)*Shown as node*                                                                                                                                                                                                                    |
| **aws.elasticmapreduce.mrtotal\_nodes**(gauge)                                             | The average number of nodes presently available to MapReduce jobs. (Hadoop v2 only)*Shown as node*                                                                                                                                                                                                                                                               |
| **aws.elasticmapreduce.mrtotal\_nodes.sum**(gauge)                                         | The sum of the number of nodes presently available to MapReduce jobs. (Hadoop v2 only)*Shown as node*                                                                                                                                                                                                                                                            |
| **aws.elasticmapreduce.mrunhealthy\_nodes**(gauge)                                         | The average number of nodes available to MapReduce jobs marked in an UNHEALTHY state. (Hadoop v2 only)*Shown as node*                                                                                                                                                                                                                                            |
| **aws.elasticmapreduce.mrunhealthy\_nodes.sum**(gauge)                                     | The sum of the number of nodes available to MapReduce jobs marked in an UNHEALTHY state. (Hadoop v2 only)*Shown as node*                                                                                                                                                                                                                                         |
| **aws.elasticmapreduce.multi\_master\_instance\_group\_nodes\_requested**(count)           | The number of requested master nodes. (Hadoop v2 only)*Shown as node*                                                                                                                                                                                                                                                                                            |
| **aws.elasticmapreduce.multi\_master\_instance\_group\_nodes\_running**(count)             | The number of running master nodes. (Hadoop v2 only)*Shown as node*                                                                                                                                                                                                                                                                                              |
| **aws.elasticmapreduce.multi\_master\_instance\_group\_nodes\_running\_percentage**(gauge) | The percentage of master nodes that are running over the requested master node instance count. (Hadoop v2 only)*Shown as percent*                                                                                                                                                                                                                                |
| **aws.elasticmapreduce.no\_of\_black\_listed\_task\_trackers**(gauge)                      | The average number of blacklisted TaskTracker nodes.*Shown as node*                                                                                                                                                                                                                                                                                              |
| **aws.elasticmapreduce.no\_of\_black\_listed\_task\_trackers.sum**(gauge)                  | The sum of the number of blacklisted TaskTracker nodes.*Shown as node*                                                                                                                                                                                                                                                                                           |
| **aws.elasticmapreduce.no\_of\_gray\_listed\_task\_trackers**(gauge)                       | The average number of graylisted TaskTracker nodes.*Shown as node*                                                                                                                                                                                                                                                                                               |
| **aws.elasticmapreduce.no\_of\_gray\_listed\_task\_trackers.sum**(gauge)                   | The sum of the number of graylisted TaskTracker nodes.*Shown as node*                                                                                                                                                                                                                                                                                            |
| **aws.elasticmapreduce.pending\_deletion\_blocks**(gauge)                                  | The average number of blocks marked for deletion. (Hadoop v2 only)*Shown as block*                                                                                                                                                                                                                                                                               |
| **aws.elasticmapreduce.pending\_deletion\_blocks.sum**(gauge)                              | The sum of the number of blocks marked for deletion. (Hadoop v2 only)*Shown as block*                                                                                                                                                                                                                                                                            |
| **aws.elasticmapreduce.reduce\_slots\_open**(gauge)                                        | Average unused reduce task capacity. This is calculated as the maximum reduce task capacity for a given cluster, less the number of reduce tasks currently running in that cluster. (Hadoop v1 only)                                                                                                                                                             |
| **aws.elasticmapreduce.reduce\_slots\_open.sum**(gauge)                                    | The sum of unused reduce task capacity. This is calculated as the maximum reduce task capacity for a given cluster, less the number of reduce tasks currently running in that cluster. (Hadoop v1 only)                                                                                                                                                          |
| **aws.elasticmapreduce.remaining\_map\_tasks**(gauge)                                      | The average number of remaining map tasks for each job. If you have a scheduler installed and multiple jobs running, multiple graphs are generated. A remaining map task is one that is not in any of the following states: Running, Killed, or Completed. (Hadoop v1 only)*Shown as task*                                                                       |
| **aws.elasticmapreduce.remaining\_map\_tasks.sum**(gauge)                                  | The sum of the number of remaining map tasks for each job. If you have a scheduler installed and multiple jobs running, multiple graphs are generated. A remaining map task is one that is not in any of the following states: Running, Killed, or Completed. (Hadoop v1 only)*Shown as task*                                                                    |
| **aws.elasticmapreduce.remaining\_map\_tasks\_per\_slot**(gauge)                           | The ratio of the total map tasks remaining to the total map slots available in the cluster. (Hadoop v1 only)                                                                                                                                                                                                                                                     |
| **aws.elasticmapreduce.remaining\_reduce\_tasks**(gauge)                                   | The average number of remaining reduce tasks for each job. If you have a scheduler installed and multiple jobs running, multiple graphs are generated. (Hadoop v1 only)*Shown as task*                                                                                                                                                                           |
| **aws.elasticmapreduce.remaining\_reduce\_tasks.sum**(gauge)                               | The sum of the number of remaining reduce tasks for each job. If you have a scheduler installed and multiple jobs running, multiple graphs are generated. (Hadoop v1 only)*Shown as task*                                                                                                                                                                        |
| **aws.elasticmapreduce.running\_map\_tasks**(gauge)                                        | The average number of running map tasks for each job. If you have a scheduler installed and multiple jobs running, multiple graphs are generated. (Hadoop v1 only)*Shown as task*                                                                                                                                                                                |
| **aws.elasticmapreduce.running\_map\_tasks.sum**(gauge)                                    | The sum of the number of running map tasks for each job. If you have a scheduler installed and multiple jobs running, multiple graphs are generated. (Hadoop v1 only)*Shown as task*                                                                                                                                                                             |
| **aws.elasticmapreduce.running\_reduce\_tasks**(gauge)                                     | The average number of running reduce tasks for each job. If you have a scheduler installed and multiple jobs running, multiple graphs are generated. (Hadoop v1 only)*Shown as task*                                                                                                                                                                             |
| **aws.elasticmapreduce.running\_reduce\_tasks.sum**(gauge)                                 | The sum of the number of running reduce tasks for each job. If you have a scheduler installed and multiple jobs running, multiple graphs are generated. (Hadoop v1 only)*Shown as task*                                                                                                                                                                          |
| **aws.elasticmapreduce.s\_3bytes\_read**(gauge)                                            | The average number of bytes read from Amazon S3.*Shown as byte*                                                                                                                                                                                                                                                                                                  |
| **aws.elasticmapreduce.s\_3bytes\_read.sum**(gauge)                                        | The sum of the number of bytes read from Amazon S3.*Shown as byte*                                                                                                                                                                                                                                                                                               |
| **aws.elasticmapreduce.s\_3bytes\_written**(gauge)                                         | The average number of bytes written to Amazon S3.*Shown as byte*                                                                                                                                                                                                                                                                                                 |
| **aws.elasticmapreduce.s\_3bytes\_written.sum**(gauge)                                     | The sum of the number of bytes written to Amazon S3.*Shown as byte*                                                                                                                                                                                                                                                                                              |
| **aws.elasticmapreduce.task\_nodes\_pending**(gauge)                                       | The average number of task nodes waiting to be assigned. All of the task nodes requested may not be immediately available; this metric reports the pending requests. Data points for this metric are reported only when a corresponding instance group exists. (Hadoop v1 only)*Shown as node*                                                                   |
| **aws.elasticmapreduce.task\_nodes\_pending.sum**(gauge)                                   | The sum of the number of task nodes waiting to be assigned. All of the task nodes requested may not be immediately available; this metric reports the pending requests. Data points for this metric are reported only when a corresponding instance group exists. (Hadoop v1 only)*Shown as node*                                                                |
| **aws.elasticmapreduce.task\_nodes\_requested**(gauge)                                     | The average number of task nodes in a cluster as determined by managed scaling.*Shown as node*                                                                                                                                                                                                                                                                   |
| **aws.elasticmapreduce.task\_nodes\_requested.sum**(gauge)                                 | The sum of the number of task nodes in a cluster as determined by managed scaling.*Shown as node*                                                                                                                                                                                                                                                                |
| **aws.elasticmapreduce.task\_nodes\_running**(gauge)                                       | The average number of task nodes working. Data points for this metric are reported only when a corresponding instance group exists. (Hadoop v1 only)*Shown as node*                                                                                                                                                                                              |
| **aws.elasticmapreduce.task\_nodes\_running.sum**(gauge)                                   | The sum of the number of task nodes working. Data points for this metric are reported only when a corresponding instance group exists. (Hadoop v1 only)*Shown as node*                                                                                                                                                                                           |
| **aws.elasticmapreduce.task\_units\_requested**(gauge)                                     | The average number of task units in a cluster as determined by managed scaling.*Shown as node*                                                                                                                                                                                                                                                                   |
| **aws.elasticmapreduce.task\_units\_requested.sum**(gauge)                                 | The sum of the number of task units in a cluster as determined by managed scaling.*Shown as node*                                                                                                                                                                                                                                                                |
| **aws.elasticmapreduce.task\_units\_running**(gauge)                                       | The average number of task units working. Data points for this metric are reported only when a corresponding instance group exists. (Hadoop v1 only)*Shown as node*                                                                                                                                                                                              |
| **aws.elasticmapreduce.task\_units\_running.sum**(gauge)                                   | The sum of the number of task units working. Data points for this metric are reported only when a corresponding instance group exists. (Hadoop v1 only)*Shown as node*                                                                                                                                                                                           |
| **aws.elasticmapreduce.total\_load**(gauge)                                                | The average total number of concurrent data transfers.                                                                                                                                                                                                                                                                                                           |
| **aws.elasticmapreduce.total\_load.sum**(gauge)                                            | The sum of the total number of concurrent data transfers.                                                                                                                                                                                                                                                                                                        |
| **aws.elasticmapreduce.total\_map\_tasks**(gauge)                                          | The average total number of map tasks.*Shown as task*                                                                                                                                                                                                                                                                                                            |
| **aws.elasticmapreduce.total\_map\_tasks.sum**(gauge)                                      | The sum of the total number of map tasks.*Shown as task*                                                                                                                                                                                                                                                                                                         |
| **aws.elasticmapreduce.total\_nodes\_requested**(gauge)                                    | The sum total number of nodes in a cluster as determined by managed scaling.*Shown as node*                                                                                                                                                                                                                                                                      |
| **aws.elasticmapreduce.total\_nodes\_requested.average**(gauge)                            | The average of total number of nodes in a cluster as determined by managed scaling.*Shown as node*                                                                                                                                                                                                                                                               |
| **aws.elasticmapreduce.total\_nodes\_running**(gauge)                                      | The current average number of nodes available in a running cluster.*Shown as node*                                                                                                                                                                                                                                                                               |
| **aws.elasticmapreduce.total\_reduce\_tasks**(gauge)                                       | The average total number of reduce tasks.*Shown as task*                                                                                                                                                                                                                                                                                                         |
| **aws.elasticmapreduce.total\_reduce\_tasks.sum**(gauge)                                   | The sum of the total number of reduce tasks.*Shown as task*                                                                                                                                                                                                                                                                                                      |
| **aws.elasticmapreduce.total\_units\_requested**(gauge)                                    | The average total number of units in a cluster as determined by managed scaling.*Shown as node*                                                                                                                                                                                                                                                                  |
| **aws.elasticmapreduce.total\_units\_requested.sum**(gauge)                                | The sum of total number of units in a cluster as determined by managed scaling.*Shown as node*                                                                                                                                                                                                                                                                   |
| **aws.elasticmapreduce.total\_units\_running**(gauge)                                      | The current average number of units available in a running cluster.*Shown as node*                                                                                                                                                                                                                                                                               |
| **aws.elasticmapreduce.total\_units\_running.sum**(gauge)                                  | The current sum of units available in a running cluster.*Shown as node*                                                                                                                                                                                                                                                                                          |
| **aws.elasticmapreduce.total\_vcpurequested**(gauge)                                       | The average total number of vCPUs in a cluster as determined by managed scaling.*Shown as node*                                                                                                                                                                                                                                                                  |
| **aws.elasticmapreduce.total\_vcpurequested.sum**(gauge)                                   | The sum of total number of vCPUs in a cluster as determined by managed scaling.*Shown as node*                                                                                                                                                                                                                                                                   |
| **aws.elasticmapreduce.total\_vcpurunning**(gauge)                                         | The current average number of vCPUs available in a running cluster.*Shown as node*                                                                                                                                                                                                                                                                               |
| **aws.elasticmapreduce.total\_vcpurunning.sum**(gauge)                                     | The current sum of vCPUs available in a running cluster.*Shown as node*                                                                                                                                                                                                                                                                                          |
| **aws.elasticmapreduce.under\_replicated\_blocks**(gauge)                                  | The average number of blocks that need to be replicated one or more times. (Hadoop v2 only)*Shown as block*                                                                                                                                                                                                                                                      |
| **aws.elasticmapreduce.under\_replicated\_blocks.sum**(gauge)                              | The sum of the number of blocks that need to be replicated one or more times. (Hadoop v2 only)*Shown as block*                                                                                                                                                                                                                                                   |
| **aws.elasticmapreduce.yarnmemory\_available\_percentage**(gauge)                          | The percentage of remaining memory available to YARN. (Hadoop v2 only)*Shown as percent*                                                                                                                                                                                                                                                                         |
| **aws.elasticmapreduce.backup\_failed**(count)                                             | Whether the last backup failed. (Hadoop v1 only)                                                                                                                                                                                                                                                                                                                 |

Each of the metrics retrieved from AWS is assigned the same tags that appear in the AWS console, including but not limited to host name, security-groups, and more.

### Events{% #events %}

The Amazon EMR integration does not include any events.

### Service Checks{% #service-checks %}

The Amazon EMR integration does not include any service checks.

## Troubleshooting{% #troubleshooting %}

Need help? Contact [Datadog support](https://docs.datadoghq.com/help/).