This page discusses metrics for monitoring serverless applications on AWS Lambda. There are 3 ways to get metrics from AWS Lambda:
Collect metrics from non-Lambda resources
Datadog can also help you collect metrics for AWS managed resources—such as API Gateway, AppSync, and SQS—to help you monitor your entire serverless application. These metrics are enriched with corresponding AWS resource tags.
To collect these metrics, set up the Datadog AWS integration.
Enhanced Lambda metrics
Datadog generates enhanced Lambda metrics from your Lambda runtime out-of-the-box with low latency, several second granularity, and detailed metadata for cold starts and custom tags.
Enhanced Lambda metrics are in addition to the default Lambda metrics enabled with the AWS Lambda integration. Enhanced metrics are distinguished by being in the aws.lambda.enhanced.* namespace. You can view these metrics on the Enhanced Lambda Metrics default dashboard.
The following real-time enhanced Lambda metrics are available, and they are tagged with corresponding aws_account, region, functionname, cold_start, memorysize, executedversion, resource and runtime tags.
These metrics are distributions: you can query them using the count, min, max, sum, and avg aggregations. Enhanced metrics are enabled automatically with Serverless Monitoring but can be disabled by setting the DD_ENHANCED_METRICS environment variable to false on your Lambda function.
- aws.lambda.enhanced.invocations
- Measures the number of times a function is invoked in response to an event or an invocation of an API call.
- aws.lambda.enhanced.errors
- Measures the number of invocations that failed due to errors in the function.
- aws.lambda.enhanced.max_memory_used
- Measures the maximum amount of memory (mb) used by the function.
- aws.lambda.enhanced.duration
- Measures the elapsed seconds from when the function code starts executing as a result of an invocation to when it stops executing.
- aws.lambda.enhanced.billed_duration
- Measures the billed amount of time the function ran for (100ms increments).
- aws.lambda.enhanced.init_duration
- Measures the initialization time (second) of a function during a cold start.
- aws.lambda.enhanced.runtime_duration
- Measures the elapsed milliseconds from when the function code starts executing to when it returns the response back to the client, excluding the post-runtime duration added by Lambda extension executions.
- aws.lambda.enhanced.post_runtime_duration
- Measures the elapsed milliseconds from when the function code returns the response back to the client to when the function stops executing, representing the duration added by Lambda extension executions.
- aws.lambda.enhanced.response_latency
- Measures the elapsed time in milliseconds from when the invocation request is received to when the first byte of response is sent to the client.
- aws.lambda.enhanced.response_duration
- Measures the elapsed time in milliseconds from when the first byte of response to the last byte of response is sent to the client.
- aws.lambda.enhanced.produced_bytes
- Measures the number of bytes returned by a function.
- aws.lambda.enhanced.estimated_cost
- Measures the total estimated cost of the function invocation (US dollars).
- aws.lambda.enhanced.timeouts
- Measures the number of times a function times out.
- aws.lambda.enhanced.out_of_memory
- Measures the number of times a function runs out of memory.
- Because there are many variations of out-of-memory errors, some cases may not be handled well despite best efforts. If you encounter such a case, create an issue in the Datadog Lambda Extension GitHub repo.
- aws.lambda.enhanced.cpu_total_utilization
- Measures the total CPU utilization of the function as a number of cores.
- aws.lambda.enhanced.cpu_total_utilization_pct
- Measures the total CPU utilization of the function as a percent.
- aws.lambda.enhanced.cpu_max_utilization
- Measures the CPU utilization on the most utilized core.
- aws.lambda.enhanced.cpu_min_utilization
- Measures the CPU utilization on the least utilized core.
- aws.lambda.enhanced.cpu_system_time
- Measures the amount of time the CPU spent running in kernel mode.
- aws.lambda.enhanced.cpu_user_time
- Measures the amount of time the CPU spent running in user mode.
- aws.lambda.enhanced.cpu_total_time
- Measures the total amount of time the CPU spent running.
- aws.lambda.enhanced.num_cores
- Measures the number of cores available.
- aws.lambda.enhanced.rx_bytes
- Measures the bytes received by the function.
- aws.lambda.enhanced.tx_bytes
- Measures the bytes sent by the function.
- aws.lambda.enhanced.total_network
- Measures the bytes sent and received by the function.
- aws.lambda.enhanced.tmp_max
- Measures the total available space in the /tmp directory.
- aws.lambda.enhanced.tmp_used
- Measures the space used in the /tmp directory.
- aws.lambda.enhanced.fd_max
- Measures the total number of file descriptors available for use.
- aws.lambda.enhanced.fd_use
- Measures the maximum number of file descriptors used over the duration of the function invocation.
- aws.lambda.enhanced.threads_max
- Measures the total number of threads available for use.
- aws.lambda.enhanced.threads_use
- Measures the maximum number of threads used over the duration of the function invocation.
Submit custom metrics
Create custom metrics from logs or traces
If your Lambda functions are already sending trace or log data to Datadog, and the data you want to query is captured in an existing log or trace, you can generate custom metrics from logs and traces without re-deploying or making any changes to your application code.
With log-based metrics, you can record a count of logs that match a query or summarize a numeric value contained in a log, such as a request duration. Log-based metrics are a cost-efficient way to summarize log data from the entire ingest stream. Learn more about creating log-based metrics.
You can also generate metrics from all ingested spans, regardless of whether they are indexed by a retention filter. Learn more about creating span-based metrics.
Submit custom metrics directly from a Lambda function
All custom metrics are submitted as distributions.
Note: Distribution metrics must be submitted with a new name, do not re-use a name of a previously submitted metric.
- Install Serverless Monitoring for AWS Lambda and ensure that you have installed the Datadog Lambda extension. 
- Choose your runtime: 
from datadog_lambda.metric import lambda_metric
def lambda_handler(event, context):
    lambda_metric(
        "coffee_house.order_value",             # Metric name
        12.45,                                  # Metric value
        tags=['product:latte', 'order:online']  # Associated tags
    )
const { sendDistributionMetric } = require('datadog-lambda-js');
async function myHandler(event, context) {
    sendDistributionMetric(
        'coffee_house.order_value', // Metric name
        12.45,                      // Metric value
        'product:latte',            // First tag
        'order:online'              // Second tag
    );
}
package main
import (
  "github.com/aws/aws-lambda-go/lambda"
  "github.com/DataDog/datadog-lambda-go"
)
func main() {
  lambda.Start(ddlambda.WrapFunction(myHandler, nil))
}
func myHandler(ctx context.Context, event MyEvent) (string, error) {
  ddlambda.Distribution(
    "coffee_house.order_value",     // Metric name
    12.45,                          // Metric value
    "product:latte", "order:online" // Associated tags
  )
}
Install the latest version of java-dogstatsd-client.
package com.datadog.lambda.sample.java;
import com.amazonaws.services.lambda.runtime.Context;
import com.amazonaws.services.lambda.runtime.RequestHandler;
import com.amazonaws.services.lambda.runtime.events.APIGatewayV2ProxyRequestEvent;
import com.amazonaws.services.lambda.runtime.events.APIGatewayV2ProxyResponseEvent;
// import the statsd client builder
import com.timgroup.statsd.NonBlockingStatsDClientBuilder;
import com.timgroup.statsd.StatsDClient;
public class Handler implements RequestHandler<APIGatewayV2ProxyRequestEvent, APIGatewayV2ProxyResponseEvent> {
    // instantiate the statsd client
    private static final StatsDClient Statsd = new NonBlockingStatsDClientBuilder().hostname("localhost").build();
    @Override
    public APIGatewayV2ProxyResponseEvent handleRequest(APIGatewayV2ProxyRequestEvent request, Context context) {
        // submit a distribution metric
        Statsd.recordDistributionValue("my.custom.java.metric", 1, new String[]{"tag:value"});
        APIGatewayV2ProxyResponseEvent response = new APIGatewayV2ProxyResponseEvent();
        response.setStatusCode(200);
        return response;
    }
    static {
        // ensure all metrics are flushed before shutdown
        Runtime.getRuntime().addShutdownHook(new Thread() {
            @Override
            public void run() {
                System.out.println("[runtime] shutdownHook triggered");
                try {
                    Thread.sleep(300);
                } catch (InterruptedException e) {
                    System.out.println("[runtime] sleep interrupted");
                }
                System.out.println("[runtime] exiting");
            }
        });
    }
}
Install the latest version of dogstatsd-csharp-client.
using System.IO;
// import the statsd client
using StatsdClient;
namespace Example
{
  public class Function
  {
    static Function()
    {
        // instantiate the statsd client
        var dogstatsdConfig = new StatsdConfig
        {
            StatsdServerName = "127.0.0.1",
            StatsdPort = 8125,
        };
        if (!DogStatsd.Configure(dogstatsdConfig))
            throw new InvalidOperationException("Cannot initialize DogstatsD. Set optionalExceptionHandler argument in the `Configure` method for more information.");
    }
    public Stream MyHandler(Stream stream)
    {
        // submit a distribution metric
        DogStatsd.Distribution("my.custom.dotnet.metric", 1, tags: new[] { "tag:value" });
        // your function logic
    }
  }
}
- Install the DogStatsD client for your runtime
- Follow the sample code to submit your custom metrics.
Submit historical metrics
Use the Datadog Lambda Extension to submit historical metrics. These metrics can have timestamps up to one hour in the past.
Start by installing Serverless Monitoring for AWS Lambda. Ensure that you have installed the latest Datadog Lambda Extension.
Then, choose your runtime:
from datadog_lambda.metric import lambda_metric
def lambda_handler(event, context):
    lambda_metric(
        "coffee_house.order_value",             # Metric name
        12.45,                                  # Metric value
        tags=['product:latte', 'order:online']  # Associated tags
    )
    # Submit a metric with a timestamp that is within the last 20 minutes
    lambda_metric(
        "coffee_house.order_value",             # Metric name
        12.45,                                  # Metric value
        timestamp=int(time.time()),             # Unix epoch in seconds
        tags=['product:latte', 'order:online']  # Associated tags
    )
const { sendDistributionMetric } = require('datadog-lambda-js');
async function myHandler(event, context) {
    sendDistributionMetric(
        'coffee_house.order_value', // Metric name
        12.45,                      // Metric value
        'product:latte',            // First tag
        'order:online'              // Second tag
    );
    // Submit a metric with a timestamp that is within the last 20 minutes
    sendDistributionMetricWithDate(
        'coffee_house.order_value', // Metric name
        12.45,                      // Metric value
        new Date(Date.now()),       // date
        'product:latte',            // First tag
        'order:online',             // Second tag
    );
}
package main
import (
  "github.com/aws/aws-lambda-go/lambda"
  "github.com/DataDog/datadog-lambda-go"
)
func main() {
  lambda.Start(ddlambda.WrapFunction(myHandler, nil))
}
func myHandler(ctx context.Context, event MyEvent) (string, error) {
  ddlambda.Distribution(
    "coffee_house.order_value",     // Metric name
    12.45,                          // Metric value
    "product:latte", "order:online" // Associated tags
  )
  // Submit a metric with a timestamp that is within the last 20 minutes
  ddlambda.MetricWithTimestamp(
    "coffee_house.order_value",     // Metric name
    12.45,                          // Metric value
    time.Now(),                     // Timestamp
    "product:latte", "order:online" // Associated tags
  )
}
require 'datadog/lambda'
def handler(event:, context:)
    # You only need to wrap your function handler (Not helper functions).
    Datadog::Lambda.wrap(event, context) do
        Datadog::Lambda.metric(
          'coffee_house.order_value',         # Metric name
          12.45,                              # Metric value
          "product":"latte", "order":"online" # Associated tags
        )
        # Submit a metric with a timestamp that is within the last 20 minutes
        Datadog::Lambda.metric(
          'coffee_house.order_value',         # Metric name
          12.45,                              # Metric value
          time: Time.now.utc,                 # Timestamp
          "product":"latte", "order":"online" # Associated tags
        )
    end
end
public class Handler implements RequestHandler<APIGatewayV2ProxyRequestEvent, APIGatewayV2ProxyResponseEvent> {
    public Integer handleRequest(APIGatewayV2ProxyRequestEvent request, Context context){
        DDLambda dd = new DDLambda(request, lambda);
        Map<String,String> myTags = new HashMap<String, String>();
            myTags.put("product", "latte");
            myTags.put("order", "online");
        dd.metric(
            "coffee_house.order_value", // Metric name
            12.45,                      // Metric value
            myTags);                    // Associated tags
    }
}
Write a reusable function that logs your custom metrics in the following format:
{
    "m": "Metric name",
    "v": "Metric value",
    "e": "Unix timestamp (seconds)",
    "t": "Array of tags"
}
For example:
{
    "m": "coffee_house.order_value",
    "v": 12.45,
    "e": 1572273854,
    "t": ["product:latte", "order:online"]
}
Understanding distribution metrics
When Datadog receives multiple count or gauge metric points that share the same timestamp and set of tags, only the most recent one counts. This works for host-based applications because metric points get aggregated by the Datadog agent and tagged with a unique host tag.
A Lambda function may launch many concurrent execution environments when traffic increases. The function may submit count or gauge metric points that overwrite each other and cause undercounted results. To avoid this problem, custom metrics generated by Lambda functions are submitted as distributions because distribution metric points are aggregated on the Datadog backend, and every metric point counts.
Distributions provide avg, sum, max, min, count aggregations by default. On the Metric Summary page, you can enable percentile aggregations (p50, p75, p90, p95, p99) and also manage tags. To monitor a distribution for a gauge metric type, use avg for both the time and space aggregations. To monitor a distribution for a count metric type, use sum for both the time and space aggregations. Refer to the guide Query to the Graph for how time and space aggregations work.
Understanding your metrics usage, volume, and pricing in Datadog
Datadog provides granular information about the custom metrics you’re ingesting, the tag cardinality, and management tools for your custom metrics within the Metrics Summary page of the Datadog app. You can view all serverless custom metrics under the ‘Serverless’ tag in the Distribution Metric Origin facet panel. You can also control custom metrics volumes and costs with Metrics without Limits™.