The Service Map for APM is here!

Envoy

Agent Check Agent Check

Supported OS: Linux Mac OS Windows

Overview

This check collects distributed system observability metrics from Envoy.

Setup

Installation

The Envoy check is included in the Datadog Agent package, so you don’t need to install anything else on your server.

via Istio

If you are using Envoy as part of Istio, to access Envoy’s admin endpoint you need to set Istio’s proxyAdminPort.

Standard

There are 2 ways to setup the /stats endpoint:

Unsecured stats endpoint

Here’s an example Envoy admin configuration:

admin:
  access_log_path: "/dev/null"
  address:
    socket_address:
      address: 0.0.0.0
      port_value: 8001
Secured stats endpoint

Create a listener/vhost that routes to the admin endpoint (Envoy connecting to itself), but only has a route for /stats; all other routes get a static/error response. Additionally, this allows nice integration with L3 filters for auth, for example.

Here’s an example config (from this gist):

admin:
  access_log_path: /dev/null
  address:
    socket_address:
      protocol: TCP
      address: 127.0.0.1
      port_value: 8081
static_resources:
  listeners:
    - address:
        socket_address:
          protocol: TCP
          address: 0.0.0.0
          port_value: 80
      filter_chains:
        - filters:
            - name: envoy.http_connection_manager
              config:
                codec_type: AUTO
                stat_prefix: ingress_http
                route_config:
                  virtual_hosts:
                    - name: backend
                      domains:
                        - "*"
                      routes:
                        - match:
                            prefix: /stats
                          route:
                            cluster: service_stats
                http_filters:
                  - name: envoy.router
                    config:
  clusters:
    - name: service_stats
      connect_timeout: 0.250s
      type: LOGICAL_DNS
      lb_policy: ROUND_ROBIN
      hosts:
        - socket_address:
            protocol: TCP
            address: 127.0.0.1
            port_value: 8001

Configuration

  1. Edit the envoy.d/conf.yaml file, in the conf.d/ folder at the root of your Agent’s configuration directory to start collecting your Envoy performance data. See the sample envoy.d/conf.yaml for all available configuration options.

  2. Check if the Datadog Agent can access Envoy’s admin endpoint.

  3. Restart the Agent

Setting Description
stats_url (REQUIRED) The admin stats endpoint, e.g. http://localhost:80/stats. Add a ?usedonly on the end if you wish to ignore unused metrics instead of reporting them as 0.
tags A list of custom tags to apply to this instance.
metric_whitelist A list of regular expressions.
metric_blacklist A list of regular expressions.
cache_metrics Cache results of whitelist/blacklist to decrease CPU utilization, at the expense of some memory (default is true).
username The username to authenticate with if behind basic auth.
password The password to authenticate with if behind basic auth.
verify_ssl This instructs the check to validate SSL certificates when connecting to Envoy. Defaulting to true, set to false if you want to disable SSL certificate validation.
skip_proxy If true, the check bypasses any proxy settings enabled and attempt to reach Envoy directly.
timeout A custom timeout for network requests in seconds (default is 20).

Metric filtering

Metrics can be filtered using a regular expression metric_whitelist or metric_blacklist. If both are used, then whitelist is applied first, and then blacklist is applied on the resulting set.

The filtering occurs before tag extraction, so you have the option to have certain tags decide whether or not to keep or ignore metrics. An exhaustive list of all metrics and tags can be found in metrics.py. Let’s walk through an example of Envoy metric tagging!

...
'cluster.grpc.success': {
    'tags': (
        ('cluster_name', ),
        ('grpc_service', 'grpc_method', ),
        (),
    ),
    ...
},
...

Here there are 3 tag sequences: ('cluster_name'), ('grpc_service', 'grpc_method'), and empty (). The number of sequences corresponds exactly to how many metric parts there are. For this metric, there are 3 parts: cluster, grpc, and success. Envoy separates everything with a ., hence the final metric name would be:

cluster.<cluster_name>.grpc.<grpc_service>.<grpc_method>.success

If you care only about the cluster name and grpc service, you would add this to your whitelist:

^cluster\.(cluster5|cluster7)\.grpc\.serviceXYZ\.

Validation

Run the Agent’s status subcommand and look for envoy under the Checks section.

Data Collected

Metrics

envoy.runtime.load_error
(count)
Total number of load attempts that resulted in an error
shown as error
envoy.runtime.override_dir_not_exists
(count)
Total number of loads that did not use an override directory
shown as occurrence
envoy.runtime.override_dir_exists
(count)
Total number of loads that did use an override directory
shown as occurrence
envoy.runtime.load_success
(count)
Total number of load attempts that were successful
shown as success
envoy.runtime.num_keys
(gauge)
Number of keys currently loaded
shown as location
envoy.cluster_manager.cds.config_reload
(count)
Total API fetches that resulted in a config reload due to a different config
shown as request
envoy.cluster_manager.cds.update_attempt
(count)
Total API fetches attempted
shown as request
envoy.cluster_manager.cds.update_success
(count)
Total API fetches completed successfully
shown as request
envoy.cluster_manager.cds.update_failure
(count)
Total API fetches that failed (either network or schema errors)
shown as request
envoy.cluster_manager.cds.version
(gauge)
Hash of the contents from the last successful API fetch
shown as item
envoy.http.no_route
(count)
Total requests that had no route and resulted in a 404
shown as request
envoy.http.no_cluster
(count)
Total requests in which the target cluster did not exist and resulted in a 404
shown as request
envoy.http.rq_redirect
(count)
Total requests that resulted in a redirect response
shown as request
envoy.http.rq_total
(count)
Total routed requests
shown as request
envoy.vhost.vcluster.upstream_rq_1xx
(count)
Aggregate HTTP 1xx response codes
shown as response
envoy.vhost.vcluster.upstream_rq_2xx
(count)
Aggregate HTTP 2xx response codes
shown as response
envoy.vhost.vcluster.upstream_rq_3xx
(count)
Aggregate HTTP 3xx response codes
shown as response
envoy.vhost.vcluster.upstream_rq_4xx
(count)
Aggregate HTTP 4xx response codes
shown as response
envoy.vhost.vcluster.upstream_rq_5xx
(count)
Aggregate HTTP 5xx response codes
shown as response
envoy.cluster.ratelimit.ok
(count)
Total under limit responses from the rate limit service
shown as response
envoy.cluster.ratelimit.error
(count)
Total errors contacting the rate limit service
shown as response
envoy.cluster.ratelimit.over_limit
(count)
Total over limit responses from the rate limit service
shown as response
envoy.http.ip_tagging.hit
(count)
Total number of requests that have the tag_name tag applied to it
shown as request
envoy.http.ip_tagging.no_hit
(count)
Total number of requests with no applicable IP tags
shown as request
envoy.http.ip_tagging.total
(count)
Total number of requests the IP Tagging Filter operated on
shown as request
envoy.cluster.grpc.success
(count)
Total successful service/method calls
shown as operation
envoy.cluster.grpc.failure
(count)
Total failed service/method calls
shown as operation
envoy.cluster.grpc.total
(count)
Total service/method calls
shown as operation
envoy.http.dynamodb.operation.upstream_rq_total
(count)
Total number of requests with operation_name tag
shown as request
envoy.http.dynamodb.table.upstream_rq_total
(count)
Total number of requests on table_name tag table
shown as request
envoy.http.dynamodb.error
(count)
Total number of specific error_type tag for a given table_name tag
shown as error
envoy.http.dynamodb.error.BatchFailureUnprocessedKeys
(count)
Total number of partial batch failures for a given table_name tag
shown as error
envoy.http.buffer.rq_timeout
(count)
Total requests that timed out waiting for a full request
shown as timeout
envoy.http.rds.config_reload
(count)
Total API fetches that resulted in a config reload due to a different config
shown as request
envoy.http.rds.update_attempt
(count)
Total API fetches attempted
shown as request
envoy.http.rds.update_success
(count)
Total API fetches completed successfully
shown as request
envoy.http.rds.update_failure
(count)
Total API fetches that failed (either network or schema errors)
shown as request
envoy.http.rds.version
(gauge)
Hash of the contents from the last successful API fetch
shown as item
envoy.tcp.downstream_cx_total
(count)
Total number of connections handled by the filter
shown as connection
envoy.tcp.downstream_cx_no_route
(count)
Number of connections for which no matching route was found
shown as connection
envoy.tcp.downstream_cx_tx_bytes_total
(count)
Total bytes written to the downstream connection
shown as byte
envoy.tcp.downstream_cx_tx_bytes_buffered
(gauge)
Total bytes currently buffered to the downstream connection
shown as byte
envoy.tcp.downstream_flow_control_paused_reading_total
(count)
Total number of times flow control paused reading from downstream
shown as occurrence
envoy.tcp.downstream_flow_control_resumed_reading_total
(count)
Total number of times flow control resumed reading from downstream
shown as occurrence
envoy.auth.clientssl.update_success
(count)
Total principal update successes
shown as success
envoy.auth.clientssl.update_failure
(count)
Total principal update failures
shown as error
envoy.auth.clientssl.auth_no_ssl
(count)
Total connections ignored due to no TLS
shown as connection
envoy.auth.clientssl.auth_ip_white_list
(count)
Total connections allowed due to the IP white list
shown as connection
envoy.auth.clientssl.auth_digest_match
(count)
Total connections allowed due to certificate match
shown as connection
envoy.auth.clientssl.auth_digest_no_match
(count)
Total connections denied due to no certificate match
shown as connection
envoy.auth.clientssl.total_principals
(gauge)
Total loaded principals
shown as item
envoy.ratelimit.total
(count)
Total requests to the rate limit service
shown as response
envoy.ratelimit.error
(count)
Total errors contacting the rate limit service
shown as response
envoy.ratelimit.over_limit
(count)
Total over limit responses from the rate limit service
shown as response
envoy.ratelimit.ok
(count)
Total under limit responses from the rate limit service
shown as response
envoy.ratelimit.cx_closed
(count)
Total connections closed due to an over limit response from the rate limit service
shown as connection
envoy.ratelimit.active
(gauge)
Total active requests to the rate limit service
shown as request
envoy.redis.downstream_cx_active
(gauge)
Total active connections
shown as connection
envoy.redis.downstream_cx_protocol_error
(count)
Total protocol errors
shown as error
envoy.redis.downstream_cx_rx_bytes_buffered
(gauge)
Total received bytes currently buffered
shown as byte
envoy.redis.downstream_cx_rx_bytes_total
(count)
Total bytes received
shown as byte
envoy.redis.downstream_cx_total
(count)
Total connections
shown as connection
envoy.redis.downstream_cx_tx_bytes_buffered
(gauge)
Total sent bytes currently buffered
shown as byte
envoy.redis.downstream_cx_tx_bytes_total
(count)
Total bytes sent
shown as byte
envoy.redis.downstream_cx_drain_close
(count)
Number of connections closed due to draining
shown as connection
envoy.redis.downstream_rq_active
(gauge)
Total active requests
shown as request
envoy.redis.downstream_rq_total
(count)
Total requests
shown as request
envoy.redis.splitter.invalid_request
(count)
Number of requests with an incorrect number of arguments
shown as request
envoy.redis.splitter.unsupported_command
(count)
Number of commands issued which are not recognized by the command splitter
shown as operation
envoy.redis.command.total
(count)
Number of commands
shown as operation
envoy.mongo.decoding_error
(count)
Number of MongoDB protocol decoding errors
shown as error
envoy.mongo.delay_injected
(count)
Number of times the delay is injected
shown as occurrence
envoy.mongo.op_get_more
(count)
Number of OP_GET_MORE messages
shown as message
envoy.mongo.op_insert
(count)
Number of OP_INSERT messages
shown as message
envoy.mongo.op_kill_cursors
(count)
Number of OP_KILL_CURSORS messages
shown as message
envoy.mongo.op_query
(count)
Number of OP_QUERY messages
shown as message
envoy.mongo.op_query_tailable_cursor
(count)
Number of OP_QUERY with tailable cursor flag set
shown as message
envoy.mongo.op_query_no_cursor_timeout
(count)
Number of OP_QUERY with no cursor timeout flag set
shown as message
envoy.mongo.op_query_await_data
(count)
Number of OP_QUERY with await data flag set
shown as message
envoy.mongo.op_query_exhaust
(count)
Number of OP_QUERY with exhaust flag set
shown as message
envoy.mongo.op_query_no_max_time
(count)
Number of queries without maxTimeMS set
shown as query
envoy.mongo.op_query_scatter_get
(count)
Number of scatter get queries
shown as query
envoy.mongo.op_query_multi_get
(count)
Number of multi get queries
shown as query
envoy.mongo.op_query_active
(gauge)
Number of active queries
shown as query
envoy.mongo.op_reply
(count)
Number of OP_REPLY messages
shown as message
envoy.mongo.op_reply_cursor_not_found
(count)
Number of OP_REPLY with cursor not found flag set
shown as message
envoy.mongo.op_reply_query_failure
(count)
Number of OP_REPLY with query failure flag set
shown as message
envoy.mongo.op_reply_valid_cursor
(count)
Number of OP_REPLY with a valid cursor
shown as message
envoy.mongo.cx_destroy_local_with_active_rq
(count)
Connections destroyed locally with an active query
shown as connection
envoy.mongo.cx_destroy_remote_with_active_rq
(count)
Connections destroyed remotely with an active query
shown as connection
envoy.mongo.cx_drain_close
(count)
Connections gracefully closed on reply boundaries during server drain
shown as connection
envoy.mongo.cmd.total
(count)
Number of commands
shown as command
envoy.mongo.collection.query.total
(count)
Number of queries
shown as query
envoy.mongo.collection.query.scatter_get
(count)
Number of scatter gets
shown as query
envoy.mongo.collection.query.multi_get
(count)
Number of multi gets
shown as query
envoy.mongo.collection.callsite.query.total
(count)
Number of queries for the callsite tag
shown as query
envoy.mongo.collection.callsite.query.scatter_get
(count)
Number of scatter gets for the callsite tag
shown as query
envoy.mongo.collection.callsite.query.multi_get
(count)
Number of multi gets for the callsite tag
shown as query
envoy.listener.downstream_cx_total
(count)
Total connections
shown as connection
envoy.listener.downstream_cx_destroy
(count)
Total destroyed connections
shown as connection
envoy.listener.downstream_cx_active
(gauge)
Total active connections
shown as connection
envoy.listener.ssl.connection_error
(count)
Total TLS connection errors not including failed certificate verifications
shown as error
envoy.listener.ssl.handshake
(count)
Total successful TLS connection handshakes
shown as success
envoy.listener.ssl.session_reused
(count)
Total successful TLS session resumptions
shown as success
envoy.listener.ssl.no_certificate
(count)
Total successful TLS connections with no client certificate
shown as success
envoy.listener.ssl.fail_no_sni_match
(count)
Total TLS connections that were rejected because of missing SNI match
shown as connection
envoy.listener.ssl.fail_verify_no_cert
(count)
Total TLS connections that failed because of missing client certificate
shown as connection
envoy.listener.ssl.fail_verify_error
(count)
Total TLS connections that failed CA verification
shown as connection
envoy.listener.ssl.fail_verify_san
(count)
Total TLS connections that failed SAN verification
shown as connection
envoy.listener.ssl.fail_verify_cert_hash
(count)
Total TLS connections that failed certificate pinning verification
shown as connection
envoy.listener.ssl.cipher
(count)
Total TLS connections that used cipher tag
shown as connection
envoy.listener_manager.listener_added
(count)
Total listeners added (either via static config or LDS)
shown as host
envoy.listener_manager.listener_modified
(count)
Total listeners modified (via LDS)
shown as host
envoy.listener_manager.listener_removed
(count)
Total listeners removed (via LDS)
shown as host
envoy.listener_manager.listener_create_success
(count)
Total listener objects successfully added to workers
shown as host
envoy.listener_manager.listener_create_failure
(count)
Total failed listener object additions to workers
shown as host
envoy.listener_manager.total_listeners_warming
(gauge)
Number of currently warming listeners
shown as host
envoy.listener_manager.total_listeners_active
(gauge)
Number of currently active listeners
shown as host
envoy.listener_manager.total_listeners_draining
(gauge)
Number of currently draining listeners
shown as host
envoy.http.downstream_cx_total
(count)
Total connections
shown as connection
envoy.http.downstream_cx_ssl_total
(count)
Total TLS connections
shown as connection
envoy.http.downstream_cx_http1_total
(count)
Total HTTP/1.1 connections
shown as connection
envoy.http.downstream_cx_websocket_total
(count)
Total WebSocket connections
shown as connection
envoy.http.downstream_cx_http2_total
(count)
Total HTTP/2 connections
shown as connection
envoy.http.downstream_cx_destroy
(count)
Total connections destroyed
shown as connection
envoy.http.downstream_cx_destroy_remote
(count)
Total connections destroyed due to remote close
shown as connection
envoy.http.downstream_cx_destroy_local
(count)
Total connections destroyed due to local close
shown as connection
envoy.http.downstream_cx_destroy_active_rq
(count)
Total connections destroyed with active requests
shown as connection
envoy.http.downstream_cx_destroy_local_active_rq
(count)
Total connections destroyed locally with active requests
shown as connection
envoy.http.downstream_cx_destroy_remote_active_rq
(count)
Total connections destroyed remotely with active requests
shown as connection
envoy.http.downstream_cx_active
(gauge)
Total active connections
shown as connection
envoy.http.downstream_cx_ssl_active
(gauge)
Total active TLS connections
shown as connection
envoy.http.downstream_cx_http1_active
(gauge)
Total active HTTP/1.1 connections
shown as connection
envoy.http.downstream_cx_websocket_active
(gauge)
Total active WebSocket connections
shown as connection
envoy.http.downstream_cx_http2_active
(gauge)
Total active HTTP/2 connections
shown as connection
envoy.http.downstream_cx_protocol_error
(count)
Total protocol errors
shown as error
envoy.http.downstream_cx_rx_bytes_total
(count)
Total bytes received
shown as byte
envoy.http.downstream_cx_rx_bytes_buffered
(gauge)
Total received bytes currently buffered
shown as byte
envoy.http.downstream_cx_tx_bytes_total
(count)
Total bytes sent
shown as byte
envoy.http.downstream_cx_tx_bytes_buffered
(gauge)
Total sent bytes currently buffered
shown as byte
envoy.http.downstream_cx_drain_close
(count)
Total connections closed due to draining
shown as connection
envoy.http.downstream_cx_idle_timeout
(count)
Total connections closed due to idle timeout
shown as connection
envoy.http.downstream_flow_control_paused_reading_total
(count)
Total number of times reads were disabled due to flow control
shown as occurrence
envoy.http.downstream_flow_control_resumed_reading_total
(count)
Total number of times reads were enabled on the connection due to flow control
shown as occurrence
envoy.http.downstream_rq_total
(count)
Total requests
shown as request
envoy.http.downstream_rq_http1_total
(count)
Total HTTP/1.1 requests
shown as request
envoy.http.downstream_rq_http2_total
(count)
Total HTTP/2 requests
shown as request
envoy.http.downstream_rq_active
(gauge)
Total active requests
shown as request
envoy.http.downstream_rq_response_before_rq_complete
(count)
Total responses sent before the request was complete
shown as response
envoy.http.downstream_rq_rx_reset
(count)
Total request resets received
shown as request
envoy.http.downstream_rq_tx_reset
(count)
Total request resets sent
shown as request
envoy.http.downstream_rq_non_relative_path
(count)
Total requests with a non-relative HTTP path
shown as request
envoy.http.downstream_rq_too_large
(count)
Total requests resulting in a 413 due to buffering an overly large body
shown as request
envoy.http.downstream_rq_1xx
(count)
Total 1xx responses
shown as response
envoy.http.downstream_rq_2xx
(count)
Total 2xx responses
shown as response
envoy.http.downstream_rq_3xx
(count)
Total 3xx responses
shown as response
envoy.http.downstream_rq_4xx
(count)
Total 4xx responses
shown as response
envoy.http.downstream_rq_5xx
(count)
Total 5xx responses
shown as response
envoy.http.downstream_rq_ws_on_non_ws_route
(count)
Total WebSocket upgrade requests rejected by non WebSocket routes
shown as request
envoy.http.rs_too_large
(count)
Total response errors due to buffering an overly large body
shown as error
envoy.http.user_agent.downstream_cx_total
(count)
Total connections
shown as connection
envoy.http.user_agent.downstream_cx_destroy_remote_active_rq
(count)
Total connections destroyed remotely with active requests
shown as connection
envoy.http.user_agent.downstream_rq_total
(count)
Total requests
shown as request
envoy.listener.http.downstream_rq_1xx
(count)
Total 1xx responses
shown as response
envoy.listener.http.downstream_rq_2xx
(count)
Total 2xx responses
shown as response
envoy.listener.http.downstream_rq_3xx
(count)
Total 3xx responses
shown as response
envoy.listener.http.downstream_rq_4xx
(count)
Total 4xx responses
shown as response
envoy.listener.http.downstream_rq_5xx
(count)
Total 5xx responses
shown as response
envoy.http2.rx_reset
(count)
Total number of reset stream frames received by Envoy
shown as message
envoy.http2.tx_reset
(count)
Total number of reset stream frames transmitted by Envoy
shown as message
envoy.http2.header_overflow
(count)
Total number of connections reset due to the headers being larger than 63 K
shown as connection
envoy.http2.trailers
(count)
Total number of trailers seen on requests coming from downstream
shown as item
envoy.http2.headers_cb_no_stream
(count)
Total number of errors where a header callback is called without an associated stream. This tracks an unexpected occurrence due to an as yet undiagnosed bug.
shown as error
envoy.http2.too_many_header_frames
(count)
Total number of times an HTTP2 connection is reset due to receiving too many headers frames. Envoy currently supports proxying at most one header frame for 100-Continue one non-100 response code header frame and one frame with trailers.
shown as occurrence
envoy.cluster_manager.cluster_added
(count)
Total clusters added (either via static config or CDS)
shown as node
envoy.cluster_manager.cluster_modified
(count)
Total clusters modified (via CDS)
shown as node
envoy.cluster_manager.cluster_removed
(count)
Total clusters removed (via CDS)
shown as node
envoy.cluster_manager.active_clusters
(gauge)
Number of currently active (warmed) clusters
shown as node
envoy.cluster_manager.warming_clusters
(gauge)
Number of currently warming (not active) clusters
shown as node
envoy.cluster.upstream_cx_total
(count)
Total connections
shown as connection
envoy.cluster.upstream_cx_active
(gauge)
Total active connections
shown as connection
envoy.cluster.upstream_cx_http1_total
(count)
Total HTTP/1.1 connections
shown as connection
envoy.cluster.upstream_cx_http2_total
(count)
Total HTTP/2 connections
shown as connection
envoy.cluster.upstream_cx_connect_fail
(count)
Total connection failures
shown as error
envoy.cluster.upstream_cx_connect_timeout
(count)
Total connection timeouts
shown as timeout
envoy.cluster.upstream_cx_connect_attempts_exceeded
(count)
Total consecutive connection failures exceeding configured connection attempts
shown as error
envoy.cluster.upstream_cx_overflow
(count)
Total times that the cluster’s connection circuit breaker overflowed
shown as occurrence
envoy.cluster.upstream_cx_destroy
(count)
Total destroyed connections
shown as connection
envoy.cluster.upstream_cx_destroy_local
(count)
Total connections destroyed locally
shown as connection
envoy.cluster.upstream_cx_destroy_remote
(count)
Total connections destroyed remotely
shown as connection
envoy.cluster.upstream_cx_destroy_with_active_rq
(count)
Total connections destroyed with active requests
shown as connection
envoy.cluster.upstream_cx_destroy_local_with_active_rq
(count)
Total connections destroyed locally with active requests
shown as connection
envoy.cluster.upstream_cx_destroy_remote_with_active_rq
(count)
Total connections destroyed remotely with active requests
shown as connection
envoy.cluster.upstream_cx_close_notify
(count)
Total connections closed via HTTP/1.1 connection close header or HTTP/2 GOAWAY
shown as connection
envoy.cluster.upstream_cx_rx_bytes_total
(count)
Total received connection bytes
shown as byte
envoy.cluster.upstream_cx_rx_bytes_buffered
(gauge)
Received connection bytes currently buffered
shown as byte
envoy.cluster.upstream_cx_tx_bytes_total
(count)
Total sent connection bytes
shown as byte
envoy.cluster.upstream_cx_tx_bytes_buffered
(gauge)
Send connection bytes currently buffered
shown as byte
envoy.cluster.upstream_cx_protocol_error
(count)
Total connection protocol errors
shown as error
envoy.cluster.upstream_cx_max_requests
(count)
Total connections closed due to maximum requests
shown as connection
envoy.cluster.upstream_cx_none_healthy
(count)
Total times connection not established due to no healthy hosts
shown as connection
envoy.cluster.upstream_rq_total
(count)
Total requests
shown as request
envoy.cluster.upstream_rq_active
(gauge)
Total active requests
shown as request
envoy.cluster.upstream_rq_pending_total
(count)
Total requests pending a connection pool connection
shown as request
envoy.cluster.upstream_rq_pending_overflow
(count)
Total requests that overflowed connection pool circuit breaking and were failed
shown as request
envoy.cluster.upstream_rq_pending_failure_eject
(count)
Total requests that were failed due to a connection pool connection failure
shown as request
envoy.cluster.upstream_rq_pending_active
(gauge)
Total active requests pending a connection pool connection
shown as request
envoy.cluster.upstream_rq_cancelled
(count)
Total requests cancelled before obtaining a connection pool connection
shown as request
envoy.cluster.upstream_rq_maintenance_mode
(count)
Total requests that resulted in an immediate 503 due to maintenance mode
shown as request
envoy.cluster.upstream_rq_timeout
(count)
Total requests that timed out waiting for a response
shown as request
envoy.cluster.upstream_rq_per_try_timeout
(count)
Total requests that hit the per try timeout
shown as request
envoy.cluster.upstream_rq_rx_reset
(count)
Total requests that were reset remotely
shown as request
envoy.cluster.upstream_rq_tx_reset
(count)
Total requests that were reset locally
shown as request
envoy.cluster.upstream_rq_retry
(count)
Total request retries
shown as request
envoy.cluster.upstream_rq_retry_success
(count)
Total request retry successes
shown as request
envoy.cluster.upstream_rq_retry_overflow
(count)
Total requests not retried due to circuit breaking
shown as request
envoy.cluster.upstream_flow_control_paused_reading_total
(count)
Total number of times flow control paused reading from upstream
shown as occurrence
envoy.cluster.upstream_flow_control_resumed_reading_total
(count)
Total number of times flow control resumed reading from upstream
shown as occurrence
envoy.cluster.upstream_flow_control_backed_up_total
(count)
Total number of times the upstream connection backed up and paused reads from downstream
shown as occurrence
envoy.cluster.upstream_flow_control_drained_total
(count)
Total number of times the upstream connection drained and resumed reads from downstream
shown as occurrence
envoy.cluster.membership_change
(count)
Total cluster membership changes
shown as event
envoy.cluster.membership_healthy
(gauge)
Current cluster healthy total (inclusive of both health checking and outlier detection)
shown as node
envoy.cluster.membership_total
(gauge)
Current cluster membership total
shown as node
envoy.cluster.retry_or_shadow_abandoned
(count)
Total number of times shadowing or retry buffering was canceled due to buffer limits
shown as occurrence
envoy.cluster.config_reload
(count)
Total API fetches that resulted in a config reload due to a different config
shown as request
envoy.cluster.update_attempt
(count)
Total cluster membership update attempts
shown as occurrence
envoy.cluster.update_success
(count)
Total cluster membership update successes
shown as success
envoy.cluster.update_failure
(count)
Total cluster membership update failures
shown as error
envoy.cluster.version
(gauge)
Hash of the contents from the last successful API fetch
shown as item
envoy.cluster.max_host_weight
(gauge)
Maximum weight of any host in the cluster
shown as item
envoy.cluster.bind_errors
(count)
Total errors binding the socket to the configured source address
shown as error
envoy.cluster.health_check.attempt
(count)
Number of health checks
shown as check
envoy.cluster.health_check.success
(count)
Number of successful health checks
shown as check
envoy.cluster.health_check.failure
(count)
Number of immediately failed health checks (e.g. HTTP 503) as well as network failures
shown as check
envoy.cluster.health_check.passive_failure
(count)
Number of health check failures due to passive events (e.g. x-envoy-immediate-health-check-fail)
shown as check
envoy.cluster.health_check.network_failure
(count)
Number of health check failures due to network error
shown as check
envoy.cluster.health_check.verify_cluster
(count)
Number of health checks that attempted cluster name verification
shown as check
envoy.cluster.health_check.healthy
(gauge)
Number of healthy members
shown as check
envoy.cluster.outlier_detection.ejections_enforced_total
(count)
Number of enforced ejections due to any outlier type
envoy.cluster.outlier_detection.ejections_active
(gauge)
Number of currently ejected hosts
envoy.cluster.outlier_detection.ejections_overflow
(count)
Number of ejections aborted due to the max ejection %
envoy.cluster.outlier_detection.ejections_enforced_consecutive_5xx
(count)
Number of enforced consecutive 5xx ejections
envoy.cluster.outlier_detection.ejections_detected_consecutive_5xx
(count)
Number of detected consecutive 5xx ejections (even if unenforced)
envoy.cluster.outlier_detection.ejections_enforced_success_rate
(count)
Number of enforced success rate outlier ejections
envoy.cluster.outlier_detection.ejections_detected_success_rate
(count)
Number of detected success rate outlier ejections (even if unenforced)
envoy.cluster.outlier_detection.ejections_enforced_consecutive_gateway_failure
(count)
Number of enforced consecutive gateway failure ejections
envoy.cluster.outlier_detection.ejections_detected_consecutive_gateway_failure
(count)
Number of detected consecutive gateway failure ejections (even if unenforced)
envoy.cluster.upstream_rq_1xx
(count)
Aggregate HTTP 1xx response codes
shown as response
envoy.cluster.upstream_rq_2xx
(count)
Aggregate HTTP 2xx response codes
shown as response
envoy.cluster.upstream_rq_3xx
(count)
Aggregate HTTP 3xx response codes
shown as response
envoy.cluster.upstream_rq_4xx
(count)
Aggregate HTTP 4xx response codes
shown as response
envoy.cluster.upstream_rq_5xx
(count)
Aggregate HTTP 5xx response codes
shown as response
envoy.cluster.canary.upstream_rq_1xx
(count)
Upstream canary aggregate HTTP 1xx response codes
shown as response
envoy.cluster.canary.upstream_rq_2xx
(count)
Upstream canary aggregate HTTP 2xx response codes
shown as response
envoy.cluster.canary.upstream_rq_3xx
(count)
Upstream canary aggregate HTTP 3xx response codes
shown as response
envoy.cluster.canary.upstream_rq_4xx
(count)
Upstream canary aggregate HTTP 4xx response codes
shown as response
envoy.cluster.canary.upstream_rq_5xx
(count)
Upstream canary aggregate HTTP 5xx response codes
shown as response
envoy.cluster.internal.upstream_rq_1xx
(count)
Internal origin aggregate HTTP 1xx response codes
shown as response
envoy.cluster.internal.upstream_rq_2xx
(count)
Internal origin aggregate HTTP 2xx response codes
shown as response
envoy.cluster.internal.upstream_rq_3xx
(count)
Internal origin aggregate HTTP 3xx response codes
shown as response
envoy.cluster.internal.upstream_rq_4xx
(count)
Internal origin aggregate HTTP 4xx response codes
shown as response
envoy.cluster.internal.upstream_rq_5xx
(count)
Internal origin aggregate HTTP 5xx response codes
shown as response
envoy.cluster.external.upstream_rq_1xx
(count)
External origin aggregate HTTP 1xx response codes
shown as response
envoy.cluster.external.upstream_rq_2xx
(count)
External origin aggregate HTTP 2xx response codes
shown as response
envoy.cluster.external.upstream_rq_3xx
(count)
External origin aggregate HTTP 3xx response codes
shown as response
envoy.cluster.external.upstream_rq_4xx
(count)
External origin aggregate HTTP 4xx response codes
shown as response
envoy.cluster.external.upstream_rq_5xx
(count)
External origin aggregate HTTP 5xx response codes
shown as response
envoy.cluster.zone.upstream_rq_1xx
(count)
Aggregate HTTP 1xx response codes
shown as response
envoy.cluster.zone.upstream_rq_2xx
(count)
Aggregate HTTP 2xx response codes
shown as response
envoy.cluster.zone.upstream_rq_3xx
(count)
Aggregate HTTP 3xx response codes
shown as response
envoy.cluster.zone.upstream_rq_4xx
(count)
Aggregate HTTP 4xx response codes
shown as response
envoy.cluster.zone.upstream_rq_5xx
(count)
Aggregate HTTP 5xx response codes
shown as response
envoy.cluster.lb_healthy_panic
(count)
Total requests load balanced with the load balancer in panic mode
shown as request
envoy.cluster.lb_zone_cluster_too_small
(count)
No zone aware routing because of small upstream cluster size
envoy.cluster.lb_zone_routing_all_directly
(count)
Sending all requests directly to the same zone
envoy.cluster.lb_zone_routing_sampled
(count)
Sending some requests to the same zone
envoy.cluster.lb_zone_routing_cross_zone
(count)
Zone aware routing mode but have to send cross zone
envoy.cluster.lb_local_cluster_not_ok
(count)
Local host set is not set or it is panic mode for local cluster
envoy.cluster.lb_zone_number_differs
(count)
Number of zones in local and upstream cluster different
envoy.cluster.lb_subsets_active
(gauge)
Number of currently available subsets
envoy.cluster.lb_subsets_created
(count)
Number of subsets created
envoy.cluster.lb_subsets_removed
(count)
Number of subsets removed due to no hosts
envoy.cluster.lb_subsets_selected
(count)
Number of times any subset was selected for load balancing
shown as occurrence
envoy.cluster.lb_subsets_fallback
(count)
Number of times the fallback policy was invoked
shown as occurrence
envoy.cluster.update_empty
(count)
Total cluster membership updates ending with empty cluster load assignment and continuing with previous config
shown as occurrence
envoy.cluster.lb_recalculate_zone_structures
(count)
The number of times locality aware routing structures are regenerated for fast decisions on upstream locality selection
shown as occurrence
envoy.cluster.lb_zone_no_capacity_left
(count)
Total number of times ended with random zone selection due to rounding error
shown as occurrence
envoy.http.tracing.random_sampling
(count)
Total number of traceable decisions by random sampling
shown as occurrence
envoy.http.tracing.service_forced
(count)
Total number of traceable decisions by server runtime flag tracing.global_enabled
shown as occurrence
envoy.http.tracing.client_enabled
(count)
Total number of traceable decisions by request header x-envoy-force-trace
shown as occurrence
envoy.http.tracing.not_traceable
(count)
Total number of non-traceable decisions by request id
shown as occurrence
envoy.http.tracing.health_check
(count)
Total number of non-traceable decisions by health check
shown as occurrence
envoy.http.rq_direct_response
(count)
Total requests that resulted in a direct response
shown as request
envoy.stats.overflow
(count)
Total number of times Envoy cannot allocate a statistic due to a shortage of shared memory
shown as error
envoy.server.uptime
(gauge)
Current server uptime in seconds
shown as second
envoy.server.memory_allocated
(gauge)
Current amount of allocated memory in bytes
shown as byte
envoy.server.memory_heap_size
(gauge)
Current reserved heap size in bytes
shown as byte
envoy.server.live
(gauge)
1 if the server is not currently draining, 0 otherwise
shown as occurrence
envoy.server.parent_connections
(gauge)
Total connections of the old Envoy process on hot restart
shown as connection
envoy.server.total_connections
(gauge)
Total connections of both new and old Envoy processes
shown as connection
envoy.server.version
(gauge)
Integer represented version number based on SCM revision
shown as item
envoy.server.days_until_first_cert_expiring
(gauge)
Number of days until the next certificate being managed will expire
shown as day
envoy.filesystem.write_buffered
(count)
Total number of times file data is moved to Envoy’s internal flush buffer
shown as occurrence
envoy.filesystem.write_completed
(count)
Total number of times a file was written
shown as occurrence
envoy.filesystem.flushed_by_timer
(count)
Total number of times internal flush buffers are written to a file due to flush timeout
shown as occurrence
envoy.filesystem.reopen_failed
(count)
Total number of times a file was failed to be opened
shown as occurrence
envoy.filesystem.write_total_buffered
(gauge)
Current total size of internal flush buffer in bytes
shown as byte
envoy.vhost.vcluster.upstream_rq_time.0percentile
(gauge)
Request time milliseconds 0-percentile
shown as millisecond
envoy.vhost.vcluster.upstream_rq_time.25percentile
(gauge)
Request time milliseconds 25-percentile
shown as millisecond
envoy.vhost.vcluster.upstream_rq_time.50percentile
(gauge)
Request time milliseconds 50-percentile
shown as millisecond
envoy.vhost.vcluster.upstream_rq_time.75percentile
(gauge)
Request time milliseconds 75-percentile
shown as millisecond
envoy.vhost.vcluster.upstream_rq_time.90percentile
(gauge)
Request time milliseconds 90-percentile
shown as millisecond
envoy.vhost.vcluster.upstream_rq_time.95percentile
(gauge)
Request time milliseconds 95-percentile
shown as millisecond
envoy.vhost.vcluster.upstream_rq_time.99percentile
(gauge)
Request time milliseconds 99-percentile
shown as millisecond
envoy.vhost.vcluster.upstream_rq_time.99_9percentile
(gauge)
Request time milliseconds 99.9-percentile
shown as millisecond
envoy.vhost.vcluster.upstream_rq_time.100percentile
(gauge)
Request time milliseconds 100-percentile
shown as millisecond
envoy.http.dynamodb.operation.upstream_rq_time.0percentile
(gauge)
Time spent on operation_name tag 0-percentile
shown as millisecond
envoy.http.dynamodb.operation.upstream_rq_time.25percentile
(gauge)
Time spent on operation_name tag 25-percentile
shown as millisecond
envoy.http.dynamodb.operation.upstream_rq_time.50percentile
(gauge)
Time spent on operation_name tag 50-percentile
shown as millisecond
envoy.http.dynamodb.operation.upstream_rq_time.75percentile
(gauge)
Time spent on operation_name tag 75-percentile
shown as millisecond
envoy.http.dynamodb.operation.upstream_rq_time.90percentile
(gauge)
Time spent on operation_name tag 90-percentile
shown as millisecond
envoy.http.dynamodb.operation.upstream_rq_time.95percentile
(gauge)
Time spent on operation_name tag 95-percentile
shown as millisecond
envoy.http.dynamodb.operation.upstream_rq_time.99percentile
(gauge)
Time spent on operation_name tag 99-percentile
shown as millisecond
envoy.http.dynamodb.operation.upstream_rq_time.99_9percentile
(gauge)
Time spent on operation_name tag 99.9-percentile
shown as millisecond
envoy.http.dynamodb.operation.upstream_rq_time.100percentile
(gauge)
Time spent on operation_name tag 100-percentile
shown as millisecond
envoy.http.dynamodb.table.upstream_rq_time.0percentile
(gauge)
Time spent on table_name tag table 0-percentile
shown as millisecond
envoy.http.dynamodb.table.upstream_rq_time.25percentile
(gauge)
Time spent on table_name tag table 25-percentile
shown as millisecond
envoy.http.dynamodb.table.upstream_rq_time.50percentile
(gauge)
Time spent on table_name tag table 50-percentile
shown as millisecond
envoy.http.dynamodb.table.upstream_rq_time.75percentile
(gauge)
Time spent on table_name tag table 75-percentile
shown as millisecond
envoy.http.dynamodb.table.upstream_rq_time.90percentile
(gauge)
Time spent on table_name tag table 90-percentile
shown as millisecond
envoy.http.dynamodb.table.upstream_rq_time.95percentile
(gauge)
Time spent on table_name tag table 95-percentile
shown as millisecond
envoy.http.dynamodb.table.upstream_rq_time.99percentile
(gauge)
Time spent on table_name tag table 99-percentile
shown as millisecond
envoy.http.dynamodb.table.upstream_rq_time.99_9percentile
(gauge)
Time spent on table_name tag table 99.9-percentile
shown as millisecond
envoy.http.dynamodb.table.upstream_rq_time.100percentile
(gauge)
Time spent on table_name tag table 100-percentile
shown as millisecond
envoy.mongo.cmd.reply_num_docs.0percentile
(gauge)
Number of documents in reply 0-percentile
shown as document
envoy.mongo.cmd.reply_num_docs.25percentile
(gauge)
Number of documents in reply 25-percentile
shown as document
envoy.mongo.cmd.reply_num_docs.50percentile
(gauge)
Number of documents in reply 50-percentile
shown as document
envoy.mongo.cmd.reply_num_docs.75percentile
(gauge)
Number of documents in reply 75-percentile
shown as document
envoy.mongo.cmd.reply_num_docs.90percentile
(gauge)
Number of documents in reply 90-percentile
shown as document
envoy.mongo.cmd.reply_num_docs.95percentile
(gauge)
Number of documents in reply 95-percentile
shown as document
envoy.mongo.cmd.reply_num_docs.99percentile
(gauge)
Number of documents in reply 99-percentile
shown as document
envoy.mongo.cmd.reply_num_docs.99_9percentile
(gauge)
Number of documents in reply 99.9-percentile
shown as document
envoy.mongo.cmd.reply_num_docs.100percentile
(gauge)
Number of documents in reply 100-percentile
shown as document
envoy.mongo.cmd.reply_size.0percentile
(gauge)
Size of the reply in bytes 0-percentile
shown as byte
envoy.mongo.cmd.reply_size.25percentile
(gauge)
Size of the reply in bytes 25-percentile
shown as byte
envoy.mongo.cmd.reply_size.50percentile
(gauge)
Size of the reply in bytes 50-percentile
shown as byte
envoy.mongo.cmd.reply_size.75percentile
(gauge)
Size of the reply in bytes 75-percentile
shown as byte
envoy.mongo.cmd.reply_size.90percentile
(gauge)
Size of the reply in bytes 90-percentile
shown as byte
envoy.mongo.cmd.reply_size.95percentile
(gauge)
Size of the reply in bytes 95-percentile
shown as byte
envoy.mongo.cmd.reply_size.99percentile
(gauge)
Size of the reply in bytes 99-percentile
shown as byte
envoy.mongo.cmd.reply_size.99_9percentile
(gauge)
Size of the reply in bytes 99.9-percentile
shown as byte
envoy.mongo.cmd.reply_size.100percentile
(gauge)
Size of the reply in bytes 100-percentile
shown as byte
envoy.mongo.cmd.reply_time_ms.0percentile
(gauge)
Command time in milliseconds 0-percentile
shown as millisecond
envoy.mongo.cmd.reply_time_ms.25percentile
(gauge)
Command time in milliseconds 25-percentile
shown as millisecond
envoy.mongo.cmd.reply_time_ms.50percentile
(gauge)
Command time in milliseconds 50-percentile
shown as millisecond
envoy.mongo.cmd.reply_time_ms.75percentile
(gauge)
Command time in milliseconds 75-percentile
shown as millisecond
envoy.mongo.cmd.reply_time_ms.90percentile
(gauge)
Command time in milliseconds 90-percentile
shown as millisecond
envoy.mongo.cmd.reply_time_ms.95percentile
(gauge)
Command time in milliseconds 95-percentile
shown as millisecond
envoy.mongo.cmd.reply_time_ms.99percentile
(gauge)
Command time in milliseconds 99-percentile
shown as millisecond
envoy.mongo.cmd.reply_time_ms.99_9percentile
(gauge)
Command time in milliseconds 99.9-percentile
shown as millisecond
envoy.mongo.cmd.reply_time_ms.100percentile
(gauge)
Command time in milliseconds 100-percentile
shown as millisecond
envoy.mongo.collection.query.reply_num_docs.0percentile
(gauge)
Number of documents in reply 0-percentile
shown as document
envoy.mongo.collection.query.reply_num_docs.25percentile
(gauge)
Number of documents in reply 25-percentile
shown as document
envoy.mongo.collection.query.reply_num_docs.50percentile
(gauge)
Number of documents in reply 50-percentile
shown as document
envoy.mongo.collection.query.reply_num_docs.75percentile
(gauge)
Number of documents in reply 75-percentile
shown as document
envoy.mongo.collection.query.reply_num_docs.90percentile
(gauge)
Number of documents in reply 90-percentile
shown as document
envoy.mongo.collection.query.reply_num_docs.95percentile
(gauge)
Number of documents in reply 95-percentile
shown as document
envoy.mongo.collection.query.reply_num_docs.99percentile
(gauge)
Number of documents in reply 99-percentile
shown as document
envoy.mongo.collection.query.reply_num_docs.99_9percentile
(gauge)
Number of documents in reply 99.9-percentile
shown as document
envoy.mongo.collection.query.reply_num_docs.100percentile
(gauge)
Number of documents in reply 100-percentile
shown as document
envoy.mongo.collection.query.reply_size.0percentile
(gauge)
Size of the reply in bytes 0-percentile
shown as byte
envoy.mongo.collection.query.reply_size.25percentile
(gauge)
Size of the reply in bytes 25-percentile
shown as byte
envoy.mongo.collection.query.reply_size.50percentile
(gauge)
Size of the reply in bytes 50-percentile
shown as byte
envoy.mongo.collection.query.reply_size.75percentile
(gauge)
Size of the reply in bytes 75-percentile
shown as byte
envoy.mongo.collection.query.reply_size.90percentile
(gauge)
Size of the reply in bytes 90-percentile
shown as byte
envoy.mongo.collection.query.reply_size.95percentile
(gauge)
Size of the reply in bytes 95-percentile
shown as byte
envoy.mongo.collection.query.reply_size.99percentile
(gauge)
Size of the reply in bytes 99-percentile
shown as byte
envoy.mongo.collection.query.reply_size.99_9percentile
(gauge)
Size of the reply in bytes 99.9-percentile
shown as byte
envoy.mongo.collection.query.reply_size.100percentile
(gauge)
Size of the reply in bytes 100-percentile
shown as byte
envoy.mongo.collection.query.reply_time_ms.0percentile
(gauge)
Query time in milliseconds 0-percentile
shown as millisecond
envoy.mongo.collection.query.reply_time_ms.25percentile
(gauge)
Query time in milliseconds 25-percentile
shown as millisecond
envoy.mongo.collection.query.reply_time_ms.50percentile
(gauge)
Query time in milliseconds 50-percentile
shown as millisecond
envoy.mongo.collection.query.reply_time_ms.75percentile
(gauge)
Query time in milliseconds 75-percentile
shown as millisecond
envoy.mongo.collection.query.reply_time_ms.90percentile
(gauge)
Query time in milliseconds 90-percentile
shown as millisecond
envoy.mongo.collection.query.reply_time_ms.95percentile
(gauge)
Query time in milliseconds 95-percentile
shown as millisecond
envoy.mongo.collection.query.reply_time_ms.99percentile
(gauge)
Query time in milliseconds 99-percentile
shown as millisecond
envoy.mongo.collection.query.reply_time_ms.99_9percentile
(gauge)
Query time in milliseconds 99.9-percentile
shown as millisecond
envoy.mongo.collection.query.reply_time_ms.100percentile
(gauge)
Query time in milliseconds 100-percentile
shown as millisecond
envoy.mongo.collection.callsite.query.reply_num_docs.0percentile
(gauge)
Number of documents in reply for the callsite tag 0-percentile
shown as document
envoy.mongo.collection.callsite.query.reply_num_docs.25percentile
(gauge)
Number of documents in reply for the callsite tag 25-percentile
shown as document
envoy.mongo.collection.callsite.query.reply_num_docs.50percentile
(gauge)
Number of documents in reply for the callsite tag 50-percentile
shown as document
envoy.mongo.collection.callsite.query.reply_num_docs.75percentile
(gauge)
Number of documents in reply for the callsite tag 75-percentile
shown as document
envoy.mongo.collection.callsite.query.reply_num_docs.90percentile
(gauge)
Number of documents in reply for the callsite tag 90-percentile
shown as document
envoy.mongo.collection.callsite.query.reply_num_docs.95percentile
(gauge)
Number of documents in reply for the callsite tag 95-percentile
shown as document
envoy.mongo.collection.callsite.query.reply_num_docs.99percentile
(gauge)
Number of documents in reply for the callsite tag 99-percentile
shown as document
envoy.mongo.collection.callsite.query.reply_num_docs.99_9percentile
(gauge)
Number of documents in reply for the callsite tag 99.9-percentile
shown as document
envoy.mongo.collection.callsite.query.reply_num_docs.100percentile
(gauge)
Number of documents in reply for the callsite tag 100-percentile
shown as document
envoy.mongo.collection.callsite.query.reply_size.0percentile
(gauge)
Size of the reply in bytes for the callsite tag 0-percentile
shown as byte
envoy.mongo.collection.callsite.query.reply_size.25percentile
(gauge)
Size of the reply in bytes for the callsite tag 25-percentile
shown as byte
envoy.mongo.collection.callsite.query.reply_size.50percentile
(gauge)
Size of the reply in bytes for the callsite tag 50-percentile
shown as byte
envoy.mongo.collection.callsite.query.reply_size.75percentile
(gauge)
Size of the reply in bytes for the callsite tag 75-percentile
shown as byte
envoy.mongo.collection.callsite.query.reply_size.90percentile
(gauge)
Size of the reply in bytes for the callsite tag 90-percentile
shown as byte
envoy.mongo.collection.callsite.query.reply_size.95percentile
(gauge)
Size of the reply in bytes for the callsite tag 95-percentile
shown as byte
envoy.mongo.collection.callsite.query.reply_size.99percentile
(gauge)
Size of the reply in bytes for the callsite tag 99-percentile
shown as byte
envoy.mongo.collection.callsite.query.reply_size.99_9percentile
(gauge)
Size of the reply in bytes for the callsite tag 99.9-percentile
shown as byte
envoy.mongo.collection.callsite.query.reply_size.100percentile
(gauge)
Size of the reply in bytes for the callsite tag 100-percentile
shown as byte
envoy.mongo.collection.callsite.query.reply_time_ms.0percentile
(gauge)
Query time in milliseconds for the callsite tag 0-percentile
shown as millisecond
envoy.mongo.collection.callsite.query.reply_time_ms.25percentile
(gauge)
Query time in milliseconds for the callsite tag 25-percentile
shown as millisecond
envoy.mongo.collection.callsite.query.reply_time_ms.50percentile
(gauge)
Query time in milliseconds for the callsite tag 50-percentile
shown as millisecond
envoy.mongo.collection.callsite.query.reply_time_ms.75percentile
(gauge)
Query time in milliseconds for the callsite tag 75-percentile
shown as millisecond
envoy.mongo.collection.callsite.query.reply_time_ms.90percentile
(gauge)
Query time in milliseconds for the callsite tag 90-percentile
shown as millisecond
envoy.mongo.collection.callsite.query.reply_time_ms.95percentile
(gauge)
Query time in milliseconds for the callsite tag 95-percentile
shown as millisecond
envoy.mongo.collection.callsite.query.reply_time_ms.99percentile
(gauge)
Query time in milliseconds for the callsite tag 99-percentile
shown as millisecond
envoy.mongo.collection.callsite.query.reply_time_ms.99_9percentile
(gauge)
Query time in milliseconds for the callsite tag 99.9-percentile
shown as millisecond
envoy.mongo.collection.callsite.query.reply_time_ms.100percentile
(gauge)
Query time in milliseconds for the callsite tag 100-percentile
shown as millisecond
envoy.listener.downstream_cx_length_ms.0percentile
(gauge)
Connection length in milliseconds 0-percentile
shown as millisecond
envoy.listener.downstream_cx_length_ms.25percentile
(gauge)
Connection length in milliseconds 25-percentile
shown as millisecond
envoy.listener.downstream_cx_length_ms.50percentile
(gauge)
Connection length in milliseconds 50-percentile
shown as millisecond
envoy.listener.downstream_cx_length_ms.75percentile
(gauge)
Connection length in milliseconds 75-percentile
shown as millisecond
envoy.listener.downstream_cx_length_ms.90percentile
(gauge)
Connection length in milliseconds 90-percentile
shown as millisecond
envoy.listener.downstream_cx_length_ms.95percentile
(gauge)
Connection length in milliseconds 95-percentile
shown as millisecond
envoy.listener.downstream_cx_length_ms.99percentile
(gauge)
Connection length in milliseconds 99-percentile
shown as millisecond
envoy.listener.downstream_cx_length_ms.99_9percentile
(gauge)
Connection length in milliseconds 99.9-percentile
shown as millisecond
envoy.listener.downstream_cx_length_ms.100percentile
(gauge)
Connection length in milliseconds 100-percentile
shown as millisecond
envoy.http.downstream_cx_length_ms.0percentile
(gauge)
Connection length in milliseconds 0-percentile
shown as millisecond
envoy.http.downstream_cx_length_ms.25percentile
(gauge)
Connection length in milliseconds 25-percentile
shown as millisecond
envoy.http.downstream_cx_length_ms.50percentile
(gauge)
Connection length in milliseconds 50-percentile
shown as millisecond
envoy.http.downstream_cx_length_ms.75percentile
(gauge)
Connection length in milliseconds 75-percentile
shown as millisecond
envoy.http.downstream_cx_length_ms.90percentile
(gauge)
Connection length in milliseconds 90-percentile
shown as millisecond
envoy.http.downstream_cx_length_ms.95percentile
(gauge)
Connection length in milliseconds 95-percentile
shown as millisecond
envoy.http.downstream_cx_length_ms.99percentile
(gauge)
Connection length in milliseconds 99-percentile
shown as millisecond
envoy.http.downstream_cx_length_ms.99_9percentile
(gauge)
Connection length in milliseconds 99.9-percentile
shown as millisecond
envoy.http.downstream_cx_length_ms.100percentile
(gauge)
Connection length in milliseconds 100-percentile
shown as millisecond
envoy.http.downstream_rq_time.0percentile
(gauge)
Request time in milliseconds 0-percentile
shown as millisecond
envoy.http.downstream_rq_time.25percentile
(gauge)
Request time in milliseconds 25-percentile
shown as millisecond
envoy.http.downstream_rq_time.50percentile
(gauge)
Request time in milliseconds 50-percentile
shown as millisecond
envoy.http.downstream_rq_time.75percentile
(gauge)
Request time in milliseconds 75-percentile
shown as millisecond
envoy.http.downstream_rq_time.90percentile
(gauge)
Request time in milliseconds 90-percentile
shown as millisecond
envoy.http.downstream_rq_time.95percentile
(gauge)
Request time in milliseconds 95-percentile
shown as millisecond
envoy.http.downstream_rq_time.99percentile
(gauge)
Request time in milliseconds 99-percentile
shown as millisecond
envoy.http.downstream_rq_time.99_9percentile
(gauge)
Request time in milliseconds 99.9-percentile
shown as millisecond
envoy.http.downstream_rq_time.100percentile
(gauge)
Request time in milliseconds 100-percentile
shown as millisecond
envoy.cluster.upstream_cx_connect_ms.0percentile
(gauge)
Connection establishment in milliseconds 0-percentile
shown as millisecond
envoy.cluster.upstream_cx_connect_ms.25percentile
(gauge)
Connection establishment in milliseconds 25-percentile
shown as millisecond
envoy.cluster.upstream_cx_connect_ms.50percentile
(gauge)
Connection establishment in milliseconds 50-percentile
shown as millisecond
envoy.cluster.upstream_cx_connect_ms.75percentile
(gauge)
Connection establishment in milliseconds 75-percentile
shown as millisecond
envoy.cluster.upstream_cx_connect_ms.90percentile
(gauge)
Connection establishment in milliseconds 90-percentile
shown as millisecond
envoy.cluster.upstream_cx_connect_ms.95percentile
(gauge)
Connection establishment in milliseconds 95-percentile
shown as millisecond
envoy.cluster.upstream_cx_connect_ms.99percentile
(gauge)
Connection establishment in milliseconds 99-percentile
shown as millisecond
envoy.cluster.upstream_cx_connect_ms.99_9percentile
(gauge)
Connection establishment in milliseconds 99.9-percentile
shown as millisecond
envoy.cluster.upstream_cx_connect_ms.100percentile
(gauge)
Connection establishment in milliseconds 100-percentile
shown as millisecond
envoy.cluster.upstream_cx_length_ms.0percentile
(gauge)
Connection length in milliseconds 0-percentile
shown as millisecond
envoy.cluster.upstream_cx_length_ms.25percentile
(gauge)
Connection length in milliseconds 25-percentile
shown as millisecond
envoy.cluster.upstream_cx_length_ms.50percentile
(gauge)
Connection length in milliseconds 50-percentile
shown as millisecond
envoy.cluster.upstream_cx_length_ms.75percentile
(gauge)
Connection length in milliseconds 75-percentile
shown as millisecond
envoy.cluster.upstream_cx_length_ms.90percentile
(gauge)
Connection length in milliseconds 90-percentile
shown as millisecond
envoy.cluster.upstream_cx_length_ms.95percentile
(gauge)
Connection length in milliseconds 95-percentile
shown as millisecond
envoy.cluster.upstream_cx_length_ms.99percentile
(gauge)
Connection length in milliseconds 99-percentile
shown as millisecond
envoy.cluster.upstream_cx_length_ms.99_9percentile
(gauge)
Connection length in milliseconds 99.9-percentile
shown as millisecond
envoy.cluster.upstream_cx_length_ms.100percentile
(gauge)
Connection length in milliseconds 100-percentile
shown as millisecond
envoy.cluster.upstream_rq_time.0percentile
(gauge)
Request time in milliseconds 0-percentile
shown as millisecond
envoy.cluster.upstream_rq_time.25percentile
(gauge)
Request time in milliseconds 25-percentile
shown as millisecond
envoy.cluster.upstream_rq_time.50percentile
(gauge)
Request time in milliseconds 50-percentile
shown as millisecond
envoy.cluster.upstream_rq_time.75percentile
(gauge)
Request time in milliseconds 75-percentile
shown as millisecond
envoy.cluster.upstream_rq_time.90percentile
(gauge)
Request time in milliseconds 90-percentile
shown as millisecond
envoy.cluster.upstream_rq_time.95percentile
(gauge)
Request time in milliseconds 95-percentile
shown as millisecond
envoy.cluster.upstream_rq_time.99percentile
(gauge)
Request time in milliseconds 99-percentile
shown as millisecond
envoy.cluster.upstream_rq_time.99_9percentile
(gauge)
Request time in milliseconds 99.9-percentile
shown as millisecond
envoy.cluster.upstream_rq_time.100percentile
(gauge)
Request time in milliseconds 100-percentile
shown as millisecond
envoy.cluster.canary.upstream_rq_time.0percentile
(gauge)
Upstream canary request time in milliseconds 0-percentile
shown as millisecond
envoy.cluster.canary.upstream_rq_time.25percentile
(gauge)
Upstream canary request time in milliseconds 25-percentile
shown as millisecond
envoy.cluster.canary.upstream_rq_time.50percentile
(gauge)
Upstream canary request time in milliseconds 50-percentile
shown as millisecond
envoy.cluster.canary.upstream_rq_time.75percentile
(gauge)
Upstream canary request time in milliseconds 75-percentile
shown as millisecond
envoy.cluster.canary.upstream_rq_time.90percentile
(gauge)
Upstream canary request time in milliseconds 90-percentile
shown as millisecond
envoy.cluster.canary.upstream_rq_time.95percentile
(gauge)
Upstream canary request time in milliseconds 95-percentile
shown as millisecond
envoy.cluster.canary.upstream_rq_time.99percentile
(gauge)
Upstream canary request time in milliseconds 99-percentile
shown as millisecond
envoy.cluster.canary.upstream_rq_time.99_9percentile
(gauge)
Upstream canary request time in milliseconds 99.9-percentile
shown as millisecond
envoy.cluster.canary.upstream_rq_time.100percentile
(gauge)
Upstream canary request time in milliseconds 100-percentile
shown as millisecond
envoy.cluster.internal.upstream_rq_time.0percentile
(gauge)
Internal origin request time in milliseconds 0-percentile
shown as millisecond
envoy.cluster.internal.upstream_rq_time.25percentile
(gauge)
Internal origin request time in milliseconds 25-percentile
shown as millisecond
envoy.cluster.internal.upstream_rq_time.50percentile
(gauge)
Internal origin request time in milliseconds 50-percentile
shown as millisecond
envoy.cluster.internal.upstream_rq_time.75percentile
(gauge)
Internal origin request time in milliseconds 75-percentile
shown as millisecond
envoy.cluster.internal.upstream_rq_time.90percentile
(gauge)
Internal origin request time in milliseconds 90-percentile
shown as millisecond
envoy.cluster.internal.upstream_rq_time.95percentile
(gauge)
Internal origin request time in milliseconds 95-percentile
shown as millisecond
envoy.cluster.internal.upstream_rq_time.99percentile
(gauge)
Internal origin request time in milliseconds 99-percentile
shown as millisecond
envoy.cluster.internal.upstream_rq_time.99_9percentile
(gauge)
Internal origin request time in milliseconds 99.9-percentile
shown as millisecond
envoy.cluster.internal.upstream_rq_time.100percentile
(gauge)
Internal origin request time in milliseconds 100-percentile
shown as millisecond
envoy.cluster.external.upstream_rq_time.0percentile
(gauge)
External origin request time in milliseconds 0-percentile
shown as millisecond
envoy.cluster.external.upstream_rq_time.25percentile
(gauge)
External origin request time in milliseconds 25-percentile
shown as millisecond
envoy.cluster.external.upstream_rq_time.50percentile
(gauge)
External origin request time in milliseconds 50-percentile
shown as millisecond
envoy.cluster.external.upstream_rq_time.75percentile
(gauge)
External origin request time in milliseconds 75-percentile
shown as millisecond
envoy.cluster.external.upstream_rq_time.90percentile
(gauge)
External origin request time in milliseconds 90-percentile
shown as millisecond
envoy.cluster.external.upstream_rq_time.95percentile
(gauge)
External origin request time in milliseconds 95-percentile
shown as millisecond
envoy.cluster.external.upstream_rq_time.99percentile
(gauge)
External origin request time in milliseconds 99-percentile
shown as millisecond
envoy.cluster.external.upstream_rq_time.99_9percentile
(gauge)
External origin request time in milliseconds 99.9-percentile
shown as millisecond
envoy.cluster.external.upstream_rq_time.100percentile
(gauge)
External origin request time in milliseconds 100-percentile
shown as millisecond
envoy.cluster.zone.upstream_rq_time.0percentile
(gauge)
Zone request time in milliseconds 0-percentile
shown as millisecond
envoy.cluster.zone.upstream_rq_time.25percentile
(gauge)
Zone request time in milliseconds 25-percentile
shown as millisecond
envoy.cluster.zone.upstream_rq_time.50percentile
(gauge)
Zone request time in milliseconds 50-percentile
shown as millisecond
envoy.cluster.zone.upstream_rq_time.75percentile
(gauge)
Zone request time in milliseconds 75-percentile
shown as millisecond
envoy.cluster.zone.upstream_rq_time.90percentile
(gauge)
Zone request time in milliseconds 90-percentile
shown as millisecond
envoy.cluster.zone.upstream_rq_time.95percentile
(gauge)
Zone request time in milliseconds 95-percentile
shown as millisecond
envoy.cluster.zone.upstream_rq_time.99percentile
(gauge)
Zone request time in milliseconds 99-percentile
shown as millisecond
envoy.cluster.zone.upstream_rq_time.99_9percentile
(gauge)
Zone request time in milliseconds 99.9-percentile
shown as millisecond
envoy.cluster.zone.upstream_rq_time.100percentile
(gauge)
Zone request time in milliseconds 100-percentile
shown as millisecond

See metrics.py for a list of tags sent by each metric.

Events

The Envoy check does not include any events at this time.

Service Checks

envoy.can_connect:

Returns CRITICAL if the Agent cannot connect to Envoy to collect metrics, otherwise OK.

Troubleshooting

Need help? Contact Datadog Support.


Mistake in the docs? Feel free to contribute!