aws.bedrock.content_filtered_count (count) | The total number of times the text output content was filtered. Shown as time |
aws.bedrock.input_token_count (gauge) | The average number of input tokens used in prompts invoked for a model. Shown as token |
aws.bedrock.input_token_count.maximum (gauge) | The maximum number of input tokens used in prompts invoked for a model. Shown as token |
aws.bedrock.input_token_count.minimum (gauge) | The minimum number of input tokens used in prompts invoked for a model. Shown as token |
aws.bedrock.input_token_count.sum (count) | The total number of input tokens used in prompts invoked for a model. Shown as token |
aws.bedrock.invocation_client_errors (count) | The number of client invocation errors. Shown as error |
aws.bedrock.invocation_latency (gauge) | Average latency of the invocations in milliseconds. Shown as millisecond |
aws.bedrock.invocation_latency.maximum (gauge) | The maximum invocation latency over a one-minute period. Shown as millisecond |
aws.bedrock.invocation_latency.minimum (gauge) | The minimum invocation latency over a one-minute period. Shown as millisecond |
aws.bedrock.invocation_latency.p90 (gauge) | The 90th percentile of invocation latency over a one-minute period. Shown as millisecond |
aws.bedrock.invocation_latency.p95 (gauge) | The 95th percentile of invocation latency over a 1 minute period. Shown as millisecond |
aws.bedrock.invocation_latency.p99 (gauge) | The 99th percentile of invocation latency over a one-minute period. Shown as millisecond |
aws.bedrock.invocation_server_errors (count) | The number of server invocation errors. Shown as error |
aws.bedrock.invocation_throttles (count) | The number of invocation throttles. Shown as throttle |
aws.bedrock.invocations (count) | The number of invocations sent to a model endpoint. Shown as invocation |
aws.bedrock.output_image_count (gauge) | The average number of output images returned by model invocations over a 1 minute period. Shown as item |
aws.bedrock.output_token_count (gauge) | The average number of output tokens returned by model invocations over a 1 minute period. Shown as token |
aws.bedrock.output_token_count.maximum (gauge) | The maximum number of output tokens returned by model invocations over a one-minute period. Shown as token |
aws.bedrock.output_token_count.minimum (gauge) | The minimum number of output tokens returned by model invocations over a one-minute period. Shown as token |
aws.bedrock.output_token_count.sum (count) | The total number of output tokens returned by all model invocations. Shown as token |
aws.bedrockagentcore.active_streaming_connections (gauge) | The current number of active WebSocket connections per agent. Shown as connection |
aws.bedrockagentcore.allow_decisions (count) | The number of policy decisions that resulted in ALLOW. |
aws.bedrockagentcore.api_key_fetch_failures (count) | The number of failed API key fetch operations by exception type. Shown as error |
aws.bedrockagentcore.api_key_fetch_success (count) | The number of successful API key fetch operations. |
aws.bedrockagentcore.connection_duration (gauge) | The amount of time that connection stays open. Shown as millisecond |
aws.bedrockagentcore.connection_duration.maximum (gauge) | The maximum amount of time that connection stays open. Shown as millisecond |
aws.bedrockagentcore.connection_duration.minimum (gauge) | The minimum amount of time that connection stays open. Shown as millisecond |
aws.bedrockagentcore.connection_duration.p90 (gauge) | The 90th percentile amount of time that connection stays open. Shown as millisecond |
aws.bedrockagentcore.connection_duration.p99 (gauge) | The 99th percentile amount of time that connection stays open. Shown as millisecond |
aws.bedrockagentcore.cpu_used_vcpu_hours (count) | The total amount of virtual CPU consumed in vCPU-hours. Shown as hour |
aws.bedrockagentcore.creation_count (count) | The number of created memory events and memory records. |
aws.bedrockagentcore.deny_decisions (count) | The number of policy decisions that resulted in DENY. |
aws.bedrockagentcore.determining_policies (count) | The number of determining policies for a request. |
aws.bedrockagentcore.duration (gauge) | The time elapsed between receiving the request and sending the final response token. Shown as millisecond |
aws.bedrockagentcore.duration.maximum (gauge) | The maximum time elapsed between receiving the request and sending the final response token. Shown as millisecond |
aws.bedrockagentcore.duration.minimum (gauge) | The minimum time elapsed between receiving the request and sending the final response token. Shown as millisecond |
aws.bedrockagentcore.duration.p90 (gauge) | The 90th percentile time elapsed between receiving the request and sending the final response token. Shown as millisecond |
aws.bedrockagentcore.duration.p99 (gauge) | The 99th percentile time elapsed between receiving the request and sending the final response token. Shown as millisecond |
aws.bedrockagentcore.errors (count) | The total number of system and user errors. Shown as error |
aws.bedrockagentcore.failed_extraction (count) | The number of failed extraction jobs. Shown as error |
aws.bedrockagentcore.inbound_authorization_failure (count) | The total number of inbound authorization failures. Shown as error |
aws.bedrockagentcore.inbound_authorization_success (count) | The total number of successful inbound authorization operations. |
aws.bedrockagentcore.inbound_streaming_bytes_processed (count) | The total number of bytes processed in WebSocket frames received from clients to agent containers. Shown as byte |
aws.bedrockagentcore.invocations (count) | The total number of requests made to the Data Plane API. Shown as invocation |
aws.bedrockagentcore.latency (gauge) | The time elapsed between receiving the request and sending the first response token. Shown as millisecond |
aws.bedrockagentcore.latency.maximum (gauge) | Maximum time elapsed between receiving the request and sending the first response token. Shown as millisecond |
aws.bedrockagentcore.latency.minimum (gauge) | The minimum time elapsed between receiving the request and sending the first response token. Shown as millisecond |
aws.bedrockagentcore.latency.p90 (gauge) | The 90th percentile time elapsed between receiving the request and sending the first response token. Shown as millisecond |
aws.bedrockagentcore.latency.p99 (gauge) | The 99th percentile time elapsed between receiving the request and sending the first response token. Shown as millisecond |
aws.bedrockagentcore.memory_used_gb_hours (count) | The total amount of memory consumed in GB-hours. Shown as hour |
aws.bedrockagentcore.no_determining_policies (count) | The number of requests denied due to no determining policies. |
aws.bedrockagentcore.outbound_streaming_bytes_processed (count) | The total number of bytes processed in WebSocket frames sent from agent containers to clients. Shown as byte |
aws.bedrockagentcore.resource_access_token_fetch_failures (count) | The number of failed OAuth2 token fetch operations by exception type. Shown as error |
aws.bedrockagentcore.resource_access_token_fetch_success (count) | The number of successful OAuth2 token fetch operations from credential providers. |
aws.bedrockagentcore.sessions (gauge) | The number of agent sessions. Shown as connection |
aws.bedrockagentcore.sessions.maximum (gauge) | The maximum number of agent sessions. Shown as connection |
aws.bedrockagentcore.sessions.minimum (gauge) | The minimum number of agent sessions. Shown as connection |
aws.bedrockagentcore.stream_publishing_failure (count) | The number of memory record events that failed to publish to Kinesis Data Stream. Shown as error |
aws.bedrockagentcore.system_errors (count) | The number of server-side errors encountered during request processing. Shown as error |
aws.bedrockagentcore.take_over_count (count) | The total number of browser user takeover events. |
aws.bedrockagentcore.take_over_release_count (count) | The total number of browser user control release events. |
aws.bedrockagentcore.target_execution_time (gauge) | The time taken to execute the target over Lambda, OpenAPI, or other integrations. Shown as millisecond |
aws.bedrockagentcore.target_execution_time.maximum (gauge) | The maximum time taken to execute the target over Lambda, OpenAPI, or other integrations. Shown as millisecond |
aws.bedrockagentcore.target_execution_time.minimum (gauge) | The minimum time taken to execute the target over Lambda, OpenAPI, or other integrations. Shown as millisecond |
aws.bedrockagentcore.target_execution_time.p90 (gauge) | The 90th percentile time taken to execute the target over Lambda, OpenAPI, or other integrations. Shown as millisecond |
aws.bedrockagentcore.target_execution_time.p99 (gauge) | The 99th percentile time taken to execute the target over Lambda, OpenAPI, or other integrations. Shown as millisecond |
aws.bedrockagentcore.target_type_apigateway (count) | The total number of requests served by API Gateway targets. |
aws.bedrockagentcore.target_type_lambda (count) | The total number of requests served by Lambda targets. |
aws.bedrockagentcore.target_type_mcp (count) | The total number of requests served by MCP targets. |
aws.bedrockagentcore.target_type_openapi (count) | The total number of requests served by OpenAPI targets. |
aws.bedrockagentcore.target_type_smithy (count) | The total number of requests served by Smithy targets. |
aws.bedrockagentcore.throttles (count) | The number of requests throttled by the service due to exceeding allowed rate limits. Shown as throttle |
aws.bedrockagentcore.token_count (count) | The number of tokens used. Shown as token |
aws.bedrockagentcore.total_mismatched_policies (count) | The number of failed policies for a request due to missing attribute or type mismatch. |
aws.bedrockagentcore.user_errors (count) | The number of client-side errors resulting from invalid requests. Shown as error |
aws.bedrockagentcore.workload_access_token_fetch_failures (count) | The number of failed workload access token fetch operations by exception type. Shown as error |
aws.bedrockagentcore.workload_access_token_fetch_success (count) | The number of successful workload access token fetch operations. |