SageMaker Endpoint

An AWS SageMaker Endpoint is a fully managed service that hosts trained machine learning models for real-time inference. Once a model is deployed to an endpoint, it can receive input data and return predictions with low latency. Endpoints are scalable, allowing automatic adjustment of compute resources based on traffic. They support multiple instance types and configurations to balance performance and cost.

aws.sagemaker_endpoint

Fields

TitleIDTypeData TypeDescription
_keycorestring
account_idcorestring
async_inference_configcorejsonReturns the description of an endpoint configuration created using the CreateEndpointConfig API.
creation_timecoretimestampA timestamp that shows when the endpoint was created.
data_capture_configcorejsonThe currently active data capture configuration used by your Endpoint.
endpoint_arncorestringThe Amazon Resource Name (ARN) of the endpoint.
endpoint_config_namecorestringThe name of the endpoint configuration associated with this endpoint.
endpoint_namecorestringName of the endpoint.
endpoint_statuscorestringThe status of the endpoint. OutOfService: Endpoint is not available to take incoming requests. Creating: CreateEndpoint is executing. Updating: UpdateEndpoint or UpdateEndpointWeightsAndCapacities is executing. SystemUpdating: Endpoint is undergoing maintenance and cannot be updated or deleted or re-scaled until it has completed. This maintenance operation does not change any customer-specified values such as VPC config, KMS encryption, model, instance type, or instance count. RollingBack: Endpoint fails to scale up or down or change its variant weight and is in the process of rolling back to its previous configuration. Once the rollback completes, endpoint returns to an InService status. This transitional status only applies to an endpoint that has autoscaling enabled and is undergoing variant weight or capacity changes as part of an UpdateEndpointWeightsAndCapacities call or when the UpdateEndpointWeightsAndCapacities operation is called explicitly. InService: Endpoint is available to process incoming requests. Deleting: DeleteEndpoint is executing. Failed: Endpoint could not be created, updated, or re-scaled. Use the FailureReason value returned by DescribeEndpoint for information about the failure. DeleteEndpoint is the only operation that can be performed on a failed endpoint. UpdateRollbackFailed: Both the rolling deployment and auto-rollback failed. Your endpoint is in service with a mix of the old and new endpoint configurations. For information about how to remedy this issue and restore the endpoint's status to InService, see Rolling Deployments.
explainer_configcorejsonThe configuration parameters for an explainer.
failure_reasoncorestringIf the status of the endpoint is Failed, the reason why it failed.
last_deployment_configcorejsonThe most recent deployment configuration for the endpoint.
last_modified_timecoretimestampA timestamp that shows when the endpoint was last modified.
pending_deployment_summarycorejsonReturns the summary of an in-progress deployment. This field is only returned when the endpoint is creating or updating with a new endpoint configuration.
production_variantscorejsonAn array of ProductionVariantSummary objects, one for each model hosted behind this endpoint.
shadow_production_variantscorejsonAn array of ProductionVariantSummary objects, one for each model that you want to host at this endpoint in shadow mode with production traffic replicated from the model specified on ProductionVariants.
tagscorehstore