SageMaker Cluster

SageMaker Cluster in AWS is a managed resource that provides a group of compute instances for running machine learning workloads. It allows you to scale training and inference jobs across multiple nodes, offering high performance and flexibility for distributed ML tasks. This resource is designed to simplify cluster management, enabling users to focus on model development and deployment without handling infrastructure complexity.

aws.sagemaker_cluster

Fields

TitleIDTypeData TypeDescription
_keycorestring
account_idcorestring
cluster_arncorestringThe Amazon Resource Name (ARN) of the SageMaker HyperPod cluster.
cluster_namecorestringThe name of the SageMaker HyperPod cluster.
cluster_statuscorestringThe status of the SageMaker HyperPod cluster.
creation_timecoretimestampThe time when the SageMaker Cluster is created.
failure_messagecorestringThe failure message of the SageMaker HyperPod cluster.
instance_groupscorejsonThe instance groups of the SageMaker HyperPod cluster.
node_recoverycorestringThe node recovery mode configured for the SageMaker HyperPod cluster.
orchestratorcorejsonThe type of orchestrator used for the SageMaker HyperPod cluster.
tagscorehstore
vpc_configcorejsonSpecifies an Amazon Virtual Private Cloud (VPC) that your SageMaker jobs, hosted models, and compute resources have access to. You can control access to and from your resources by configuring a VPC. For more information, see Give SageMaker Access to Resources in your Amazon VPC.