- 필수 기능
- 시작하기
- Glossary
- 표준 속성
- Guides
- Agent
- 통합
- 개방형텔레메트리
- 개발자
- Administrator's Guide
- API
- Datadog Mobile App
- CoScreen
- Cloudcraft
- 앱 내
- 서비스 관리
- 인프라스트럭처
- 애플리케이션 성능
- APM
- Continuous Profiler
- 스팬 시각화
- 데이터 스트림 모니터링
- 데이터 작업 모니터링
- 디지털 경험
- 소프트웨어 제공
- 보안
- AI Observability
- 로그 관리
- 관리
",t};e.buildCustomizationMenuUi=t;function n(e){let t='
",t}function s(e){let n=e.filter.currentValue||e.filter.defaultValue,t='${e.filter.label}
`,e.filter.options.forEach(s=>{let o=s.id===n;t+=``}),t+="${e.filter.label}
`,t+=`account_id
Type: STRING
algorithm_specification
Type: STRUCT
Provider name: AlgorithmSpecification
Description: Information about the algorithm used for training, and algorithm metadata.
algorithm_name
STRING
AlgorithmName
AlgorithmName
parameter or the image URI of the algorithm container to the TrainingImage
parameter. Note that the AlgorithmName
parameter is mutually exclusive with the TrainingImage
parameter. If you specify a value for the AlgorithmName
parameter, you can’t specify a value for TrainingImage
, and vice versa. If you specify values for both parameters, the training job might break; if you don’t specify any value for both parameters, the training job might raise a null
error.container_arguments
UNORDERED_LIST_STRING
ContainerArguments
container_entrypoint
UNORDERED_LIST_STRING
ContainerEntrypoint
enable_sage_maker_metrics_time_series
BOOLEAN
EnableSageMakerMetricsTimeSeries
true
. The default is false
and time-series metrics aren’t generated except in the following cases:metric_definitions
UNORDERED_LIST_STRUCT
MetricDefinitions
name
STRING
Name
regex
STRING
Regex
training_image
STRING
TrainingImage
registry/repository[:tag]
and registry/repository[@digest]
image path formats. For more information about using your custom training container, see Using Your Own Algorithms with Amazon SageMaker. AlgorithmName
parameter or the image URI of the algorithm container to the TrainingImage
parameter. For more information, see the note in the AlgorithmName
parameter description.training_image_config
STRUCT
TrainingImageConfig
training_repository_access_mode
STRING
TrainingRepositoryAccessMode
Vpc
.training_repository_auth_config
STRUCT
TrainingRepositoryAuthConfig
training_repository_credentials_provider_arn
STRING
TrainingRepositoryCredentialsProviderArn
training_input_mode
STRING
TrainingInputMode
auto_ml_job_arn
Type: STRING
Provider name: AutoMLJobArn
Description: The Amazon Resource Name (ARN) of an AutoML job.
billable_time_in_seconds
Type: INT32
Provider name: BillableTimeInSeconds
Description: The billable time in seconds. Billable time refers to the absolute wall-clock time. Multiply BillableTimeInSeconds
by the number of instances (InstanceCount
) in your training cluster to get the total compute time SageMaker bills you if you run distributed training. The formula is as follows: BillableTimeInSeconds * InstanceCount
. You can calculate the savings from using managed spot training using the formula (1 - BillableTimeInSeconds / TrainingTimeInSeconds) * 100
. For example, if BillableTimeInSeconds
is 100 and TrainingTimeInSeconds
is 500, the savings is 80%.
checkpoint_config
Type: STRUCT
Provider name: CheckpointConfig
local_path
STRING
LocalPath
/opt/ml/checkpoints/
.s3_uri
STRING
S3Uri
s3://bucket-name/key-name-prefix
.creation_time
Type: TIMESTAMP
Provider name: CreationTime
Description: A timestamp that indicates when the training job was created.
debug_hook_config
Type: STRUCT
Provider name: DebugHookConfig
collection_configurations
UNORDERED_LIST_STRUCT
CollectionConfigurations
CollectionConfiguration
parameter, see Use the SageMaker and Debugger Configuration API Operations to Create, Update, and Debug Your Training Job.collection_name
STRING
CollectionName
collection_parameters
MAP_STRING_STRING
CollectionParameters
“name”
, “include_regex”
, “reduction_config”
, “save_config”
, “tensor_names”
, and “save_histogram”
.hook_parameters
MAP_STRING_STRING
HookParameters
local_path
STRING
LocalPath
/opt/ml/output/tensors/
.s3_output_path
STRING
S3OutputPath
debug_rule_configurations
Type: UNORDERED_LIST_STRUCT
Provider name: DebugRuleConfigurations
Description: Configuration information for Amazon SageMaker Debugger rules for debugging output tensors.
instance_type
STRING
InstanceType
local_path
STRING
LocalPath
/opt/ml/processing/output/rule/
.rule_configuration_name
STRING
RuleConfigurationName
rule_evaluator_image
STRING
RuleEvaluatorImage
rule_parameters
MAP_STRING_STRING
RuleParameters
s3_output_path
STRING
S3OutputPath
volume_size_in_gb
INT32
VolumeSizeInGB
debug_rule_evaluation_statuses
Type: UNORDERED_LIST_STRUCT
Provider name: DebugRuleEvaluationStatuses
Description: Evaluation status of Amazon SageMaker Debugger rules for debugging on a training job.
last_modified_time
TIMESTAMP
LastModifiedTime
rule_configuration_name
STRING
RuleConfigurationName
rule_evaluation_job_arn
STRING
RuleEvaluationJobArn
rule_evaluation_status
STRING
RuleEvaluationStatus
status_details
STRING
StatusDetails
enable_inter_container_traffic_encryption
Type: BOOLEAN
Provider name: EnableInterContainerTrafficEncryption
Description: To encrypt all communications between ML compute instances in distributed training, choose True
. Encryption provides greater security for distributed training, but training might take longer. How long it takes depends on the amount of communication between compute instances, especially if you use a deep learning algorithms in distributed training.
enable_managed_spot_training
Type: BOOLEAN
Provider name: EnableManagedSpotTraining
Description: A Boolean indicating whether managed spot training is enabled (True
) or not (False
).
enable_network_isolation
Type: BOOLEAN
Provider name: EnableNetworkIsolation
Description: If you want to allow inbound or outbound network calls, except for calls between peers within a training cluster for distributed training, choose True
. If you enable network isolation for training jobs that are configured to use a VPC, SageMaker downloads and uploads customer data and model artifacts through the specified VPC, but the training container does not have network access.
environment
Type: MAP_STRING_STRING
Provider name: Environment
Description: The environment variables to set in the Docker container.
experiment_config
Type: STRUCT
Provider name: ExperimentConfig
experiment_name
STRING
ExperimentName
run_name
STRING
RunName
trial_component_display_name
STRING
TrialComponentDisplayName
trial_name
STRING
TrialName
failure_reason
Type: STRING
Provider name: FailureReason
Description: If the training job failed, the reason it failed.
final_metric_data_list
Type: UNORDERED_LIST_STRUCT
Provider name: FinalMetricDataList
Description: A collection of MetricData
objects that specify the names, values, and dates and times that the training algorithm emitted to Amazon CloudWatch.
metric_name
STRING
MetricName
timestamp
TIMESTAMP
Timestamp
value
FLOAT
Value
hyper_parameters
Type: MAP_STRING_STRING
Provider name: HyperParameters
Description: Algorithm-specific parameters.
infra_check_config
Type: STRUCT
Provider name: InfraCheckConfig
Description: Contains information about the infrastructure health check configuration for the training job.
enable_infra_check
BOOLEAN
EnableInfraCheck
input_data_config
Type: UNORDERED_LIST_STRUCT
Provider name: InputDataConfig
Description: An array of Channel
objects that describes each data input channel.
channel_name
STRING
ChannelName
compression_type
STRING
CompressionType
None
. CompressionType
is used only in Pipe input mode. In File mode, leave this field unset or set it to None.content_type
STRING
ContentType
data_source
STRUCT
DataSource
file_system_data_source
STRUCT
FileSystemDataSource
directory_path
STRING
DirectoryPath
file_system_access_mode
STRING
FileSystemAccessMode
ro
(read-only) or rw
(read-write) mode.file_system_id
STRING
FileSystemId
file_system_type
STRING
FileSystemType
s3_data_source
STRUCT
S3DataSource
attribute_names
UNORDERED_LIST_STRING
AttributeNames
hub_access_config
STRUCT
HubAccessConfig
hub_content_arn
STRING
HubContentArn
ModelReference
resource type that points to a SageMaker JumpStart public hub model.instance_group_names
UNORDERED_LIST_STRING
InstanceGroupNames
model_access_config
STRUCT
ModelAccessConfig
accept_eula
BOOLEAN
AcceptEula
AcceptEula
value must be explicitly defined as True
in order to accept the EULA that this model requires. You are responsible for reviewing and complying with any applicable license terms and making sure they are acceptable for your use case before downloading or using a model.s3_data_distribution_type
STRING
S3DataDistributionType
FullyReplicated
. If you want SageMaker to replicate a subset of data on each ML compute instance that is launched for model training, specify ShardedByS3Key
. If there are n ML compute instances launched for a training job, each instance gets approximately 1/n of the number of S3 objects. In this case, model training on each machine uses only the subset of training data. Don’t choose more ML compute instances for training than available S3 objects. If you do, some nodes won’t get any data and you will pay for nodes that aren’t getting any training data. This applies in both File and Pipe modes. Keep this in mind when developing algorithms. In distributed training, where you use multiple ML compute EC2 instances, you might choose ShardedByS3Key
. If the algorithm requires copying training data to the ML storage volume (when TrainingInputMode
is set to File
), this copies 1/n of the number of objects.s3_data_type
STRING
S3DataType
S3Prefix
, S3Uri
identifies a key name prefix. SageMaker uses all objects that match the specified key name prefix for model training. If you choose ManifestFile
, S3Uri
identifies an object that is a manifest file containing a list of object keys that you want SageMaker to use for model training. If you choose AugmentedManifestFile
, S3Uri
identifies an object that is an augmented manifest file in JSON lines format. This file contains the data you want to use for model training. AugmentedManifestFile
can only be used if the Channel’s input mode is Pipe
.s3_uri
STRING
S3Uri
S3DataType
, identifies either a key name prefix or a manifest. For example:s3://bucketname/exampleprefix/
s3://bucketname/example.manifest
A manifest is an S3 object which is a JSON file consisting of an array of elements. The first element is a prefix which is followed by one or more suffixes. SageMaker appends the suffix elements to the prefix to get a full set of S3Uri
. Note that the prefix must be a valid non-empty S3Uri
that precludes users from specifying a manifest whose individual S3Uri
is sourced from different S3 buckets. The following code example shows a valid manifest format: [ {“prefix”: “s3://customer_bucket/some/prefix/”},
“relative/path/to/custdata-1”,
“relative/path/custdata-2”,
…
“relative/path/custdata-N”
]
This JSON is equivalent to the following S3Uri
list: s3://customer_bucket/some/prefix/relative/path/to/custdata-1
s3://customer_bucket/some/prefix/relative/path/custdata-2
…
s3://customer_bucket/some/prefix/relative/path/custdata-N
The complete set of S3Uri
in this manifest is the input data for the channel for this data source. The object that each S3Uri
points to must be readable by the IAM role that SageMaker uses to perform tasks on your behalf.input_mode
STRING
InputMode
InputMode
, SageMaker uses the value set for TrainingInputMode
. Use this parameter to override the TrainingInputMode
setting in a AlgorithmSpecification request when you have a channel that needs a different input mode from the training job’s general setting. To download the data from Amazon Simple Storage Service (Amazon S3) to the provisioned ML storage volume, and mount the directory to a Docker volume, use File
input mode. To stream data directly from Amazon S3 to the container, choose Pipe
input mode. To use a model for incremental training, choose File
input model.record_wrapper_type
STRING
RecordWrapperType
shuffle_config
STRUCT
ShuffleConfig
S3Prefix
for S3DataType
, this shuffles the results of the S3 key prefix matches. If you use ManifestFile
, the order of the S3 object references in the ManifestFile
is shuffled. If you use AugmentedManifestFile
, the order of the JSON lines in the AugmentedManifestFile
is shuffled. The shuffling order is determined using the Seed
value. For Pipe input mode, shuffling is done at the start of every epoch. With large datasets this ensures that the order of the training data is different for each epoch, it helps reduce bias and possible overfitting. In a multi-node training job when ShuffleConfig is combined with S3DataDistributionType
of ShardedByS3Key
, the data is shuffled across nodes so that the content sent to a particular node on the first epoch might be sent to a different node on the second epoch.seed
INT64
Seed
ShuffleConfig
value.labeling_job_arn
Type: STRING
Provider name: LabelingJobArn
Description: The Amazon Resource Name (ARN) of the SageMaker Ground Truth labeling job that created the transform or training job.
last_modified_time
Type: TIMESTAMP
Provider name: LastModifiedTime
Description: A timestamp that indicates when the status of the training job was last modified.
model_artifacts
Type: STRUCT
Provider name: ModelArtifacts
Description: Information about the Amazon S3 location that is configured for storing model artifacts.
s3_model_artifacts
STRING
S3ModelArtifacts
s3://bucket-name/keynameprefix/model.tar.gz
.output_data_config
Type: STRUCT
Provider name: OutputDataConfig
Description: The S3 path where model artifacts that you configured when creating the job are stored. SageMaker creates subfolders for model artifacts.
compression_type
STRING
CompressionType
None
to output an uncompressed model, recommended for large model outputs. Defaults to gzip.kms_key_id
STRING
KmsKeyId
KmsKeyId
can be any of the following formats:“1234abcd-12ab-34cd-56ef-1234567890ab”
“arn:aws:kms:us-west-2:111122223333:key/1234abcd-12ab-34cd-56ef-1234567890ab”
“alias/ExampleAlias”
“arn:aws:kms:us-west-2:111122223333:alias/ExampleAlias”
kms:Encrypt
. If you don’t provide a KMS key ID, SageMaker uses the default KMS key for Amazon S3 for your role’s account. For more information, see KMS-Managed Encryption Keys in the Amazon Simple Storage Service Developer Guide. If the output data is stored in Amazon S3 Express One Zone, it is encrypted with server-side encryption with Amazon S3 managed keys (SSE-S3). KMS key is not supported for Amazon S3 Express One Zone The KMS key policy must grant permission to the IAM role that you specify in your CreateTrainingJob
, CreateTransformJob
, or CreateHyperParameterTuningJob
requests. For more information, see Using Key Policies in Amazon Web Services KMS in the Amazon Web Services Key Management Service Developer Guide.s3_output_path
STRING
S3OutputPath
s3://bucket-name/key-name-prefix
.profiler_config
Type: STRUCT
Provider name: ProfilerConfig
disable_profiler
BOOLEAN
DisableProfiler
True
.profiling_interval_in_milliseconds
INT64
ProfilingIntervalInMilliseconds
profiling_parameters
MAP_STRING_STRING
ProfilingParameters
DetailedProfilingConfig
, PythonProfilingConfig
, and DataLoaderProfilingConfig
. The following codes are configuration structures for the ProfilingParameters
parameter. To learn more about how to configure the ProfilingParameters
parameter, see Use the SageMaker and Debugger Configuration API Operations to Create, Update, and Debug Your Training Job.s3_output_path
STRING
S3OutputPath
profiler_rule_configurations
Type: UNORDERED_LIST_STRUCT
Provider name: ProfilerRuleConfigurations
Description: Configuration information for Amazon SageMaker Debugger rules for profiling system and framework metrics.
instance_type
STRING
InstanceType
local_path
STRING
LocalPath
/opt/ml/processing/output/rule/
.rule_configuration_name
STRING
RuleConfigurationName
rule_evaluator_image
STRING
RuleEvaluatorImage
rule_parameters
MAP_STRING_STRING
RuleParameters
s3_output_path
STRING
S3OutputPath
volume_size_in_gb
INT32
VolumeSizeInGB
profiler_rule_evaluation_statuses
Type: UNORDERED_LIST_STRUCT
Provider name: ProfilerRuleEvaluationStatuses
Description: Evaluation status of Amazon SageMaker Debugger rules for profiling on a training job.
last_modified_time
TIMESTAMP
LastModifiedTime
rule_configuration_name
STRING
RuleConfigurationName
rule_evaluation_job_arn
STRING
RuleEvaluationJobArn
rule_evaluation_status
STRING
RuleEvaluationStatus
status_details
STRING
StatusDetails
profiling_status
Type: STRING
Provider name: ProfilingStatus
Description: Profiling status of a training job.
remote_debug_config
Type: STRUCT
Provider name: RemoteDebugConfig
Description: Configuration for remote debugging. To learn more about the remote debugging functionality of SageMaker, see Access a training container through Amazon Web Services Systems Manager (SSM) for remote debugging.
enable_remote_debug
BOOLEAN
EnableRemoteDebug
resource_config
Type: STRUCT
Provider name: ResourceConfig
Description: Resources, including ML compute instances and ML storage volumes, that are configured for model training.
instance_count
INT32
InstanceCount
instance_groups
UNORDERED_LIST_STRUCT
InstanceGroups
instance_count
INT32
InstanceCount
instance_group_name
STRING
InstanceGroupName
instance_type
STRING
InstanceType
instance_type
STRING
InstanceType
ml.p4de.24xlarge
) to reduce model training time. The ml.p4de.24xlarge
instances are available in the following Amazon Web Services Regions.keep_alive_period_in_seconds
INT32
KeepAlivePeriodInSeconds
training_plan_arn
STRING
TrainingPlanArn
volume_kms_key_id
STRING
VolumeKmsKeyId
VolumeKmsKeyId
when using an instance type with local storage. For a list of instance types that support local instance storage, see Instance Store Volumes. For more information about local instance storage encryption, see SSD Instance Store Volumes. VolumeKmsKeyId
can be in any of the following formats:“1234abcd-12ab-34cd-56ef-1234567890ab”
“arn:aws:kms:us-west-2:111122223333:key/1234abcd-12ab-34cd-56ef-1234567890ab”
volume_size_in_gb
INT32
VolumeSizeInGB
File
as the TrainingInputMode
in the algorithm specification. When using an ML instance with NVMe SSD volumes, SageMaker doesn’t provision Amazon EBS General Purpose SSD (gp2) storage. Available storage is fixed to the NVMe-type instance’s storage capacity. SageMaker configures storage paths for training datasets, checkpoints, model artifacts, and outputs to use the entire capacity of the instance storage. For example, ML instance families with the NVMe-type instance storage include ml.p4d
, ml.g4dn
, and ml.g5
. When using an ML instance with the EBS-only storage option and without instance storage, you must define the size of EBS volume through VolumeSizeInGB
in the ResourceConfig
API. For example, ML instance families that use EBS volumes include ml.c5
and ml.p2
. To look up instance types and their instance storage types and volumes, see Amazon EC2 Instance Types. To find the default local paths defined by the SageMaker training platform, see Amazon SageMaker Training Storage Folders for Training Datasets, Checkpoints, Model Artifacts, and Outputs.retry_strategy
Type: STRUCT
Provider name: RetryStrategy
Description: The number of times to retry the job when the job fails due to an InternalServerError
.
maximum_retry_attempts
INT32
MaximumRetryAttempts
SecondaryStatus
is changed to STARTING
.role_arn
Type: STRING
Provider name: RoleArn
Description: The Amazon Web Services Identity and Access Management (IAM) role configured for the training job.
secondary_status
Type: STRING
Provider name: SecondaryStatus
Description: Provides detailed information about the state of the training job. For detailed information on the secondary status of the training job, see StatusMessage
under SecondaryStatusTransition. SageMaker provides primary statuses and secondary statuses that apply to each of them:
Starting
- Starting the training job.Downloading
- An optional stage for algorithms that support File
training input mode. It indicates that data is being downloaded to the ML storage volumes.Training
- Training is in progress.Interrupted
- The job stopped because the managed spot training instances were interrupted.Uploading
- Training is complete and the model artifacts are being uploaded to the S3 location.Completed
- The training job has completed.Failed
- The training job has failed. The reason for the failure is returned in the FailureReason
field of DescribeTrainingJobResponse
.MaxRuntimeExceeded
- The job stopped because it exceeded the maximum allowed runtime.MaxWaitTimeExceeded
- The job stopped because it exceeded the maximum allowed wait time.Stopped
- The training job has stopped.Stopping
- Stopping the training job.SecondaryStatus
are subject to change. LaunchingMLInstances
PreparingTraining
DownloadingTrainingImage
secondary_status_transitions
Type: UNORDERED_LIST_STRUCT
Provider name: SecondaryStatusTransitions
Description: A history of all of the secondary statuses that the training job has transitioned through.
end_time
TIMESTAMP
EndTime
start_time
TIMESTAMP
StartTime
status
STRING
Status
Starting
- Starting the training job.Downloading
- An optional stage for algorithms that support File
training input mode. It indicates that data is being downloaded to the ML storage volumes.Training
- Training is in progress.Uploading
- Training is complete and the model artifacts are being uploaded to the S3 location.Completed
- The training job has completed.Failed
- The training job has failed. The reason for the failure is returned in the FailureReason
field of DescribeTrainingJobResponse
.MaxRuntimeExceeded
- The job stopped because it exceeded the maximum allowed runtime.Stopped
- The training job has stopped.Stopping
- Stopping the training job.LaunchingMLInstances
PreparingTrainingStack
DownloadingTrainingImage
status_message
STRING
StatusMessage
TrainingJobStatus
and SecondaryStatus
in DescribeTrainingJob, and StatusMessage
together. For example, at the start of a training job, you might see the following:TrainingJobStatus
- InProgressSecondaryStatus
- TrainingStatusMessage
- Downloading the training imagestopping_condition
Type: STRUCT
Provider name: StoppingCondition
Description: Specifies a limit to how long a model training job can run. It also specifies how long a managed Spot training job has to complete. When the job reaches the time limit, SageMaker ends the training job. Use this API to cap model training costs. To stop a job, SageMaker sends the algorithm the SIGTERM
signal, which delays job termination for 120 seconds. Algorithms can use this 120-second window to save the model artifacts, so the results of training are not lost.
max_pending_time_in_seconds
INT32
MaxPendingTimeInSeconds
Pending
job states count against the MaxPendingTimeInSeconds
limit. The following scenarios do not increment the MaxPendingTimeInSeconds
counter:Scheduled
state: Jobs queued (in Pending
status) before a plan’s start date (waiting for scheduled start time)Pending
status between two capacity reservation periodsMaxPendingTimeInSeconds
only increments when jobs are actively waiting for capacity in an Active
plan.max_runtime_in_seconds
INT32
MaxRuntimeInSeconds
TimeOut
error is generated. We recommend starting with 900 seconds and increasing as necessary based on your model. For all other jobs, if the job does not complete during this time, SageMaker ends the job. When RetryStrategy
is specified in the job request, MaxRuntimeInSeconds
specifies the maximum time for all of the attempts in total, not each individual attempt. The default value is 1 day. The maximum value is 28 days. The maximum time that a TrainingJob
can run in total, including any time spent publishing metrics or archiving and uploading models after it has been stopped, is 30 days.max_wait_time_in_seconds
INT32
MaxWaitTimeInSeconds
MaxRuntimeInSeconds
. If the job does not complete during this time, SageMaker ends the job. When RetryStrategy
is specified in the job request, MaxWaitTimeInSeconds
specifies the maximum time for all of the attempts in total, not each individual attempt.tags
Type: UNORDERED_LIST_STRING
tensor_board_output_config
Type: STRUCT
Provider name: TensorBoardOutputConfig
local_path
STRING
LocalPath
/opt/ml/output/tensorboard
.s3_output_path
STRING
S3OutputPath
training_end_time
Type: TIMESTAMP
Provider name: TrainingEndTime
Description: Indicates the time when the training job ends on training instances. You are billed for the time interval between the value of TrainingStartTime
and this time. For successful jobs and stopped jobs, this is the time after model artifacts are uploaded. For failed jobs, this is the time when SageMaker detects a job failure.
training_job_arn
Type: STRING
Provider name: TrainingJobArn
Description: The Amazon Resource Name (ARN) of the training job.
training_job_name
Type: STRING
Provider name: TrainingJobName
Description: Name of the model training job.
training_job_status
Type: STRING
Provider name: TrainingJobStatus
Description: The status of the training job. SageMaker provides the following training job statuses:
InProgress
- The training is in progress.Completed
- The training job has completed.Failed
- The training job has failed. To see the reason for the failure, see the FailureReason
field in the response to a DescribeTrainingJobResponse
call.Stopping
- The training job is stopping.Stopped
- The training job has stopped.SecondaryStatus
.training_start_time
Type: TIMESTAMP
Provider name: TrainingStartTime
Description: Indicates the time when the training job starts on training instances. You are billed for the time interval between this time and the value of TrainingEndTime
. The start time in CloudWatch Logs might be later than this time. The difference is due to the time it takes to download the training data and to the size of the training container.
training_time_in_seconds
Type: INT32
Provider name: TrainingTimeInSeconds
Description: The training time in seconds.
tuning_job_arn
Type: STRING
Provider name: TuningJobArn
Description: The Amazon Resource Name (ARN) of the associated hyperparameter tuning job if the training job was launched by a hyperparameter tuning job.
vpc_config
Type: STRUCT
Provider name: VpcConfig
Description: A VpcConfig object that specifies the VPC that this training job has access to. For more information, see Protect Training Jobs by Using an Amazon Virtual Private Cloud.
security_group_ids
UNORDERED_LIST_STRING
SecurityGroupIds
sg-xxxxxxxx
. Specify the security groups for the VPC that is specified in the Subnets
field.subnets
UNORDERED_LIST_STRING
Subnets
warm_pool_status
Type: STRUCT
Provider name: WarmPoolStatus
Description: The status of the warm pool associated with the training job.
resource_retained_billable_time_in_seconds
INT32
ResourceRetainedBillableTimeInSeconds
ResourceRetainedBillableTimeInSeconds
by the number of instances (InstanceCount
) in your training cluster to get the total compute time SageMaker bills you if you run warm pool training. The formula is as follows: ResourceRetainedBillableTimeInSeconds * InstanceCount
.reused_by_job
STRING
ReusedByJob
status
STRING
Status
InUse
: The warm pool is in use for the training job.Available
: The warm pool is available to reuse for a matching training job.Reused
: The warm pool moved to a matching training job for reuse.Terminated
: The warm pool is no longer available. Warm pools are unavailable if they are terminated by a user, terminated for a patch update, or terminated for exceeding the specified KeepAlivePeriodInSeconds
.