SageMaker Inference Recommender Job

SageMaker Inference Recommender Job is an AWS resource that helps optimize machine learning model deployment by automatically testing different instance types and configurations. It evaluates performance, cost, and latency to recommend the best inference setup for a given model. This reduces the manual effort of benchmarking and ensures efficient, cost-effective model serving in production.

aws.sagemaker_inference_recommendations_job

Fields

TitleIDTypeData TypeDescription
_keycorestring
account_idcorestring
completion_timecoretimestampA timestamp that shows when the job completed.
creation_timecoretimestampA timestamp that shows when the job was created.
endpoint_performancescorejsonThe performance results from running an Inference Recommender job on an existing endpoint.
failure_reasoncorestringIf the job fails, provides information why the job failed.
inference_recommendationscorejsonThe recommendations made by Inference Recommender.
input_configcorejsonReturns information about the versioned model package Amazon Resource Name (ARN), the traffic pattern, and endpoint configurations you provided when you initiated the job.
job_arncorestringThe Amazon Resource Name (ARN) of the job.
job_descriptioncorestringThe job description that you provided when you initiated the job.
job_namecorestringThe name of the job. The name must be unique within an Amazon Web Services Region in the Amazon Web Services account.
job_typecorestringThe job type that you provided when you initiated the job.
last_modified_timecoretimestampA timestamp that shows when the job was last modified.
role_arncorestringThe Amazon Resource Name (ARN) of the Amazon Web Services Identity and Access Management (IAM) role you provided when you initiated the job.
statuscorestringThe status of the job.
stopping_conditionscorejsonThe stopping conditions that you provided when you initiated the job.
tagscorehstore