aws-cdk-lib.aws_sagemaker.CfnModelPackage.InferenceSpecificationProperty

interface InferenceSpecificationProperty

LanguageType name
.NETAmazon.CDK.AWS.Sagemaker.CfnModelPackage.InferenceSpecificationProperty
Gogithub.com/aws/aws-cdk-go/awscdk/v2/awssagemaker#CfnModelPackage_InferenceSpecificationProperty
Javasoftware.amazon.awscdk.services.sagemaker.CfnModelPackage.InferenceSpecificationProperty
Pythonaws_cdk.aws_sagemaker.CfnModelPackage.InferenceSpecificationProperty
TypeScript aws-cdk-lib » aws_sagemaker » CfnModelPackage » InferenceSpecificationProperty

Defines how to perform inference generation after a training job is run.

Example

// The code below shows an example of how to instantiate this type.
// The values are placeholders you should change.
import { aws_sagemaker as sagemaker } from 'aws-cdk-lib';

declare const modelInput: any;
const inferenceSpecificationProperty: sagemaker.CfnModelPackage.InferenceSpecificationProperty = {
  containers: [{
    image: 'image',

    // the properties below are optional
    containerHostname: 'containerHostname',
    environment: {
      environmentKey: 'environment',
    },
    framework: 'framework',
    frameworkVersion: 'frameworkVersion',
    imageDigest: 'imageDigest',
    modelDataUrl: 'modelDataUrl',
    modelInput: modelInput,
    nearestModelName: 'nearestModelName',
  }],
  supportedContentTypes: ['supportedContentTypes'],
  supportedResponseMimeTypes: ['supportedResponseMimeTypes'],

  // the properties below are optional
  supportedRealtimeInferenceInstanceTypes: ['supportedRealtimeInferenceInstanceTypes'],
  supportedTransformInstanceTypes: ['supportedTransformInstanceTypes'],
};

Properties

NameTypeDescription
containersIResolvable | IResolvable | ModelPackageContainerDefinitionProperty[]The Amazon ECR registry path of the Docker image that contains the inference code.
supportedContentTypesstring[]The supported MIME types for the input data.
supportedResponseMimeTypesstring[]The supported MIME types for the output data.
supportedRealtimeInferenceInstanceTypes?string[]A list of the instance types that are used to generate inferences in real-time.
supportedTransformInstanceTypes?string[]A list of the instance types on which a transformation job can be run or on which an endpoint can be deployed.

containers

Type: IResolvable | IResolvable | ModelPackageContainerDefinitionProperty[]

The Amazon ECR registry path of the Docker image that contains the inference code.


supportedContentTypes

Type: string[]

The supported MIME types for the input data.


supportedResponseMimeTypes

Type: string[]

The supported MIME types for the output data.


supportedRealtimeInferenceInstanceTypes?

Type: string[] (optional)

A list of the instance types that are used to generate inferences in real-time.

This parameter is required for unversioned models, and optional for versioned models.


supportedTransformInstanceTypes?

Type: string[] (optional)

A list of the instance types on which a transformation job can be run or on which an endpoint can be deployed.

This parameter is required for unversioned models, and optional for versioned models.