@aws-cdk_aws-sagemaker-alpha.InstanceProductionVariantProps

interface InstanceProductionVariantProps ๐Ÿ”น

LanguageType name
.NETAmazon.CDK.AWS.Sagemaker.Alpha.InstanceProductionVariantProps
Gogithub.com/aws/aws-cdk-go/awscdksagemakeralpha/v2#InstanceProductionVariantProps
Javasoftware.amazon.awscdk.services.sagemaker.alpha.InstanceProductionVariantProps
Pythonaws_cdk.aws_sagemaker_alpha.InstanceProductionVariantProps
TypeScript (source)@aws-cdk/aws-sagemaker-alpha ยป InstanceProductionVariantProps

Construction properties for an instance production variant.

Example

// The code below shows an example of how to instantiate this type.
// The values are placeholders you should change.
import * as sagemaker_alpha from '@aws-cdk/aws-sagemaker-alpha';

declare const acceleratorType: sagemaker_alpha.AcceleratorType;
declare const instanceType: sagemaker_alpha.InstanceType;
declare const model: sagemaker_alpha.Model;
const instanceProductionVariantProps: sagemaker_alpha.InstanceProductionVariantProps = {
  model: model,
  variantName: 'variantName',

  // the properties below are optional
  acceleratorType: acceleratorType,
  initialInstanceCount: 123,
  initialVariantWeight: 123,
  instanceType: instanceType,
};

Properties

NameTypeDescription
model๐Ÿ”นIModelThe model to host.
variantName๐Ÿ”นstringName of the production variant.
acceleratorType?๐Ÿ”นAcceleratorTypeThe size of the Elastic Inference (EI) instance to use for the production variant.
initialInstanceCount?๐Ÿ”นnumberNumber of instances to launch initially.
initialVariantWeight?๐Ÿ”นnumberDetermines initial traffic distribution among all of the models that you specify in the endpoint configuration.
instanceType?๐Ÿ”นInstanceTypeInstance type of the production variant.

model๐Ÿ”น

Type: IModel

The model to host.


variantName๐Ÿ”น

Type: string

Name of the production variant.


acceleratorType?๐Ÿ”น

Type: AcceleratorType (optional, default: none)

The size of the Elastic Inference (EI) instance to use for the production variant.

EI instances provide on-demand GPU computing for inference.


initialInstanceCount?๐Ÿ”น

Type: number (optional, default: 1)

Number of instances to launch initially.


initialVariantWeight?๐Ÿ”น

Type: number (optional, default: 1.0)

Determines initial traffic distribution among all of the models that you specify in the endpoint configuration.

The traffic to a production variant is determined by the ratio of the variant weight to the sum of all variant weight values across all production variants.


instanceType?๐Ÿ”น

Type: InstanceType (optional, default: InstanceType.T2_MEDIUM)

Instance type of the production variant.