aws-cdk-lib.aws_sagemaker.CfnEndpointConfig.AsyncInferenceClientConfigProperty

interface AsyncInferenceClientConfigProperty

LanguageType name
.NETAmazon.CDK.AWS.Sagemaker.CfnEndpointConfig.AsyncInferenceClientConfigProperty
Gogithub.com/aws/aws-cdk-go/awscdk/v2/awssagemaker#CfnEndpointConfig_AsyncInferenceClientConfigProperty
Javasoftware.amazon.awscdk.services.sagemaker.CfnEndpointConfig.AsyncInferenceClientConfigProperty
Pythonaws_cdk.aws_sagemaker.CfnEndpointConfig.AsyncInferenceClientConfigProperty
TypeScript aws-cdk-lib » aws_sagemaker » CfnEndpointConfig » AsyncInferenceClientConfigProperty

Configures the behavior of the client used by SageMaker to interact with the model container during asynchronous inference.

Example

// The code below shows an example of how to instantiate this type.
// The values are placeholders you should change.
import { aws_sagemaker as sagemaker } from 'aws-cdk-lib';
const asyncInferenceClientConfigProperty: sagemaker.CfnEndpointConfig.AsyncInferenceClientConfigProperty = {
  maxConcurrentInvocationsPerInstance: 123,
};

Properties

NameTypeDescription
maxConcurrentInvocationsPerInstance?numberThe maximum number of concurrent requests sent by the SageMaker client to the model container.

maxConcurrentInvocationsPerInstance?

Type: number (optional)

The maximum number of concurrent requests sent by the SageMaker client to the model container.

If no value is provided, SageMaker will choose an optimal value for you.