MLA-C01 Deployment and Orchestration of ML Workflows • 20 Questions
20 MLA-C01 Deployment and Orchestration of ML Workflows practice questions with answers and explanations. Free, no signup.
A data science team has trained a PyTorch model using Amazon SageMaker and wants to deploy it with a custom inference container that includes a pre-processing step. The team needs to minimize latency and ensure the pre-processing runs only once per request. Which SageMaker real-time inference option should they use?