MLA-C01 Deployment and Orchestration of ML Workflows • Set 5
MLA-C01 Deployment and Orchestration of ML Workflows Practice Test 5 — 15 questions with explanations. Free, no signup.
A data science team has trained a PyTorch model for real-time inference and needs to deploy it on AWS with GPU acceleration while minimizing cold-start latency. Which SageMaker inference option should they choose?