MLS-C01 • Practice Test 23
Free MLS-C01 practice test — 15 questions with explanations. Set 23. No signup required.
A company is using Amazon SageMaker to deploy a real-time inference endpoint for a computer vision model. The endpoint receives bursts of traffic with up to 500 requests per second, but the load is unpredictable. Which scaling strategy is MOST cost-effective while maintaining low latency?