PMLE • Practice Test 24
Free PMLE practice test — 15 questions with explanations. Set 24. No signup required.
A team is scaling their prototype inference model to handle high-throughput requests with low latency. They use a custom container on Vertex AI Prediction. They notice that latency spikes occur under heavy load. What is the most effective strategy?