PMLE Serving and scaling models • Set 5
PMLE Serving and scaling models Practice Test 5 — 15 questions with explanations. Free, no signup.
A data engineer is troubleshooting a Vertex AI Endpoint that serves a large BERT model. After deployment, many prediction requests fail with 'Out of Memory' errors. The machine type is n1-standard-8 (30 GB memory) with no accelerator. Which action will most likely resolve the issue?