A startup has developed a prototype ML model using scikit-learn on a single machine. They now need to scale it to handle larger datasets and deploy it for real-time predictions. The team is small and wants minimal operational overhead. Which Google Cloud service should they use?
Vertex AI provides managed training, deployment, and autoscaling with minimal operational overhead.
Why this answer
Vertex AI (option B) is the correct choice because it provides a unified, fully managed MLOps platform that integrates model training, deployment, and scaling with minimal operational overhead. It supports scikit-learn models natively, offers auto-scaling for real-time predictions, and eliminates the need to manage infrastructure, making it ideal for a small team transitioning from a prototype.
Exam trap
Google Cloud often tests the misconception that any serverless option (like Cloud Functions) is suitable for ML inference, but the trap here is that Cloud Functions has severe resource and timeout limitations that make it impractical for real-time model serving, whereas Vertex AI is purpose-built for this workload.
How to eliminate wrong answers
Option A (AI Platform Prediction) is wrong because it is a legacy service that has been superseded by Vertex AI; while it could technically serve predictions, it lacks the unified workflow and newer features of Vertex AI, and using it would incur unnecessary complexity and potential deprecation risks. Option C (Cloud Functions) is wrong because it is a serverless compute service designed for event-driven, short-lived tasks (max 9 minutes timeout and 2 GB memory), not for hosting persistent ML models requiring real-time inference with low latency and large payloads. Option D (Compute Engine with TensorFlow Serving) is wrong because it requires manual setup, scaling, and maintenance of virtual machines, which contradicts the team's goal of minimal operational overhead; TensorFlow Serving also adds an extra layer of complexity for a scikit-learn model that could be served more simply via Vertex AI's built-in containers.