CCNA Architecting Low-Code ML Solutions Questions

59 questions · Architecting Low-Code ML Solutions · All types, answers revealed

1
MCQmedium

A company needs to extract key fields from scanned invoices, such as invoice number and total amount, with high accuracy. They want a managed service and plan to use human review for low-confidence results. Which combination of services should they use?

A.Vision API and Natural Language API
B.Document AI and Human-in-the-Loop
C.Translation API and AutoML Vision
D.BigQuery ML and Vertex AI Prediction
AnswerB

Why this answer

Document AI provides specialized processors like invoice parser, and Human-in-the-Loop (HITL) can be integrated for low-confidence predictions. Vision API is generic OCR, Natural Language API is for text analysis, and Translation API is for language translation.

2
Multi-Selectmedium

A company wants to analyze videos to detect objects and track their movement over time. Which TWO Google Cloud services are suitable for this task?

Select 2 answers
A.AutoML Vision
B.Speech-to-Text
C.AutoML Video
D.Video Intelligence API
E.Natural Language API
AnswersC, D

Why this answer

AutoML Video supports object tracking, and Video Intelligence API provides both object detection and tracking. AutoML Vision is for images only, Natural Language for text, and Speech-to-Text for audio.

3
MCQhard

A company uses BigQuery ML with a remote model calling Vertex AI's pre-trained image classification model. They need to classify images stored in Cloud Storage buckets. What is the correct approach?

A.Create a remote model with model_type='VERTEX_AI' and use ML.PREDICT with image URIs.
B.Train a custom model in BigQuery ML with image data.
C.Export images to BigQuery as base64 and then use ML.PREDICT.
D.Use ML.PREDICT with IMAGE data type directly.
AnswerA

Why this answer

Option A is correct because BigQuery ML remote models allow you to invoke Vertex AI pre-trained models via the `model_type='VERTEX_AI'` setting. You can then use `ML.PREDICT` directly on Cloud Storage image URIs without needing to export or transform the image data, as BigQuery ML handles the URI resolution and passes the image to Vertex AI for classification.

Exam trap

Google often tests the misconception that BigQuery ML can handle unstructured data like images natively, leading candidates to choose options that involve base64 encoding or direct IMAGE data types, when the correct approach is to use remote models with URI references.

How to eliminate wrong answers

Option B is wrong because BigQuery ML does not support training custom image classification models natively; it is designed for tabular and structured data, not raw image pixel data. Option C is wrong because exporting images as base64 is unnecessary and inefficient; BigQuery ML remote models accept Cloud Storage URIs directly, and base64 encoding adds overhead without benefit. Option D is wrong because BigQuery ML does not have an IMAGE data type; image data is referenced via URIs, not stored as a native column type.

4
MCQmedium

A data engineer wants to use BigQuery ML to train a model that predicts customer churn using a table with customer features and a label column. They want to use a deep neural network. Which model type should they specify?

A.BOOSTED_TREE_CLASSIFIER
B.LOGISTIC_REG
C.DNN_CLASSIFIER
D.DNN_REGRESSOR
AnswerC

Why this answer

Option C is correct because the DNN_CLASSIFIER model type in BigQuery ML is specifically designed for classification tasks using a deep neural network architecture. Since the problem is predicting customer churn (a binary classification problem) and the data engineer explicitly wants to use a deep neural network, DNN_CLASSIFIER is the appropriate choice.

Exam trap

The trap is that candidates may confuse DNN_REGRESSOR with DNN_CLASSIFIER, but BigQuery ML uses different model types for regression vs. classification. DNN_CLASSIFIER is used for classification tasks like churn prediction, while DNN_REGRESSOR is for continuous values.

How to eliminate wrong answers

Option A is wrong because BOOSTED_TREE_CLASSIFIER uses gradient-boosted decision trees, not a deep neural network, so it does not meet the requirement for a DNN model. Option B is wrong because LOGISTIC_REG is a logistic regression model, which is a linear classifier and not a deep neural network. Option D is wrong because DNN_REGRESSOR is used for regression tasks (predicting continuous values), not for classification tasks like churn prediction.

5
MCQeasy

A data analyst wants to build a binary classification model to predict customer churn using SQL queries in BigQuery. Which BigQuery ML model type should they use?

A.MATRIX_FACTORIZATION
B.LINEAR_REG
C.LOGISTIC_REG
D.K_MEANS
AnswerC

Why this answer

BigQuery ML supports LOGISTIC_REG for binary classification via SQL. LINEAR_REG is for regression, K_MEANS for clustering, and MATRIX_FACTORIZATION for recommendations.

6
MCQmedium

A company wants to analyze customer reviews for sentiment (positive, negative, neutral) using a pre-trained model with no training. They have text data stored in BigQuery. Which Google Cloud service should they use?

A.Translation API
B.Speech-to-Text API
C.Natural Language API
D.AutoML NLP
AnswerC

Why this answer

Natural Language API provides pre-trained sentiment analysis. AutoML NLP is for custom models. Speech-to-Text is for audio transcription.

Translation API is for language translation.

7
MCQmedium

A retail company wants to build a recommendation system for their e-commerce website. They have user purchase history and product metadata. Which Google Cloud service is most suitable for building a 'frequently bought together' recommendation model with minimal custom ML development?

A.Vertex AI Prediction with a custom TensorFlow model
B.BigQuery ML with MATRIX_FACTORIZATION model type
C.Recommendations AI with the 'frequently bought together' model type
D.Vertex AI AutoML Tables
AnswerC

Why this answer

Recommendations AI provides pre-built models for retail recommendations, including 'frequently bought together'. AutoML Tables could be used but requires more customisation. Vertex AI Prediction and BigQuery ML are not purpose-built for this use case and would require more development.

8
Multi-Selecteasy

A company wants to transcribe audio from customer service calls and then analyze the sentiment of the transcribed text. Which TWO Google Cloud services should they use?

Select 2 answers
A.Natural Language API
B.Document AI
C.Speech-to-Text
D.Translation API
E.Vision API
AnswersA, C

Why this answer

Speech-to-Text transcribes audio to text, and Natural Language API can analyze sentiment from text. Vision API is for images, Translation for language translation, and Document AI for document processing.

9
Multi-Selectmedium

A data scientist wants to use AutoML Tables to build a binary classification model for loan default prediction. The dataset has 200 features and 1 million rows, with highly imbalanced classes. Which TWO options should they consider? (Choose 2)

Select 2 answers
A.Enable automatic feature engineering
B.Set the target column to 'default'
C.Disable hyperparameter tuning to save cost
D.Use Manual data split
E.Use the 'maximize AUC' optimisation objective
AnswersB, E

Why this answer

For binary classification, the target column must be set. For imbalanced classes, 'maximize AUC' is recommended. AutoML Tables automatically handles feature engineering and hyperparameter tuning.

Manual split is optional.

10
Multi-Selecthard

A retail company wants to implement a recommendation system using Recommendations AI. They need to generate personalized recommendations for users based on their browsing history and purchase behavior. Which THREE recommendation types are available in Recommendations AI?

Select 3 answers
A.trending-now
B.recommended-for-you
C.others-you-may-like
D.frequently-bought-together
E.most-popular
AnswersB, C, D

Why this answer

Recommendations AI offers five recommendation types: recommended-for-you (personalized), others-you-may-like (item similarity), frequently-bought-together (complementary), recently-viewed (session-based), and also-viewed. 'Most popular' is not a built-in type.

11
MCQhard

A company has an existing TensorFlow model for fraud detection that they want to use for predictions in BigQuery. They want to call the model from SQL queries without moving data out of BigQuery. How should they deploy the model?

A.Import the TensorFlow model directly into BigQuery ML
B.Deploy the model to Vertex AI Prediction and use a remote model in BigQuery ML
C.Export BigQuery data to Cloud Storage and use AI Platform Prediction
D.Use AutoML Tables to retrain the model in BigQuery ML
AnswerA

Why this answer

BigQuery ML (BQML) natively supports importing TensorFlow models directly, allowing you to use them for predictions via SQL without moving data out of BigQuery. This is the simplest and most efficient approach because it eliminates the need for external services or data export, leveraging BQML's built-in `CREATE MODEL` statement with the `OPTIONS(model_type = 'TENSORFLOW')` clause.

Exam trap

The trap here is that candidates often overcomplicate the solution by choosing Vertex AI or AI Platform, not realizing that BigQuery ML has native TensorFlow support, which is the most direct and low-code way to meet the requirement of keeping data in BigQuery.

How to eliminate wrong answers

Option B is wrong because deploying to Vertex AI Prediction and using a remote model adds unnecessary complexity and latency; while it works, it requires setting up a remote model and a connection, which is not the simplest or most direct method when BQML directly supports TensorFlow imports. Option C is wrong because exporting BigQuery data to Cloud Storage and using AI Platform Prediction moves data out of BigQuery, violating the requirement to keep data in BigQuery and adding extra steps and cost. Option D is wrong because AutoML Tables retrains the model from scratch, which does not reuse the existing TensorFlow model and may produce different results, whereas the requirement is to use the existing model as-is.

12
Multi-Selecthard

A company wants to use Document AI to process a large volume of invoices. They need to extract line items and also have a human review the extracted data for accuracy. Which THREE features should they use? (Choose 3)

Select 3 answers
A.Online Processing
B.Custom Extractor
C.Invoice Parser Processor
D.Human-in-the-Loop (HITL)
E.Batch Processing
AnswersC, D, E

Why this answer

The Invoice Parser Processor (C) is a specialized Document AI processor designed to extract structured data from invoices, including line items, which directly meets the requirement for extracting line items from a large volume of invoices. It is pre-trained on invoice layouts and can handle complex table structures, making it the correct choice for this task.

Exam trap

Google often tests the distinction between pre-built processors and custom extractors, where candidates mistakenly choose Custom Extractor (B) thinking they need a tailored solution, even though a pre-built Invoice Parser Processor (C) is available and more appropriate for standard invoice extraction.

13
Multi-Selecthard

A data engineer is using BigQuery ML with a BOOSTED_TREE_CLASSIFIER model. After training, they want to evaluate the model and understand which features contribute most to predictions. Which THREE BigQuery ML functions should they use?

Select 3 answers
A.ML.EVALUATE
B.ML.GLOBAL_EXPLAIN
C.ML.TRAIN
D.ML.FEATURE_IMPORTANCE
E.ML.PREDICT
AnswersA, D, E

Why this answer

ML.EVALUATE evaluates model performance, ML.FEATURE_IMPORTANCE returns feature importance scores, and ML.PREDICT makes predictions. ML.TRAIN is for training, ML.GLOBAL_EXPLAIN is not a standard function (ML.EXPLAIN_PREDICT exists but is not listed as an option).

14
MCQmedium

An organisation wants to use Document AI to process contracts but requires human review for high-risk clauses. Which feature should they enable?

A.Human-in-the-Loop (HITL)
B.Batch Processing
C.Online Prediction
D.AutoML Training
AnswerA

Why this answer

Human-in-the-Loop (HITL) is the correct feature because it allows Document AI to automatically process contracts while routing high-risk clauses to human reviewers for validation. This balances automation efficiency with the need for expert oversight on sensitive content, which is a core requirement for compliance-driven document processing.

Exam trap

The trap here is that candidates confuse HITL with AutoML Training, thinking that training a model with human-labeled data is the same as having a human review live predictions, but HITL is a runtime workflow, not a training process.

How to eliminate wrong answers

Option B (Batch Processing) is wrong because it handles large volumes of documents asynchronously but does not include any mechanism for human review or intervention on specific clauses. Option C (Online Prediction) is wrong because it provides real-time predictions on individual documents but lacks the built-in workflow to pause and escalate high-risk clauses to a human. Option D (AutoML Training) is wrong because it is used to train custom models on labeled data, not to manage human review workflows during inference.

15
Multi-Selectmedium

A company needs to classify images of products into categories (e.g., electronics, clothing, food). They have labeled images and want to use a low-code solution on Google Cloud. Which TWO services are suitable for this task?

Select 1 answer
A.AutoML Tables
B.AutoML Vision
C.Cloud Vision API using product search
D.Vertex AI Workbench with a custom TensorFlow model
E.Document AI custom extractor
AnswersB

Why this answer

AutoML Vision is a low-code service that allows you to train custom image classification models using labeled images without writing code. It directly matches the requirement to classify product images into categories like electronics, clothing, and food, making it one of the two suitable services.

Exam trap

Google often tests the distinction between AutoML Vision (custom image classification) and Cloud Vision API (pre-trained models for general tasks like label detection or product search), causing candidates to mistakenly choose the API for custom classification needs.

16
MCQhard

A team trained a TensorFlow model locally and wants to deploy it to BigQuery ML for predictions without retraining. They have exported the SavedModel to Cloud Storage. Which statement is correct?

A.They need to convert the model to a BigQuery ML native format first.
B.They can create a model using CREATE MODEL with model_type='tensorflow' and the path to the SavedModel.
C.They must first retrain the model using ML.TRAIN on BigQuery.
D.They can use ML.PREDICT directly on the SavedModel in Cloud Storage.
AnswerB

Why this answer

BigQuery ML supports importing TensorFlow SavedModels with model_type='tensorflow'. ML.TRAIN is for training, not importing. ML.PREDICT requires a model in BigQuery, not directly on Cloud Storage.

No conversion is needed.

17
Multi-Selectmedium

A retail company uses Recommendations AI to power personalized product recommendations on their website. They notice that the 'frequently-bought-together' model is not capturing complementary items that are often purchased in the same session but not necessarily in the same transaction. Which TWO actions should they take to improve the model?

Select 2 answers
A.Decrease the event retention period to focus on recent purchases
B.Enable the 'others-you-may-like' recommendation type in addition to 'frequently-bought-together'
C.Use AutoML Tables to build a custom recommendation model
D.Set the recommendation type to only 'frequently-bought-together'
E.Ingest session-level event data (e.g., product views in the same session) into Recommendations AI
AnswersB, E

'Others-you-may-like' uses co-viewed and co-purchased signals, capturing session-level patterns.

Why this answer

To capture cross-session patterns, the company should ensure that user events (including session-level co-occurrence) are properly tracked and ingested. Enabling session-level events and using the 'others-you-may-like' model (which uses co-viewed behavior) can help. Setting the recommendation type to 'frequently-bought-together' does not address session-level data.

Reducing event retention would harm model quality.

18
MCQmedium

A data analyst wants to train a binary classification model in BigQuery ML on a dataset of 10 million rows with 50 features. They need to evaluate the model's performance on a held-out test set. Which sequence of SQL statements should they run?

A.CREATE MODEL then ML.FEATURE_IMPORTANCE
B.ML.TRAIN then ML.EVALUATE
C.CREATE MODEL then ML.PREDICT
D.CREATE MODEL then ML.EVALUATE
AnswerD

CREATE MODEL trains the model, and ML.EVALUATE returns evaluation metrics on the test set.

Why this answer

First, create the model using CREATE MODEL. Then, evaluate it using ML.EVALUATE, which uses the test split defined in the model options. ML.PREDICT is for predictions, not evaluation.

ML.TRAIN is not a valid function; model training is done via CREATE MODEL.

19
MCQhard

A healthcare organization wants to build a model to predict patient readmission risk using structured electronic health record (EHR) data. They need to train a model using SQL in BigQuery, but they also want to leverage AutoML's ability to automatically search for the best architecture. Which approach should they take?

A.Use a pre-built Vision API model via BigQuery ML remote model
B.Use BigQuery ML with the AUTO model type
C.Use AutoML Tables with Vertex AI and export predictions
D.Use BigQuery ML with a DNN_CLASSIFIER and manual hyperparameter tuning
AnswerB

Why this answer

BigQuery ML's AUTO model type automatically performs architecture search and hyperparameter tuning, making it ideal for users who want to leverage AutoML capabilities directly within SQL on structured EHR data. This approach avoids manual model selection while staying entirely within BigQuery's SQL interface, which is the stated requirement.

Exam trap

The trap here is that candidates confuse AutoML Tables (a separate Vertex AI service) with BigQuery ML's built-in AUTO model type, assuming they must export data to use AutoML, when in fact BigQuery ML provides AutoML capabilities directly within SQL.

How to eliminate wrong answers

Option A is wrong because Vision API is designed for image analysis, not structured EHR data, and BigQuery ML remote models require a pre-built API endpoint, not AutoML architecture search. Option C is wrong because AutoML Tables (now Vertex AI Tabular) is a separate service that requires exporting data out of BigQuery and does not allow training via SQL in BigQuery. Option D is wrong because DNN_CLASSIFIER with manual hyperparameter tuning contradicts the requirement to 'automatically search for the best architecture' — it requires explicit user-specified parameters and does not perform automated architecture search.

20
MCQhard

A media company wants to automatically moderate user-uploaded videos by detecting explicit content (e.g., violence, adult material). They need a solution that integrates with their video processing pipeline and scales to millions of videos. Which approach should they take?

A.Use Video Intelligence API with explicit content detection
B.Use AutoML Video to train a custom explicit content detection model
C.Use Natural Language API on video transcripts
D.Use Vision API to analyze each video frame
AnswerA

Video Intelligence API has built-in explicit content detection, suitable for this use case.

Why this answer

Video Intelligence API provides explicit content detection as a pre-built feature. It can analyze video content and flag inappropriate material. AutoML Video would require custom training, which is unnecessary.

Vision API is for images. Natural Language API is for text.

21
MCQhard

A company has a TensorFlow model trained outside of Google Cloud and wants to use it for online predictions on Vertex AI. They have saved the model in SavedModel format. What is the most efficient way to deploy this model?

A.Import the model into BigQuery ML using CREATE MODEL with model_type='TENSORFLOW'
B.Use Vertex AI AutoML Tables to retrain the model
C.Use Cloud Functions to run the model for each prediction request
D.Upload the saved model to Vertex AI and create an endpoint for online predictions
AnswerD

Why this answer

Vertex AI supports importing SavedModel directly without retraining. BigQuery ML can import TensorFlow models but is for batch predictions. Vertex AI Prediction is the standard for online predictions.

AutoML Tables is for training new models.

22
MCQmedium

A logistics company wants to classify shipping documents into categories (invoice, packing slip, bill of lading) using a custom model with minimal code. They have labeled training images. Which Google Cloud service is most appropriate?

A.Vertex AI AutoML Tables
B.AutoML Vision for image classification
C.Document AI custom extractor
D.Cloud Vision API with label detection
AnswerB

Why this answer

Document AI provides custom extractors but not classification. AutoML Vision can train a custom image classification model. Vertex AI AutoML Tables is for tabular data, not images.

Cloud Vision API is for pre-built image analysis, not custom classification.

23
MCQhard

A financial institution needs to extract structured data from scanned PDFs of loan applications, including text fields and tables. They require a human review step for high-risk applications. Which Google Cloud service and configuration should they use?

A.Document AI with a form parser processor and enable Human-in-the-Loop for high-risk applications
B.Document AI with a custom extractor processor and use Cloud Functions for human review
C.Cloud Vision API to detect text and tables, then send to Cloud Dataflow for processing
D.Vertex AI AutoML Vision to train a custom model for document parsing
AnswerA

Why this answer

Document AI provides specialised processors for document parsing (including form parser). Human-in-the-Loop (HITL) is a feature of Document AI that allows human review for high-risk documents. Cloud Vision API is for image analysis, not document parsing.

Vertex AI AutoML Vision is for image classification/object detection.

24
Multi-Selecthard

A company is building a document processing pipeline using Document AI to extract data from invoices. They want to ensure high accuracy and handle edge cases where the model may be uncertain. Which THREE steps should they include in their pipeline?

Select 3 answers
A.Regularly retrain the processor using human-verified data
B.Use the pre-built invoice parser without any modifications
C.Use AutoML Vision to classify invoice types
D.Enable Human-in-the-Loop (HITL) to review documents with low confidence scores
E.Use a custom processor trained on their specific invoice format
AnswersA, D, E

Continuous improvement with human-labeled data helps the model learn from edge cases.

Why this answer

To improve accuracy, use human review for low-confidence documents, create a custom processor (or use a pre-built one) tailored to their invoice format, and continuously improve the model with human-labeled data. Using the form parser is a good start but not enough; they need to enable HITL and potentially train a custom processor.

25
MCQmedium

A retailer wants to implement a recommendation engine that suggests products based on a user's current cart. They have limited ML expertise and want a quick deployment. Which Recommendations AI model type should they use?

A.Others you may like
B.Recently viewed
C.Recommended for you
D.Frequently bought together
AnswerD

Why this answer

The 'Frequently bought together' model type is the correct choice because it directly leverages collaborative filtering based on co-purchase patterns in historical transaction data, enabling the retailer to recommend items commonly purchased alongside the current cart contents. This model requires minimal ML expertise and can be quickly deployed using pre-built Recommendations AI templates, as it does not require user-level personalization or real-time session data.

Exam trap

In Google PMLE exams, candidates often mistakenly select 'Recommended for you' thinking it is the default or most versatile model, but the trap here is that they overlook the specific requirement of 'based on a user's current cart' and the need for quick deployment with limited ML expertise, which points to the simpler, cart-focused 'Frequently bought together' model instead.

How to eliminate wrong answers

Option A is wrong because 'Others you may like' is a generic model that relies on item similarity or user-based collaborative filtering, which typically requires more extensive user interaction history and ML tuning, not just the current cart context. Option B is wrong because 'Recently viewed' is a session-based model that recommends items based on the user's browsing history, not the contents of the current cart, and it does not capture co-purchase relationships. Option C is wrong because 'Recommended for you' is a personalized model that requires user-level behavioral data and often uses deep learning techniques, making it more complex to deploy and less suitable for a quick, low-ML-expertise solution focused on cart-based suggestions.

26
MCQeasy

A data scientist wants to evaluate the performance of a BigQuery ML classification model on a test dataset. Which function should they use?

A.ML.PREDICT
B.ML.FEATURE_IMPORTANCE
C.ML.EVALUATE
D.ML.TRAIN
AnswerC

Why this answer

ML.EVALUATE is the correct function because it computes classification metrics (e.g., precision, recall, accuracy, F1 score, ROC AUC) directly on a trained BigQuery ML model using a provided test dataset or evaluation input. This is the dedicated function for assessing model performance after training, aligning with the task of evaluating a classification model on held-out test data.

Exam trap

Google often tests the distinction between prediction (ML.PREDICT) and evaluation (ML.EVALUATE), trapping candidates who confuse generating outputs with measuring performance, especially when the question mentions 'evaluate performance' but the candidate fixates on 'predict' as the primary ML function.

How to eliminate wrong answers

Option A is wrong because ML.PREDICT is used to generate predictions (class labels or probabilities) on new data, not to compute evaluation metrics like accuracy or precision. Option B is wrong because ML.FEATURE_IMPORTANCE is used to retrieve feature weights or importance scores from a trained model (e.g., for interpretability), not to evaluate overall model performance on a test set. Option D is wrong because ML.TRAIN is used to initiate the training process of a BigQuery ML model, not to evaluate an already trained model on test data.

27
MCQmedium

A company wants to transcribe customer service calls in real-time. The audio is telephony quality (8 kHz). Which Speech-to-Text model should they specify?

A.latest_short
B.latest_long
C.telephony
D.default
AnswerC

Why this answer

The 'telephony' model is optimised for audio from phone calls (8 kHz). 'latest_long' is for long-form non-telephony audio, 'latest_short' for short utterances, and there is no 'default' model.

28
MCQmedium

A company needs to analyze customer feedback from app reviews to identify common themes and sentiment. They have millions of reviews in multiple languages. Which combination of pre-built APIs should they use?

A.Speech-to-Text then Natural Language API
B.Natural Language API only
C.Translation API then Natural Language API
D.Vision API then Natural Language API
AnswerC

Translate reviews to a common language, then analyze sentiment and entities.

Why this answer

Natural Language API can perform sentiment analysis and entity extraction. Translation API can translate non-English reviews to English before analysis. Speech-to-Text is for audio.

Vision API is for images.

29
MCQeasy

A company needs to detect objects in real-time from a live video feed. They want to use a pre-trained model with minimal setup. Which Google Cloud service should they use?

A.Vertex AI AutoML Video
B.Video Intelligence API
C.Document AI
D.Cloud Vision API
AnswerB

Why this answer

The Video Intelligence API is the correct choice because it provides pre-trained models for real-time video analysis, including object detection, with minimal setup. It is designed to extract metadata from video streams without requiring custom training, making it ideal for real-time live video feeds. Vertex AI AutoML Video, in contrast, requires custom training and is not pre-trained for immediate use.

Exam trap

The trap here is that candidates confuse Cloud Vision API (static images) with video analysis, or assume Vertex AI AutoML Video is pre-trained, when in fact it requires custom model training and is not a 'minimal setup' solution.

How to eliminate wrong answers

Option A is wrong because Vertex AI AutoML Video requires custom training on labeled data, not a pre-trained model, and involves more setup for real-time detection. Option C is wrong because Document AI is specialized for document processing (e.g., OCR, form parsing), not video object detection. Option D is wrong because Cloud Vision API is designed for static image analysis, not real-time video streams, and lacks native video frame processing capabilities.

30
MCQeasy

A company wants to classify customer support emails into categories like 'billing', 'technical', or 'account'. They have labeled email text data. Which AutoML solution should they use?

A.AutoML Tables
B.AutoML Natural Language
C.AutoML Video
D.AutoML Vision
AnswerB

Why this answer

AutoML Natural Language is designed for text classification. AutoML Vision is for images, AutoML Video for video, and AutoML Tables for tabular data.

31
MCQmedium

A data scientist needs to forecast daily sales for the next 30 days using historical sales data stored in BigQuery. They want to use BigQuery ML. Which model type should they choose?

A.LINEAR_REG
B.BOOSTED_TREE_REGRESSOR
C.K_MEANS
D.ARIMA_PLUS
AnswerD

Why this answer

ARIMA_PLUS is the correct choice because it is specifically designed for time-series forecasting, such as predicting daily sales over a future horizon. BigQuery ML's ARIMA_PLUS model automatically handles seasonality, trend, and holiday effects, making it ideal for 30-day sales forecasts from historical data.

Exam trap

The trap here is that candidates often confuse regression models (like LINEAR_REG or BOOSTED_TREE_REGRESSOR) with time-series forecasting, not realizing that standard regression assumes independent observations and cannot inherently model temporal dependencies or extrapolate beyond the training period.

How to eliminate wrong answers

Option A is wrong because LINEAR_REG is a linear regression model for predicting a continuous target from input features, but it does not inherently model time-series dependencies like autocorrelation or seasonality, making it unsuitable for forecasting sequential daily sales. Option B is wrong because BOOSTED_TREE_REGRESSOR is an ensemble tree-based model for regression tasks, but it treats each row independently and cannot capture temporal patterns or extrapolate into the future without explicit feature engineering of time lags. Option C is wrong because K_MEANS is an unsupervised clustering algorithm used to partition data into groups, not for forecasting numerical values over time.

32
MCQmedium

A data engineer wants to use BigQuery ML to train a model for predicting customer churn (binary classification) using a large dataset. They want the model to be automatically tuned. Which model type should they choose?

A.LOGISTIC_REG
B.BOOSTED_TREE_CLASSIFIER
C.DNN_CLASSIFIER
D.AUTOML_CLASSIFIER
AnswerD

Why this answer

Option D (AUTOML_CLASSIFIER) is correct because it automatically performs architecture search and hyperparameter tuning to find the best model for binary classification tasks, such as customer churn prediction. This is ideal when the data engineer wants the model to be automatically tuned without manual intervention, as AutoML handles feature engineering, model selection, and tuning under the hood.

Exam trap

The trap here is that candidates often confuse 'automatically tuned' with models that have default hyperparameters (like LOGISTIC_REG or BOOSTED_TREE_CLASSIFIER), but only AUTOML_CLASSIFIER performs automated hyperparameter tuning and architecture search without requiring manual specification.

How to eliminate wrong answers

Option A (LOGISTIC_REG) is wrong because logistic regression does not support automatic tuning; it requires manual specification of hyperparameters like learning rate or regularization, and it is a simpler linear model that may not capture complex patterns in large datasets. Option B (BOOSTED_TREE_CLASSIFIER) is wrong because while it can be tuned, it does not offer fully automatic tuning; the user must manually set parameters such as tree depth, learning rate, and number of iterations. Option C (DNN_CLASSIFIER) is wrong because deep neural network classifiers require manual tuning of architecture (e.g., number of layers, neurons) and hyperparameters (e.g., learning rate, batch size), and they do not automatically search for the optimal configuration.

33
MCQeasy

A data analyst wants to train a binary classification model on a BigQuery table without moving data out of BigQuery. They have limited ML expertise. Which approach should they take?

A.Use BigQuery ML with CREATE MODEL and LOGISTIC_REG model type.
B.Use Cloud Datalab to train an XGBoost model on BigQuery data.
C.Train a model using Vertex AI Workbench with a custom container.
D.Export the data to Cloud Storage and use Vertex AI AutoML Tables.
AnswerA

Why this answer

Option A is correct because BigQuery ML allows users to create and train binary classification models directly on data in BigQuery using SQL, with no need to move data or have deep ML expertise. The LOGISTIC_REG model type implements logistic regression, a standard algorithm for binary classification, and the CREATE MODEL statement handles all the underlying training infrastructure, making it ideal for a data analyst with limited ML skills.

Exam trap

Google often tests the distinction between low-code/no-code solutions (like BigQuery ML) and more advanced, infrastructure-heavy approaches (like custom containers or AutoML with data export), expecting candidates to recognize that the simplest, most integrated option is correct when the user has limited ML expertise and wants to avoid data movement.

How to eliminate wrong answers

Option B is wrong because Cloud Datalab is a deprecated interactive notebook service that requires users to write custom code and manage infrastructure, which is not suitable for someone with limited ML expertise and does not leverage BigQuery's native ML capabilities. Option C is wrong because Vertex AI Workbench with a custom container demands advanced knowledge of containerization, model training pipelines, and infrastructure management, far beyond the scope of a low-code solution for a data analyst. Option D is wrong because exporting data to Cloud Storage and using Vertex AI AutoML Tables, while low-code, introduces unnecessary data movement and additional complexity compared to the simpler, fully integrated BigQuery ML approach that keeps data in place.

34
MCQhard

A company wants to use BigQuery ML to train a DNN_CLASSIFIER model on a dataset with 100 million rows. They are concerned about training time and cost. Which approach can help optimize training performance while staying within BigQuery ML?

A.Use OPTIONS('MAX_ITERATIONS' = 10) to limit training iterations
B.Use Vertex AI AutoML Tables instead of BigQuery ML
C.Train on a 10% random sample of the data to reduce cost
D.BigQuery ML automatically optimizes training; no additional configuration needed
AnswerD

BigQuery ML handles optimization internally, adjusting training parameters for efficiency.

Why this answer

BigQuery ML automatically selects the optimal training configuration. However, using OPTIONS like 'MAX_ITERATIONS' with a smaller number can reduce training time, but setting it too low may harm accuracy. Using a filtered subset for training is not standard.

Using Vertex AI AutoML Tables would require data export. Early stopping is not directly configurable in BigQuery ML DNN.

35
MCQhard

A financial services company uses Document AI to process loan applications. They want to ensure that any documents the model cannot process with high confidence are reviewed by a human before finalizing the decision. Which Document AI feature should they enable?

A.AutoML Tables model retraining
B.Cloud DLP for data inspection
C.Increase the number of processors
D.Human-in-the-Loop (HITL)
AnswerD

HITL enables human review for low-confidence documents, ensuring accuracy.

Why this answer

Human-in-the-Loop (HITL) allows documents with low confidence scores to be routed to human reviewers. This is a built-in feature of Document AI. AutoML Tables is not directly related.

Cloud DLP is for data loss prevention.

36
MCQmedium

A company needs to forecast product demand for the next 12 months using historical sales data. They want to use BigQuery ML with minimal coding. Which model type is most suitable?

A.K_MEANS
B.MATRIX_FACTORIZATION
C.ARIMA_PLUS
D.LINEAR_REG
AnswerC

Why this answer

ARIMA_PLUS is designed for time-series forecasting and automatically handles seasonality, trends, and holidays. K_MEANS is for clustering, MATRIX_FACTORIZATION for recommendations, and LINEAR_REG for simple regression.

37
MCQeasy

A developer wants to add text translation to a mobile app. They need to translate user-generated content into multiple languages, and latency is critical. Which pre-built API should they use?

A.Translation API
B.Vision API
C.Text-to-Speech API
D.Natural Language API
AnswerA

Translation API is designed for text translation across languages.

Why this answer

Translation API provides fast, real-time translation for text. Natural Language API is for analysis. Text-to-Speech is for audio.

Vision API is for images.

38
Multi-Selectmedium

A company wants to build a model to predict housing prices using BigQuery ML. They have a dataset with features like area, number of bedrooms, and location. Which TWO model types are appropriate for this regression task?

Select 2 answers
A.LOGISTIC_REG
B.K_MEANS
C.MATRIX_FACTORIZATION
D.BOOSTED_TREE_REGRESSOR
E.LINEAR_REG
AnswersD, E

Why this answer

BOOSTED_TREE_REGRESSOR (D) is appropriate because it is a tree-based ensemble method specifically designed for regression tasks, and BigQuery ML supports it via the `CREATE MODEL` statement with `model_type='BOOSTED_TREE_REGRESSOR'`. It handles non-linear relationships and interactions between features like area, bedrooms, and location, making it suitable for predicting continuous housing prices.

Exam trap

The PMLE exam often tests the distinction between regression and classification models, leading candidates to mistakenly choose LOGISTIC_REG for regression tasks because of the word 'regression' in its name, but it is actually a classification algorithm.

39
Multi-Selectmedium

A company needs to detect objects in live video streams from security cameras. They require low-latency predictions and want to minimise operational overhead. Which TWO services should they use? (Choose 2)

Select 2 answers
A.Live Stream API
B.Vertex AI Predictions
C.Cloud Run
D.AutoML Video
E.Video Intelligence API
AnswersC, E

Why this answer

Video Intelligence API can analyse live streams with object detection. Cloud Run provides serverless compute to host the streaming pipeline. AutoML Video requires custom training, Vertex AI Predictions is for deployed models, Live Stream API is for ingestion.

40
Multi-Selectmedium

A data scientist wants to use BigQuery ML for time-series forecasting. They need to evaluate model accuracy and compare different models. Which THREE BigQuery ML functions should they use?

Select 3 answers
A.ML.EXPLAIN_PREDICT
B.ML.PREDICT
C.ML.FEATURE_IMPORTANCE
D.ML.EVALUATE
E.ML.TRAIN
AnswersB, D, E

Why this answer

ML.EVALUATE is used to evaluate model performance. ML.PREDICT generates forecasts. ML.TRAIN is used to retrain models.

ML.FEATURE_IMPORTANCE is for tree models. ML.EXPLAIN_PREDICT provides local explanations.

41
MCQmedium

A company has a large dataset of labeled images (e.g., different species of plants). They want to train a custom image classification model with minimal effort and no prior ML experience. Which Google Cloud service should they use?

A.Cloud TPU
B.AutoML Vision
C.Vertex AI Workbench with a custom TensorFlow model
D.Vision API
AnswerB

AutoML Vision allows training custom image classification models with a simple UI, no coding required.

Why this answer

AutoML Vision is the correct choice because it allows users with no prior ML experience to train a custom image classification model using a simple graphical interface, requiring only labeled images as input. It automates model architecture search, hyperparameter tuning, and deployment, minimizing manual effort while delivering a production-ready model.

Exam trap

The trap here is that candidates confuse AutoML Vision (custom model training with minimal effort) with Vision API (pre-trained, no custom training), often picking D because both involve 'Vision' and seem low-code, but Vision API cannot be retrained on custom data.

How to eliminate wrong answers

Option A is wrong because Cloud TPU is a hardware accelerator for training custom models, requiring users to write and manage their own ML code (e.g., TensorFlow/PyTorch), which demands significant ML expertise and effort. Option C is wrong because Vertex AI Workbench with a custom TensorFlow model requires users to write, debug, and train a model from scratch using notebooks, which is not minimal effort and assumes ML experience. Option D is wrong because Vision API is a pre-trained API for general image recognition tasks (e.g., label detection, OCR) and cannot be trained on custom labeled datasets like plant species; it offers no customization for specific classification needs.

42
MCQeasy

A company wants to transcribe customer service calls in real-time to detect sentiment and identify urgent issues. They need a solution with low latency. Which combination of pre-built APIs should they use?

A.Text-to-Speech and Natural Language API
B.Speech-to-Text and Translation API
C.Video Intelligence API
D.Speech-to-Text and Natural Language API
AnswerD

Speech-to-Text transcribes audio, then Natural Language API performs sentiment analysis on the text.

Why this answer

Speech-to-Text can transcribe audio in real-time, and Natural Language API can analyze sentiment from the transcribed text. Text-to-Speech is for generating speech. Video Intelligence is for video content.

43
MCQmedium

A company wants to build a product recommendation engine for their e-commerce website. They have historical purchase data and user interaction logs. They want a managed service that can quickly generate personalized recommendations without building custom models. Which service should they use?

A.Dataflow with TensorFlow
B.BigQuery ML with MATRIX_FACTORIZATION
C.AutoML Tables
D.Recommendations AI
AnswerD

Recommendations AI provides pre-built models for personalized recommendations, ideal for e-commerce.

Why this answer

Recommendations AI is a managed service specifically for retail recommendation use cases. It offers pre-built models like 'recommended-for-you' and 'frequently-bought-together'. BigQuery ML would require custom model building, and AutoML Tables is for general tabular data, not specifically for recommendations.

44
MCQhard

A company uses BigQuery ML to train a boosted tree classifier on a large dataset. After training, they want to understand which features most influence predictions. Which BigQuery ML function should they use?

A.ML.EXPLAIN_PREDICT
B.ML.FEATURE_IMPORTANCE
C.ML.EVALUATE
D.ML.PREDICT
AnswerB

Why this answer

ML.FEATURE_IMPORTANCE returns feature importance for tree-based models in BigQuery ML. ML.EVALUATE gives evaluation metrics, ML.PREDICT gives predictions, ML.EXPLAIN_PREDICT gives local explanations with SHAP values.

45
MCQeasy

A retail company wants to build a recommendation system to show 'frequently bought together' items. Which Recommendations AI model type should they use?

A.recently-viewed
B.frequently-bought-together
C.recommended-for-you
D.others-you-may-like
AnswerB

Why this answer

Option B is correct because the 'frequently-bought-together' model type in Google Cloud Recommendations AI is specifically designed to identify items that are commonly purchased in the same transaction, using co-occurrence analysis of historical purchase data. This directly matches the requirement to show items that are frequently bought together, leveraging association rule mining (e.g., Apriori algorithm) to generate recommendations.

Exam trap

In Google Cloud Recommendations AI, the key distinction is between transaction-based co-purchase models ('frequently-bought-together') and personalized recommendation models ('recommended-for-you'). Candidates may confuse 'frequently-bought-together' with 'others-you-may-like' because both relate to item similarity, but only 'frequently-bought-together' uses co-occurrence analysis of actual transactions.

How to eliminate wrong answers

Option A is wrong because 'recently-viewed' is a model type that surfaces items a user has recently browsed, not items that are frequently purchased together, and it relies on session-based user behavior rather than transaction co-occurrence. Option C is wrong because 'recommended-for-you' is a personalized model that uses user-item interaction history (e.g., collaborative filtering) to suggest items tailored to an individual, not cross-item purchase patterns. Option D is wrong because 'others-you-may-like' is a similarity-based model that recommends items similar to a given product based on content or metadata, not on transactional co-purchase frequency.

46
MCQhard

An engineer wants to use BigQuery ML to explain predictions from a trained boosted tree classifier for a specific set of input rows. Which function should they use?

A.ML.EVALUATE
B.ML.FEATURE_IMPORTANCE
C.ML.PREDICT
D.ML.EXPLAIN_PREDICT
AnswerD

Why this answer

ML.EXPLAIN_PREDICT provides feature attributions for each prediction. ML.EVALUATE returns aggregate metrics, ML.FEATURE_IMPORTANCE gives global importance, and ML.PREDICT only returns predictions.

47
Multi-Selecthard

A company is building a document processing pipeline for invoices. They need to extract key fields (invoice number, date, total amount) and allow human review for invoices over $10,000. Which TWO Google Cloud services/features should they combine?

Select 2 answers
A.Cloud Vision API for OCR
B.Human-in-the-Loop (HITL) on Document AI
C.AutoML Tables to predict missing fields
D.Document AI with invoice parser processor
E.Cloud Translation API to translate invoices
AnswersB, D

Why this answer

Document AI with a specialized invoice parser can extract fields. Human-in-the-Loop (HITL) is a feature of Document AI that enables human review. Cloud Vision API is not specialized for invoices.

AutoML Tables is for tabular models. Cloud Translation is not needed.

48
MCQeasy

A data analyst wants to train a linear regression model to predict house prices using only SQL queries on BigQuery. Which BigQuery ML model type should they use?

A.BOOSTED_TREE_REGRESSOR
B.LOGISTIC_REG
C.LINEAR_REG
D.DNN_REGRESSOR
AnswerC

Why this answer

The question specifies a linear regression model for predicting house prices, which is a regression task with a continuous target variable. BigQuery ML's LINEAR_REG model type is explicitly designed for linear regression, making it the correct choice for this use case.

Exam trap

Google often tests the distinction between regression and classification model types, and the trap here is that candidates might confuse LOGISTIC_REG (classification) with linear regression due to the word 'logistic' sounding similar to 'linear', or they might overcomplicate the solution by choosing a tree or neural network model when a simple linear model suffices.

How to eliminate wrong answers

Option A is wrong because BOOSTED_TREE_REGRESSOR is a tree-based ensemble method, not a linear model, and is overkill for a simple linear regression task. Option B is wrong because LOGISTIC_REG is used for binary classification, not regression (predicting continuous values like house prices). Option D is wrong because DNN_REGRESSOR is a deep neural network regressor, which is unnecessarily complex and not a linear model.

49
MCQmedium

A developer needs to transcribe phone calls with high accuracy for a call center analytics application. The audio is in English and has background noise. Which Speech-to-Text model should they choose?

A.telephony
B.latest_short
C.latest_long
D.Any model; they are all equivalent
AnswerA

Why this answer

The telephony model is optimized for phone call audio with background noise. latest_long is for longer audio (without phone-specific optimization), and latest_short is for short utterances.

50
MCQmedium

A data scientist wants to use AutoML to classify images of retail products into categories. There are 50 categories and the dataset has 100,000 labelled images. Which Vertex AI AutoML service is most appropriate?

A.AutoML Tables
B.AutoML Video
C.AutoML Vision
D.AutoML NLP
AnswerC

Why this answer

AutoML Vision is designed for image classification, object detection, and segmentation. AutoML Tables is for tabular data, AutoML NLP for text, and AutoML Video for video.

51
MCQeasy

A data scientist needs to train a time-series forecasting model on historical sales data stored in BigQuery to predict future demand. The data has strong seasonal patterns. Which BigQuery ML model type should they use?

A.MATRIX_FACTORIZATION
B.BOOSTED_TREE_REGRESSOR
C.ARIMA_PLUS
D.K_MEANS
AnswerC

ARIMA_PLUS is the correct model for time-series forecasting in BigQuery ML.

Why this answer

ARIMA_PLUS is the correct choice because it is specifically designed for time-series forecasting in BigQuery ML, handling seasonal patterns, trend decomposition, and automatic hyperparameter tuning. It models autoregressive (AR) and moving average (MA) components with seasonal differencing, making it ideal for historical sales data with strong seasonal cycles.

Exam trap

Google often tests the misconception that any regression model (like BOOSTED_TREE_REGRESSOR) can be naively applied to time-series data, ignoring the need for specialized models that handle temporal dependencies and seasonality natively.

How to eliminate wrong answers

Option A is wrong because MATRIX_FACTORIZATION is used for recommendation systems (e.g., collaborative filtering) and cannot model temporal dependencies or seasonality in time-series data. Option B is wrong because BOOSTED_TREE_REGRESSOR is a tree-based ensemble method for regression tasks but does not inherently capture time-series structures like seasonality, trend, or autocorrelation without extensive feature engineering. Option D is wrong because K_MEANS is an unsupervised clustering algorithm that groups data points by similarity and has no mechanism for forecasting future values or modeling sequential patterns.

52
MCQmedium

A retail company wants to predict customer churn using historical purchase data stored in BigQuery. The data includes customer demographics, transaction history, and support interactions. The team is comfortable writing SQL and wants to avoid moving data to a separate environment. Which approach should they take?

A.Use the Cloud Natural Language API to analyze customer support interactions and combine results with purchase data in BigQuery.
B.Export the data to a CSV file and use Vertex AI AutoML Tables to train a classification model.
C.Use BigQuery ML to create a logistic regression model (LOGISTIC_REG) on the data directly in BigQuery.
D.Create a Dataflow pipeline to stream data to Cloud SQL and use Cloud SQL's built-in ML functions.
AnswerC

BigQuery ML supports logistic regression for binary classification and runs entirely in BigQuery using SQL.

Why this answer

Option C is correct because BigQuery ML allows the team to build and train a logistic regression model directly on data stored in BigQuery using SQL syntax, without moving data to a separate environment. The LOGISTIC_REG model type is specifically designed for binary classification tasks like churn prediction, and it runs entirely within BigQuery's serverless infrastructure, satisfying the team's requirement to avoid data movement.

Exam trap

This question tests the misconception that ML requires moving data to a separate platform (like Vertex AI or Cloud SQL), when in fact BigQuery ML provides a low-code, SQL-based solution that keeps data in place and meets the stated constraints.

How to eliminate wrong answers

Option A is wrong because the Cloud Natural Language API is used for text analysis (e.g., sentiment extraction), not for training a predictive churn model; it would require additional steps to combine results and does not provide a built-in classification model. Option B is wrong because exporting data to a CSV file and using Vertex AI AutoML Tables violates the requirement to avoid moving data to a separate environment, and it introduces unnecessary data egress and manual steps. Option D is wrong because Cloud SQL does not have built-in ML functions for training classification models; it is a relational database service, and streaming data through Dataflow to Cloud SQL adds complexity and does not leverage BigQuery's native ML capabilities.

53
MCQmedium

A healthcare provider needs to extract structured information from incoming PDF forms (e.g., patient intake forms). They want to automate data extraction without writing custom models. Which Google Cloud service should they use?

A.Document AI with a form parser processor
B.Natural Language API for entity extraction
C.Vision API
D.AutoML Vision for object detection
AnswerA

Document AI's form parser is designed to extract key-value pairs and tables from forms.

Why this answer

Document AI with a form parser processor is the correct choice because it is purpose-built for extracting structured data from PDF forms, including key-value pairs and tables, without requiring custom model development. It uses pre-trained models specifically for form understanding, making it ideal for automating intake form processing.

Exam trap

A common pitfall is choosing the Vision API for form parsing because it performs OCR, but it lacks the specialized form-field extraction capabilities of Document AI, which is designed specifically for structured data extraction from forms.

How to eliminate wrong answers

Option B is wrong because Natural Language API is designed for extracting entities and sentiment from unstructured text, not for parsing structured form fields from PDF documents. Option C is wrong because Vision API provides optical character recognition (OCR) to extract raw text from images but lacks the form-specific parsing logic to identify key-value pairs and table structures. Option D is wrong because AutoML Vision for object detection is used for identifying objects within images, not for extracting structured data from forms.

54
MCQmedium

A retail company wants to generate product recommendations on their website using Google Cloud. They have historical transaction data and need a managed service that provides personalized recommendations like 'frequently bought together'. Which service should they use?

A.Recommendations AI
B.BigQuery ML
C.Vertex AI Prediction
D.AutoML Tables
AnswerA

Why this answer

Recommendations AI offers pre-built retail models including frequently-bought-together. AutoML Tables would require custom training, BigQuery ML is for SQL-based models, and Vertex AI Prediction is for deploying custom models.

55
Multi-Selectmedium

A company needs to build a custom model to classify images of products into categories. They have a large labeled dataset. They want to use AutoML but are unsure which options support image classification. Which TWO AutoML products support image classification?

Select 1 answer
A.AutoML Natural Language
B.AutoML Video
C.AutoML Vision
D.AutoML Translation
E.AutoML Tables
AnswersC

Why this answer

AutoML Vision (Option C) is the correct choice because it is specifically designed for image classification tasks, allowing users to train custom models on labeled image datasets without writing code. It supports features like object detection and image segmentation, making it ideal for the described use case.

Exam trap

Google often tests the distinction between AutoML services by focusing on the data modality (text, image, video, tabular), leading candidates to mistakenly choose AutoML Video for image tasks due to its visual nature, but it is strictly for video sequences, not static images.

56
MCQeasy

A company needs to extract text from scanned invoices and parse key fields like invoice number and total amount. Which Document AI processor should they use?

A.OCR Processor
B.Contract Parser
C.Form Parser
D.Invoice Parser
AnswerD

Why this answer

The Invoice Parser is specialised for parsing invoice documents. OCR Processor extracts text only, Form Parser extracts form fields, and Contract Parser is for legal contracts.

57
MCQeasy

A media company wants to transcribe audio files from customer support calls into text for analysis. The audio is in English with clear speech and no background noise. They want a quick solution with no ML model training. Which Google Cloud service should they use?

A.Translation API to translate the audio
B.AutoML NLP to train a transcription model
C.Vertex AI Workbench to train a custom speech recognition model
D.Speech-to-Text API with the latest_long model
AnswerD

Why this answer

Speech-to-Text is a pre-built API for transcribing audio to text. It is ready to use without training. AutoML NLP is for text classification, not transcription.

Vertex AI Workbench and Translation API are not relevant.

58
Multi-Selecteasy

A data analyst wants to use BigQuery ML to train a linear regression model (LINEAR_REG) to predict house prices. They have a table with features like square footage, number of bedrooms, and location. Which TWO statements about the training process are correct?

Select 2 answers
A.The analyst must call ML.TRAIN after CREATE MODEL to start training
B.The trained model is stored in Cloud Storage
C.The model must be exported to Vertex AI for prediction
D.The model is automatically evaluated on a held-out test set if data splitting is enabled
E.Training is performed using the CREATE MODEL statement
AnswersD, E

By default, BigQuery ML splits data into training and evaluation sets.

Why this answer

Option D is correct because when data splitting is enabled in BigQuery ML, the `CREATE MODEL` statement automatically reserves a portion of the input data as a held-out test set. After training completes, BigQuery ML evaluates the model on this test set and reports metrics like mean absolute error and R², without requiring any manual split or separate evaluation step.

Exam trap

A common misconception is that BigQuery ML requires an explicit training command (like `ML.TRAIN`) or that models are stored in Cloud Storage by default, when in fact training is fully encapsulated in `CREATE MODEL` and models reside in BigQuery's internal storage.

59
MCQmedium

An engineer needs to perform sentiment analysis on customer reviews. They have a large volume of text and need a solution that requires minimal customisation. Which option is most efficient?

A.Use Vertex AI Prediction with a pre-trained model
B.Use BigQuery ML with LOGISTIC_REG
C.Train a custom model using AutoML NLP
D.Use the Natural Language API
AnswerD

Why this answer

The Natural Language API provides pre-built sentiment analysis with minimal setup. AutoML NLP would require custom training, BigQuery ML is for tabular data, and Vertex AI Prediction needs a deployed model.

Ready to test yourself?

Try a timed practice session using only Architecting Low-Code ML Solutions questions.

CCNA Architecting Low-Code ML Solutions Questions | Courseiva