A retail company wants to predict customer churn using historical purchase data stored in BigQuery. The data includes customer demographics, transaction history, and support interactions. The team is comfortable writing SQL and wants to avoid moving data to a separate environment. Which approach should they take?
Trap 1: Use the Cloud Natural Language API to analyze customer support…
This does not build a predictive churn model; it only analyzes text.
Trap 2: Export the data to a CSV file and use Vertex AI AutoML Tables to…
This requires moving data out of BigQuery, which the scenario wants to avoid.
Trap 3: Create a Dataflow pipeline to stream data to Cloud SQL and use…
Cloud SQL does not have built-in ML functions like BigQuery ML.
- A
Use the Cloud Natural Language API to analyze customer support interactions and combine results with purchase data in BigQuery.
Why wrong: This does not build a predictive churn model; it only analyzes text.
- B
Export the data to a CSV file and use Vertex AI AutoML Tables to train a classification model.
Why wrong: This requires moving data out of BigQuery, which the scenario wants to avoid.
- C
Use BigQuery ML to create a logistic regression model (LOGISTIC_REG) on the data directly in BigQuery.
BigQuery ML supports logistic regression for binary classification and runs entirely in BigQuery using SQL.
- D
Create a Dataflow pipeline to stream data to Cloud SQL and use Cloud SQL's built-in ML functions.
Why wrong: Cloud SQL does not have built-in ML functions like BigQuery ML.