Practice PMLE Collaborating Within and Across Teams to Manage Data and Models questions with full explanations on every answer.
Start practicing
Collaborating Within and Across Teams to Manage Data and Models — choose a session length
Free · No account required
Click any question to see the full explanation and answer options, or start a focused practice session above.
A data science team uses Vertex AI Experiments to track training runs. They want to automatically log parameters, metrics, and artifacts for all runs with minimal code changes. Which approach should they take?
2A machine learning team wants to share features across multiple models to reduce training-serving skew and ensure consistency. Which Vertex AI service should they use?
3An organization uses Vertex AI Pipelines and wants to track the lineage of datasets, models, and metrics across pipeline runs. They need to query upstream and downstream dependencies of an artifact. Which service should they use?
4A team uses Vertex AI Feature Store with an online store for low-latency serving. They need to support frequent updates to features (e.g., every minute) and require high write throughput (thousands of writes per second). Which online store type should they choose?
5A machine learning team wants to implement champion/challenger model deployment. They have two model versions: v1 (champion) and v2 (challenger). They deploy both to the same endpoint with traffic splitting. How should they manage model versions in Vertex AI Model Registry to reflect this?
6A data engineer needs to version large datasets (multiple TB) in a Data Lake on Google Cloud. They require ACID transactions to ensure consistency when multiple jobs read/write concurrently. Which solution should they use?
7A team wants to use Vertex AI Workbench for collaborative notebook development. They need a persistent environment that can be stopped and restarted without losing installed packages and data. Which instance type should they choose?
8A team is building ML pipelines with Vertex AI. They want to reuse standard pipeline components across teams and enforce governance. What approach should they take?
9A company uses Vertex AI Feature Store for feature engineering. They need to ensure point-in-time correctness to avoid data leakage during training. Which feature retrieval method should they use?
10A machine learning engineer needs to deploy a model to an endpoint for real-time predictions. The model is registered in Vertex AI Model Registry. Which command should they use to create an endpoint and deploy the model with the alias 'champion'?
11A data scientist wants to automatically generate model documentation that includes model purpose, training data, evaluation results, and intended use. Which tool should they use?
12A team monitors features in Vertex AI Feature Store for drift. They want to set up automated alerts when a feature's distribution deviates significantly from the baseline. Which feature monitoring configuration should they use?
13A company wants to implement a central model governance strategy using Vertex AI. They need to track model lineage, store evaluation metrics, and manage model versions across teams. Which THREE Vertex AI services should they use? (Choose 3)
14A team uses Vertex AI Feature Store with an online store for real-time predictions. They notice that the online store queries are taking longer than expected. Which TWO actions could improve online store performance? (Choose 2)
15A data science team collaborates using Vertex AI Workbench user-managed notebooks. They want to version control their notebook code and share it with team members. Which TWO tools should they use? (Choose 2)
16A data science team wants to share engineered features across multiple projects while ensuring low-latency serving for online predictions. Which Google Cloud service should they use to store and serve these features?
17You are configuring a Vertex AI Feature Store online store for a real-time recommendation system that requires single-digit millisecond latency and high throughput. The feature values are updated frequently. Which online store type should you use?
18A team is training a model using historical data and wants to avoid data leakage when joining feature values from a feature store. The features include time-varying data like user activity counts. Which retrieval method should they use when creating a training dataset?
19You are setting up feature monitoring in Vertex AI Feature Store to detect drift in a numerical feature. The monitoring job should run daily and alert if the Jensen-Shannon divergence exceeds 0.1. Which configuration should you use?
20A company uses Vertex AI Model Registry to manage multiple model versions. They want to designate a model version as 'champion' for production deployment and another as 'challenger' for A/B testing. Which feature of the registry should they use?
21A data scientist is using Vertex AI Experiments to track training runs. They want to automatically log all hyperparameters, metrics, and model artifacts without modifying their training code. Which approach should they use?
22A machine learning pipeline in Vertex AI produces a dataset artifact, a trained model, and evaluation metrics. The team wants to query the lineage to find all downstream artifacts that depend on a particular dataset. Which Vertex AI service should they use?
23A team of data scientists is collaborating on notebooks in Vertex AI Workbench. They need to use Git for version control and share notebooks with real-time editing. Which type of Workbench instance should they choose?
24You are using DVC for data versioning in an ML project on Google Cloud. Your training data is stored in Cloud Storage. You want to track a new version of the dataset after preprocessing. Which DVC command should you use to register the changes?
25An organisation uses Delta Lake on Dataproc to manage a data lake for ML training. They need ACID transactions for concurrent reads and writes. Which file format does Delta Lake use as the underlying storage?
26You need to create a reproducible snapshot of a BigQuery table as of a specific timestamp for ML model training. The snapshot should be queryable without copying the entire dataset. Which BigQuery feature should you use?
27A team wants to enforce governance and compliance for all ML models across the organisation. They need a centralised repository that tracks model versions, deployment history, and evaluation metrics. Which service should they use?
28A company uses Vertex AI Pipelines for ML workflows. They want to standardize pipeline templates across teams to ensure consistency. Which TWO approaches should they use?
29A machine learning team needs to ensure that the same features used for training are used for serving in production to avoid training-serving skew. They use Vertex AI Feature Store. Which THREE actions should they take?
30A data science team uses Vertex AI Experiments to compare multiple model training runs. They want to capture and compare hyperparameters, metrics, and code versions for each run. Which TWO steps should they take?
31A data scientist wants to track machine learning experiments, including parameters, metrics, and artifacts, and compare runs. Which Vertex AI service should they use?
32A team uses Vertex AI Feature Store with an online store. They need low-latency serving for millions of features with high write throughput. Which online store type should they choose?
33A company trains a model using features from Vertex AI Feature Store. They notice training-serving skew because the feature values used at training time differ from those served online. How should they address this?
34A data science team wants to version control their datasets along with code using Git. They need a tool that integrates with Git and tracks changes to large data files. Which tool should they use?
35An ML engineer needs to deploy a model from Vertex AI Model Registry to an endpoint. The model has multiple versions. They want to designate one version as the 'champion' for production traffic. How should they do this?
36A company uses Vertex AI Pipelines to train and deploy models. They want to automatically generate model documentation that includes model details, intended use, and evaluation results. What should they use?
37A team wants to share feature definitions across multiple projects in their organization using Vertex AI Feature Store. What is the recommended approach?
38A data scientist is using Vertex AI Workbench notebooks and wants to collaborate with team members in real-time on the same notebook. Which notebook type supports real-time collaboration?
39A team uses Vertex AI Metadata to track pipeline runs. They need to identify all artifacts that were generated by a particular pipeline execution. Which API method should they use?
40A company uses Delta Lake on Dataproc for their data lake. They need to ensure ACID transactions and schema enforcement for data ingested from streaming sources. Which Delta Lake feature should they enable?
41An organization needs to implement MLOps with standardized pipeline templates across multiple teams. Which Vertex AI feature should they use to create reusable pipeline components?
42A data engineer wants to create a BigQuery table snapshot for point-in-time recovery of a critical dataset. The snapshot should be created daily and retained for 30 days. What should they use?
43A company wants to monitor features in Vertex AI Feature Store for drift over time. Which two services should they use? (Choose two.)
44A team uses Vertex AI Pipelines and wants to track lineage of artifacts and executions. Which three resources should they use? (Choose three.)
45A company wants to implement a centralized model registry for governance. Which two features should they use? (Choose two.)
46An ML team wants to share feature definitions across multiple projects to reduce training-serving skew and ensure consistency. They currently store features in Cloud Storage and manually coordinate updates, leading to errors. Which Google Cloud service should they use to centrally manage and serve features for both training and online inference?
47A data scientist is training a model using Vertex AI Experiments and wants to automatically log model parameters, metrics, and artifacts without modifying their training script. Which approach should they use?
48An ML engineer has a model trained in Vertex AI and wants to deploy it to an endpoint with autoscaling and traffic splitting for canary testing. They have the model artifact stored in Vertex AI Model Registry with alias 'champion'. What is the correct sequence of steps?
49A team wants to track the lineage of ML pipeline runs, including which datasets, parameters, and models were used in each execution. Which Vertex AI service should they use?
50An organization uses Vertex AI Workbench user-managed notebooks and wants to enable collaboration where multiple data scientists can edit the same notebook simultaneously. Which configuration should they use?
51A company uses BigQuery as their data warehouse. They want to version datasets for ML experiments and be able to query snapshots at specific points in time. Which approach is most cost-effective and requires minimal operational overhead?
52An ML team uses Vertex AI Pipelines and wants to automatically generate model cards documenting model purpose, evaluation results, and intended use. Which approach should they take?
53A team is using Vertex AI Feature Store with an online store for low-latency serving. They notice increasing latency during peak hours. The feature data is updated frequently and requires strong consistency. Which online store type should they use?
54A data scientist needs to retrieve training data from Vertex AI Feature Store that exactly matches the feature values as they were at a specific historical timestamp to avoid label leakage. Which feature view configuration should they use?
55An ML team wants to monitor feature drift in their production model. Which Vertex AI Feature Store capability should they use?
56A company uses Vertex AI Pipelines to orchestrate ML workflows. After a pipeline run, they want to query the lineage of a particular model artifact to find out which dataset and hyperparameters were used to produce it. Which API method should they use?
57A team is using Delta Lake on Dataproc for their data lake with ACID transactions. They want to version data for ML experiments and roll back to a previous version if needed. Which Delta Lake feature should they use?
58An organization wants to implement central governance for ML models across teams. Which TWO services should they use together to achieve model versioning, lineage, and deployment management? (Select 2)
59An ML team uses Vertex AI Workbench managed notebooks and wants to version their notebook code and collaborate using Git. Which THREE steps are required to set up Git integration? (Select 3)
60A company wants to use DVC for data versioning alongside their ML code in Git. Which TWO statements about DVC are correct? (Select 2)
61A team is building a fraud detection model that requires joining real-time transaction features with historical user features. They need to ensure that the training data does not use future information (data leakage). Which Vertex AI Feature Store capability should they use?
62An ML team wants to automatically track training runs, including hyperparameters and metrics, with minimal code changes. Which Vertex AI service should they use?
63A data science team needs to share features across multiple ML models while ensuring consistency between training and serving. Which approach best achieves this?
64An ML engineer trained a model and registered it in Vertex AI Model Registry. They want to assign the alias 'champion' to the best-performing version for production deployment. Which gcloud command should they use?
65Which Vertex AI service is used to track the lineage of ML pipeline components, artefacts, and executions?
66A team uses Vertex AI Workbench managed notebooks. They want to version control their notebook files and collaborate using Git. What is the best way to integrate Git?
67An ML team wants to implement data versioning for large datasets stored in Google Cloud Storage. They need to track changes over time and reproduce previous data states. Which tool is most appropriate?
68A company uses Vertex AI Feature Store with an online store for low-latency serving. They observe high latency during peak hours. The feature values are small (< 1 KB each) and the workload is read-heavy. Which change would most effectively reduce latency?
69What is the primary benefit of using a centralised model registry in MLOps?
70An ML engineer needs to deploy a model to an endpoint and gradually shift traffic from the previous version (champion) to a new version (challenger) for A/B testing. How should they configure the endpoint?
71A team wants to implement automated model documentation that captures training data, feature importance, evaluation metrics, and intended use. Which Vertex AI feature supports this?
72A company is implementing MLOps with Vertex AI. They need to ensure that only approved models can be deployed to production. Which TWO practices should they adopt?
73An ML team uses Delta Lake on Dataproc for data versioning. Which THREE benefits does Delta Lake provide?
74A team wants to monitor features in Vertex AI Feature Store for drift. Which TWO configurations are required?
75Which THREE are valid uses of Vertex AI Metadata?
76A data science team wants to share a set of engineered features across multiple projects and teams to reduce training-serving skew and ensure consistency. They need low-latency serving (single-digit milliseconds) for online predictions and also need to retrieve historical feature values for training. Which approach should they take?
77A team is operationalizing a machine learning pipeline using Vertex AI. They want to automatically track experiment runs, log model parameters and metrics, and store model artifacts for reproducibility. They also need to capture lineage between pipeline components (e.g., which dataset and hyperparameter tuning job produced a model). Which TWO services should they use together to achieve this? (Choose two.)
78A company is implementing MLOps on Google Cloud and needs to manage model versions, assign aliases (e.g., 'champion' for production, 'challenger' for staging), store evaluation metrics alongside each model version, and deploy models to endpoints. Which service should they use? (Choose THREE that are part of the solution.)
79A team uses Vertex AI Workbench notebooks for collaborative model development. They want to ensure that code changes are version-controlled, that multiple data scientists can work on the same notebook without conflicts, and that the environment is reproducible across team members. Which approach should they take?
The Collaborating Within and Across Teams to Manage Data and Models domain covers the key concepts tested in this area of the PMLE exam blueprint published by Google Cloud. Courseiva provides free domain-focused practice, mock exams, missed-question review, and readiness tracking across all PMLE domains — no account required.
The Courseiva PMLE question bank contains 79 questions in the Collaborating Within and Across Teams to Manage Data and Models domain. Click any question to see the full explanation and answer breakdown.
Start with a 10-question focused session to identify your baseline accuracy in this domain. Read every explanation — even for questions you answer correctly — to understand the reasoning. Once you score consistently above 80%, move to a 20–30 question session to confirm depth before moving to the next domain.
Yes — the session launcher on this page draws questions exclusively from the Collaborating Within and Across Teams to Manage Data and Models domain. Choose 10, 20, 30, or 50 questions for a focused session, or click individual questions to review them one by one.
Save your results, see per-domain analytics, and get readiness scores — free, for every certification.
Sign Up FreeFree forever · Every certification included