Question 88 of 506

Quick Answer

The answer is to store training code in a Cloud Source Repository with tags linked to experiment IDs, record the dataset path and version in experiment parameters, and log the complete environment specification including framework versions. These three actions are correct because they create a full lineage for each Vertex AI experiment, capturing the three critical pillars of reproducibility: code, data, and environment. By tagging code commits to specific experiment IDs, you ensure any team member can checkout the exact source; recording dataset parameters locks in the data source; and logging environment details prevents dependency drift. On the Google Professional Machine Learning Engineer exam, this question tests your understanding of Vertex AI Experiments’ built-in lineage tracking, often appearing as a multi-select scenario where distractors include vague actions like “save the model artifact” without versioning or “use default runtime” without pinning dependencies. A common trap is forgetting that environment specification must be explicit, not implicit. Memory tip: think “Code, Data, Env” as the three legs of the reproducibility stool—if any leg is missing, the experiment falls over.

PMLE Practice Question: Collaborating within and across teams to manage data and models

This PMLE practice question tests your understanding of collaborating within and across teams to manage data and models. Read the scenario carefully and evaluate each option against the stated constraints before committing to an answer. After answering, compare your reasoning against the explanation and wrong-answer breakdown below. Once you have made your selection, read the full explanation to reinforce the concept and understand why each distractor is designed to mislead on exam day.

A machine learning team is collaborating on a project using Vertex AI Experiments to track model training runs. They want to ensure that all team members can reproduce any experiment by using the same code, data, and environment. Which THREE actions should the team take?

Question 1mediummulti select
Full question →

Answer choices

Why each option matters

Answer the question above first, then reveal the full breakdown to understand why each option is right or wrong.

Correct answer & explanation

Store the training code in a Cloud Source Repository and tag commits with the experiment ID.

Option A is correct because storing training code in a Cloud Source Repository with tags linked to experiment IDs ensures that every team member can retrieve the exact code version used for a given experiment. This is a core reproducibility practice in Vertex AI Experiments, where the code snapshot is a key component of the experiment lineage.

Key principle: Answer the scenario, not the keyword: identify the specific constraint before choosing the most familiar-sounding option.

Answer analysis

Option-by-option breakdown

For each option: why learners choose it and why it is or isn't the right answer here.

  • Store the training code in a Cloud Source Repository and tag commits with the experiment ID.

    Why this is correct

    This ensures the exact code version is tied to the experiment.

    Related concept

    Read the scenario before looking for a memorised answer.

  • Build a custom container image for training and push it to Artifact Registry with a fixed tag.

    Why this is correct

    A fixed container image ensures the same environment is used across runs.

    Related concept

    Read the scenario before looking for a memorised answer.

  • Record the path and version of the training dataset in the experiment parameters.

    Why this is correct

    This allows others to use the exact same data for reproduction.

    Related concept

    Read the scenario before looking for a memorised answer.

  • Share a service account key with all team members so they can access the same resources.

    Why it's wrong here

    Sharing keys is a security risk and not a standard reproducibility practice.

  • Use Vertex AI's hyperparameter tuning job to automatically find the best parameters.

    Why it's wrong here

    Automatic tuning can change parameters between runs, making exact reproduction difficult.

Common exam traps

Common exam trap: answer the scenario, not the keyword

Google Cloud often tests the distinction between actions that enable reproducibility versus actions that improve model performance or access control, so candidates mistakenly select hyperparameter tuning or service account sharing as reproducibility measures.

Detailed technical explanation

How to think about this question

Vertex AI Experiments automatically logs parameters, metrics, and artifacts, but it does not automatically version the training code or container image. By using Cloud Source Repository tags and Artifact Registry fixed tags, teams create immutable snapshots that can be referenced later. The dataset path and version recorded as experiment parameters allow the exact data to be retrieved, forming a complete reproducibility triad: code, data, environment.

KKey Concepts to Remember

  • Read the scenario before looking for a memorised answer.
  • Find the constraint that changes the correct option.
  • Eliminate answers that are true in general but not in this case.

TExam Day Tips

  • Watch for words such as best, first, most likely and least administrative effort.
  • Review why wrong options are wrong, not only why the correct option is correct.

Key takeaway

Answer the scenario, not the keyword: identify the specific constraint before choosing the most familiar-sounding option.

Real-world example

How this comes up in practice

A company's IT admin needs to give a contractor read-only access to production logs without sharing account credentials. Using role-based access control (RBAC) and temporary scoped permissions — not a permanent shared password — is the correct pattern. Questions like this test whether you can apply least-privilege access across cloud identity services.

What to study next

Got this wrong? Here's your next step.

Identify which exam domain this question belongs to, review the core concept, then practise similar questions from the same domain.

Related practice questions

Related PMLE practice-question pages

Use these pages to review the topic behind this question. This is how one missed question becomes focused revision.

Practice this exam

Start a free PMLE practice session

Short sessions build daily habit. Longer sessions build exam-day stamina. Try a timed session to simulate real conditions.

FAQ

Questions learners often ask

What does this PMLE question test?

Collaborating within and across teams to manage data and models — This question tests Collaborating within and across teams to manage data and models — Read the scenario before looking for a memorised answer..

What is the correct answer to this question?

The correct answer is: Store the training code in a Cloud Source Repository and tag commits with the experiment ID. — Option A is correct because storing training code in a Cloud Source Repository with tags linked to experiment IDs ensures that every team member can retrieve the exact code version used for a given experiment. This is a core reproducibility practice in Vertex AI Experiments, where the code snapshot is a key component of the experiment lineage.

What should I do if I get this PMLE question wrong?

Identify which exam domain this question belongs to, review the core concept, then practise similar questions from the same domain.

What is the key concept behind this question?

Read the scenario before looking for a memorised answer.

About these practice questions

Courseiva creates original exam-style practice questions with explanations and wrong-answer analysis. It does not publish real exam questions, exam dumps, or protected exam content. Learn why practice questions differ from exam dumps →

How Courseiva writes practice questions · Editorial policy

Same concept, more angles

1 more ways this is tested on PMLE

These questions test the same concept from different angles. Work through them to make sure you can recognise it however the exam phrases it.

Variation 1. A data science team is collaborating on a project to build a churn prediction model. They use Vertex AI Workbench instances for development. Each data scientist has their own instance with a persistent disk. They share code via a GitHub repository. They want to ensure that the model training is reproducible across different team members' environments. Currently, they manually install Python packages in their instances, and they have noticed that the model metrics differ slightly between runs on different instances. Which of the following is the best action to ensure reproducibility?

medium
  • A.Standardize the instance machine type and ensure all have the same number of CPUs.
  • B.Use Cloud Functions to run the training code instead.
  • C.Use Vertex AI Experiments with a fixed environment by specifying a prebuilt container.
  • D.Create a custom Docker image with all dependencies and use it in Vertex AI Training jobs.
  • E.Ask all team members to use the same Python virtual environment and install packages from a requirements.txt file.

Why C: Option C is correct because Vertex AI Experiments with a prebuilt container ensures a fixed, reproducible environment by pinning the exact OS, Python version, and all dependencies. This eliminates the variability introduced by manual package installations and differing instance configurations, directly addressing the team's issue of inconsistent model metrics across runs.

Keep practising

More PMLE practice questions

Last reviewed: Jun 30, 2026

Question Discussion

Share a tip, memory trick, or ask about the reasoning behind this question. Do not post real exam questions, leaked content, braindumps, or copyrighted exam material. Comments are moderated and may be removed without notice.

Loading comments…

Sign in to join the discussion.

This PMLE practice question is part of Courseiva's free Google Cloud certification practice question bank. Courseiva provides original exam-style practice questions with explanations, topic-based practice, mock exams, readiness tracking, and study analytics to help learners prepare for the PMLE exam.