Question 226 of 500

Fundamentals of Generative AI →hardMultiple ChoiceObjective-mapped

Quick Answer

The correct answer is to use RAG with Vertex AI Vector Search and embeddings. This approach is most appropriate because Retrieval-Augmented Generation allows the system to convert proprietary documents into vector embeddings stored in a vector database, then retrieve only the most relevant chunks for the foundation model’s prompt, keeping the full documents isolated and never exposed to the model for training or memorization. On the Google Cloud Generative AI Leader exam, this scenario tests your understanding of how to balance data privacy with generative AI capabilities, often appearing as a trap where candidates might incorrectly choose fine-tuning or direct model access. The key insight is that RAG acts as a privacy buffer: the model sees only the retrieved context, not the raw documents. Memory tip: think of RAG as a librarian who brings you a relevant page from a locked vault—you never see the vault itself.

Generative AI Leader Fundamentals of Generative AI Practice Question

This Generative AI Leader practice question tests your understanding of fundamentals of generative ai. Compare every option against the stated constraints before choosing — the best answer satisfies all requirements, not just the most obvious one. After answering, compare your reasoning against the explanation and wrong-answer breakdown below. Once you have made your selection, read the full explanation to reinforce the concept and understand why each distractor is designed to mislead on exam day.

A company has a large dataset of proprietary documents and wants to build a Q&A system using a foundation model without exposing the documents to the model. Which approach is most appropriate?

Question 1hardmultiple choice

Full question →

A
Use RAG with Vertex AI Vector Search and embeddings
RAG retrieves documents at query time without training on them.
B
Use a zero-shot model with context in prompt
Why wrong: Zero-shot with context is limited by context length and may not scale.
C
Fine-tune the model on the documents
Why wrong: Fine-tuning embeds data into the model, risking exposure.
D
Use prompt engineering to instruct the model
Why wrong: Prompt engineering alone cannot handle large volumes of proprietary data effectively.

Full breakdown with real-world context →

Answer choices

Why each option matters

Answer the question above first, then reveal the full breakdown to understand why each option is right or wrong.

Correct answer & explanation

✓

Use RAG with Vertex AI Vector Search and embeddings

Option A is correct because Retrieval-Augmented Generation (RAG) with Vertex AI Vector Search allows the system to retrieve relevant document chunks via embeddings without exposing the full documents to the foundation model. The model only sees the retrieved context in the prompt, ensuring proprietary data remains isolated and not used for training or memorization.

Key principle: Answer the scenario, not the keyword: identify the specific constraint before choosing the most familiar-sounding option.

Answer analysis

Option-by-option breakdown

For each option: why learners choose it and why it is or isn't the right answer here.

✓
Use RAG with Vertex AI Vector Search and embeddings
Why this is correct
RAG retrieves documents at query time without training on them.
Related concept
Read the scenario before looking for a memorised answer.
✗
Use a zero-shot model with context in prompt
Why it's wrong here
Zero-shot with context is limited by context length and may not scale.
✗
Fine-tune the model on the documents
Why it's wrong here
Fine-tuning embeds data into the model, risking exposure.
✗
Use prompt engineering to instruct the model
Why it's wrong here
Prompt engineering alone cannot handle large volumes of proprietary data effectively.

Common exam traps

Common exam trap: answer the scenario, not the keyword

Google Cloud often tests the misconception that fine-tuning or prompt engineering can solve data privacy concerns, when in reality RAG is the only approach that keeps proprietary documents isolated from the model's training and inference pipeline.

Detailed technical explanation

How to think about this question

RAG works by first converting documents into vector embeddings using a model like text-embedding-004, storing them in a vector database such as Vertex AI Vector Search, then at query time retrieving the top-k most relevant chunks via cosine similarity. The foundation model (e.g., Gemini) only receives these retrieved chunks as context, never the full dataset, ensuring compliance with data governance policies. In practice, this approach is critical for industries like healthcare or finance where documents contain PII or trade secrets that must not be exposed to the model's training data.

KKey Concepts to Remember

Read the scenario before looking for a memorised answer.
Find the constraint that changes the correct option.
Eliminate answers that are true in general but not in this case.

TExam Day Tips

Watch for words such as best, first, most likely and least administrative effort.
Review why wrong options are wrong, not only why the correct option is correct.

Key takeaway

Answer the scenario, not the keyword: identify the specific constraint before choosing the most familiar-sounding option.

Real-world example

How this comes up in practice

A cloud solutions architect for a retail company is evaluating services for a new workload. The correct answer here reflects best practice for the specific scenario described — not a general cloud recommendation. Answer the scenario, not the keyword: identify the specific constraint before choosing the most familiar-sounding option. Cloud exam questions reward reading the constraint carefully: the same technology can be right or wrong depending on the use case.

What to study next

Got this wrong? Here's your next step.

Identify which exam domain this question belongs to, review the core concept, then practise similar questions from the same domain.

Related Generative AI Leader practice-question pages

Use these pages to review the topic behind this question. This is how one missed question becomes focused revision.

Fundamentals of Generative AI practice questions

Practise Generative AI Leader questions linked to Fundamentals of Generative AI.

Business Strategies for Generative AI Solutions practice questions

Practise Generative AI Leader questions linked to Business Strategies for Generative AI Solutions.

Google Cloud's Generative AI Offerings practice questions

Practise Generative AI Leader questions linked to Google Cloud's Generative AI Offerings.

Techniques to Improve Generative AI Model Output practice questions

Practise Generative AI Leader questions linked to Techniques to Improve Generative AI Model Output.

Generative AI Leader fundamentals practice questions

Practise Generative AI Leader questions linked to Generative AI Leader fundamentals.

Generative AI Leader scenario practice questions

Practise Generative AI Leader questions linked to Generative AI Leader scenario.

Generative AI Leader troubleshooting practice questions

Practise Generative AI Leader questions linked to Generative AI Leader troubleshooting.

Practice this exam

Start a free Generative AI Leader practice session

Short sessions build daily habit. Longer sessions build exam-day stamina. Try a timed session to simulate real conditions.

10 questions 20 questions 30 questions 50 questions Timed 30

Generative AI Leader practice-test guide →Study guide →Browse all practice tests

FAQ

Questions learners often ask

What does this Generative AI Leader question test?

Fundamentals of Generative AI — This question tests Fundamentals of Generative AI — Read the scenario before looking for a memorised answer..

What is the correct answer to this question?

The correct answer is: Use RAG with Vertex AI Vector Search and embeddings — Option A is correct because Retrieval-Augmented Generation (RAG) with Vertex AI Vector Search allows the system to retrieve relevant document chunks via embeddings without exposing the full documents to the foundation model. The model only sees the retrieved context in the prompt, ensuring proprietary data remains isolated and not used for training or memorization.

What should I do if I get this Generative AI Leader question wrong?

Identify which exam domain this question belongs to, review the core concept, then practise similar questions from the same domain.

What is the key concept behind this question?

Read the scenario before looking for a memorised answer.

About these practice questions

Courseiva creates original exam-style practice questions with explanations and wrong-answer analysis. It does not publish real exam questions, exam dumps, or protected exam content. Learn why practice questions differ from exam dumps →

How Courseiva writes practice questions · Editorial policy

Same concept, more angles

3 more ways this is tested on Generative AI Leader

These questions test the same concept from different angles. Work through them to make sure you can recognise it however the exam phrases it.

Variation 1. A startup is building a customer service chatbot that generates responses in real-time. They want the model to have up-to-date information on the latest product catalog but cannot afford frequent fine-tuning. Which technique should they use to inject current data into the model without retraining?

easy

A.Rely on the model's zero-shot capabilities to infer product details.
✓ B.Use retrieval-augmented generation (RAG) to fetch relevant documents from a vector database at inference time.
C.Craft detailed system prompts that include the entire product catalog in the prompt.
D.Fine-tune the base model weekly on the latest product catalog.

Why B: Retrieval-Augmented Generation (RAG) is the correct technique because it allows the chatbot to fetch the most current product catalog entries from an external vector database at inference time, without requiring any model retraining. This keeps responses grounded in up-to-date information while avoiding the cost and latency of frequent fine-tuning.

Variation 2. A startup is building a customer support chatbot using Vertex AI and wants to ground responses in their product documentation to reduce hallucinations. Which approach should they use?

easy

✓ A.Enable Vertex AI Grounding with a custom enterprise data store containing the documentation.
B.Use the Codey API for text generation.
C.Use the base model without any grounding to maximize flexibility.
D.Fine-tune the model on the documentation and deploy.

Why A: Vertex AI Grounding with a custom enterprise data store is the correct approach because it allows the chatbot to retrieve and cite specific chunks from the product documentation in real time, directly reducing hallucinations by constraining responses to verified content. This method uses the underlying grounding service to query a vector-based data store (powered by Vertex AI Search) and append source references to the model's output, ensuring factual accuracy without retraining.

Variation 3. A healthcare company is building a clinical decision support system using Gemini 1.5 Pro on Vertex AI. They need responses that are highly accurate and comply with medical regulations, including traceability to source documents. They have a large corpus of curated medical guidelines stored in PDFs in Cloud Storage. Their team has experience with both fine-tuning and prompt engineering. Which approach best ensures regulatory compliance and accuracy?

medium

✓ A.Use a combination of grounding to the medical guidelines and prompt engineering with system instructions specifying compliance requirements.
B.Use prompt engineering with system instructions and few-shot examples, but no grounding.
C.Use grounding to the medical guidelines but rely on prompt engineering only for compliance instructions.
D.Fine-tune the model on the medical guidelines corpus to internalize the knowledge.

Why A: Option D is correct because combining grounding (which ties answers to the actual guidelines) with prompt engineering (which enforces compliance requirements) provides traceability and accuracy. Option A (fine-tuning only) risks the model memorizing rather than citing sources, and updates require retraining. Option B (grounding only) may still allow the model to generate ungrounded responses if not properly constrained. Option C (prompt engineering only) relies on the model's pre-trained knowledge, which is less reliable.

Last reviewed: Jun 30, 2026

Question Discussion

Share a tip, memory trick, or ask about the reasoning behind this question. Do not post real exam questions, leaked content, braindumps, or copyrighted exam material. Comments are moderated and may be removed without notice.

Loading comments…

This Generative AI Leader practice question is part of Courseiva's free Google Cloud certification practice question bank. Courseiva provides original exam-style practice questions with explanations, topic-based practice, mock exams, readiness tracking, and study analytics to help learners prepare for the Generative AI Leader exam.