Practice Generative AI Leader Fundamentals of Generative AI questions with full explanations on every answer.
Start practicing
Fundamentals of Generative AI — choose a session length
Free · No account required
Click any question to see the full explanation and answer options, or start a focused practice session above.
A startup is building a customer support chatbot using Vertex AI and wants to ground responses in their product documentation to reduce hallucinations. Which approach should they use?
2A data scientist notices that a text generation model deployed on Vertex AI returns repetitive outputs after a few turns in a chat application. What is the most likely cause and the best parameter adjustment?
3A financial services company wants to use generative AI to generate personalized investment advice. They must ensure responses comply with regulatory requirements (e.g., no guarantees of returns). Which Vertex AI safety feature should they primarily use?
4A company is using Vertex AI to generate marketing copy. They notice that the output sometimes contains factual inaccuracies. Which parameter adjustment is most likely to improve factual accuracy?
5A team is fine-tuning a large language model on custom data using Vertex AI. They find that the training loss decreases but validation loss increases. What is the best course of action?
6A developer wants to generate product descriptions from a list of features using Vertex AI. Which model type is best suited for this task?
7A company is using Vertex AI to generate email responses. They want to ensure sensitive customer data (PII) is not included in the output. What is the most effective approach?
8Which TWO statements are true about generative AI models?
9A company is deploying a generative AI model for medical diagnosis support. Which THREE considerations are critical for responsible AI?
10A team is evaluating generative AI models for a content moderation system. Which THREE metrics are most important to assess?
11A company wants to build a chatbot using Vertex AI that can answer customer questions based on their internal knowledge base. Which Google Cloud service should they use to store and retrieve the knowledge base efficiently?
12A data scientist is fine-tuning a large language model using Vertex AI. The training job fails with an out-of-memory error. Which action should they take to resolve this issue?
13A company is deploying a generative AI application that generates medical reports. They need to ensure the output is factual and minimizes hallucinations. Which approach is most effective?
14A developer is using Vertex AI PaLM API to generate code snippets. The responses sometimes contain security vulnerabilities. What is the best practice to mitigate this?
15A machine learning engineer is building a text-to-image model using Vertex AI. They want to reduce inference latency. Which strategy is most effective?
16A company is using Vertex AI to generate personalized marketing emails. The model sometimes produces biased content. What is the most effective way to detect and mitigate bias?
17Which TWO options are best practices for deploying generative AI models on Vertex AI? (Choose two.)
18Which THREE factors should be considered when choosing between fine-tuning and prompt engineering for a generative AI task? (Choose three.)
19A data scientist sees the above error when trying to deploy a model to an endpoint. What is the most likely cause?
20A developer receives the above JSON response from a Vertex AI PaLM API call for a medical advice application. What should the developer be most concerned about?
21You are an ML engineer at a retail company. You have deployed a generative AI model on Vertex AI to generate product descriptions. The model uses a custom container and is deployed to a single endpoint. Recently, you noticed that inference latency has increased significantly during peak hours, causing timeouts. You have checked the logs and found that the CPU utilization on the deployed instances is consistently above 90% during peak hours. The model is currently deployed with a single machine type (n1-standard-4) and no scaling. You need to reduce latency without incurring excessive cost. What should you do?
22You are a data scientist at a financial institution. You are using Vertex AI to fine-tune a large language model (LLM) for generating financial reports. You have prepared a dataset of 10,000 examples. During fine-tuning, you notice that the training loss is decreasing steadily, but the validation loss is increasing after 5 epochs. The model's generated reports on the validation set contain many factual errors and nonsensical statements. You suspect overfitting. You have limited compute budget and need to improve generalization. What should you do?
23A retail company is building a generative AI chatbot to assist customers with product recommendations and order tracking. The chatbot uses Vertex AI with Gemini 1.5 Pro, and the development team has implemented a Retrieval-Augmented Generation (RAG) pipeline using Vertex AI Search for grounding. The pipeline uses a vector store containing product descriptions and order history. During testing, the team observes that the chatbot sometimes provides incorrect order statuses—for example, claiming an order is 'shipped' when it is actually 'pending'. The team suspects the issue is related to how context is retrieved and used. The RAG pipeline currently retrieves the top 5 chunks based on cosine similarity from the vector store, and passes them as context to the model. The team is considering several changes to improve factual accuracy. Which single action would most effectively reduce hallucinations in this scenario?
24A marketing team wants to generate product descriptions using generative AI. They need to ensure factual accuracy and avoid hallucinations. Which approach should they use?
25A company fine-tunes a text model on internal HR policies. After deployment, the model sometimes outputs sensitive employee information. What is the most likely cause?
26A developer uses the Vertex AI Python SDK to call a Gemini model for structured JSON output. However, the model often returns malformed JSON. Which parameter should the developer set in the generation configuration to enforce valid JSON output?
27A graphic design company wants to generate high-quality synthetic images for product mockups. Which Google Cloud generative AI service is most suitable?
28A data scientist fine-tunes a foundation model on customer support transcripts. After evaluation, the model's responses are too formal. Which adjustment during fine-tuning is most likely to make responses more conversational?
29An organization uses a fine-tuned model for medical diagnosis and must comply with HIPAA. Which measure is essential when deploying the model on Vertex AI?
30A prompt engineer wants to improve the model's adherence to a specific output format (e.g., always start with a greeting). Which technique should they try first?
31During a RAG pipeline implementation, the retrieval system frequently returns irrelevant documents, causing the generator to produce incorrect answers. Which change is most likely to improve the relevance of retrieved documents?
32A team is training a custom foundation model using JAX on TPUs on Google Cloud. They encounter frequent Out of Memory (OOM) errors. Which action is most effective in resolving the OOM error?
33A data scientist is selecting a base model for generating Python code. Which TWO factors are most important to consider?
34What are THREE benefits of using embedding models in a Retrieval Augmented Generation (RAG) system?
35A company deploys a Gemini model on Vertex AI for a customer-facing chatbot. They observe the chatbot occasionally produces toxic language. Which TWO measures should they implement immediately to reduce toxic outputs?
36Refer to the exhibit. A data scientist runs the gcloud command and sees the model listed. However, when they try to deploy the model to an endpoint, they get an error: 'Model is not deployable'. What is the most likely reason?
37Refer to the exhibit. This IAM policy is applied to a Vertex AI project. A user 'test@example.com' reports they cannot create a ModelEvaluationPipelineJob. Which action should the administrator take?
38Refer to the exhibit. A developer sees this error when trying to call a Vertex AI endpoint for online prediction. What permission does the requesting identity need to be granted?
39A startup wants to use a pre-trained model to generate product descriptions without training. Which Google Cloud service should they use?
40A data scientist fine-tunes a large language model on Vertex AI but gets poor results on validation data. What is the most likely cause?
41A gen AI application produces hallucinations (factually incorrect outputs). Which mitigation strategy is LEAST effective?
42What is the purpose of grounding in Vertex AI?
43A company wants to build a chatbot that answers questions based on internal documents. Which approach is most appropriate?
44A developer uses Vertex AI to generate code but the output is not syntactically correct. Which parameter should be adjusted?
45Which Google Cloud product provides access to pre-trained foundation models like Gemini?
46A company fine-tunes a model using Vertex AI and notices the model's performance drops on the original training task (e.g., language understanding) after fine-tuning for a new task (e.g., summarization). What could be the cause?
47Which of the following is a best practice when using Vertex AI for prompt engineering?
48Which TWO are benefits of using retrieval-augmented generation (RAG) over fine-tuning?
49Which THREE are valid methods to reduce bias in generative AI outputs?
50Which TWO are components of the Vertex AI Generative AI Studio?
51Refer to the exhibit. A developer runs this command. What is the primary purpose?
52Refer to the exhibit. A user with this IAM role tries to deploy a model to a Vertex AI Endpoint but fails. What is the most likely reason?
53Refer to the exhibit. What is the most likely cause of this error?
54A company wants to use generative AI to summarize customer support tickets. Which Google Cloud tool is best suited for this task?
55A developer is using Vertex AI Gemini API for a chatbot. The chatbot sometimes outputs harmful content. What is the best first step to mitigate this?
56A company has a large dataset of proprietary documents and wants to build a Q&A system using a foundation model without exposing the documents to the model. Which approach is most appropriate?
57A data scientist notices that a Gemini model generates inconsistent responses to similar prompts. What is the likely cause?
58A company wants to generate images from text descriptions using Google Cloud. Which service should they use?
59A team is building a medical diagnosis assistant using a foundation model. To comply with regulations, they need to ensure the model does not make up facts. What is the best approach?
60After fine-tuning a foundation model on company emails, the model outputs confidential information. What is the most likely cause?
61A developer wants to quickly experiment with different foundation models available in Google Cloud. Which tool should they use?
62A company is deploying a chatbot that uses a foundation model. They want to minimize latency for user queries. Which action is most effective?
63A team is evaluating generative AI models on Vertex AI. They need to compare models based on specific criteria. Which TWO criteria are most important for selecting a model for a text summarization task?
64A company is fine-tuning a Gemma model using Vertex AI. They observe that the model overfits. Which TWO actions should they take to mitigate overfitting?
65Which THREE components are core to a typical Retrieval Augmented Generation (RAG) system?
66Refer to the exhibit. A developer creates a model resource with this YAML config but gets an error that the model is not deployable. What is missing?
67Refer to the exhibit. A developer sees this error when trying to deploy a model from Vertex AI Model Registry. What is the most likely cause?
68Refer to the exhibit. A developer runs this command but forgets to specify the model name. What will happen?
69A company is using Vertex AI to deploy a text generation model for a chatbot. They want to reduce the response latency. Which configuration change is most effective?
70A data scientist needs to fine-tune a foundation model for a sentiment analysis task without managing infrastructure. Which Google Cloud service should they use?
71An organization wants to ensure their generative AI application does not produce toxic or harmful content. Which Vertex AI feature should they implement?
72A team uses PaLM 2 API to generate product descriptions, but the output sometimes contains factual inaccuracies. What is the best approach to improve accuracy?
73A developer wants to build a RAG application using Vertex AI. Which vector database is natively integrated with Vertex AI for storing embeddings?
74During model evaluation, a team observes good performance on training data but poor on validation data. Which regularization technique is most appropriate to address this?
75A financial institution deploys a chatbot using Gemini Pro in Vertex AI. Compliance requires logging all user inputs and model outputs for audit. Which approach meets this requirement?
76A research team is training a large language model from scratch using TPUs on Google Cloud. Which storage solution provides the highest throughput for training data?
77An MLOps engineer wants to implement continuous evaluation of a generative model in production. Which Vertex AI component should they use?
78A company is designing a prompt engineering strategy for a customer service chatbot using Gemini. Which two practices are recommended for improving response quality? (Choose TWO)
79A developer is using Vertex AI Studio to test a text generation model. Which two actions can be performed in Vertex AI Studio? (Choose TWO)
80A company is migrating an on-premises NLP pipeline to Vertex AI. Which three capabilities of Vertex AI align with common MLOps best practices for generative AI? (Choose THREE)
81Refer to the exhibit. A developer executed the command to list endpoints. They notice that two models are deployed to the same endpoint. What is the most likely reason for this configuration?
82Refer to the exhibit. An administrator creates this IAM policy for a Vertex AI project. What is the effect of this policy?
83Refer to the exhibit. A machine learning engineer is configuring a model using this YAML. What is the purpose of the 'tuningPipeline' field?
84A company wants to use a pre-trained language model for customer support summarization. They need to ensure responses are concise and accurate. Which prompt engineering technique is most effective?
85A developer is using Vertex AI Generative AI Studio to fine-tune a PaLM 2 model for code generation. After training, they notice the model generates plausible but incorrect code. What is the most likely cause?
86A company wants to build a chatbot that answers questions using their internal knowledge base. Which approach is most suitable?
87A company is deploying a generative AI model for medical advice. What is the most important consideration?
88A team uses Vertex AI to host a large language model. They want to reduce latency for real-time applications. What is the best strategy?
89A company has fine-tuned a foundation model on proprietary data. During evaluation, they find the model performs well on seen examples but poorly on unseen but similar tasks. What is the problem?
90Which Google Cloud service provides a managed environment for prompt engineering and model evaluation?
91A company wants to generate images from text descriptions. Which model in Vertex AI Model Garden should they use?
92During fine-tuning a model on Vertex AI, the job fails with error 'ResourceExhausted: Out of memory'. What is the most likely cause?
93Which TWO of the following are best practices for prompt engineering?
94Which THREE of the following are potential risks when deploying generative AI?
95Which TWO are benefits of using pre-trained foundation models instead of training from scratch?
96Refer to the exhibit. A team has deployed a model to an endpoint with the configuration shown. They notice that during peak traffic, the endpoint frequently returns 429 (Too Many Requests) errors. Which action should they take to resolve this issue?
97Refer to the exhibit. A company has this IAM policy on a Vertex AI project. Alice complains she cannot create a new model. What is the most likely reason?
98Refer to the exhibit. A data scientist is fine-tuning a model. The training loss and accuracy are improving each epoch. However, after training, the model performs poorly on a held-out validation set. What is the most likely issue?
99A startup is building a customer service chatbot that generates responses in real-time. They want the model to have up-to-date information on the latest product catalog but cannot afford frequent fine-tuning. Which technique should they use to inject current data into the model without retraining?
100A medical imaging team wants to generate synthetic X-ray images to augment a training dataset for a rare disease. Which type of generative model is most suitable for generating high-fidelity, realistic medical images?
101An enterprise deploys a large language model (LLM) for internal document summarization. Users complain that summaries sometimes include statements not present in the original document. Which mitigation strategy should the team prioritize to address this hallucination issue?
102A company is developing a code generation assistant and wants to ensure the model respects access control policies, e.g., it should not generate code that uses internal APIs that the user is not authorized to access. Which technique is most effective for embedding such policy constraints into the model's behavior?
103A team is tuning a large language model for a question-answering task. They notice the model gives high confidence scores to answers that are factually incorrect. Which evaluation metric should they primarily use to detect this overconfidence problem?
104A generative AI model is trained on a dataset containing biased text. The team wants to debias the model without significantly sacrificing performance on the original task. Which approach is most appropriate?
105A multimodal generative AI system processes both image and text inputs to produce captions. During inference, the image encoder sometimes produces noisy or missing features. Which architectural design decision best handles such input degradation without retraining?
106An organization wants to use a generative model to automatically generate legal contracts. The model must produce clauses that are not only grammatically correct but also legally enforceable and consistent with current jurisdiction laws. Which combination of techniques best ensures legal compliance?
107Which TWO of the following are key differences between generative AI and discriminative AI? (Choose two.)
108Which THREE of the following are common techniques to reduce harmful biases in generative AI models? (Choose three.)
109Which THREE of the following are key considerations when deploying a generative AI model in a production environment with strict latency requirements? (Choose three.)
110You are a generative AI lead at a healthcare startup developing a system to summarize patient medical records for quick review by doctors. The system uses a fine-tuned LLM. After deployment, doctors report that the summaries often miss critical details like medication dosages and allergy information. The current pipeline preprocesses patient records by extracting text from EHR, feeding it to the LLM, and outputting a summary. The team has limited time and budget. They cannot retrain the model because it is hosted as a managed API. Which action should you take to most effectively improve the summarization quality without changing the model?
111You are the lead AI engineer at a financial services firm. You have fine-tuned a large language model on historical trade reports to generate daily market summaries. The model is deployed on Google Cloud's Vertex AI using a custom container. A few weeks after deployment, the operations team notices that inference latency has increased by 300%, causing timeouts. You investigate and find that the model's memory consumption has grown unexpectedly, and the GPUs are idling due to high data transfer wait times. The model architecture and code have not changed. Which action is most likely to resolve the latency issue?
112You are a generative AI architect at a social media company. You are tasked with building a content moderation system that uses a generative model to flag toxic comments. The system must have very low false positive rates (i.e., not flag harmless comments) to avoid user backlash, but it must catch nearly all toxic comments. You have a large dataset of labeled toxic and non-toxic comments. You plan to use a pre-trained LLM and fine-tune it for classification. During experimentation, you notice that the model's recall for toxic comments is high (95%) but its precision is low (60%), leading to many false positives. You need to improve precision without substantially reducing recall. Which approach should you try first?
113You are the AI lead at an e-commerce company that uses a generative model to write product descriptions from images and key attributes. The model is a multimodal transformer that encodes both image and text (attributes) and decodes a description. Recently, your team deployed a new version of the image encoder that uses a more powerful backbone (ViT-L instead of ViT-B). After deployment, the generated descriptions became longer but often include irrelevant visual details (e.g., background objects) and occasionally misrepresent the product's main features. The model was fine-tuned on the same dataset as before. The descriptions from the old model were concise and focused. What is the most likely cause of the degradation and the best fix?
114A company is building a conversational AI using the Gemini API on Vertex AI. They want to reduce the chance of generating toxic content while still allowing creative and engaging responses for their gaming community. Which TWO safety settings should they adjust in the safety_settings parameter?
115A startup is developing a customer support chatbot using Vertex AI PaLM 2 API. They notice that the model sometimes generates plausible-sounding but factually incorrect information about company policies. The chatbot currently uses no external data. To reduce these hallucinations without retraining the model, the team needs a solution that can be implemented quickly and maintains low latency. They have access to the company's internal policy database stored in Cloud SQL. Which approach should they take?
116A marketing team wants to generate product descriptions using a text generation model on Vertex AI. They need consistent output style across all descriptions, including tone and length. They have a small set of 10 high-quality example descriptions that capture the desired style. The team has limited ML expertise and wants a quick solution that does not require model retraining. Which approach should they use?
117A healthcare company is building a clinical decision support system using Gemini 1.5 Pro on Vertex AI. They need responses that are highly accurate and comply with medical regulations, including traceability to source documents. They have a large corpus of curated medical guidelines stored in PDFs in Cloud Storage. Their team has experience with both fine-tuning and prompt engineering. Which approach best ensures regulatory compliance and accuracy?
118A company is deploying a Gemini 1.0 Ultra model for a code generation assistant. They have set up Vertex AI Model Evaluation with a custom evaluation dataset to measure pass@1 accuracy. The initial evaluation shows 65% pass@1. They want to improve to 80% without collecting more training data. They have already attempted basic prompt engineering (e.g., 'write correct code') with limited improvement. Which approach is most likely to achieve the desired improvement?
119A large enterprise runs a production application that uses the Gemini API on Vertex AI for real-time content moderation. They are experiencing occasional 429 (Too Many Requests) errors during peak hours. Their current quota is 1000 requests per minute (RPM) and they are hitting around 950 RPM on average, with spikes up to 1050. They have already implemented exponential backoff and retry logic. They need to reduce the error rate without reducing the quality of moderation. Which additional measure should they take?
120A financial services company is building a customer service agent using Vertex AI Agent Builder. They want the agent to only answer questions based on their approved policy documents, which are stored in Cloud Storage. They also need to ensure that the agent never reveals internal employee names or account numbers. They have set up grounding with the documents but find that the agent sometimes ignores the grounding and generates responses using the model's internal knowledge. What should they do to strictly constrain the agent to only use the provided documents?
121A retail company uses the Vertex AI Gemini API to generate product descriptions. Recently, the model started producing factually incorrect statements about product specifications, such as wrong dimensions and materials. Which strategy should be implemented to improve factual accuracy?
122A company is deploying a large language model (LLM) for customer support using Vertex AI. Which TWO best practices should they follow to ensure high-quality and cost-effective responses?
123A developer runs the command above to test a text classification model deployed on a Vertex AI endpoint. The model returns an error. What is the most likely cause?
124A healthcare company is using Vertex AI to build a generative AI assistant that helps doctors draft clinical notes. The assistant uses a fine-tuned PaLM 2 model deployed on a private endpoint. Recently, doctors have reported that the assistant takes over 30 seconds to respond, causing workflow delays. Additionally, the monthly Vertex AI costs have increased by 40% without a proportional increase in usage. The model responses are generally accurate but sometimes include irrelevant details. The company wants to improve response time and cost while maintaining acceptable quality. A review of logs shows that most requests are for similar note types (e.g., progress notes, discharge summaries) and that the same prompt is used repeatedly with minor variations. What should the company do first?
The Fundamentals of Generative AI domain covers the key concepts tested in this area of the Generative AI Leader exam blueprint published by Google Cloud. Courseiva provides free domain-focused practice, mock exams, missed-question review, and readiness tracking across all Generative AI Leader domains — no account required.
The Courseiva Generative AI Leader question bank contains 124 questions in the Fundamentals of Generative AI domain. Click any question to see the full explanation and answer breakdown.
Start with a 10-question focused session to identify your baseline accuracy in this domain. Read every explanation — even for questions you answer correctly — to understand the reasoning. Once you score consistently above 80%, move to a 20–30 question session to confirm depth before moving to the next domain.
Yes — the session launcher on this page draws questions exclusively from the Fundamentals of Generative AI domain. Choose 10, 20, 30, or 50 questions for a focused session, or click individual questions to review them one by one.
Save your results, see per-domain analytics, and get readiness scores — free, for every certification.
Sign Up FreeFree forever · Every certification included