Knowledge + Practice

CCNA Gcl Applying Genai Business Questions

75 of 150 questions · Page 2/2 · Gcl Applying Genai Business topic · Answers revealed

Practice these questions Exam hub All questions

76

Multi-Selectmedium

A company wants to deploy a GenAI code review assistant that integrates into their existing Git workflow. They want to use a managed Google Cloud service to minimize operational overhead. Which TWO services should they consider? (Choose two.)

Select 2 answers

A.Cloud Build

B.Compute Engine

C.Cloud Spanner

D.Vertex AI Agent Builder

E.Cloud Run

AnswersA, D

Cloud Build can be configured to trigger code review actions on pull requests, integrating with Git.

Why this answer

Cloud Build is correct because it is a managed CI/CD platform that integrates natively with Git repositories, enabling automated code review workflows triggered by pull requests or commits. It can invoke a GenAI code review service as a build step, minimizing operational overhead by eliminating the need to manage infrastructure.

Exam trap

Cisco often tests the distinction between managed CI/CD services (Cloud Build) and general-purpose compute (Cloud Run, Compute Engine), where candidates mistakenly choose Cloud Run because it is serverless, but it lacks native Git integration for automated code review triggers.

Practice this question →

77

MCQhard

A data scientist is using Vertex AI RAG Engine to build a question-answering system over a large corpus of technical manuals. Users report that answers are often verbose and include irrelevant details. Which configuration change is MOST likely to improve answer conciseness?

A.Increase the chunk size of documents in the index

B.Use a different embedding model for retrieval

C.Decrease the number of retrieved documents (k) in the retrieval step

D.Increase the max output token limit

AnswerC

Reducing k limits the context, forcing the model to focus on the most relevant snippets, leading to more concise answers.

Why this answer

RAG Engine allows tuning the number of retrieved documents (k). Reducing k limits the context, encouraging concise answers. Changing chunk size might affect relevance but not directly conciseness.

Embedding model type affects retrieval quality. Increasing max output tokens would worsen verbosity.

Practice this question →

78

MCQmedium

A company is rolling out a generative AI tool to employees. To ensure successful adoption, they plan to provide training and identify early adopters. Which change management practice is MOST critical early in the rollout?

A.Set up a dedicated help desk for AI tool issues

B.Conduct mandatory training for all employees before launch

C.Identify and train AI champions who can support their teams

D.Monitor usage metrics for the first month

AnswerC

Champions provide peer support and drive adoption more effectively.

Why this answer

Identifying AI champions who can advocate and help peers is crucial for organic adoption. Training all employees at once is less effective. Monitoring usage is important but later.

Setting up a help desk is reactive.

Practice this question →

79

MCQmedium

A company runs a pilot for a GenAI-powered internal knowledge base assistant. They want to measure adoption. Which metric is BEST for this purpose?

A.Number of follow-up questions asked per session

B.User satisfaction score from surveys

C.Average response time of the assistant

D.Number of unique users per week divided by total employees

AnswerD

This calculates adoption rate.

Why this answer

Option D is the best metric for measuring adoption because it directly captures the breadth of usage across the organization. Adoption is defined as the proportion of the target user base that actively uses the system, and dividing unique weekly users by total employees provides a clear percentage of uptake, which is the standard measure for adoption in enterprise GenAI deployments.

Exam trap

The trap here is that candidates confuse adoption with engagement or satisfaction, picking metrics like follow-up questions or survey scores, but Cisco specifically tests that adoption is about the proportion of the target population using the system, not how deeply or happily they use it.

How to eliminate wrong answers

Option A is wrong because the number of follow-up questions per session measures engagement depth or conversational complexity, not adoption; a single power user could inflate this metric while overall adoption remains low. Option B is wrong because user satisfaction scores measure quality or user experience, not adoption; a system can have high satisfaction but low usage if few employees try it. Option C is wrong because average response time measures performance latency, not adoption; a fast assistant is irrelevant if no one uses it.

Practice this question →

80

MCQhard

A legal team wants to use GenAI to review contracts and highlight risky clauses. They need the AI to consistently follow a specific classification taxonomy. The team has a small set of labeled examples (500 contracts). Which approach yields the BEST accuracy for this use case?

A.Prompt engineer a large foundation model with few-shot examples in Vertex AI Studio

B.Use the RAG Engine to retrieve similar clauses and ask the model to classify

C.Fine-tune a base model using the labeled examples in Vertex AI

D.Use a larger foundation model without fine-tuning and rely on its pre-trained knowledge

AnswerC

Fine-tuning adapts the model to the specific classification task and taxonomy, achieving higher accuracy with limited examples.

Why this answer

Fine-tuning a base model on the labeled examples teaches the model the specific taxonomy with high accuracy, even with a small dataset. Prompt engineering alone may be inconsistent. Larger model without fine-tuning may not adhere to taxonomy.

RAG is for retrieval, not classification.

Practice this question →

81

MCQhard

A legal firm wants to automate contract analysis. They need to extract key clauses (e.g., termination, indemnification) from scanned PDFs. The team expects high accuracy and must maintain data privacy. Which combination of services is most suitable?

A.Use AutoML Tables to train a classification model on text features

B.Use Document AI for OCR and Vertex AI with a custom fine-tuned model for clause extraction

C.Use Gemini API directly with a prompt to analyze PDFs

D.Use AppSheet to create a form for manual entry and then use BigQuery ML

AnswerB

Document AI extracts text from scans; a fine-tuned model on Vertex AI provides high accuracy and data stays in Google Cloud.

Why this answer

Document AI performs OCR and extracts text from scanned PDFs; Vertex AI with a custom fine-tuned model provides high accuracy for clause extraction while keeping data within the customer's project.

Practice this question →

82

MCQmedium

A company wants to build a customer service chatbot that answers questions about their internal policy documents. The documents are updated monthly, and the team cannot afford to retrain a model each time. Which approach is MOST appropriate?

A.Use Retrieval-Augmented Generation (RAG) with the policy documents indexed in a vector store

B.Fine-tune a base LLM on the policy documents monthly

C.Use a larger foundation model with a longer context window and paste all documents into each prompt

D.Train a custom model from scratch on the policy documents each month

AnswerA

RAG retrieves the latest document chunks at query time, eliminating the need to retrain.

Why this answer

Retrieval-Augmented Generation (RAG) is the most appropriate approach because it allows the chatbot to retrieve relevant policy document chunks from a vector store at inference time, without requiring model retraining when documents are updated monthly. This decouples the knowledge base from the model weights, enabling dynamic updates by simply re-indexing the new documents into the vector database.

Exam trap

Cisco often tests the misconception that fine-tuning is the only way to incorporate domain knowledge, but here the key trap is ignoring the cost and frequency of updates—candidates may pick fine-tuning (B) because it seems 'customized,' without realizing RAG avoids retraining entirely.

How to eliminate wrong answers

Option B is wrong because fine-tuning a base LLM monthly on updated policy documents is costly, time-consuming, and risks catastrophic forgetting of previous policies, making it impractical for frequent updates. Option C is wrong because pasting all policy documents into each prompt exceeds typical context window limits (e.g., 4K–128K tokens), leading to truncation, high latency, and increased cost, while also violating the constraint of not retraining. Option D is wrong because training a custom model from scratch each month is prohibitively expensive, requires massive computational resources and data, and is unnecessary when a retrieval-based approach can leverage existing pre-trained LLMs.

Practice this question →

83

Multi-Selectmedium

An enterprise is evaluating whether to build a custom fine-tuned model or use a pre-built API for code generation. Which three factors should they consider in the build vs. buy decision? (Choose THREE)

Select 3 answers

A.Data privacy and security requirements

B.Number of developer seats in the organization

C.Compatibility with on-premises legacy systems

D.Level of customization needed for the organization's coding standards

E.Availability of pre-built models for the specific programming language

AnswersA, D, E

If code contains proprietary logic, a custom model deployed within VPC may be necessary.

Why this answer

The three key factors are data privacy (build may be needed for IP-sensitive code), customization requirements (build offers more control), and cost/ROI (build requires upfront investment but may reduce per-query cost).

Practice this question →

84

MCQmedium

During a proof-of-concept for a GenAI document summarization tool, the team wants to evaluate whether the summaries are accurate and retain key information before scaling. Which evaluation approach is most appropriate for this stage?

A.Measure latency and cost as the primary evaluation metrics

B.Deploy to all users and collect feedback via a survey

C.Run an A/B test with a small user group and have domain experts manually review a sample of summaries for accuracy

D.Use ROUGE scores exclusively to compare summaries against human-written ones

AnswerC

A/B testing with manual expert review provides reliable quality assessment before broader rollout.

Why this answer

A/B testing with manual review by domain experts provides qualitative and quantitative feedback on accuracy and completeness, which is crucial for a pilot. Automated metrics alone may not capture business relevance.

Practice this question →

85

Multi-Selectmedium

A retail company wants to generate personalized marketing content (emails, social posts) at scale using generative AI. They need consistent brand voice and the ability to review outputs before publishing. Which two Google Cloud capabilities should they use? (Choose TWO)

Select 2 answers

A.Google Workspace Duet AI in Docs for drafting, then manual review

B.AutoML Tables for predicting customer segments

C.Vertex AI Agent Builder with grounding and human-in-the-loop

D.Pre-trained Gemini model via Vertex AI API with no customization

E.Cloud Vision API for image analysis

AnswersA, C

Duet AI can assist in drafting content quickly, and manual review ensures brand alignment.

Why this answer

Option A is correct because Google Workspace Duet AI in Docs allows marketers to draft personalized content using generative AI while maintaining control over brand voice through iterative editing. The manual review step ensures outputs meet quality and compliance standards before publishing, addressing the need for human oversight in content generation.

Exam trap

The trap here is that candidates may confuse AutoML Tables (a predictive modeling tool) with generative AI capabilities, or overlook that pre-trained models without customization fail to meet brand voice requirements, while Cloud Vision API is irrelevant to text generation tasks.

Practice this question →

86

MCQhard

A data science team wants to build a custom model for generating product descriptions that adhere to specific brand guidelines. They have 5,000 high-quality examples. Which approach balances cost and accuracy?

A.Fine-tune a foundation model (e.g., PaLM 2) using Vertex AI Model Garden

B.Use a pre-built API with prompt engineering and few-shot examples

C.Use Vertex AI Agent Builder with a custom prompt

D.Train a model from scratch using TensorFlow on Vertex AI

AnswerA

Fine-tuning adapts the model to the specific style with a reasonable cost.

Why this answer

Fine-tuning a foundation model on the examples yields high accuracy with moderate cost. Training from scratch is overkill; prompt engineering may not capture all nuances.

Practice this question →

87

MCQeasy

A product manager wants to quickly build a conversational agent that can answer FAQs from the company's help center articles. They have limited coding experience. Which Google Cloud service is BEST suited for this task?

A.Vertex AI Pipelines

B.Vertex AI Agent Builder

C.Vertex AI Studio

D.Model Garden

AnswerB

Agent Builder allows building chatbots with a visual interface and easy integration with data stores.

Why this answer

Vertex AI Agent Builder is the best choice because it provides a no-code/low-code interface specifically designed for building conversational agents and search experiences. It allows the product manager to connect help center articles as a data source and automatically generate a FAQ-answering agent without writing code, making it ideal for someone with limited coding experience.

Exam trap

Cisco often tests the distinction between tools for building agents (Agent Builder) versus tools for model experimentation (Studio) or model selection (Model Garden), and candidates mistakenly choose Studio or Model Garden because they think any generative AI tool can build a chatbot, ignoring the specific no-code agent-building capability required.

How to eliminate wrong answers

Option A is wrong because Vertex AI Pipelines is a tool for orchestrating and automating ML workflows (e.g., training, deployment) and requires pipeline definition via code or SDK, not for quickly building a conversational agent. Option C is wrong because Vertex AI Studio is a platform for experimenting with and tuning generative AI models (like prompts and foundation models), but it does not provide a built-in agent builder for connecting to help center articles and generating FAQ responses without custom development. Option D is wrong because Model Garden is a repository of pre-trained foundation models and does not include the agent-building or data-connecting capabilities needed to create a conversational FAQ agent.

Practice this question →

88

MCQmedium

A company is piloting a GenAI feature for email drafting in Gmail. They want to measure productivity improvement. Which metric is MOST directly tied to the business goal of reducing time spent on email composition?

A.Adoption rate of the GenAI feature among the pilot group

B.Increase in employee satisfaction survey scores

C.Reduction in average time-to-send per email

D.Reduction in total tokens consumed per email

AnswerC

This directly measures the productivity improvement (time saved) when drafting emails.

Why this answer

Option C is correct because the primary business goal is to reduce the time employees spend composing emails. Measuring the average time-to-send per email directly quantifies this efficiency gain, as it captures the end-to-end duration from initiation to dispatch, which the GenAI feature aims to shorten by generating draft content.

Exam trap

Cisco often tests the distinction between proxy metrics (like token consumption or adoption) and direct business outcome metrics, so the trap here is that candidates confuse technical efficiency (fewer tokens) with user productivity (time saved).

How to eliminate wrong answers

Option A is wrong because adoption rate measures how many users try the feature, not the actual productivity impact; high adoption could occur even if the feature does not save time. Option B is wrong because employee satisfaction scores are a lagging, subjective indicator that can be influenced by factors unrelated to email composition speed, such as overall morale or feature usability. Option D is wrong because reduction in total tokens consumed per email measures model efficiency or verbosity, not the business-relevant time savings; fewer tokens do not guarantee faster composition due to latency or user review time.

Practice this question →

89

MCQeasy

A marketing team wants to generate consistent brand-aligned social media posts using Vertex AI Studio. Which prompt engineering technique should they use to ensure the output tone matches their brand voice?

A.Set the temperature to 0 and use a long system instruction

B.Provide a few-shot prompt with examples of previous brand-aligned posts

C.Use a zero-shot prompt describing the brand voice

D.Use chain-of-thought prompting to explain the reasoning behind each post

AnswerB

Few-shot examples guide the model to replicate the desired tone and style.

Why this answer

Few-shot examples provide the model with clear examples of desired tone, ensuring consistency. Zero-shot or chain-of-thought are less effective for tone adherence.

Practice this question →

90

MCQhard

A company is building a GenAI application using the Gemini API. They want to minimize latency and cost for a high-volume use case. Which strategy is MOST effective?

A.Use a fine-tuned smaller model for the specific task

B.Increase temperature to generate more diverse responses

C.Use the largest available model for all requests

D.Implement caching for identical or similar user queries

AnswerD

Caching avoids re-computation for repeated inputs, reducing latency and cost.

Why this answer

Caching frequent requests reduces redundant processing, reducing both latency and token costs. Batching can also help, but caching directly addresses repeated queries.

Practice this question →

91

MCQhard

A company is using a large language model to generate code reviews. They want to reduce token costs while maintaining quality. Which approach is MOST effective?

A.Reduce the max output tokens to 100

B.Use a zero-shot prompt instead of few-shot to reduce prompt tokens

C.Cache frequently used code snippets in a prompt template

D.Use a smaller model fine-tuned for code (e.g., Codey)

AnswerD

Codey is optimized for code tasks and is more cost-effective than a general large model.

Why this answer

Using a smaller, specialized model for code tasks reduces cost per token and often performs well on code tasks. Caching is not applicable for code generation. Reducing max tokens might truncate output.

Fewer examples may harm quality.

Practice this question →

92

Multi-Selecthard

A company is deploying a code generation assistant for internal developers. They want to ensure the generated code is secure and follows best practices. Which two Vertex AI features should they use? (Choose TWO)

Select 2 answers

A.Vertex AI Agent Builder for conversation

B.Grounding with Google Search for real-time security best practices

C.Vertex AI Model Evaluation to assess code quality metrics

D.Cloud DLP for data loss prevention

E.AutoML Image for code snippet images

AnswersB, C

Grounding retrieves current security guidelines to inform the model.

Why this answer

Grounding with Google Search can retrieve up-to-date security best practices, and Model Evaluation can assess code quality. The other options are not directly relevant to security or best practices.

Practice this question →

93

MCQhard

A company deploys a GenAI-powered code review assistant. During evaluation, they find that the assistant often suggests security vulnerabilities as improvements. What is the MOST likely cause?

A.The model was trained on a dataset with many insecure code examples

B.The model's temperature is set too low

C.The model is too small for code generation tasks

D.The prompt does not include a security constraint

AnswerA

Training data bias toward insecure code can cause the model to suggest vulnerabilities.

Why this answer

The most likely cause is that the model was trained on a dataset containing many insecure code examples. A GenAI code review assistant learns patterns from its training data; if that data includes prevalent security vulnerabilities (e.g., SQL injection, buffer overflows), the model will internalize those patterns as 'normal' or even 'desirable' improvements. This leads to the assistant suggesting insecure code changes because it is statistically replicating the flawed logic it was exposed to during training.

Exam trap

Cisco often tests the misconception that prompt engineering alone (e.g., adding a security constraint) can override fundamental training data biases, when in fact the model's learned weights from the training corpus are the dominant factor in output quality.

How to eliminate wrong answers

Option B is wrong because setting the temperature too low (e.g., near 0) makes the model more deterministic and conservative, reducing randomness and the likelihood of suggesting unusual or insecure patterns; it would not cause the model to actively suggest vulnerabilities. Option C is wrong because model size (number of parameters) affects capability and fluency, not the tendency to generate insecure code; a small model can still produce secure suggestions if trained on secure data, while a large model trained on insecure data will replicate those flaws. Option D is wrong because while a missing security constraint in the prompt might fail to guide the model away from vulnerabilities, the root cause is the training data; even with a security constraint, a model trained on insecure examples may still suggest vulnerabilities due to its ingrained patterns, and the question asks for the 'most likely' cause, which is the data quality issue.

Practice this question →

94

MCQmedium

A company wants to build a customer service chatbot that answers questions about their internal policy documents. The documents are updated monthly, and the team cannot afford to retrain a model each time. Which approach is MOST appropriate?

A.Train a custom model from scratch on the policy documents each month

B.Fine-tune a base LLM on the policy documents monthly

C.Use Retrieval-Augmented Generation (RAG) with the policy documents indexed in a vector store

D.Use a larger foundation model with a longer context window and paste all documents into each prompt

AnswerC

RAG retrieves relevant document chunks at query time, ensuring the chatbot always answers from the latest uploaded documents without any model retraining.

Why this answer

RAG (Retrieval-Augmented Generation) allows the LLM to retrieve relevant document sections at inference time, so knowledge stays current without retraining. The other options either require expensive retraining for each update or lack document grounding.

Practice this question →

95

MCQmedium

A company wants to embed a generative AI writing assistant into their Google Docs workflow. The assistant should help users draft emails and reports based on prompts. Which Google Workspace feature should they leverage?

A.Google Workspace Add-ons with Vertex AI

B.Vertex AI API integrated via Apps Script

C.Gmail Smart Compose

D.Duet AI in Google Docs (Gemini for Google Workspace)

AnswerD

Duet AI provides native 'Help me write' capabilities in Docs, ideal for drafting.

Why this answer

Duet AI in Docs (now Gemini for Google Workspace) provides 'Help me write' functionality for drafting content. Apps Script can be used for custom add-ons but requires development. Vertex AI API is for external integration.

Smart Compose is for Gmail only.

Practice this question →

96

MCQeasy

A project manager wants to automatically generate weekly status reports from meeting notes and project data. The team uses Google Workspace. Which built-in capability is the QUICKEST to implement?

A.Use Gemini for Workspace (Duet AI) in Google Docs and Google Meet to generate summaries and reports

B.Select a model from Model Garden and deploy it as a private endpoint for report generation

C.Use Vertex AI Studio to design a custom prompt and call the Gemini API from a custom app

D.Write a Google Apps Script to call the Gemini API and format the report

AnswerA

Gemini for Workspace provides built-in AI capabilities that can summarize meeting notes and help write reports directly in Docs, requiring no custom code.

Why this answer

Gemini for Workspace (Duet AI) can generate summaries directly in Google Docs and Meet, leveraging existing data with no custom development. Custom API integration or Model Garden would require more effort. Apps Script is for custom automation, not built-in.

Practice this question →

97

Multi-Selectmedium

A company is using Vertex AI to generate report summaries. They want to measure ROI. Which three metrics should they track? (Choose THREE)

Select 3 answers

A.Error rate (e.g., factual errors in summaries)

B.Number of employees trained on the system

C.Number of active users per month

D.Accuracy score of generated summaries compared to human-written ones

E.Time saved per report (minutes)

AnswersA, D, E

Reducing errors improves trust and saves correction time.

Why this answer

Option A is correct because error rate directly measures the quality and reliability of generated summaries, which is a key factor in determining ROI. If summaries contain factual errors, they require human correction, reducing the time savings and potentially introducing business risks. Tracking error rate helps quantify the cost of inaccuracies against the benefits of automation.

Exam trap

The trap here is that candidates confuse adoption metrics (active users, training count) with ROI metrics, but ROI requires direct financial or efficiency quantification, not just usage statistics.

Practice this question →

98

Multi-Selecthard

A company is deploying a document summarization solution using Vertex AI. They want to minimize cost while maintaining quality. Which three strategies should they implement? (Choose THREE)

Select 3 answers

A.Use prompt compression to cut token usage

B.Choose a smaller, specialized model (e.g., Gemini 1.5 Flash)

C.Implement response caching for repeated queries

D.Use the largest available model for best quality

E.Batch multiple document summarization requests together

AnswersB, C, E

Smaller models are cheaper and often adequate for summarization.

Why this answer

Using caching reduces repeated processing, choosing a smaller model lowers token cost, and batching requests minimizes overhead. Prompt compression is not a standard Vertex AI feature, and using the largest model increases cost.

Practice this question →

99

MCQmedium

A financial services firm wants to use generative AI to assist employees in drafting emails. They are evaluating Duet AI in Gmail (now Gemini in Workspace). Which capability directly supports this use case?

A.Speaker notes generation in Slides

B.Smart Compose in Gmail

C.Meeting summaries in Meet

D.Help me write in Docs

AnswerB

Smart Compose uses generative AI to suggest complete sentences as the user types, speeding up email drafting.

Why this answer

Smart Compose in Gmail is a generative AI feature that provides real-time, context-aware suggestions to help users draft emails faster. It directly supports the use case of assisting employees in drafting emails by predicting and completing sentences as they type, reducing effort and improving efficiency.

Exam trap

Cisco often tests the distinction between generative AI features across different Workspace apps, and the trap here is confusing 'Help me write in Docs' (a document-focused tool) with the email-specific Smart Compose in Gmail, leading candidates to pick a feature that is not directly integrated into the email drafting workflow.

How to eliminate wrong answers

Option A is wrong because Speaker notes generation in Slides is designed to create presentation notes, not assist with email drafting. Option C is wrong because Meeting summaries in Meet provide post-meeting recaps, not real-time email composition assistance. Option D is wrong because Help me write in Docs is a generative AI feature for document creation in Google Docs, not for drafting emails within Gmail.

Practice this question →

100

Multi-Selectmedium

A company wants to adopt GenAI for code generation and review. To ensure code quality and security, they plan to implement a change management program. Which THREE actions are most effective?

Select 3 answers

A.Create a code review checklist specifically for AI-generated code

B.Conduct training sessions on prompt engineering for code generation

C.Appoint a single AI champion to manage all code generation

D.Phase out senior developers to rely entirely on AI-generated code

E.Pilot the tool with a small group of developers before company-wide rollout

AnswersA, B, E

A checklist helps reviewers catch common AI errors like security flaws.

Why this answer

Training developers, piloting with a small team, and creating review checklists address skills, risk mitigation, and quality control. Phasing out senior developers would be counterproductive. A single champion is insufficient.

Practice this question →

101

MCQeasy

Which Vertex AI feature allows a business user to explore and test over 300 foundation models from Google and partners, including Gemma, Llama, and Claude, without writing code?

A.Vertex AI Pipelines

B.Vertex AI Studio

C.Vertex AI Agent Builder

D.Model Garden

AnswerD

Model Garden offers over 300 foundation models for exploration, testing, and deployment.

Why this answer

Model Garden is the central hub in Vertex AI that provides access to a wide variety of foundation models for exploration and testing.

Practice this question →

102

MCQmedium

A company wants to build an internal knowledge base that allows employees to ask questions about company policies in natural language. The knowledge base is stored in a Google Cloud SQL database. Which architecture should they use?

A.Use Gemini API with a prompt that includes all policies

B.Use AutoML Natural Language to classify questions

C.Export Cloud SQL to BigQuery and use BigQuery ML

D.Use Vertex AI Agent Builder with Grounding to connect to Cloud SQL

AnswerD

Agent Builder can ground answers in database content via search or RAG.

Why this answer

Vertex AI Agent Builder with Grounding allows connecting to the database and answering questions based on its content, providing a natural language interface.

Practice this question →

103

MCQmedium

A healthcare organization wants to use generative AI to draft email responses to patient inquiries. They need to ensure that the model never generates medical advice and always includes a disclaimer. Where should they enforce these constraints?

A.Select a model from Model Garden that is pre-trained on medical data

B.Fine-tune the model with a dataset that only includes disclaimers

C.Use Grounding with Google Search to restrict knowledge to approved sources

D.Configure safety settings and system instructions in the Vertex AI API call

AnswerD

Safety settings and system instructions provide runtime constraints to prevent unwanted outputs and enforce disclaimers.

Why this answer

Safety settings and system instructions are applied at the API call level to constrain model behavior. Fine-tuning would embed the rules but is heavy. Grounding doesn't enforce constraints.

Model Garden is for model selection.

Practice this question →

104

MCQhard

A financial services firm needs to analyze thousands of legal contracts to extract key clauses (e.g., termination, indemnification) with high accuracy. They plan to use GenAI but are concerned about data privacy because contracts contain sensitive information. They want a solution where data is not used for model training and remains in their own Google Cloud project. Which approach best meets these requirements?

A.Access a foundation model through Model Garden and use it via a Colab notebook

B.Deploy a fine-tuned model on Vertex AI and query via Vertex AI API with data residency in the same region

C.Use Gemini for Google Workspace to open each contract in Docs and ask Gemini to summarize clauses

D.Use Vertex AI Studio's prompt library with few-shot examples, ensuring prompts are saved in the project

AnswerB

Vertex AI API allows you to deploy models with data residency controls and contracts prohibit use of data for training. Fine-tuning can be done on compliant infrastructure.

Why this answer

Vertex AI API with data residency controls and no training data sharing provides the privacy and security needed. Vertex AI Studio and Model Garden are not inherently private, and Gemini for Workspace may use data for improvement unless disabled.

Practice this question →

105

MCQmedium

A legal firm wants to use GenAI for contract analysis. They need to extract key clauses and flag risky terms. Which combination of services on Vertex AI is BEST suited?

A.Use Vertex AI RAG Engine with a vector store of example clauses

B.Fine-tune a foundation model on a dataset of labeled contracts and deploy it via Vertex AI Endpoint

C.Use Vertex AI Model Garden to select a model and then deploy it to an endpoint for real-time analysis

D.Use Vertex AI Agent Builder with a tool for document analysis

AnswerB

Fine-tuning on labeled contracts improves extraction and risk flagging accuracy.

Why this answer

Vertex AI's foundation models can parse contracts, and a fine-tuned model on legal documents would improve accuracy for clause extraction and risk flagging.

Practice this question →

106

MCQeasy

A developer wants to add a GenAI feature to their existing web application. They need to integrate with the app's backend using REST APIs. Which integration pattern is MOST appropriate?

A.Use Apps Script to call the Gemini API

B.Use Vertex AI Agent Builder to create an agent and embed it via iframe

C.Integrate via Vertex AI API or Gemini API

D.Build a Google Workspace add-on

AnswerC

REST APIs are the standard way to call GenAI models from any backend.

Why this answer

API-first integration using Vertex AI API or Gemini API is the standard way to add GenAI to existing applications. Workspace add-ons are for Google Workspace apps. Apps Script is for automating Workspace.

Agent Builder is for building conversational agents, not general API integration.

Practice this question →

107

MCQeasy

Which Google Cloud feature enables you to experiment with different prompts and model parameters interactively, and also supports model tuning without writing code?

A.Vertex AI Agent Builder

B.Google Cloud Console

C.Model Garden

D.Vertex AI Studio

AnswerD

Vertex AI Studio offers a visual interface for prompt engineering, model tuning, and evaluation without requiring coding.

Why this answer

Vertex AI Studio is the correct answer because it provides an interactive, code-free environment for experimenting with prompts and model parameters, and also supports model tuning through a graphical interface. This aligns directly with the question's requirement for interactive experimentation and no-code tuning, which are core features of Vertex AI Studio within the Google Cloud AI Platform.

Exam trap

The trap here is that candidates may confuse Model Garden's model discovery and selection capabilities with the interactive experimentation and tuning features that are exclusive to Vertex AI Studio.

How to eliminate wrong answers

Option A is wrong because Vertex AI Agent Builder is designed for creating conversational agents and search experiences, not for interactive prompt experimentation or model tuning. Option B is wrong because Google Cloud Console is a general management interface for all GCP services, lacking the specialized, interactive prompt engineering and tuning capabilities of Vertex AI Studio. Option C is wrong because Model Garden is a repository for discovering and accessing pre-trained models, but it does not provide an interactive environment for prompt experimentation or direct model tuning without code.

Practice this question →

108

MCQmedium

A company is evaluating ROI for a GenAI-based code review assistant. Which metric set BEST captures both productivity and quality improvements?

A.Number of reviews completed per day and lines of code written

B.Model inference latency and cost per token

C.Developer satisfaction score and reduction in code churn (percentage of code rewritten)

D.Time saved per code review and bug detection rate (percentage of bugs caught before deployment)

AnswerD

Time saved measures productivity; bug detection rate measures quality improvement. Together they provide a balanced view of the assistant's value.

Why this answer

Time saved per review (productivity) and bug detection rate (quality) directly measure the tool's impact. Code churn and developer satisfaction are secondary. Defect escape rate is important but harder to measure directly for code review.

Practice this question →

109

MCQmedium

An enterprise is concerned about the cost of using a large LLM for a high-volume customer support chatbot. They want to reduce token consumption while maintaining response quality. Which strategy would be MOST effective?

A.Use a multimodal model to handle both text and images

B.Always use the largest available model to ensure best quality

C.Increase the max output tokens to capture more detail

D.Implement response caching for common questions and batch similar requests

AnswerD

Caching eliminates redundant model calls for frequent queries; batching reduces per-call overhead.

Why this answer

Caching frequent queries reduces costs because the response is served from cache without model inference. Batching requests also saves on per-request overhead. Choosing a smaller model may reduce quality.

Practice this question →

110

MCQmedium

A company wants to adopt GenAI for internal knowledge management. They plan to start with a small pilot team, gather feedback, and then expand. Which change management approach is MOST aligned with this strategy?

A.Pilot with a small group, identify AI champions, collect feedback, and iteratively expand

B.Conduct mandatory training for all employees before the rollout

C.Deploy the solution to the entire organization at once with a communication campaign

D.Focus solely on the technical deployment without change management activities

AnswerA

This approach minimizes risk, builds advocates, and allows for improvements based on real usage.

Why this answer

Iterative rollout with a pilot group, AI champions, and measuring adoption is a proven change management pattern. Starting with the entire organization is risky. Mandatory training may cause resistance.

Focusing only on technical deployment ignores the human side.

Practice this question →

111

Multi-Selecthard

A company is planning an iterative rollout of a GenAI code generation tool for developers. They want to ensure adoption and minimize resistance. Which THREE change management practices are most effective?

Select 3 answers

A.Start with a small pilot group and gather feedback before expanding

B.Provide mandatory training sessions for all developers before rollout

C.Identify AI champions within the team to advocate and support peers

D.Immediately replace existing code review processes with AI-generated reviews

E.Monitor only the number of prompts submitted as the success metric

AnswersA, B, C

Pilot allows learning and refinement before full rollout.

Why this answer

Identifying AI champions, providing hands-on training, and starting with a small pilot group are proven strategies for AI adoption.

Practice this question →

112

Multi-Selecthard

A company deploys a GenAI chatbot for customer support using Vertex AI Agent Builder. The chatbot sometimes gives incorrect answers. The team wants to improve accuracy without retraining the underlying model. Which THREE actions should they take? (Choose 3)

Select 3 answers

A.Enable Grounding with company knowledge base documents

B.Use Vertex AI Prompt Tuning to optimize the system prompt

C.Increase the model's temperature setting for more creative responses

D.Fine-tune the underlying model on past conversations

E.Implement a human-in-the-loop escalation for low-confidence responses

AnswersA, B, E

Grounding allows the agent to retrieve answers from authoritative sources.

Why this answer

Grounding with company documents (RAG) provides authoritative knowledge. Prompt tuning guides the model. Adding a human-in-the-loop fallback catches errors.

Fine-tuning would require retraining. Increasing temperature would worsen accuracy.

Practice this question →

113

MCQmedium

A company wants to build a customer service chatbot that answers questions about their internal policy documents. The documents are updated monthly, and the team cannot afford to retrain a model each time. Which approach is MOST appropriate?

A.Train a custom model from scratch on the policy documents each month

B.Use a larger foundation model with a longer context window and paste all documents into each prompt

C.Use Retrieval-Augmented Generation (RAG) with the policy documents indexed in a vector store

D.Fine-tune a base LLM on the policy documents monthly

AnswerC

RAG retrieves relevant document chunks at query time, ensuring the chatbot always answers from the latest uploaded documents without any model retraining.

Why this answer

Retrieval-Augmented Generation (RAG) is the most appropriate approach because it allows the chatbot to answer questions based on the latest policy documents without retraining the model. By indexing the documents in a vector store and retrieving relevant chunks at query time, RAG ensures the model uses up-to-date information while keeping the underlying LLM static, which is both cost-effective and scalable for monthly updates.

Exam trap

Cisco often tests the misconception that fine-tuning is the only way to incorporate new information into an LLM, leading candidates to overlook the efficiency and flexibility of RAG for dynamic, frequently updated knowledge bases.

How to eliminate wrong answers

Option A is wrong because training a custom model from scratch each month is prohibitively expensive and time-consuming, requiring significant computational resources and data preparation, and it does not leverage the benefits of a pre-trained foundation model. Option B is wrong because pasting all policy documents into each prompt would exceed the context window limits of even the largest models (e.g., 128K tokens for GPT-4 Turbo), leading to truncated inputs, high token costs, and degraded performance due to irrelevant context. Option D is wrong because fine-tuning a base LLM monthly on the policy documents would still require retraining the model each time, which is costly and introduces the risk of catastrophic forgetting, where the model loses previously learned knowledge from the fine-tuning process.

Practice this question →

114

MCQhard

A retail company has deployed a generative AI chatbot for customer support. They notice that the model sometimes provides incorrect product information. The team wants to ground the model's responses in their product catalog to improve accuracy. Which Vertex AI feature should they enable?

A.Use Vertex AI RAG Engine

B.Enable Grounding with Google Search

C.Increase the model's temperature setting

D.Fine-tune the model with product catalog updates

AnswerB

Grounding allows the model to retrieve real-time information from a designated data source, ensuring responses are based on the catalog.

Why this answer

Option B is correct because Grounding with Google Search allows the model to retrieve real-time, authoritative information from the product catalog via Vertex AI's grounding service, ensuring responses are based on verified data rather than the model's internal knowledge. This feature directly addresses the need to reduce hallucinations by anchoring the model's output to a trusted source, such as a product database, without requiring custom retrieval infrastructure.

Exam trap

Cisco often tests the distinction between grounding (real-time retrieval from a trusted source) and fine-tuning (static model updates), leading candidates to mistakenly choose fine-tuning when the question emphasizes dynamic accuracy improvements.

How to eliminate wrong answers

Option A is wrong because Vertex AI RAG Engine (Retrieval-Augmented Generation) is a framework for building custom retrieval pipelines, but it requires additional setup and indexing of the product catalog, whereas Grounding with Google Search provides a simpler, out-of-the-box solution for grounding responses in external data. Option C is wrong because increasing the model's temperature setting would make responses more random and creative, which is the opposite of what is needed to improve accuracy and reduce incorrect product information. Option D is wrong because fine-tuning the model with product catalog updates would require retraining the model on static data, which is inefficient for dynamic catalogs and does not guarantee real-time grounding; it also risks catastrophic forgetting and does not leverage Vertex AI's built-in grounding capabilities.

Practice this question →

115

Multi-Selecteasy

A project manager wants to measure the business impact of a GenAI code review tool. Which THREE metrics should they track to evaluate ROI? (Choose 3)

Select 3 answers

A.Total tokens consumed by the GenAI model

B.Developer satisfaction score

C.Average time saved per code review

D.Number of lines of code generated

E.Defect escape rate (bugs found in production)

AnswersB, C, E

Measures team acceptance and morale, which affects long-term adoption and productivity.

Why this answer

Developer satisfaction score (B) is a critical metric for GenAI code review ROI because it directly measures user adoption and perceived value. If developers find the tool frustrating or inaccurate, they will bypass it, negating any potential time savings or defect reduction. High satisfaction correlates with sustained usage, which is necessary for long-term return on investment.

Exam trap

Cisco often tests the distinction between cost/usage metrics (like tokens consumed) and value/outcome metrics (like time saved or defect reduction), leading candidates to mistakenly select operational metrics instead of business impact metrics.

Practice this question →

116

MCQhard

A financial services firm is deploying a GenAI-powered contract analysis tool. The tool must extract key clauses and flag risky language. Which strategy BEST ensures structured, machine-readable output that downstream systems can parse?

A.Ask the model to write a summary of the contract in natural language

B.Fine-tune the model on a dataset of contracts with clause labels

C.Use a few-shot prompt with examples of JSON output containing the desired fields

D.Rely on the model's pre-trained ability to extract clauses without any formatting instructions

AnswerC

Few-shot examples guide the model to consistently produce JSON, enabling automated extraction and integration with downstream systems.

Why this answer

Option C is correct because using a few-shot prompt with JSON output examples directly instructs the model to produce structured, machine-readable data. This approach leverages the model's in-context learning ability to follow a specific schema, ensuring downstream systems can parse the extracted clauses without additional transformation. It balances flexibility and precision without requiring costly fine-tuning or relying on unreliable free-form text.

Exam trap

Cisco often tests the misconception that fine-tuning (Option B) is the only way to achieve structured output, when in fact few-shot prompting with JSON examples can provide a more flexible and cost-effective solution for many business use cases.

How to eliminate wrong answers

Option A is wrong because asking for a natural language summary produces unstructured text that downstream systems cannot reliably parse for key clauses and risk flags, requiring additional NLP processing. Option B is wrong because fine-tuning on labeled contracts improves extraction accuracy but does not guarantee structured output; the model may still return free-form text unless explicitly prompted for a format like JSON. Option D is wrong because relying on the model's pre-trained ability without formatting instructions leads to inconsistent, ad-hoc responses that vary in structure and completeness, making automated parsing impossible.

Practice this question →

117

MCQmedium

A company wants to build a customer service chatbot that answers questions about their internal policy documents. The documents are updated monthly, and the team cannot afford to retrain a model each time. Which approach is MOST appropriate?

A.Train a custom model from scratch on the policy documents each month

B.Use a larger foundation model with a longer context window and paste all documents into each prompt

C.Use Retrieval-Augmented Generation (RAG) with the policy documents indexed in a vector store

D.Fine-tune a base LLM on the policy documents monthly

AnswerC

RAG retrieves relevant document chunks at query time, ensuring the chatbot always answers from the latest uploaded documents without any model retraining.

Why this answer

RAG (Retrieval-Augmented Generation) allows the LLM to retrieve relevant document sections at inference time, so knowledge stays current without retraining. The other options either require expensive retraining for each update or lack document grounding.

Practice this question →

118

MCQeasy

Which feature in Vertex AI allows users to browse over 300 foundation models and deploy them with minimal code?

A.Vertex AI Studio

B.Vertex AI Agent Builder

C.Vertex AI Model Garden

D.Vertex AI Pipelines

AnswerC

Model Garden is a central repository for discovering and deploying foundation models.

Why this answer

Option C, Vertex AI Model Garden, is correct because it provides a centralized repository where users can browse, discover, and deploy over 300 foundation models—including first-party, open-source, and third-party models—with minimal code. It abstracts away infrastructure complexity by offering pre-built deployment templates and one-click integration with Vertex AI endpoints, enabling rapid experimentation and production deployment without deep ML engineering overhead.

Exam trap

The trap here is that candidates confuse Vertex AI Studio's prompt engineering capabilities with Model Garden's model discovery and deployment functionality, leading them to select A instead of C.

How to eliminate wrong answers

Option A is wrong because Vertex AI Studio is a low-code environment for prototyping and tuning generative AI models, but it does not serve as a model repository for browsing and deploying over 300 foundation models; it focuses on prompt design and model customization. Option B is wrong because Vertex AI Agent Builder is designed for creating conversational agents and search experiences using pre-built components, not for browsing and deploying a broad catalog of foundation models. Option D is wrong because Vertex AI Pipelines is an orchestration service for building and managing ML workflows, not a model discovery and deployment interface.

Practice this question →

119

MCQeasy

A marketing team needs to generate personalized email campaigns for thousands of customers. They want to maintain brand tone consistency and avoid manual writing. Which GenAI approach is BEST suited?

A.Use Vertex AI Studio with prompt design and few-shot examples in the prompt

B.Fine-tune a small model on brand guidelines only

C.Embed a rules-based template engine with no AI

D.Train a custom model from scratch on past campaigns

AnswerA

Vertex AI Studio enables rapid prompt iteration. Few-shot examples ensure consistent tone and structure without custom training.

Why this answer

Option A is correct because Vertex AI Studio enables prompt engineering with few-shot examples, allowing the team to generate personalized emails while maintaining brand tone consistency without fine-tuning or custom training. This approach leverages a pre-trained large language model (LLM) with carefully designed prompts that include brand guidelines and a few examples, ensuring output adheres to the desired style and context. It avoids the overhead of fine-tuning or building custom models, making it ideal for rapid deployment and iterative refinement.

Exam trap

Cisco often tests the misconception that fine-tuning or custom training is always necessary for domain-specific tasks, when in fact prompt engineering with few-shot examples can achieve comparable results with far less effort and cost.

How to eliminate wrong answers

Option B is wrong because fine-tuning a small model on brand guidelines only may lead to catastrophic forgetting or insufficient generalization, as the model might overfit to the narrow dataset and lose the broad language understanding needed for diverse customer personalization. Option C is wrong because a rules-based template engine cannot adapt to the nuanced, context-aware personalization required for thousands of unique customers; it would produce rigid, repetitive content that fails to capture brand tone dynamically. Option D is wrong because training a custom model from scratch on past campaigns is resource-intensive, requires massive labeled datasets, and is unnecessary when pre-trained models with prompt engineering can achieve the same goal more efficiently.

Practice this question →

120

MCQeasy

A project manager wants to track the ROI of a generative AI feature that assists customer support agents. Which metric is MOST directly tied to productivity improvement?

A.Adoption rate of the AI tool

B.Customer satisfaction (CSAT) score

C.Average handle time (AHT) per ticket

D.Cost per API call

AnswerC

AHT directly reflects agent efficiency; a decrease indicates productivity improvement.

Why this answer

Average handle time (AHT) directly measures the time agents spend per interaction, so a reduction indicates productivity gain. CSAT measures satisfaction, not efficiency. Cost per API call is a cost metric.

Adoption rate measures usage.

Practice this question →

121

MCQeasy

Which of the following best describes the primary benefit of using Grounding with Google Search when building a GenAI chatbot?

A.It reduces the model's latency by caching responses

B.It enables the model to generate images based on text descriptions

C.It provides fine-tuning capabilities for domain-specific data

D.It allows the model to access real-time information from the internet to reduce hallucinations

AnswerD

Grounding connects the model to live search results, ensuring responses are based on current data.

Why this answer

Grounding with Google Search connects the GenAI chatbot to real-time internet data, allowing it to retrieve current facts and events that the model was not trained on. This reduces hallucinations by ensuring responses are based on verified, up-to-date information rather than relying solely on the model's static training data.

Exam trap

The trap here is that candidates often confuse Grounding (a retrieval-based technique for real-time accuracy) with fine-tuning (a training-based technique for domain adaptation), leading them to select Option C incorrectly.

How to eliminate wrong answers

Option A is wrong because Grounding with Google Search does not cache responses to reduce latency; it introduces additional retrieval latency by querying live search results. Option B is wrong because Grounding is a text-based retrieval mechanism and does not enable image generation, which requires a multimodal model or separate image generation service. Option C is wrong because Grounding is a retrieval-augmented generation (RAG) technique, not a fine-tuning method; fine-tuning adjusts model weights on domain-specific data, whereas Grounding retrieves external data at inference time.

Practice this question →

122

Multi-Selectmedium

A company wants to build a generative AI-powered internal knowledge base for employees. They need to integrate with existing Google Workspace documents (Docs, Slides) and allow natural language queries. Which TWO services should they combine?

Select 2 answers

A.Model Garden

B.Gemini API

C.Vertex AI Agent Builder

D.Vertex AI RAG Engine

E.Document AI (DocAI)

AnswersC, D

Agent Builder creates the conversational interface for the knowledge base.

Why this answer

Vertex AI Agent Builder can create a conversational agent, and RAG Engine can index Google Workspace documents for retrieval. Gemini API alone is not a knowledge base. DocAI is for document parsing, not retrieval.

Model Garden is a model repository.

Practice this question →

123

MCQmedium

A company is piloting a GenAI feature that summarizes customer support tickets. They want to measure the impact on agent productivity before rolling out to all teams. Which approach BEST evaluates the pilot?

A.Survey agents on their perception of productivity after using the tool

B.Run an A/B test where half the agents use the tool and half do not, then compare average handling time

C.Compare the cost of the API before and after deployment

D.Measure the number of summaries generated per day

AnswerB

A/B testing provides a statistically sound comparison of actual metrics.

Why this answer

A/B testing with a control group provides a rigorous comparison of productivity metrics. The other options lack a baseline for comparison.

Practice this question →

124

Multi-Selectmedium

A company wants to use GenAI to generate marketing content such as blog posts and social media updates. They need the content to be on-brand and factually accurate. Which TWO features should they use?

Select 2 answers

A.Provide few-shot examples in the prompt to set the brand tone

B.Use a longer context window to include all brand guidelines

C.Fine-tune the model on a large corpus of past marketing content

D.Enable grounding with Google Search for factual accuracy

E.Use Vertex AI Studio to design prompts with no additional grounding

AnswersA, D

Few-shot learning guides the model to produce content consistent with the brand voice.

Why this answer

Few-shot examples in prompts help maintain brand voice. Grounding with Google Search ensures factual accuracy. Vertex AI Studio is for prompt design but not directly for accuracy.

Fine-tuning may be overkill. Longer context may dilute the message.

Practice this question →

125

MCQhard

A company is deploying a GenAI-powered email drafting feature. They want to control costs while maintaining low latency for real-time suggestions. Which strategy is MOST effective?

A.Batch all email drafting requests and run them every hour

B.Implement caching for frequently generated email drafts and use a smaller model variant for real-time requests

C.Use the largest available model and increase the number of tokens per request to generate more complete drafts

D.Use a large model with a longer context window to reduce the number of API calls

AnswerB

Caching avoids repeated inference for common drafts, and a smaller model reduces cost and latency for less common requests.

Why this answer

Caching common prompt-output pairs reduces API calls for repeated inputs. Choosing a smaller model balances speed and cost. Batching is for offline processing, not real-time.

Long context is more expensive.

Practice this question →

126

MCQeasy

A team wants to build a GenAI application that can interact with external APIs (e.g., to check inventory or place orders). Which Vertex AI component provides this capability?

A.Grounding with Google Search

B.Model Garden

C.Vertex AI Agent Builder

D.Vertex AI Extensions

AnswerD

Extensions enable agents to connect to external APIs and execute actions.

Why this answer

Extensions allow agents to call external APIs and perform actions beyond text generation. Agent Builder is the platform but Extensions provide the specific API integration. Model Garden and Grounding do not support API calls.

Practice this question →

127

MCQhard

A global company wants to deploy a GenAI application that must comply with GDPR and CCPA. They need to ensure that user data submitted to the LLM is not used for model training or improvement. Which combination of actions should they take on Vertex AI?

A.Disable data logging and use a pai-saas-llm model with a contract that prohibits training on inference data

B.Enable data logging for debugging and rely on model cards for compliance

C.Use the default Vertex AI settings, which automatically anonymize data

D.Store all user prompts in a separate BigQuery table for audit trails

AnswerA

Disabling logging prevents data retention; contractual prohibition ensures data is not used for training.

Why this answer

Disabling data logging and using a model that does not train on user data (like a custom deployment) ensures compliance. Default settings may allow training on data.

Practice this question →

128

MCQmedium

A product manager wants to generate meeting summaries automatically using Gemini for Google Workspace. They need summaries to be sent to all participants immediately after the meeting ends. Which Gemini feature should they use?

A.Gemini in Google Docs - Help me write

B.Gemini in Gmail - Smart Compose

C.Vertex AI Agent Builder with a meeting transcription model

D.Gemini in Google Meet - Take notes and summaries

AnswerD

This feature captures notes and summaries during or after a meeting and can automatically distribute them.

Why this answer

Gemini in Google Meet can automatically generate meeting summaries and share them after the meeting. The other options are for different Workspace apps or manual use.

Practice this question →

129

Multi-Selecthard

A company is deploying a GenAI system that generates product descriptions. During A/B testing, the new system shows a 20% increase in click-through rate (CTR) but a 15% increase in average cost per query due to the model size. The team wants to optimize cost without sacrificing the CTR gain. Which THREE actions should they take? (Choose three.)

Select 3 answers

A.Batch similar requests to reduce per-request overhead

B.Use a larger model with higher accuracy to further increase CTR

C.Increase the number of few-shot examples in the prompt

D.Switch to a smaller model and re-A/B test to confirm CTR impact

E.Implement response caching for repeated product SKUs

AnswersA, D, E

Batching reduces the number of API calls and can lower cost.

Why this answer

Option A is correct because batching similar requests reduces the per-request overhead by combining multiple inference calls into a single batch, which amortizes the fixed costs (e.g., model loading, token processing) across more outputs. This directly lowers the average cost per query while preserving the model architecture and CTR gains, as the model's output quality remains unchanged.

Exam trap

Cisco often tests the misconception that adding more few-shot examples always improves output quality, but in reality, it increases token costs and can degrade performance due to context window limits or irrelevant examples.

Practice this question →

130

Multi-Selectmedium

A company wants to build a GenAI-powered customer support chatbot. They require the chatbot to provide accurate answers based on the latest product documentation, and they need to control costs by minimizing token usage. Which TWO strategies should they use?

Select 2 answers

A.Use the largest available model for maximum accuracy

B.Cache frequent queries and their responses

C.Implement Retrieval-Augmented Generation (RAG) to retrieve relevant document chunks

D.Use a longer context window to include entire documents in the prompt

E.Fine-tune a large language model on the product documentation

AnswersB, C

Caching reduces repeated API calls, saving tokens and cost.

Why this answer

RAG ensures answers are grounded in latest docs. Caching common queries reduces token usage. Fine-tuning is expensive and not dynamic.

Using a large model increases cost. Long context windows increase token usage.

Practice this question →

131

MCQhard

A hospital wants to summarize patient-doctor conversations into structured clinical notes using GenAI. They need high accuracy and must avoid hallucinated medical information. Which combination of techniques is BEST?

A.Use RAG with medical textbooks and a low temperature setting

B.Fine-tune a medical-specific model on de-identified transcripts and use a structured output format in the prompt

C.Use a large model with zero-shot prompting and post-process the output with rule-based checks

D.Use a Gemini model with a high temperature setting to encourage creativity

AnswerB

Fine-tuning on medical data improves accuracy; structured output reduces hallucination.

Why this answer

Fine-tuning with structured output (e.g., JSON schema) and a strict prompt ensures the model produces accurate, formatted notes and reduces hallucinations.

Practice this question →

132

MCQmedium

A startup is building a GenAI application and must decide between using a pre-built API (e.g., Vertex AI Gemini API) or fine-tuning a custom model. Which factor STRONGLY favors using the pre-built API?

A.The application must process sensitive data that cannot leave the company's VPC

B.The startup has a large dataset of labeled examples and high compute budget

C.The application requires highly accurate, domain-specific terminology

D.The startup needs to launch quickly with minimal ML infrastructure and operational overhead

AnswerD

Pre-built APIs allow rapid integration without managing models or training pipelines.

Why this answer

Option D is correct because using a pre-built API like Vertex AI Gemini API eliminates the need to manage ML infrastructure, handle model training, or operationalize a custom model. This allows the startup to integrate GenAI capabilities rapidly via simple API calls, focusing on application logic rather than the complexities of model deployment, scaling, and maintenance.

Exam trap

Cisco often tests the misconception that 'more control equals better performance,' leading candidates to choose fine-tuning when the question explicitly asks for the factor that favors a pre-built API, which is speed and reduced operational burden.

How to eliminate wrong answers

Option A is wrong because processing sensitive data that cannot leave the VPC actually favors a self-hosted or fine-tuned model within the VPC, not a pre-built API which typically requires data to be sent to an external endpoint. Option B is wrong because having a large dataset of labeled examples and a high compute budget are prerequisites for fine-tuning, not reasons to use a pre-built API; fine-tuning would leverage those resources for domain adaptation. Option C is wrong because achieving highly accurate, domain-specific terminology is a primary reason to fine-tune a model on proprietary data, as a general-purpose pre-built API may lack the specialized vocabulary or context.

Practice this question →

133

MCQhard

A company is deploying a GenAI contract analysis system that processes confidential legal documents. They need to ensure that the model does not retain or train on customer data. Which configuration is REQUIRED?

A.Select a model with a context window large enough to hold the entire contract

B.Use a public model endpoint with data encryption in transit

C.Opt out of model logging and data retention in the API settings

D.Use a smaller model to reduce the risk of data memorization

AnswerC

Opting out of logging and data retention ensures the provider does not store or train on your data, fulfilling confidentiality.

Why this answer

Option C is correct because the primary requirement is to prevent the GenAI model from retaining or training on confidential legal documents. Most enterprise GenAI APIs (e.g., OpenAI, Azure OpenAI) provide a data privacy setting that allows customers to opt out of model logging and data retention, ensuring that prompts and responses are not stored or used for model improvement. This configuration directly addresses the compliance need for data confidentiality in contract analysis.

Exam trap

The trap here is that candidates confuse data security measures (encryption, context window, model size) with data privacy controls (opt-out of logging and retention), leading them to select technically valid but irrelevant options for the specific requirement of preventing data retention and training.

How to eliminate wrong answers

Option A is wrong because a large context window does not prevent data retention or training; it only allows the model to process longer inputs, which is unrelated to privacy controls. Option B is wrong because data encryption in transit (e.g., TLS 1.3) protects data during transmission but does not prevent the model provider from logging or retaining the data on their servers. Option D is wrong because using a smaller model does not inherently reduce the risk of data memorization; memorization depends on training data and model architecture, not model size alone, and does not address API-level data retention policies.

Practice this question →

134

MCQmedium

A company wants to build a customer service chatbot that answers questions about their internal policy documents. The documents are updated monthly, and the team cannot afford to retrain a model each time. Which approach is MOST appropriate?

A.Train a custom model from scratch on the policy documents each month

B.Fine-tune a base LLM on the policy documents monthly

C.Use Retrieval-Augmented Generation (RAG) with the policy documents indexed in a vector store

D.Use a larger foundation model with a longer context window and paste all documents into each prompt

AnswerC

RAG retrieves relevant document chunks at query time, ensuring the chatbot always answers from the latest uploaded documents without any model retraining.

Why this answer

Retrieval-Augmented Generation (RAG) is the most appropriate approach because it allows the chatbot to answer questions using the latest policy documents without retraining the model. By indexing the documents in a vector store and retrieving relevant chunks at query time, RAG provides up-to-date answers while keeping the underlying LLM static, which is ideal for monthly document updates.

Exam trap

Cisco often tests the misconception that fine-tuning is the only way to adapt an LLM to domain-specific knowledge, but the trap here is that candidates overlook RAG's ability to handle dynamic, frequently updated documents without retraining, which is a core requirement in the question.

How to eliminate wrong answers

Option A is wrong because training a custom model from scratch each month is prohibitively expensive and time-consuming, requiring full GPU-based training cycles and large datasets, which is not feasible for monthly updates. Option B is wrong because fine-tuning a base LLM monthly still requires significant computational resources and risks catastrophic forgetting of previous knowledge, and it does not inherently handle dynamic document updates without retraining. Option D is wrong because pasting all documents into each prompt exceeds typical context window limits (e.g., 4K–128K tokens) and degrades performance due to attention mechanism scaling issues, making it impractical for large or growing document sets.

Practice this question →

135

MCQmedium

A company uses Vertex AI Agent Builder to create a customer support agent. They need the agent to answer questions about order status by calling an internal API. Which Vertex AI feature should they use?

A.Vertex AI RAG Engine

B.Vertex AI Extensions

C.Grounding with Google Search

D.Vertex AI Model Garden

AnswerB

Extensions enable the agent to call custom APIs as tools.

Why this answer

Extensions in Vertex AI Agent Builder allow the agent to call external APIs (including internal ones) as tools during conversation.

Practice this question →

136

Multi-Selectmedium

A company wants to use Gemini for Google Workspace to improve productivity. They want to generate meeting summaries in Google Meet and draft email replies in Gmail. Which two Duet AI features should they enable? (Choose TWO)

Select 2 answers

A.Meet 'Take notes for me'

B.Gmail Smart Compose

C.Docs 'Help me write'

D.Sheets formula assistance

E.Slides speaker notes generation

AnswersA, B

This feature generates meeting notes and summaries automatically.

Why this answer

Option A is correct because the 'Take notes for me' feature in Google Meet uses Duet AI to automatically capture meeting notes, action items, and summaries, directly addressing the requirement to generate meeting summaries. This feature leverages Gemini's natural language processing to transcribe and summarize conversations in real time, enhancing productivity by eliminating manual note-taking.

Exam trap

Cisco often tests the distinction between features that are specific to a single application (like Docs or Sheets) versus cross-application productivity tools, leading candidates to select features that are technically correct but do not match the exact use case described.

Practice this question →

137

MCQmedium

A legal department wants to automate contract analysis using GenAI. They need to identify risky clauses and extract key dates. Which Google Cloud service is best suited for this task?

A.Document AI

B.Vertex AI Agent Builder

C.BigQuery

D.Model Garden

AnswerB

Agent Builder allows creating an agent that can ingest contracts, use an LLM for analysis, and extract structured data like clauses and dates.

Why this answer

Vertex AI Agent Builder is the correct choice because it enables the creation of custom generative AI agents that can analyze contract text, identify risky clauses, and extract key dates using large language models (LLMs) and retrieval-augmented generation (RAG). It provides a no-code/low-code environment to build agents that leverage enterprise data, making it ideal for automating contract analysis without extensive ML expertise.

Exam trap

The trap here is that candidates often confuse Document AI's structured extraction capabilities with generative AI's ability to perform nuanced risk analysis, leading them to choose Document AI without recognizing that Vertex AI Agent Builder provides the necessary generative and agentic capabilities for this specific use case.

How to eliminate wrong answers

Option A is wrong because Document AI is a document understanding service focused on OCR, parsing, and structured data extraction (e.g., forms, invoices) using pre-trained models, not generative AI for clause risk analysis or flexible date extraction. Option C is wrong because BigQuery is a serverless data warehouse for SQL-based analytics and large-scale data processing, not a service for building generative AI agents or analyzing contract text. Option D is wrong because Model Garden is a repository of pre-trained models and foundation models for experimentation and fine-tuning, but it does not provide the agent-building framework, RAG integration, or deployment tools needed for this task.

Practice this question →

138

MCQeasy

A data analyst wants to use Gemini in Google Sheets to help with complex formulas. Which feature should they use?

A.Model Garden in Vertex AI

B.Smart Compose in Gmail

C.Gemini for Workspace in Google Sheets

D.Help me write in Google Docs

AnswerC

Gemini in Sheets offers formula suggestions and assistance directly within the spreadsheet.

Why this answer

Gemini for Workspace in Google Sheets provides an AI-powered side panel that can generate, explain, and debug complex formulas directly within the spreadsheet environment. This feature is specifically designed to assist with formula creation and data analysis tasks, making it the correct choice for a data analyst using Gemini in Google Sheets.

Exam trap

The trap here is that candidates may confuse general-purpose AI writing features (like Help me write in Docs or Smart Compose in Gmail) with the specialized, context-aware formula assistance provided by Gemini for Workspace in Sheets, failing to recognize that each Workspace tool has a domain-specific integration.

How to eliminate wrong answers

Option A is wrong because Model Garden in Vertex AI is a repository of foundation models for building and deploying custom AI applications, not a feature integrated into Google Sheets for formula assistance. Option B is wrong because Smart Compose in Gmail is a feature for suggesting complete sentences in email composition, unrelated to spreadsheet formulas or data analysis. Option D is wrong because Help me write in Google Docs is a generative writing assistant for document creation, not designed to handle complex formulas or spreadsheet-specific tasks.

Practice this question →

139

MCQeasy

A product manager wants to add a feature that drafts meeting summaries automatically in Google Meet. Which Gemini for Google Workspace capability should they use?

A.Vertex AI Model Garden

B.Gemini for Workspace in Google Meet

C.Vertex AI Agent Builder

D.Duet AI in Google Slides

AnswerB

Gemini for Workspace (formerly Duet AI) includes meeting summaries in Google Meet.

Why this answer

Gemini for Workspace in Google Meet provides meeting summaries. Duet AI for Slides does not generate meeting summaries. Vertex AI Agent Builder and Model Garden are not directly integrated into Meet.

Practice this question →

140

Multi-Selectmedium

A company wants to integrate generative AI into their existing CRM workflow to draft personalized email responses. They have limited engineering resources. Which two approaches should they consider? (Choose TWO)

Select 2 answers

A.Use Vertex AI API with a low-code integration platform (e.g., Apigee)

B.Fine-tune a model on historical email data to ensure brand voice

C.Use Gemini API via Google Apps Script to add a custom menu in the CRM

D.Deploy a dedicated GPU cluster for inference

E.Build a custom web UI for the assistant from scratch

AnswersA, C

Low-code platforms reduce the need for custom coding.

Why this answer

Using Gemini API via Apps Script is a lightweight integration, and using Vertex AI API with a low-code tool like Apigee or Cloud Functions can also minimize engineering effort. Building a custom UI or fine-tuning is resource-intensive.

Practice this question →

141

Multi-Selecthard

A manufacturing company wants to use GenAI to generate maintenance reports from sensor data. They need structured output (JSON) for downstream systems, and they want to reduce token costs. Which THREE strategies should they use?

Select 3 answers

A.Use batch API requests for multiple sensor readings

B.Use the largest available model to ensure accuracy

C.Use structured output formatting in the prompt (e.g., 'Return JSON')

D.Choose the smallest model that meets accuracy requirements

E.Include multiple few-shot examples of JSON in every prompt

AnswersA, C, D

Batch requests reduce per-token cost.

Why this answer

Structured output ensures JSON format; batch requests reduce cost; the smallest suitable model minimizes token usage. Few-shot adds tokens; caching may not help for diverse sensor data.

Practice this question →

142

MCQmedium

A company wants to integrate GenAI into their existing customer relationship management (CRM) system. The CRM is a third-party SaaS application. Which implementation pattern is MOST suitable?

A.API-first integration by calling Vertex AI API from the CRM's custom code

B.Building a standalone AI application and exporting data manually

C.Embedding GenAI using Google Workspace add-ons

D.Using Apps Script to extend Google Sheets connected to the CRM

AnswerA

The CRM can make API calls to Vertex AI for predictions, keeping the CRM intact.

Why this answer

API-first integration allows the CRM to call Google Cloud GenAI APIs without modifying the CRM's core. Workspace add-ons are for Google Workspace, not SaaS CRM.

Practice this question →

143

Multi-Selecthard

A company is moving a GenAI proof-of-concept to production. They need to ensure the system can handle variable traffic and maintain low latency. Which THREE practices should they implement? (Choose 3)

Select 3 answers

A.Implement response caching for common queries

B.Enable auto-scaling for the serving infrastructure

C.Reduce the input context length to the absolute minimum

D.Set up monitoring and alerting on latency metrics

E.Use a single, large instance to handle all traffic

AnswersA, B, D

Caching reduces latency and cost by reusing responses for identical requests.

Why this answer

Option A is correct because response caching stores the outputs of frequently requested queries, allowing the system to serve them instantly without recomputation. This drastically reduces latency for repeated requests and offloads the underlying model, which is critical for maintaining responsiveness under variable traffic patterns in production.

Exam trap

Cisco often tests the misconception that minimizing input context length universally improves performance, ignoring the trade-off with output quality, and that a single large instance is simpler and sufficient for production traffic, overlooking scalability and fault tolerance requirements.

Practice this question →

144

MCQeasy

A marketing team wants to generate social media posts from product descriptions using Generative AI. They need consistent brand tone and the ability to iterate quickly. Which tool is BEST suited for this task?

A.Duet AI in Google Docs

B.Vertex AI Model Garden

C.Vertex AI Studio

D.Vertex AI Agent Builder

AnswerC

Vertex AI Studio offers a user interface for designing and testing prompts, tuning models, and evaluating outputs — ideal for marketing content creation.

Why this answer

Vertex AI Studio is the correct choice because it provides a purpose-built environment for prompt engineering, model tuning, and rapid iteration with foundation models. It allows the marketing team to experiment with prompts, adjust parameters like temperature and top-p, and maintain consistent brand tone through saved prompt templates and versioning, directly supporting the need for quick iteration.

Exam trap

Cisco often tests the distinction between a general-purpose AI assistant (Duet AI) and a dedicated prompt engineering platform (Vertex AI Studio), leading candidates to choose Duet AI because they confuse document assistance with content generation.

How to eliminate wrong answers

Option A is wrong because Duet AI in Google Docs is an AI-powered assistant for document creation and editing, not a tool for generating social media posts from product descriptions with iterative prompt engineering. Option B is wrong because Vertex AI Model Garden is a repository for discovering and deploying pre-trained models, but it lacks the integrated prompt engineering and iterative testing environment needed for fine-tuning brand tone. Option D is wrong because Vertex AI Agent Builder is designed for building conversational agents and chatbots, not for generating and iterating on social media content from product descriptions.

Practice this question →

145

MCQeasy

A developer needs to integrate a GenAI model into an existing customer relationship management (CRM) system. The CRM exposes REST APIs and runs on-premises. Which integration pattern is MOST suitable?

A.Use Apps Script to call the model from Google Sheets

B.Fine-tune a model on CRM data and deploy it on-premises

C.Implement an API-first integration by calling Vertex AI API from the CRM's backend

D.Build a custom Google Workspace add-on

AnswerC

Vertex AI API can be called via HTTPS from any system that supports REST, making it ideal for integrating with an on-premises CRM.

Why this answer

API-first integration using Vertex AI API allows any system with HTTP capabilities to call the GenAI model. Workspace add-ons and Apps Script are for Google Workspace only. Fine-tuning doesn't help with integration.

Practice this question →

146

MCQhard

A company has deployed a GenAI-powered report generation system using Vertex AI. They notice that the cost is higher than expected. Investigation shows that many requests include very long prompts with repetitive boilerplate text. Which cost optimization strategy is MOST effective?

A.Increase the batch size for batch requests

B.Enable context caching for repeated prompt prefixes

C.Switch to a smaller model size

D.Ignore the cost increase as it will stabilize

AnswerB

Context caching stores repeated prefixes so they are not re-sent with each request, reducing token consumption and cost.

Why this answer

Caching repeated prompt prefixes can significantly reduce token usage and cost. Batch requests help with throughput but not with per-request token savings. Reducing model size may hurt quality.

Ignoring is not a strategy.

Practice this question →

147

MCQhard

A legal firm wants to automate contract analysis to extract key clauses and risks. They have 10,000 contracts in PDF format. The solution must handle varying layouts and be cost-effective. Which approach is BEST?

A.Use Document AI to convert PDFs to structured text, then use Vertex AI with a prompt that specifies clauses to extract

B.Build a retrieval-augmented generation (RAG) system in Vertex AI Agent Builder

C.Use the Gemini API directly with the raw PDF files as input

D.Fine-tune a Gemini model on 100 annotated contracts and run inference on all contracts

AnswerA

Document AI handles layout parsing, and the structured text is then fed to a GenAI model with a well-designed prompt. This combination is scalable and cost-effective.

Why this answer

Using Document AI to parse PDFs into text, then Vertex AI with a structured prompt for clause extraction combines robust document understanding with flexible GenAI. Fine-tuning on 100 contracts is insufficient for layout variation. Agent Builder is overkill.

Direct Gemini API on raw PDFs loses document structure.

Practice this question →

148

MCQeasy

A marketing agency uses generative AI to create social media posts. They need the output to be in a specific JSON format for downstream processing. Which prompt technique should they use?

A.Set the temperature to 0.0

B.Use a model fine-tuned for JSON generation

C.Include a system instruction to output JSON

D.Specify the desired JSON schema in the prompt and use few-shot examples

AnswerD

Explicit schema description plus few-shot examples reliably produce JSON output.

Why this answer

Option D is correct because explicitly specifying the desired JSON schema in the prompt combined with few-shot examples provides the most reliable way to enforce structured output from a generative AI model. This technique leverages the model's pattern-matching ability by showing concrete input-output pairs, which is more effective than vague instructions or parameter adjustments alone for achieving exact JSON formatting.

Exam trap

Cisco often tests the misconception that a simple parameter change (like temperature) or a vague instruction is sufficient to enforce structured output, when in reality explicit schema definition with examples is required for reliable JSON generation.

How to eliminate wrong answers

Option A is wrong because setting temperature to 0.0 only reduces randomness in token selection, making output more deterministic, but does not guarantee the model will output valid JSON or follow a specific schema—it can still produce malformed or non-JSON text. Option B is wrong because while a fine-tuned model may improve JSON generation, fine-tuning requires significant data and compute resources, and the question asks for a prompt technique, not a model customization approach. Option C is wrong because a system instruction to output JSON is a weak constraint—models often ignore or partially follow system instructions, especially for complex schemas, leading to missing fields, extra keys, or incorrect nesting.

Practice this question →

149

MCQeasy

A data scientist wants to compare the performance of three different foundation models for a text summarization task. They have a labeled dataset of summaries. Which Vertex AI tool should they use to perform this evaluation?

A.Vertex AI RAG Engine - Retrieval evaluation

B.Vertex AI Agent Builder - Agent evaluation

C.Model Garden - Model comparison view

D.Vertex AI Studio - Evaluation

AnswerD

Vertex AI Studio includes evaluation capabilities to compare model responses against ground truth data.

Why this answer

Vertex AI Studio provides a model evaluation interface where you can compare model outputs using automatic metrics. Model Garden is a model hub. Agent Builder is for building agents.

RAG Engine is for retrieval.

Practice this question →

150

MCQeasy

A company wants to generate images for slide decks using Gemini in Google Slides. Which Gemini feature in Google Slides should they use?

A.Speaker notes generation

B.Slide layout optimization

C.Image generation from text prompts

D.Summation of slide content

AnswerC

Gemini in Slides can generate images based on text prompts, ideal for creating visuals.

Why this answer

Gemini in Slides can generate images from text prompts, allowing users to create visuals directly within the presentation.

Practice this question →

← PreviousPage 2 of 2 · 150 questions total

Ready to test yourself?

Try a timed practice session using only Gcl Applying Genai Business questions.

Start 20-question session

CCNA Gcl Applying Genai Business Questions — Page 2 of 2 | Courseiva