Knowledge + Practice

CCNA Gcl Applying Genai Business Questions

75 of 150 questions · Page 1/2 · Gcl Applying Genai Business topic · Answers revealed

Practice these questions Exam hub All questions

1

Multi-Selectmedium

A retail company wants to build an internal knowledge base chatbot using Vertex AI. They need to ensure the chatbot only answers from approved company documents and can handle updates without retraining. Which TWO components should they include? (Choose 2)

Select 2 answers

A.RAG Engine to connect the chatbot to a document index

B.A vector store (e.g., Vertex AI Vector Search) to index the documents

C.Fine-tuned model on company documents

D.Model Garden to select a pre-trained model

E.Apps Script to update the document index

AnswersA, B

RAG Engine retrieves relevant chunks from approved documents, ensuring answers are grounded.

Why this answer

RAG Engine retrieves answers from the indexed documents, satisfying the approved-source requirement. A vector store indexes the documents for retrieval. Fine-tuning is not needed.

Model Garden is just a model hub. Apps Script is for Workspace automation.

Practice this question →

2

MCQhard

A financial services firm uses a fine-tuned model for contract analysis. They observe that the model's performance degrades after a few months because contract language evolves. The team wants to maintain accuracy without full retraining. What is the MOST cost-effective approach?

A.Switch to a larger base model and use zero-shot prompting

B.Retrain the model from scratch every quarter with all historical data

C.Perform incremental fine-tuning with a small representative sample of new contracts

D.Use Vertex AI Model Monitoring to detect drift and alert, then manually adjust prompts

AnswerC

Incremental fine-tuning updates the model on new patterns efficiently, using far less compute than full retraining.

Why this answer

Fine-tuning the existing model with a small amount of new data (incremental fine-tuning) is the most cost-effective way to adapt to language evolution without full retraining. The other options are either too costly or do not adapt the model.

Practice this question →

3

MCQmedium

A company wants to add a GenAI-powered meeting summarization feature to their internal app. They have existing meeting transcript data in Cloud Storage. Which Google Cloud service should they use to build the summarization pipeline with minimal infrastructure management?

A.Cloud Functions

B.Compute Engine with custom containers

C.Google Chat API

D.Vertex AI Agent Builder

AnswerD

Vertex AI Agent Builder offers a managed, serverless environment to build generative AI pipelines, ideal for summarization.

Why this answer

Vertex AI Agent Builder is the correct choice because it provides a managed, serverless platform specifically designed for building GenAI-powered applications like meeting summarization. It integrates directly with existing data sources (e.g., Cloud Storage transcripts), offers pre-built models for summarization, and handles infrastructure scaling, security, and monitoring automatically, minimizing operational overhead.

Exam trap

Cisco often tests the misconception that any serverless compute (like Cloud Functions) can easily handle AI workloads, but the trap here is that Cloud Functions lacks native GenAI integration and would require significant custom code and external API calls, whereas Vertex AI Agent Builder is purpose-built for such tasks with minimal management.

How to eliminate wrong answers

Option A is wrong because Cloud Functions is a serverless compute service for event-driven code, but it lacks native GenAI capabilities and would require manual integration with external AI models, increasing complexity and management. Option B is wrong because Compute Engine with custom containers requires full infrastructure management (provisioning VMs, scaling, patching) and does not provide built-in GenAI summarization features, contradicting the requirement for minimal infrastructure management. Option C is wrong because Google Chat API is a messaging API for building chat interfaces, not a pipeline for processing meeting transcripts and generating summaries; it cannot perform AI summarization tasks.

Practice this question →

4

MCQhard

A healthcare startup wants to generate synthetic patient notes for training medical residents. They need the output to follow a strict template with sections: Chief Complaint, History, Assessment, Plan. Which prompt engineering strategy should they use to ensure consistent structure?

A.Use a zero-shot prompt asking for a patient note in bullet points

B.Provide a few-shot example of the template filled out and instruct the model to follow that format for new cases

C.Set temperature to 0 and max tokens to a high value

D.Use a system prompt that lists the sections as instructions

AnswerB

Few-shot examples with the exact template teach the model the required structure and sections.

Why this answer

Structured output via few-shot examples showing the exact template ensures the model learns the required format. Adding instructions alone may not enforce the structure as reliably.

Practice this question →

5

MCQmedium

A healthcare organization is deploying a generative AI application that processes Protected Health Information (PHI). They must ensure compliance with HIPAA. Which Google Cloud offering should they use?

A.Google Workspace with Duet AI

B.Model Garden open-source models deployed on Compute Engine

C.Gemini API with default settings

D.Vertex AI APIs with data residency and HIPAA compliance enabled

AnswerD

Vertex AI offers HIPAA-compliant deployments with data residency controls, suitable for PHI.

Why this answer

Vertex AI APIs with data residency and HIPAA compliance enabled is the correct choice because it meets regulatory requirements for PHI. Other options either lack HIPAA coverage or introduce unnecessary complexity.

Practice this question →

6

MCQmedium

A compliance officer requires that all AI-generated content in Google Workspace be reviewed before sharing externally. Which change management approach BEST supports this requirement while maintaining user adoption?

A.Roll out the feature to a pilot group with training and a feedback loop before company-wide deployment

B.Allow all sharing but audit logs after the fact

C.Disable AI features in Workspace for all users until a review tool is built

D.Immediately block all external sharing of AI-generated content

AnswerA

An iterative rollout with training and champions allows users to adapt to the review process, improving adoption and compliance.

Why this answer

Iterative rollout with training and a champion program helps users learn the review workflow and provide feedback, leading to higher adoption. Strict enforcement without support reduces adoption. Limiting access doesn't teach compliance.

No review is a risk.

Practice this question →

7

MCQeasy

A company wants to generate marketing images for a new product launch using GenAI. Which Google Cloud service should they use?

A.Gemini for Google Workspace

B.Vertex AI Agent Builder

C.Vertex AI Model Garden

D.Vertex AI Imagen

AnswerD

Imagen is the image generation model on Vertex AI.

Why this answer

Imagen on Vertex AI is Google Cloud's image generation model, available through Vertex AI APIs or Model Garden.

Practice this question →

8

MCQhard

A company is migrating a GenAI proof-of-concept to production. During the pilot, they used a large model (e.g., Gemini 1.5 Pro) and incurred high costs. The use case is simple: generating short product descriptions from structured data. Which cost optimization strategy should they implement first?

A.Fine-tune a smaller model on the specific task

B.Implement batch processing to group requests

C.Reduce the model's temperature to 0.0

D.Switch to a smaller model like Gemini 1.5 Flash and use structured prompts

AnswerD

A smaller model is cheaper per token and sufficient for simple descriptions; structured prompts maintain quality.

Why this answer

Option D is correct because the primary cost driver in this scenario is the model size itself. Since the use case is simple (generating short product descriptions from structured data), a smaller model like Gemini 1.5 Flash can handle the task with significantly lower inference cost per token. Structured prompts further optimize by reducing token waste and ensuring consistent output, making this the most direct and impactful first step.

Exam trap

Cisco often tests the misconception that fine-tuning is the first step for any production optimization, when in reality, model selection and prompt engineering are cheaper and faster to implement for simple tasks.

How to eliminate wrong answers

Option A is wrong because fine-tuning a smaller model is a valid long-term optimization but introduces upfront cost and complexity (data preparation, training compute, and validation) that is unnecessary for a simple task that can be handled by a smaller model with prompt engineering. Option B is wrong because batch processing reduces per-request overhead but does not address the fundamental cost per token of the large model; the savings are marginal compared to switching to a cheaper model. Option C is wrong because reducing temperature to 0.0 only affects output randomness and token selection, not the model's size or inference cost; it may improve determinism but does not reduce the number of parameters or compute required per request.

Practice this question →

9

MCQmedium

A company is building a GenAI chatbot that needs to answer questions using real-time data from their CRM and inventory systems. They want to ensure the model can access external data on demand. Which approach should they use?

A.Fine-tune the model on historical CRM and inventory data

B.Prompt the model to guess the data based on general knowledge

C.Use Vertex AI Extensions to connect to CRM and inventory APIs

D.Export CRM data to BigQuery and use that static snapshot

AnswerC

Extensions enable the model to call external APIs at inference time for real-time data.

Why this answer

Option C is correct because Vertex AI Extensions allow the GenAI chatbot to connect to external APIs (like CRM and inventory systems) in real time, enabling on-demand data retrieval without retraining the model. This approach uses a retrieval-augmented generation (RAG) pattern where the model queries live data sources via API calls, ensuring responses are based on current information rather than static snapshots.

Exam trap

Cisco often tests the distinction between fine-tuning (which changes model weights for static knowledge) and real-time data access via extensions or RAG, where candidates mistakenly think fine-tuning can provide live data when it only captures historical patterns.

How to eliminate wrong answers

Option A is wrong because fine-tuning on historical CRM and inventory data would only embed past patterns into the model, not provide real-time access to current data; the model would still be unable to answer questions about live inventory levels or recent customer interactions. Option B is wrong because prompting the model to guess data based on general knowledge would produce hallucinated or outdated responses, as the model has no inherent access to proprietary, real-time business data. Option D is wrong because exporting CRM data to BigQuery as a static snapshot would create a fixed dataset that becomes stale over time, failing the requirement for on-demand, real-time data access.

Practice this question →

10

Multi-Selecteasy

A marketing team wants to generate social media posts using generative AI. They need the tone to be consistent with their brand voice. Which two prompt engineering techniques should they use? (Choose TWO)

Select 2 answers

A.Set maximum output tokens to a low value

B.Use few-shot examples of approved posts

C.Use negative prompts like 'do not be casual'

D.Set high temperature to encourage creativity

E.Include a detailed brand style guide in the system prompt

AnswersB, E

Examples demonstrate the desired style and help the model replicate it.

Why this answer

Providing a style guide in the system prompt and using few-shot examples are effective techniques to enforce brand voice. Random examples and negative phrasing are not recommended. Maximum tokens does not affect tone.

Practice this question →

11

MCQhard

A financial services firm is deploying a GenAI application that processes sensitive client data. They must ensure that no prompts or model outputs are logged by the model provider. Which configuration in Vertex AI is REQUIRED to meet this data residency and privacy requirement?

A.Set the data residency location to a specific region in Vertex AI

B.Configure VPC Service Controls to create a service perimeter around Vertex AI

C.Use Model Garden to select a model hosted entirely on-premises

D.Enable Customer-Managed Encryption Keys (CMEK) on the Vertex AI endpoint

AnswerB

VPC Service Controls prevent data from being exfiltrated or logged outside the defined perimeter, ensuring provider-side logging is blocked.

Why this answer

VPC Service Controls create a security perimeter around Vertex AI resources, preventing data from being logged outside the customer's VPC. CMEK encrypts data at rest but doesn't prevent logging. Model Garden doesn't control logging.

Data residency settings are broader than the specific logging control needed.

Practice this question →

12

MCQmedium

A startup with limited ML expertise wants to add a GenAI feature to their SaaS application that can generate personalized email drafts for users. They need fast time-to-market and low maintenance. Which build-vs-buy decision is BEST?

A.Select a model from Model Garden and deploy it on Vertex AI

B.Fine-tune an open-source model on a corpus of email drafts to create a custom model

C.Buy a pre-built API such as the Gemini API and integrate it with prompt engineering for personalization

D.Build a custom transformer model from scratch

AnswerC

Pre-built APIs offer quick integration, low maintenance, and the startup can focus on prompt engineering to personalize drafts without ML overhead.

Why this answer

Using pre-built APIs (like Gemini API) with prompt engineering is the fastest path and requires no ML expertise. Custom fine-tuning would require data and expertise. Building from scratch is too heavy.

Model Garden still requires deployment and management.

Practice this question →

13

Multi-Selectmedium

A company is using Vertex AI RAG Engine to ground a chatbot in internal documents. The chatbot sometimes returns outdated information. Which TWO steps should they take to improve freshness?

Select 2 answers

A.Set up automated re-indexing on a schedule (e.g., daily)

B.Reduce the chunk size to 50 tokens

C.Increase the model's temperature to 1.0

D.Implement document chunking with metadata such as version or timestamp

E.Disable grounding and rely on the model's pre-training data

AnswersA, D

Regular re-indexing ensures the vector store reflects document updates.

Why this answer

Automated indexing syncs with document updates; chunking with timestamps allows retrieval of newer documents. Disabling groundings and increasing temperature do not help freshness.

Practice this question →

14

MCQmedium

A company is evaluating the ROI of implementing GenAI for code generation. Which metric BEST captures the productivity improvement of developers?

A.Percentage of code that passes unit tests on the first attempt

B.Time saved per development task (e.g., from 2 hours to 30 minutes)

C.Number of lines of code generated per day

D.Number of bugs found in production after code review

AnswerB

Time saved directly reflects productivity gain.

Why this answer

Time saved per task directly measures productivity. Code quality is an improvement metric, but time saved is the primary productivity indicator.

Practice this question →

15

MCQeasy

A marketing team wants to use generative AI to create ad copy that matches their brand voice. They have several examples of previous high-performing ads. Which Vertex AI Studio feature would best help them achieve consistent tone and style without custom model training?

A.Use of a pre-built template in Vertex AI Studio

B.Supervised fine-tuning on the ad examples

C.Few-shot prompting with examples of previous ads

D.Model evaluation to compare outputs

AnswerC

Few-shot prompting allows the model to learn from examples within the prompt, producing copy that matches the brand voice without training.

Why this answer

Option C is correct because few-shot prompting in Vertex AI Studio allows the model to infer the desired tone and style from a small set of example ads without requiring custom model training. This approach leverages the model's in-context learning capability, making it ideal for quickly adapting to a brand voice while avoiding the cost and complexity of fine-tuning.

Exam trap

The trap here is that candidates often confuse few-shot prompting with fine-tuning, assuming that any use of examples requires model retraining, when in fact few-shot prompting achieves style transfer through in-context learning without modifying model parameters.

How to eliminate wrong answers

Option A is wrong because pre-built templates provide generic structures and do not adapt to a specific brand voice or learn from provided examples. Option B is wrong because supervised fine-tuning requires custom model training, which the question explicitly states should be avoided. Option D is wrong because model evaluation is a post-generation step used to assess output quality, not a method for guiding the model to produce consistent tone and style.

Practice this question →

16

MCQeasy

An employee wants to use GenAI to assist with writing formulas in Google Sheets. Which Gemini for Google Workspace feature should they use?

A.Formula assistance in Google Sheets

B.Help me write in Google Docs

C.Image generation in Google Slides

D.Smart Compose in Gmail

AnswerA

Formula assistance is the Sheets-specific Gemini feature.

Why this answer

Gemini for Sheets provides formula assistance, helping users generate, explain, or debug formulas.

Practice this question →

17

MCQmedium

A company wants to build a customer service chatbot that answers questions about their internal policy documents. The documents are updated monthly, and the team cannot afford to retrain a model each time. Which approach is MOST appropriate?

A.Fine-tune a base LLM on the policy documents monthly

B.Use Retrieval-Augmented Generation (RAG) with the policy documents indexed in a vector store

C.Train a custom model from scratch on the policy documents each month

D.Use a larger foundation model with a longer context window and paste all documents into each prompt

AnswerB

RAG retrieves relevant document chunks at query time, ensuring the chatbot always answers from the latest uploaded documents without any model retraining.

Why this answer

RAG (Retrieval-Augmented Generation) allows the LLM to retrieve relevant document sections at inference time, so knowledge stays current without retraining. The other options either require expensive retraining for each update or lack document grounding.

Practice this question →

18

MCQhard

A large enterprise is planning to roll out a GenAI assistant for contract negotiation. The legal team wants to ensure that the assistant's outputs are consistent and follow a predefined format for downstream processing. What is the BEST prompt engineering technique?

A.Use few-shot examples with variable formatting

B.Add a system instruction to 'be consistent'

C.Define a structured output schema (e.g., JSON) in the prompt and request the model to output in that format

D.Use chain-of-thought prompting to have the model reason step-by-step

AnswerC

Specifying a structured format like JSON ensures the output can be programmatically parsed and is consistent.

Why this answer

Option C is correct because defining a structured output schema (e.g., JSON) in the prompt explicitly constrains the model's output format, ensuring consistency and machine-readability for downstream contract negotiation processing. This technique leverages the model's instruction-following capability to produce parseable, schema-compliant responses, which is critical for automated legal workflows where variable formatting would break integration.

Exam trap

Cisco often tests the misconception that vague instructions or reasoning techniques (like chain-of-thought) are sufficient for output consistency, when in fact only explicit, machine-readable format constraints guarantee the structured output required for enterprise automation.

How to eliminate wrong answers

Option A is wrong because few-shot examples with variable formatting do not enforce a strict output structure; the model may still deviate from the desired format, leading to inconsistent downstream parsing. Option B is wrong because adding a system instruction to 'be consistent' is too vague and lacks explicit formatting constraints, relying on the model's ambiguous interpretation of consistency rather than a concrete schema. Option D is wrong because chain-of-thought prompting improves reasoning quality but does not guarantee a predefined output format; it focuses on step-by-step reasoning rather than structured output compliance.

Practice this question →

19

MCQmedium

A company is evaluating whether to use a pre-built API or fine-tune a model for their use case. They have a large dataset of domain-specific jargon and need high accuracy on specialized terms. Which factor MOST strongly suggests fine-tuning?

A.The team wants to rapidly prototype a solution

B.The team has a limited budget for compute resources

C.The application requires low latency responses

D.The model needs to understand and generate domain-specific jargon accurately

AnswerD

Fine-tuning on domain data adapts the model to specialized terminology, improving accuracy.

Why this answer

Domain-specific vocabulary is a classic reason to fine-tune, as pre-built APIs may not handle jargon well. Latency sensitivity favors smaller models or APIs. Tight budget favors pre-built.

Rapid prototyping favors pre-built.

Practice this question →

20

MCQmedium

A data science team wants to build a RAG pipeline to ground a chatbot in proprietary knowledge. They need to choose a vector database and embedding model. Which combination is NATIVELY integrated with Vertex AI and requires the least custom infrastructure?

A.TensorFlow Embedding Projector + BigQuery

B.Vertex AI Embeddings API + Vertex AI Vector Search

C.Custom embeddings using a BERT model + Elasticsearch

D.Vertex AI Embeddings API + Pinecone

AnswerB

Both services are fully managed and natively integrated into Vertex AI, reducing infrastructure overhead.

Why this answer

Option B is correct because Vertex AI Embeddings API and Vertex AI Vector Search are both native, fully managed services within the Vertex AI ecosystem, requiring zero custom infrastructure for deployment. The Embeddings API generates text embeddings directly, and Vector Search provides a scalable, low-latency vector database that integrates seamlessly without additional servers or third-party tools.

Exam trap

Cisco often tests the distinction between 'natively integrated' and 'compatible' — candidates may assume any popular vector database like Pinecone works seamlessly with Vertex AI, but only Vertex AI Vector Search offers native, infrastructure-free integration.

How to eliminate wrong answers

Option A is wrong because TensorFlow Embedding Projector is a visualization tool, not a production vector database, and BigQuery is a data warehouse without native vector search capabilities, requiring custom infrastructure for indexing and retrieval. Option C is wrong because custom embeddings using a BERT model require manual model hosting, scaling, and maintenance, while Elasticsearch is not natively integrated with Vertex AI and demands custom infrastructure for embedding generation and vector indexing. Option D is wrong because Pinecone is a third-party vector database that requires separate account setup, API key management, and network configuration, lacking native integration with Vertex AI and increasing infrastructure complexity.

Practice this question →

21

MCQmedium

A company is adopting GenAI for internal knowledge base search. They want to measure the success of the rollout. Which metric is LEAST relevant for evaluating adoption?

A.Employee satisfaction with search results

B.Reduction in time to find information

C.Percentage of employees using the tool weekly

D.Number of queries answered per day

AnswerD

Query volume indicates usage but does not differentiate between casual and effective adoption; it can be inflated by repeated queries from the same users.

Why this answer

Option D is the least relevant metric because it measures raw system throughput rather than adoption or business value. A high number of queries answered per day could indicate system abuse, redundant searches, or poor initial retrieval quality, not successful adoption. Adoption metrics should focus on user engagement, satisfaction, and efficiency gains, not just volume.

Exam trap

Cisco often tests the distinction between operational metrics (e.g., query volume) and adoption/outcome metrics (e.g., satisfaction, time savings), leading candidates to mistakenly choose a volume metric as relevant for adoption evaluation.

How to eliminate wrong answers

Option A is wrong because employee satisfaction with search results directly measures user experience and perceived value, which are core to adoption success. Option B is wrong because reduction in time to find information quantifies efficiency gains, a primary business outcome of GenAI search. Option C is wrong because percentage of employees using the tool weekly measures active engagement and habitual use, a standard adoption KPI.

Practice this question →

22

Multi-Selectmedium

A team is evaluating whether to build a custom fine-tuned model or use a pre-built API for a document summarization task. Which TWO factors most strongly support using a pre-built API?

Select 2 answers

A.The summarization domain is very specialized and requires unique terminology

B.The team lacks machine learning expertise to fine-tune a model

C.The expected usage volume is millions of API calls per day

D.The company has strict data residency requirements that prevent sending data to an API

E.The company needs to launch the feature in two weeks

AnswersB, E

Pre-built APIs require no ML expertise, reducing time and risk.

Why this answer

Option B is correct because a pre-built API eliminates the need for in-house machine learning expertise, as the team can simply call the API endpoint without managing model training, hyperparameter tuning, or infrastructure. This is especially critical when the team lacks the specialized skills required for fine-tuning, such as data preparation, gradient computation, and evaluation on domain-specific metrics like ROUGE.

Exam trap

Cisco often tests the misconception that high usage volume always favors a pre-built API, but candidates must recognize that cost and scalability trade-offs can make a custom model more economical at scale.

Practice this question →

23

MCQeasy

A sales team wants to use Gemini for Google Workspace to draft personalized emails to prospects. Which feature in Gmail should they use?

A.Help me write (in Docs)

B.Vertex AI Studio text generation

C.Smart Compose

D.Google Sheets formula assistance

AnswerC

Smart Compose provides inline suggestions as you type.

Why this answer

Smart Compose in Gmail suggests complete sentences to help draft emails faster. 'Help me write' is in Docs, not Gmail.

Practice this question →

24

Multi-Selecthard

A company is moving a GenAI proof-of-concept to production. The pilot showed promising results, but the production environment must handle higher traffic, ensure low latency, and manage costs. Which THREE actions should they take?

Select 3 answers

A.Implement caching for common or repetitive queries

B.Set up A/B testing to compare model versions and prompt strategies

C.Select a smaller, more efficient model variant if accuracy is acceptable

D.Deploy a single model endpoint without A/B testing to simplify architecture

E.Fine-tune the model on the entire dataset for better performance

AnswersA, B, C

Caching reduces latency and cost for frequent requests.

Why this answer

A/B testing helps compare versions. Choosing a smaller model balances cost and latency. Caching reduces repeated work.

Fine-tuning is expensive and not necessary. A single endpoint without testing is risky.

Practice this question →

25

Multi-Selecteasy

A project manager wants to measure the success of a generative AI feature that summarizes meeting transcripts. Which TWO metrics are MOST appropriate for evaluating quality improvement?

Select 2 answers

A.Average handle time for meeting follow-ups

B.Accuracy of summaries (e.g., factuality, completeness)

C.API response time

D.Cost per summary generated

E.User satisfaction rating of the summaries

AnswersB, E

Accuracy directly measures output quality.

Why this answer

Accuracy of summaries and user satisfaction directly measure quality. AHT measures productivity. Cost per summary is financial.

API latency is performance.

Practice this question →

26

MCQhard

A company notices that their GenAI-powered code review assistant sometimes suggests code with security vulnerabilities. They want to reduce this risk without sacrificing review speed. Which approach is MOST effective?

A.Disable the code review assistant for security-sensitive modules

B.Include the company's secure coding guidelines in the system prompt and provide few-shot examples of secure code

C.Manually review every suggestion from the assistant before accepting it

D.Switch to a smaller, less capable model that is less likely to generate complex code

AnswerB

This directly guides the model to follow security standards and reduces vulnerabilities without major overhead.

Why this answer

Option B is correct because it directly addresses the root cause of insecure code generation by providing the model with explicit secure coding guidelines and few-shot examples. This technique, known as in-context learning, steers the model's output toward desired behaviors without altering its architecture or speed. It effectively reduces security vulnerabilities while maintaining the assistant's rapid review capability.

Exam trap

Cisco often tests the misconception that disabling or manually reviewing the assistant is the safest approach, when in fact the most effective solution is to guide the model's behavior through prompt engineering without sacrificing automation speed.

How to eliminate wrong answers

Option A is wrong because disabling the assistant for security-sensitive modules eliminates its benefits entirely, leaving those modules without automated review and potentially slowing down development. Option C is wrong because manual review of every suggestion negates the speed advantage of the GenAI assistant, defeating the purpose of using it for code review. Option D is wrong because switching to a smaller, less capable model may reduce code complexity but does not inherently improve security; it could even miss vulnerabilities that a larger model might catch, and it still requires additional measures to enforce secure coding practices.

Practice this question →

27

MCQmedium

A company is evaluating the ROI of deploying a GenAI code review assistant. They want to measure productivity gains. Which metric is MOST directly tied to developer efficiency?

A.Reduction in post-release bug count

B.Time saved per code review

C.Developer satisfaction score

D.Number of lines of code reviewed

AnswerB

Time saved per review directly measures the productivity gain from using the GenAI assistant.

Why this answer

Time saved per code review (Option B) is the most direct measure of developer efficiency because it quantifies the reduction in manual review effort, which is the primary benefit of a GenAI code review assistant. Unlike indirect metrics, this directly captures the core value proposition of automating or accelerating the review process, leading to faster development cycles and reduced context switching.

Exam trap

Cisco often tests the distinction between efficiency (time/output) and effectiveness (quality/outcome), so candidates mistakenly choose bug reduction (quality) instead of time saved (efficiency) when the question explicitly asks for productivity gains.

How to eliminate wrong answers

Option A is wrong because reduction in post-release bug count measures code quality and defect prevention, not developer efficiency or productivity gains directly; a GenAI assistant might improve quality, but efficiency is about time and throughput. Option C is wrong because developer satisfaction score is a subjective, lagging indicator of morale or usability, not a direct measure of efficiency or time savings. Option D is wrong because number of lines of code reviewed is a volume metric that can increase with inefficient processes (e.g., reviewing more code due to poor suggestions) and does not capture time savings or productivity improvements.

Practice this question →

28

MCQmedium

A company wants to measure the ROI of a GenAI-based report generation tool. Which metric is MOST directly tied to business value?

A.Model accuracy on a test set

B.Number of API calls made per month

C.Percentage of reports that require human editing

D.Average time saved per report by analysts

AnswerD

Time saved translates to cost savings and productivity, directly impacting ROI.

Why this answer

Time saved per report directly quantifies productivity gains, a key component of ROI. Other metrics are indirect or operational.

Practice this question →

29

Multi-Selectmedium

An organization is planning to roll out a generative AI internal knowledge base assistant to employees. They want to ensure adoption and manage change effectively. Which two change management practices should they prioritize? (Choose TWO)

Select 2 answers

A.Provide training sessions on how to write effective prompts

B.Roll out to all employees on day one to maximize impact

C.Mandate usage of the assistant for all employees

D.Start with a pilot group of AI champions to gather feedback

E.Disable the assistant after two weeks if usage is low

AnswersA, D

Training empowers users to get better results, increasing satisfaction and adoption.

Why this answer

Option A is correct because effective prompt engineering is critical for generative AI assistants; without training, employees may produce vague or poorly structured prompts, leading to irrelevant or low-quality responses, which undermines adoption. Providing training on prompt writing directly addresses the skill gap and empowers users to leverage the assistant effectively.

Exam trap

Cisco often tests the distinction between 'maximizing immediate impact' (Option B) and 'phased adoption with feedback loops' (Option D), where candidates mistakenly choose a rapid full rollout thinking it drives faster adoption, ignoring the proven change management principle of starting small to build advocacy and refine the tool.

Practice this question →

30

Multi-Selectmedium

A company wants to adopt GenAI for internal knowledge base Q&A. They have a collection of PDFs, internal wikis, and slide decks. Which TWO services should they combine to build a RAG-based solution?

Select 2 answers

A.Vertex AI Search for grounding

B.Model Garden for model selection

C.Gemini for Google Workspace

D.Vertex AI Agent Builder with a data store for documents

E.Vertex AI Studio for prompt design

AnswersA, D

Grounding with Vertex AI Search retrieves relevant document chunks to augment LLM responses.

Why this answer

Vertex AI Agent Builder can create a data store from these documents, and grounding with Vertex AI Search retrieves relevant chunks for the LLM.

Practice this question →

31

Multi-Selectmedium

A data science team wants to decide between using a pre-built API (e.g., Vertex AI Gemini API) and fine-tuning a custom model for a specific business task. Which TWO factors are most important in making this build versus buy decision?

Select 2 answers

A.Color scheme of the user interface

B.Developer preference for programming languages

C.Number of users who will interact with the system

D.Cost of inference per query for pre-built API vs fine-tuned model

E.Availability of high-quality labeled data for the specific task

AnswersD, E

Cost comparison is crucial; fine-tuning may reduce per-query cost for high volume.

Why this answer

Pre-built APIs are best when task complexity is low and data privacy is not a concern. Fine-tuning is better for high complexity and when data must remain private.

Practice this question →

32

MCQmedium

A company is piloting a GenAI feature for internal knowledge base search. During the pilot, users report that the AI sometimes gives incorrect answers based on outdated documents. What is the MOST effective way to address this issue?

A.Add a system instruction to the prompt telling the model to only answer if it is confident

B.Decrease the temperature parameter of the model to 0 to reduce randomness

C.Implement Retrieval-Augmented Generation (RAG) with the knowledge base documents indexed in a vector store and ensure the index is updated when documents change

D.Fine-tune the model on the current knowledge base to improve accuracy

AnswerC

RAG retrieves the most relevant and current documents at inference time, directly addressing the issue of outdated information.

Why this answer

Grounding with Google Search (or enterprise search) ties responses to fresh sources, but ensuring the knowledge base is up-to-date and using RAG with the latest documents is more direct. Fine-tuning on old data will not fix freshness. Reducing temperature helps but does not solve outdated content.

Prompting to refuse uncertain answers can help but not as reliable as RAG.

Practice this question →

33

MCQeasy

A marketing team wants to generate blog post ideas and draft outlines. They need a solution that works within Google Docs and leverages a foundation model without leaving the document editor. Which service should they use?

A.Vertex AI Studio

B.Gemini for Google Workspace (Duet AI)

C.Model Garden

D.Vertex AI Agent Builder

AnswerB

Gemini for Google Workspace integrates directly into Docs, Slides, and other Workspace apps for tasks like drafting text.

Why this answer

Gemini for Google Workspace (formerly Duet AI) is the correct choice because it is the only service that integrates generative AI directly into Google Docs, allowing users to generate blog post ideas and outlines without leaving the document editor. It leverages a foundation model (Gemini) natively within the Workspace environment, enabling seamless, in-context assistance for content creation tasks.

Exam trap

The trap here is that candidates often confuse Vertex AI Studio (a model development platform) with Gemini for Google Workspace (a productivity assistant), assuming any Google Cloud generative AI service can be used directly within Docs, but only the Workspace-integrated tool provides the seamless, in-editor experience described.

How to eliminate wrong answers

Option A is wrong because Vertex AI Studio is a standalone platform for building, testing, and customizing generative AI models, not an integrated assistant within Google Docs; it requires leaving the document editor to interact with the model. Option C is wrong because Model Garden is a repository of pre-trained foundation models and does not provide a direct, in-editor assistant for generating content within Google Docs. Option D is wrong because Vertex AI Agent Builder is designed for creating conversational agents and search applications, not for inline content generation within a document editor like Google Docs.

Practice this question →

34

Multi-Selecthard

A company is implementing a GenAI-powered internal knowledge base chatbot. They need to ensure answers are based on company documents and the system should be easy to update as documents change. Which TWO components should they use?

Select 2 answers

A.Document AI

B.Vertex AI Studio

C.Vertex AI RAG Engine

D.Model Garden

E.Vertex AI Agent Builder

AnswersC, E

RAG Engine handles retrieval of relevant document chunks.

Why this answer

Option C (Vertex AI RAG Engine) is correct because it provides a managed Retrieval-Augmented Generation (RAG) service that directly ingests company documents, indexes them into a vector database, and retrieves relevant chunks to ground the chatbot's answers. This ensures responses are based on the latest document content, and the system is easy to update by simply re-indexing or refreshing the document source without retraining the model.

Exam trap

Cisco often tests the distinction between general-purpose AI services (like Document AI or Model Garden) and specialized RAG/agent-building services, so candidates mistakenly pick Document AI thinking it handles document-based Q&A, but it lacks retrieval and grounding capabilities.

Practice this question →

35

MCQmedium

A retail company wants to generate personalized marketing emails at scale. They have a customer database and past purchase history. Which implementation pattern is most cost-effective and scalable?

A.Fine-tune a model on all customer data and deploy a dedicated endpoint

B.Use Vertex AI API in batch mode, sending customer data in each request and caching responses for common segments

C.Use Duet AI in Gmail to manually draft each email

D.Use real-time streaming with WebSocket connections to generate each email on-demand

AnswerB

Batch mode allows processing many requests efficiently, and caching reduces token usage for repeated content.

Why this answer

Option B is correct because using Vertex AI API in batch mode allows the company to process large volumes of customer data asynchronously, significantly reducing costs compared to real-time or dedicated endpoint deployments. Caching responses for common segments further optimizes by avoiding redundant API calls, making this pattern both cost-effective and scalable for generating personalized marketing emails at scale.

Exam trap

Cisco often tests the misconception that fine-tuning is always the best approach for personalization, but the trap here is that batch inference with caching is more cost-effective and scalable for high-volume, non-real-time tasks like marketing email generation.

How to eliminate wrong answers

Option A is wrong because fine-tuning a model on all customer data and deploying a dedicated endpoint incurs high infrastructure costs (e.g., GPU/TPU provisioning) and is unnecessary for a task that can be achieved with prompt engineering and batch processing. Option C is wrong because using Duet AI in Gmail to manually draft each email is not scalable for generating emails at scale and defeats the purpose of automation, as it requires human intervention for every email. Option D is wrong because real-time streaming with WebSocket connections is overkill for batch email generation, leading to higher latency and cost due to maintaining persistent connections, and does not align with the asynchronous nature of bulk email campaigns.

Practice this question →

36

MCQmedium

A retail company uses Vertex AI Agent Builder to create a virtual assistant for order tracking. Users frequently ask about delivery dates, but the assistant sometimes gives incorrect information. The team wants to improve accuracy without retraining the underlying model. Which technique should they apply?

A.Increase the temperature parameter for more deterministic outputs

B.Switch to a larger model size for better reasoning

C.Add more few-shot examples to the prompt template

D.Enable Grounding with Google Search or connect to a custom data store

AnswerD

Grounding allows the agent to retrieve up-to-date information from search or enterprise databases, reducing hallucinations about order status.

Why this answer

Grounding with Google Search or enterprise data sources (like order databases) ensures the agent retrieves real-time, accurate information instead of relying solely on the model's training data.

Practice this question →

37

Multi-Selecthard

A company is transitioning a generative AI pilot to production. They need to ensure cost predictability and scalability. Which THREE actions should they take?

Select 3 answers

A.Provision reserved throughput for all requests

B.Implement response caching for common queries

C.Select the smallest model that meets quality requirements

D.Use batch API for non-real-time requests

E.Conduct A/B testing on model versions

AnswersB, C, D

Caching avoids redundant API calls, reducing token usage and cost.

Why this answer

Response caching for common queries reduces latency and API costs by serving repeated requests from a cache instead of invoking the generative AI model each time. This directly improves cost predictability (fewer model invocations) and scalability (reduced load on the model endpoint), making it a core optimization for production deployments.

Exam trap

Cisco often tests the misconception that reserved throughput (Option A) is always the best way to ensure cost predictability, when in fact it can increase costs for spiky workloads and ignores the scalability benefits of caching and batch processing.

Practice this question →

38

MCQeasy

A project manager wants to reduce the cost of using Gemini API for batch processing of customer feedback. The team is on a tight budget. Which cost management strategy is MOST effective?

A.Use batch requests to group multiple prompts

B.Increase the temperature to max to reduce output length

C.Disable logging to reduce storage costs

D.Switch to the largest model available for better accuracy

AnswerA

Batch requests are cheaper than individual calls for large volumes.

Why this answer

Batch requests reduce per-token cost by grouping multiple prompts into one API call. Caching helps but is less impactful for varied feedback.

Practice this question →

39

MCQmedium

A financial services company wants to automate contract analysis to extract key clauses and identify risky terms. They have thousands of PDF contracts and need a solution that can be quickly integrated into their existing document management system. Which Google Cloud service is MOST suitable?

A.Model Garden

B.Document AI (DocAI)

C.Vertex AI Studio

D.Vertex AI Agent Builder

AnswerB

DocAI provides pre-trained models for parsing contracts and extracting custom entities, directly integrated via API.

Why this answer

DocAI is purpose-built for document understanding and structured extraction from PDFs. Vertex AI Studio focuses on prompt design for generative models, while Model Garden is for selecting models. Vertex AI Agent Builder is for building conversational agents, not document processing.

Practice this question →

40

MCQmedium

A financial firm wants to use GenAI to draft emails for client communications. They need to ensure regulatory compliance and maintain a consistent professional tone. Which approach is MOST suitable?

A.Use zero-shot prompting with a compliance checklist in the prompt

B.Use Vertex AI Agent Builder with grounding on regulatory documents

C.Fine-tune a model on historical approved emails and deploy via Vertex AI API

D.Use Gemini in Gmail with Smart Compose enabled

AnswerC

Fine-tuning on approved examples ensures regulatory compliance and tone consistency.

Why this answer

Fine-tuning on approved examples ensures consistent tone and compliance. Fine-tuning adapts the model to the regulated domain better than prompting alone.

Practice this question →

41

MCQhard

A company wants to deploy a GenAI system for contract analysis that must produce structured output (e.g., JSON) for downstream processing. They need to ensure the output format is consistent. Which prompt engineering technique is MOST effective?

A.Post-process the output with a regex to fix formatting

B.Provide a few-shot example with the desired JSON format

C.Use a system prompt that says 'output JSON'

D.Use a larger model with better formatting capabilities

AnswerB

Few-shot examples explicitly show the model the exact structure, leading to more consistent outputs than instructions alone.

Why this answer

Option B is correct because providing a few-shot example with the desired JSON format explicitly demonstrates the exact structure, keys, and nesting required, which guides the model to generate consistent structured output. This technique leverages in-context learning, where the model infers the output pattern from the examples, making it more reliable than vague instructions or post-processing fixes.

Exam trap

Cisco often tests the misconception that a vague instruction like 'output JSON' or a larger model alone can guarantee format consistency, when in fact explicit few-shot examples are required to enforce a specific schema and reduce output variability.

How to eliminate wrong answers

Option A is wrong because post-processing with regex is a brittle workaround that cannot fix semantic errors (e.g., missing keys, incorrect nesting) and assumes the model already produced nearly correct JSON, which is not guaranteed; it also adds latency and maintenance overhead. Option C is wrong because a system prompt that simply says 'output JSON' is too vague—it does not specify the schema, key names, data types, or nesting, leading to inconsistent or malformed outputs. Option D is wrong because using a larger model does not inherently solve formatting consistency; even large models can produce varied JSON structures without explicit formatting guidance, and this approach increases cost and latency without addressing the root cause.

Practice this question →

42

MCQeasy

A marketing team wants to use GenAI to generate social media posts. They need consistent brand voice and the ability to output structured JSON for downstream scheduling. Which combination of Vertex AI features should they use?

A.Model Garden to select a model and then fine-tune it for brand voice

B.Vertex AI Studio with few-shot prompting and a response schema for JSON

C.Vertex AI RAG Engine with a vector store of past posts

D.Vertex AI Agent Builder and Grounding with Google Search

AnswerB

Vertex AI Studio supports few-shot prompting to maintain tone and a response schema to enforce JSON output, meeting both needs without extra complexity.

Why this answer

Vertex AI Studio offers prompt design capabilities, and you can define few-shot examples to enforce tone. Structured output can be requested by specifying a response schema. Model Garden is a model hub, not a prompt tool; Fine-tuning is overkill for this task.

Practice this question →

43

MCQeasy

A company is deciding between using a pre-built GenAI API (like Gemini API) and building a custom fine-tuned model. Which factor would MOST strongly favor the custom fine-tuned model?

A.Limited budget for AI development

B.Low latency requirements for real-time responses

C.Small volume of inference requests per day

D.Need for specialized domain knowledge that general models lack

AnswerD

Fine-tuning on domain-specific data improves accuracy on specialized tasks.

Why this answer

Custom fine-tuning is best when the task requires specialized domain knowledge not available in the pre-built model's training data.

Practice this question →

44

MCQmedium

A company wants to build a customer service chatbot that answers questions about their internal policy documents. The documents are updated monthly, and the team cannot afford to retrain a model each time. Which approach is MOST appropriate?

A.Fine-tune a base LLM on the policy documents monthly

B.Use Retrieval-Augmented Generation (RAG) with the policy documents indexed in a vector store

C.Train a custom model from scratch on the policy documents each month

D.Use a larger foundation model with a longer context window and paste all documents into each prompt

AnswerB

RAG retrieves relevant document chunks at query time, ensuring the chatbot always answers from the latest uploaded documents without any model retraining.

Why this answer

RAG (Retrieval-Augmented Generation) allows the LLM to retrieve relevant document sections at inference time, so knowledge stays current without retraining. The other options either require expensive retraining for each update or lack document grounding.

Practice this question →

45

Multi-Selectmedium

A company wants to build a GenAI application that summarizes customer reviews into actionable insights. The solution must be cost-effective and handle variable traffic. Which TWO strategies should they implement? (Choose 2)

Select 2 answers

A.Deploy on a dedicated GPU cluster to minimize latency

B.Use the largest available foundation model for best accuracy

C.Use a small, fine-tuned model for summarization

D.Implement batching of review summaries to reduce per-request overhead

E.Store all raw reviews in the prompt context to avoid retrieval

AnswersC, D

Smaller models cost less per token and are sufficient for summarization.

Why this answer

Using a smaller model for summarization reduces cost for high volume. Implementing batching groups requests to optimize token usage. Other options increase cost or latency.

Practice this question →

46

Multi-Selectmedium

A company wants to adopt GenAI for contract analysis. They are evaluating build vs. buy. Which TWO factors are MOST important when deciding to build a custom fine-tuned model instead of using a pre-built API? (Choose 2)

Select 2 answers

A.Need to deploy the solution within a week

B.Low budget for AI development

C.Desire to keep sensitive contract data within the company's VPC

D.Inability to evaluate model accuracy

E.Requirement for high accuracy on domain-specific legal terminology

AnswersC, E

Build option allows data not to leave the environment, meeting privacy requirements.

Why this answer

Need for domain-specific terminology and data privacy requirements are key reasons to build. Cost and speed of deployment favor buy. Model accuracy can be high for both.

Practice this question →

47

MCQmedium

A company wants to create a GenAI application that can answer questions about their internal policy documents, but they require that the model's responses are grounded in the actual documents to avoid hallucination. Which Vertex AI feature should they use?

A.Vertex AI Studio

B.Vertex AI Agent Builder

C.Grounding with Google Search (or enterprise data sources)

D.Model Garden

AnswerC

Grounding connects the model to external sources, reducing hallucinations by basing responses on retrieved information.

Why this answer

Grounding with Google Search or enterprise data sources ensures responses are based on verified sources. RAG Engine is a broader tool for building RAG applications. Vertex AI Studio is for prompt design.

Agent Builder is for conversational agents but may not provide grounding directly.

Practice this question →

48

MCQeasy

A sales team needs to draft personalized emails to prospects. They want a tool integrated into Gmail that suggests completions and can generate full drafts based on brief prompts. Which Gemini for Google Workspace feature should they use?

A.Gemini in Sheets with formula assistance

B.Gemini in Meet for meeting summaries

C.Gemini in Gmail with Smart Compose and 'Help me write'

D.Gemini in Docs with 'Help me write'

AnswerC

These features provide inline suggestions and draft generation in Gmail.

Why this answer

Smart Compose and 'Help me write' in Gmail provide AI-powered suggestions and draft generation directly within the email interface.

Practice this question →

49

MCQhard

A healthcare organization wants to deploy a GenAI chatbot for patient intake. They must ensure that the model does not generate harmful advice and that patient data remains confidential. Which set of controls should they implement on Vertex AI?

A.Use model tuning with only positive examples

B.Enable safety filters, use VPC Service Controls, and enforce Private Google Access

C.Disable logging and restrict API keys

D.Use a public model endpoint and rely on prompt engineering

AnswerB

Safety filters moderate outputs; VPC Service Controls and Private Google Access keep data on-premises.

Why this answer

Safety filters block harmful content; VPC Service Controls keep data within the customer's VPC; Private Google Access prevents data exfiltration. The other options lack these combined controls.

Practice this question →

50

MCQeasy

A marketing team wants to generate product descriptions for 1000 new items. They need consistent brand voice and the ability to review and edit outputs before publishing. Which approach is most suitable?

A.Use the Gemini API with grounding in Google Search to ensure factual accuracy

B.Deploy a customer service chatbot in Vertex AI Agent Builder to generate descriptions

C.Use the Gemini API with a few-shot prompt that includes examples of the desired brand voice

D.Build a custom fine-tuned model on past product descriptions

AnswerC

Few-shot prompting is fast, cost-effective, and allows the team to iterate on prompts until the tone is right. Outputs can be reviewed and edited before publishing.

Why this answer

Using the Gemini API with a few-shot prompt containing examples of desired tone gives the team control and consistency while allowing quick edits. Fine-tuning would take time and cost, and a chatbot is not designed for batch generation. Grounding with Google Search is unnecessary for generated descriptions.

Practice this question →

51

Multi-Selecthard

A law firm wants to use generative AI to analyze contracts and extract key clauses. They need high accuracy and the ability to handle diverse contract formats. Which three steps should they take in their proof-of-concept (PoC) phase? (Choose THREE)

Select 3 answers

A.Use few-shot prompting with examples of desired clause extraction

B.Implement a human-in-the-loop review process for extracted clauses

C.Test on a diverse set of 50-100 contracts covering common variations

D.Deploy a pre-built contract analysis API without customization

E.Train a custom model from scratch using all historical contracts

AnswersA, B, C

Few-shot examples guide the model to produce consistent, structured outputs.

Why this answer

Few-shot prompting provides the model with specific examples of desired clause extraction, guiding it to produce accurate outputs without requiring fine-tuning. This technique is efficient for a proof-of-concept because it leverages the model's existing capabilities while adapting to the task through in-context learning, which is critical for handling diverse contract formats with high accuracy.

Exam trap

Cisco often tests the misconception that a proof-of-concept should aim for a production-ready solution immediately, leading candidates to choose options like D (pre-built API) or E (custom training) instead of focusing on iterative, low-cost validation steps like few-shot prompting and human review.

Practice this question →

52

Multi-Selecteasy

A data analyst wants to use Gemini for Google Workspace to assist with Sheets. Which two capabilities does Duet AI in Sheets provide? (Choose TWO)

Select 2 answers

A.Generate formula suggestions based on natural language descriptions

B.Classify and tag data in columns automatically

C.Apply conditional formatting rules based on patterns

D.Create charts from images pasted into the sheet

E.Write custom Apps Script code automatically

AnswersA, B

Users can describe what they want, and Duet AI suggests the formula.

Why this answer

Option A is correct because Duet AI in Sheets can interpret natural language descriptions of desired calculations and automatically generate the corresponding formula. This leverages Gemini's language understanding to translate user intent into spreadsheet functions, significantly reducing the need to manually recall or construct complex formulas.

Exam trap

The trap here is that candidates may confuse Duet AI's formula suggestion capability with broader automation features like custom scripting or image-based chart creation, which are not supported in the current Duet AI for Sheets implementation.

Practice this question →

53

Multi-Selectmedium

A company is planning an iterative rollout of a GenAI assistant for sales teams. They want to maximize adoption and minimize disruption. Which TWO change management practices should they prioritize? (Choose two.)

Select 2 answers

A.Provide hands-on training sessions and documentation

B.Disable the legacy system immediately after GenAI launch

C.Roll out to all employees simultaneously to create urgency

D.Identify AI champions in the sales team to pilot the tool first

E.Set a mandatory usage target for each sales rep

AnswersA, D

Training reduces confusion and helps employees use the tool effectively.

Why this answer

Option A is correct because hands-on training and documentation directly address the learning curve and anxiety associated with adopting a GenAI assistant, ensuring users understand how to interact with the model effectively (e.g., prompt engineering basics, interpreting outputs) and reducing friction. This practice aligns with the iterative rollout strategy by providing staged support, which maximizes adoption through competence and confidence rather than coercion.

Exam trap

Cisco often tests the misconception that aggressive mandates or full-scale rollouts create urgency and speed adoption, but in GenAI contexts, they actually increase resistance and risk of failure due to the need for trust calibration and iterative refinement of model behavior.

Practice this question →

54

MCQmedium

During a proof-of-concept, a team notices that their GenAI-powered meeting summarizer occasionally includes hallucinated details. They want to improve summary accuracy before production. Which action would be most effective?

A.Upgrade to a larger model

B.Provide the meeting transcript as context in the prompt

C.Decrease the temperature to 0.0

D.Use few-shot prompting with example summaries

AnswerB

Grounding the model with the actual transcript ensures the summary is based on real content, reducing hallucinations.

Why this answer

Providing the meeting transcript as context grounds the model in factual data, reducing hallucinations. Few-shot prompting may help, but the primary issue is lack of source material. Adjusting temperature is less impactful.

Switching to a larger model without context may not help.

Practice this question →

55

Multi-Selectmedium

A company wants to use GenAI to create a knowledge base assistant that can answer questions from internal documentation. They need the assistant to always have access to the latest documents without retraining. Which TWO services should they combine? (Choose 2)

Select 2 answers

A.Vertex AI Pipelines

B.Vertex AI Agent Builder

C.Vertex AI Model Garden

D.Duet AI in Google Docs

E.Vertex AI Vector Search

AnswersB, E

Agent Builder is used to create the conversational assistant.

Why this answer

Vertex AI Agent Builder provides the conversational agent framework, and Vertex AI Vector Search enables retrieval from the latest documents using embeddings.

Practice this question →

56

MCQhard

A company is piloting a GenAI code review assistant. Developers report that the assistant often suggests incorrect or insecure code snippets. The team wants to improve the assistant's reliability before expanding the pilot. Which approach should they prioritize?

A.Use RAG to retrieve code snippets from a database of known vulnerabilities

B.Increase the model's temperature parameter to generate more diverse suggestions

C.Switch to a larger foundation model without additional tuning

D.Fine-tune the base model on a curated dataset of secure code review examples

AnswerD

Fine-tuning on high-quality examples teaches the model to prioritize secure and correct suggestions.

Why this answer

Fine-tuning on a curated dataset of secure code review examples directly addresses the assistant's tendency to suggest incorrect or insecure code by adapting the model's behavior to the specific patterns and standards of secure coding. Unlike retrieval or parameter adjustments, this approach modifies the model's weights to prioritize security and correctness in its outputs, making it the most effective method for improving reliability in a targeted domain.

Exam trap

Cisco often tests the misconception that adding more data (via RAG) or increasing model size (larger foundation model) automatically improves output quality, when in fact targeted fine-tuning on domain-specific, high-quality data is required to correct systematic errors in generative outputs.

How to eliminate wrong answers

Option A is wrong because RAG retrieves code snippets from a database of known vulnerabilities, which would surface insecure examples rather than correct ones, potentially worsening the assistant's suggestions. Option B is wrong because increasing the temperature parameter makes the model more random and less deterministic, which would increase the likelihood of generating diverse but even less reliable and more insecure code. Option C is wrong because switching to a larger foundation model without tuning does not guarantee improved security or correctness; larger models can still produce insecure code if not specifically aligned with secure coding practices, and they may introduce higher latency and cost without addressing the root cause.

Practice this question →

57

MCQmedium

A company uses Gemini for Google Workspace to help employees draft emails. After rollout, the IT team notices that the Smart Compose suggestions are sometimes inappropriate for external communications. What is the BEST way to address this while maintaining productivity gains?

A.Switch to a third-party email assistant not integrated with Workspace

B.Train employees to write more specific prompts and provide few-shot examples for desired tone

C.Disable Smart Compose for all external email accounts

D.Fine-tune the Gemini model on the company's past external emails

AnswerB

Prompt engineering training helps employees guide the model effectively, reducing inappropriate suggestions while maintaining productivity.

Why this answer

Providing few-shot examples or using custom prompts for tone can guide the model to produce more appropriate suggestions. Disabling the feature or using a generic model would not leverage Workspace integration.

Practice this question →

58

MCQeasy

A sales team wants to use Gemini for Google Workspace to draft emails in Gmail. Which feature should they enable to generate full email drafts from a brief prompt?

A.Translate

B.Smart Reply

C.Help me write

D.Smart Compose

AnswerC

Help me write generates a complete email draft from a prompt, ideal for drafting.

Why this answer

Option C is correct because 'Help me write' is the specific Gemini for Google Workspace feature designed to generate full email drafts from a brief prompt. It leverages the underlying large language model to compose complete, context-aware email content, unlike features that only suggest short phrases or replies.

Exam trap

The trap here is that candidates often confuse 'Smart Compose' (which only suggests short text completions) with the full-draft generation capability of 'Help me write', leading them to select the wrong feature.

How to eliminate wrong answers

Option A is wrong because 'Translate' is a language translation feature, not a generative drafting tool; it converts existing text from one language to another. Option B is wrong because 'Smart Reply' only suggests short, pre-generated response phrases (e.g., 'Yes, sounds good') based on the email content, not full drafts from a prompt. Option D is wrong because 'Smart Compose' provides inline word and phrase suggestions as you type, but does not generate a complete email draft from a brief prompt.

Practice this question →

59

MCQmedium

A company wants to evaluate the ROI of their GenAI content creation tool. Which metric is LEAST useful for assessing productivity gains?

A.Volume of content produced per week

B.Average time to create a piece of content

C.Content uniqueness score compared to competitors

D.Reduction in time from draft to final approval

AnswerC

Uniqueness measures content quality or differentiation, not productivity.

Why this answer

Content uniqueness score compared to competitors is a measure of originality or differentiation, not a direct metric for productivity gains. Productivity in GenAI content creation focuses on efficiency and throughput, such as volume, speed, and cycle time reduction, not competitive benchmarking.

Exam trap

Cisco often tests the distinction between productivity metrics (efficiency/throughput) and quality or differentiation metrics, leading candidates to mistakenly select a quality metric like uniqueness as relevant to productivity gains.

How to eliminate wrong answers

Option A is wrong because volume of content produced per week directly measures output quantity, a core productivity metric for GenAI tools that automate generation. Option B is wrong because average time to create a piece of content directly measures efficiency gains, a primary productivity indicator. Option D is wrong because reduction in time from draft to final approval measures workflow acceleration, a key productivity benefit of GenAI in content pipelines.

Practice this question →

60

MCQmedium

A company wants to build a customer service chatbot that answers questions about their internal policy documents. The documents are updated monthly, and the team cannot afford to retrain a model each time. Which approach is MOST appropriate?

A.Use a larger foundation model with a longer context window and paste all documents into each prompt

B.Fine-tune a base LLM on the policy documents monthly

C.Train a custom model from scratch on the policy documents each month

D.Use Retrieval-Augmented Generation (RAG) with the policy documents indexed in a vector store

AnswerD

RAG retrieves relevant document chunks at query time, ensuring the chatbot always answers from the latest uploaded documents without any model retraining.

Why this answer

Retrieval-Augmented Generation (RAG) is the most appropriate approach because it allows the chatbot to answer questions based on the latest policy documents without retraining the model. By indexing the documents in a vector store and retrieving relevant chunks at inference time, RAG ensures the model's responses are grounded in the current data, accommodating monthly updates efficiently.

Exam trap

Cisco often tests the misconception that fine-tuning is the only way to incorporate new data, when in fact RAG provides a dynamic, cost-effective alternative that avoids retraining.

How to eliminate wrong answers

Option A is wrong because pasting all documents into each prompt would exceed the context window limits of even the largest foundation models (e.g., 128K tokens), leading to high latency, cost, and potential truncation of critical information. Option B is wrong because fine-tuning a base LLM monthly on the policy documents is computationally expensive, time-consuming, and risks catastrophic forgetting of previously learned knowledge, making it unsustainable for frequent updates. Option C is wrong because training a custom model from scratch each month is prohibitively expensive, requires massive datasets and compute resources, and is entirely unnecessary when RAG can achieve the same goal with far less overhead.

Practice this question →

61

MCQmedium

A company wants to evaluate the ROI of deploying a GenAI tool for customer support. They plan to measure productivity gains. Which metric is most directly tied to productivity improvement?

A.Total token consumption

B.Average handling time per ticket

C.Ticket deflection rate

D.Customer satisfaction score (CSAT)

AnswerB

Reduction in handling time directly indicates productivity improvement.

Why this answer

Average handling time (AHT) per ticket is the most direct metric for productivity improvement because it quantifies the time an agent spends resolving a customer issue. A GenAI tool that generates response drafts or retrieves knowledge base articles reduces the agent's wrap-up and research time, directly lowering AHT. This is a standard contact center metric (e.g., measured in seconds or minutes) that maps to cost savings and throughput gains.

Exam trap

Cisco often tests the distinction between efficiency metrics (like AHT) and effectiveness or automation metrics (like deflection or CSAT), trapping candidates who confuse 'productivity' with 'customer satisfaction' or 'automation rate'.

How to eliminate wrong answers

Option A is wrong because total token consumption measures the volume of text processed by the GenAI model (e.g., input + output tokens billed per API call), not the efficiency or speed of human agents; it is a cost metric, not a productivity metric. Option C is wrong because ticket deflection rate (e.g., percentage of issues resolved by a chatbot without human intervention) measures automation success, not the productivity improvement of human agents; it is a containment metric. Option D is wrong because customer satisfaction score (CSAT) measures perceived quality of service, not the speed or efficiency of ticket resolution; it is an outcome metric that can be influenced by many factors beyond agent productivity.

Practice this question →

62

MCQeasy

A marketing team wants to generate social media posts in a consistent brand voice. They have a few examples of high-performing posts. Which prompt engineering technique should they use?

A.Few-shot prompting with examples of past successful posts

B.Fine-tuning the model on all past social media posts

C.Zero-shot prompting with a detailed description of the brand voice

D.Chain-of-thought prompting to explain reasoning

AnswerA

Few-shot examples guide the model to match the desired style and tone.

Why this answer

Few-shot prompting provides the model with examples of the desired output style and tone, enabling consistent brand voice without fine-tuning.

Practice this question →

63

MCQmedium

A developer is using Vertex AI Studio to design a prompt for a content moderation system. They need the model to return a structured JSON with fields 'category' and 'confidence_score'. Which prompt engineering technique should they use?

A.Set a role prompt instructing the model to act as a content moderator

B.Use a response schema (structured output) in Vertex AI Studio

C.Use zero-shot prompting and rely on the model's ability to infer JSON

D.Include a few-shot example of the desired JSON output in the prompt

AnswerB

Response schemas enforce the JSON format and fields, ensuring consistent structured output from the model.

Why this answer

Response schemas in Vertex AI Studio allow you to define the exact structure of the output, ensuring consistent JSON. Few-shot provides examples but not a strict schema. Zero-shot is too variable.

Role-setting is for tone, not structure.

Practice this question →

64

MCQmedium

A company is planning an iterative rollout of a GenAI tool. Which change management practice is MOST critical for ensuring high adoption among employees?

A.Implement a quick rollback plan in case employees resist

B.Conduct mandatory training sessions for all employees before rollout

C.Identify and train AI champions who can demonstrate value and support colleagues

D.Restrict access to all non-GenAI tools to force usage

AnswerC

AI champions act as internal advocates, providing peer support and driving organic adoption.

Why this answer

Option C is correct because AI champions serve as peer-level advocates who can demonstrate practical use cases, address context-specific concerns, and provide ongoing support, which is critical for overcoming the skepticism and learning curve associated with GenAI tools. Unlike top-down mandates, this grassroots approach builds trust and organic adoption by leveraging social proof and localized expertise.

Exam trap

Cisco often tests the distinction between technical contingency planning (rollback) and human-centric change management, leading candidates to confuse a safety net with an adoption strategy.

How to eliminate wrong answers

Option A is wrong because a quick rollback plan is a technical contingency for system failure, not a change management practice; it does not address the human factors of adoption and may even signal organizational uncertainty. Option B is wrong because mandatory training sessions often lead to passive compliance rather than genuine engagement, and they fail to account for varying skill levels and use cases across roles, which is especially problematic with GenAI's open-ended interaction models. Option D is wrong because restricting access to non-GenAI tools forces usage through coercion, which breeds resentment and workarounds, and ignores that GenAI is a complementary tool, not a replacement for all existing workflows.

Practice this question →

65

MCQmedium

A company is evaluating the ROI of deploying a GenAI-powered code review assistant. Which metric would BEST capture the quality improvement from using the assistant?

A.Number of code reviews completed per week

B.Lines of code written per developer per day

C.Developer satisfaction survey score

D.Defect escape rate (bugs found in production)

AnswerD

Defect escape rate directly reflects the quality of code reviews; fewer production bugs indicate higher quality.

Why this answer

Defect escape rate measures bugs that reach production; a reduction indicates higher code quality. Productivity metrics like lines of code per hour measure speed, not quality. User acceptance is a satisfaction measure, not direct quality.

Practice this question →

66

Multi-Selectmedium

A company wants to implement a change management strategy for GenAI adoption. Which TWO actions are MOST effective in driving adoption among employees?

Select 2 answers

A.Remove existing tools to force adoption

B.Appoint AI champions in each department to advocate and assist

C.Limit access to GenAI tools to senior leadership only

D.Make the use of GenAI tools mandatory for all tasks

E.Provide comprehensive training programs for employees

AnswersB, E

Champions provide local support and motivation.

Why this answer

Training programs build competence; AI champions provide peer support. Mandatory adoption and tool removal are counterproductive.

Practice this question →

67

Multi-Selecteasy

A company wants to start using Gemini for Google Workspace to improve productivity. Which TWO applications can provide meeting summarization features? (Choose 2)

Select 2 answers

A.Google Docs

B.Google Chat

C.Google Sheets

D.Gmail

E.Google Meet

AnswersB, E

Chat can display meeting summaries if integrated with Meet.

Why this answer

Google Meet and Google Chat both offer meeting summarization features powered by Gemini. In Google Meet, Gemini can automatically generate meeting notes and summaries after a recorded meeting. In Google Chat, Gemini can summarize conversations and meetings that occurred within a space, providing a concise recap of key points and action items.

Exam trap

The trap here is that candidates may confuse general AI assistance features (like drafting emails or documents) with the specific meeting summarization capability, which is only available in Google Meet and Google Chat.

Practice this question →

68

MCQeasy

A marketing team wants to generate multiple versions of ad copy for A/B testing. They need consistent brand tone across all outputs. Which prompt engineering technique is most effective?

A.Use a zero-shot prompt with a detailed description of the brand voice

B.Use a system instruction that says 'Be consistent'

C.Include few-shot examples of previous ad copy that match the desired tone

D.Set the temperature to 0.0 to eliminate randomness

AnswerC

Few-shot examples provide concrete patterns for the model to emulate, leading to consistent tone.

Why this answer

Providing few-shot examples (sample outputs with the desired tone) is the most direct way to enforce consistency. Zero-shot gives no guidance; system instructions help but are less effective than examples for tone consistency. Temperature settings control randomness.

Practice this question →

69

MCQmedium

A company wants to build a proof-of-concept generative AI application quickly. They have limited ML expertise and need to test multiple foundation models with different prompts. Which Vertex AI tool should they use?

A.Vertex AI Studio

B.Vertex AI Agent Builder

C.Model Garden

D.Vertex AI Training

AnswerA

Vertex AI Studio provides a no-code interface to experiment with prompts and models, ideal for quick prototyping.

Why this answer

Vertex AI Studio is designed for rapid prototyping with prompt design, model testing, and evaluation without requiring coding. Model Garden is for browsing models but not for interactive testing. Agent Builder is for building agents.

Custom training is for production.

Practice this question →

70

MCQhard

A developer is using the Vertex AI Gemini API to generate email drafts. They notice that the cost is higher than expected. Which cost optimization strategy is MOST effective?

A.Use a higher temperature to generate more creative drafts

B.Use a smaller model like Gemini Nano instead of Gemini Pro

C.Increase the max output tokens to reduce the number of requests

D.Cache common prompt prefixes so that the model reuses computation

AnswerD

Prompt caching reduces input token processing for repeated prefixes, lowering cost.

Why this answer

Caching repeated prompt prefixes reduces tokens processed per request. Batching requests can also reduce cost, but caching is more impactful for repetitive tasks.

Practice this question →

71

MCQhard

A healthcare company is deploying a GenAI-powered report generation system that processes patient summaries. They need the output to be structured JSON for downstream ingestion. The team is using Vertex AI Studio prompt design. Which approach best ensures consistent structured output?

A.Ask the model to output plain text and parse it with a regex after generation

B.Define the JSON schema in the system instruction and provide one few-shot example of the expected JSON

C.Use a lower temperature (0.0) and hope the model outputs valid JSON

D.Generate output in Markdown and convert to JSON using a secondary script

AnswerB

This guides the model to generate output into the exact JSON structure, reducing formatting errors.

Why this answer

Option B is correct because defining the JSON schema in the system instruction and providing a few-shot example directly constrains the model's output format at inference time, leveraging Vertex AI's instruction-following capabilities. This approach ensures the model generates valid JSON consistently without post-processing, as the schema acts as a structural template that the model learns to replicate.

Exam trap

The trap here is that candidates assume lowering temperature to 0.0 guarantees deterministic and correct structured output, but without explicit schema guidance, the model can still produce syntactically invalid JSON due to inherent token-level variability.

How to eliminate wrong answers

Option A is wrong because relying on regex parsing of plain text is brittle and error-prone; the model may produce unstructured or malformed text that regex cannot reliably parse, especially with variable patient summary content. Option C is wrong because lowering temperature to 0.0 reduces randomness but does not guarantee valid JSON syntax; the model can still output malformed JSON (e.g., missing commas, unescaped quotes) without explicit schema guidance. Option D is wrong because generating Markdown and converting to JSON adds an unnecessary secondary processing step, introducing potential conversion errors and latency, and does not leverage the model's ability to output structured data directly.

Practice this question →

72

MCQmedium

A company is evaluating whether to build a custom fine-tuned model for code generation or use a pre-built API like Gemini API. The code generation needs to follow the company's internal coding standards. Which consideration is MOST important in deciding to build vs buy?

A.Pre-built APIs may not generate code that adheres to internal standards without extensive prompting

B.Pre-built APIs are always cheaper than fine-tuned models

C.Fine-tuned models have lower latency than pre-built APIs

D.Data privacy is easier to achieve with pre-built APIs

AnswerA

A fine-tuned model can be trained on internal codebases to enforce standards, whereas a pre-built API requires careful prompt engineering and may not be consistent.

Why this answer

Fine-tuning allows the model to learn company-specific coding standards, which pre-built APIs cannot guarantee. Cost and latency are secondary; data privacy is important but can be addressed with both options.

Practice this question →

73

Multi-Selecthard

A healthcare startup is building a GenAI application that answers patient queries based on medical literature. They need to ensure factual accuracy and compliance with healthcare regulations. Which TWO strategies should they use? (Choose 2)

Select 2 answers

A.Rely on few-shot prompting with example Q&A pairs

B.Fine-tune the model on medical literature

C.Implement a response schema for structured JSON output

D.Use RAG Engine with a curated medical knowledge base

E.Use Grounding with Google Search to verify facts

AnswersD, E

RAG retrieves answers from a controlled set of medical documents, ensuring sources are authoritative and up-to-date.

Why this answer

Grounding with Google Search improves factual accuracy by basing answers on verified search results. A response schema for structured output is not directly about accuracy. RAG with a curated medical knowledge base ensures answers come from trusted sources.

Few-shot prompting alone is insufficient. Fine-tuning on medical data is not selected because two correct options are already chosen.

Practice this question →

74

MCQeasy

A company is deciding between building a custom fine-tuned model vs. using a pre-built API for a document summarization task. The documents contain domain-specific jargon. Which factor STRONGLY favors using a pre-built API with prompt engineering?

A.The need to handle highly specialized industry terminology

B.The need for the model to learn a unique writing style from past summaries

C.The need to keep all data on-premises for security compliance

D.The requirement for low initial development cost and fast time-to-market

AnswerD

Pre-built APIs require no training, making them cheaper and faster to deploy, which is a strong advantage when speed and cost matter.

Why this answer

Option D is correct because using a pre-built API with prompt engineering eliminates the need for expensive model training infrastructure and specialized ML expertise, enabling rapid deployment at low initial cost. For a document summarization task, prompt engineering can leverage the API's existing capabilities without custom fine-tuning, making it ideal when speed and budget are primary constraints.

Exam trap

Cisco often tests the misconception that prompt engineering can fully replace fine-tuning for domain adaptation, when in reality prompt engineering is limited by context window size and cannot permanently encode specialized knowledge or writing styles.

How to eliminate wrong answers

Option A is wrong because handling highly specialized industry terminology actually favors fine-tuning, as pre-built APIs may lack domain-specific vocabulary and require additional prompt engineering that cannot fully capture nuanced jargon. Option B is wrong because learning a unique writing style from past summaries requires the model to internalize patterns through training data, which is a core strength of fine-tuning, not prompt engineering that only provides temporary context. Option C is wrong because keeping data on-premises for security compliance strongly favors building a custom model, as pre-built APIs typically require data to be sent to external servers, violating data residency requirements.

Practice this question →

75

MCQhard

A company is running a GenAI proof-of-concept (PoC) for internal document Q&A. The PoC shows high latency and cost. The team suspects they are using an unnecessarily large model for the task. What is the BEST action to optimize?

A.Disable grounding and rely solely on the model's internal knowledge

B.Increase the batch size for requests

C.Switch to a smaller model from Vertex AI Model Garden

D.Fine-tune the existing model on the company's documents

AnswerC

Model Garden provides access to many model sizes; testing a smaller, faster model can reduce latency and cost.

Why this answer

Switching to a smaller model from Vertex AI Model Garden directly addresses the root cause of high latency and cost: an unnecessarily large model. Smaller models have fewer parameters, requiring less compute per inference, which reduces both response time and operational expense while often being sufficient for domain-specific tasks like internal document Q&A.

Exam trap

Cisco often tests the misconception that fine-tuning or disabling features like grounding can solve performance issues, when the real bottleneck is model size and compute efficiency.

How to eliminate wrong answers

Option A is wrong because disabling grounding removes the ability to retrieve and cite actual document content, forcing the model to rely on its internal knowledge which may be outdated or incorrect for company-specific documents, and does not address model size or latency. Option B is wrong because increasing batch size improves throughput for bulk processing but does not reduce per-request latency or cost for interactive Q&A; it may even increase memory pressure and latency for real-time use cases. Option D is wrong because fine-tuning the existing large model on company documents can improve answer relevance but does not reduce the model's parameter count or inference cost; it may actually increase latency if the fine-tuned model is still large and requires additional serving infrastructure.

Practice this question →

Page 1 of 2 · 150 questions totalNext →

Ready to test yourself?

Try a timed practice session using only Gcl Applying Genai Business questions.

Start 20-question session

CCNA Gcl Applying Genai Business Questions — Page 1 of 2 | Courseiva