Microsoft Azure AI Fundamentals AI-900 (AI-900) — Questions 151225

1020 questions total · 14pages · All types, answers revealed

Page 2

Page 3 of 14

Page 4
151
MCQmedium

What is transfer learning and how does it apply to NLP models?

A.Moving a trained model from one Azure region to another for deployment
B.Using knowledge from a large pre-trained model as the starting point for learning a new, related task
C.Transferring labeled training data between different ML projects
D.Automatically translating ML models from Python to other programming languages
AnswerB

Transfer learning applies pre-trained model knowledge (general language understanding) to new tasks, requiring less data and compute than training from scratch.

Why this answer

Transfer learning in NLP involves taking a large pre-trained model (like BERT or GPT) that has been trained on a massive corpus and fine-tuning it on a smaller, task-specific dataset. This approach dramatically reduces the amount of labeled data and training time needed, while leveraging the linguistic knowledge already captured by the base model. In Azure, services like Azure Cognitive Service for Language use transfer learning to provide high-accuracy custom models with minimal training data.

Exam trap

The trap here is that candidates confuse the general idea of 'transferring' something (data, code, or location) with the specific machine learning concept of transferring learned knowledge from a pre-trained model to a new task.

How to eliminate wrong answers

Option A is wrong because moving a model between Azure regions is a deployment or migration operation, not a learning technique; it has nothing to do with reusing learned representations for a new task. Option C is wrong because transferring labeled data between projects is a data management activity, not a model training methodology; transfer learning specifically refers to transferring learned weights or features, not raw data. Option D is wrong because translating ML models between programming languages is a code conversion or interoperability concern, unrelated to the core concept of leveraging pre-trained knowledge for a new but related task.

152
MCQmedium

What is 'Azure AI Foundry's model hub' and what models are available there?

A.A marketplace where organisations can sell their custom-trained AI models to other Azure customers
B.A curated collection of leading AI models from OpenAI, Microsoft (Phi), Meta, Mistral, and others
C.A version control system for AI models similar to Git for code
D.A centralised repository of Microsoft's internal research models not available to customers
AnswerB

The model hub provides one-stop model discovery — GPT-4o, Phi-4, Llama 3, and more — deployable as Azure endpoints.

Why this answer

Azure AI Foundry's model hub is a curated collection of leading AI models from providers like OpenAI, Microsoft (Phi), Meta, Mistral, and others. It enables developers to discover, compare, and deploy pre-built models for generative AI workloads without needing to train models from scratch. This aligns with the exam's focus on leveraging existing AI services in Azure.

Exam trap

The trap here is that candidates confuse the model hub with a general marketplace or version control system, overlooking that it is specifically a curated collection of pre-built, ready-to-deploy models from multiple leading AI providers.

How to eliminate wrong answers

Option A is wrong because the model hub is not a marketplace for selling custom-trained models; it is a curated catalog of pre-built models from major providers. Option C is wrong because the model hub is not a version control system like Git; it is a repository for model discovery and deployment, not for tracking code changes. Option D is wrong because the model hub includes models from multiple third-party vendors and is fully available to customers, not restricted to Microsoft's internal research models.

153
MCQmedium

What is fine-tuning in the context of large language models?

A.Adjusting the model's response speed for production deployment
B.Training a pre-trained model further on domain-specific data to improve task performance
C.Manually reviewing and correcting model outputs
D.Compressing a large model into a smaller, faster version
AnswerB

Fine-tuning adapts a foundation model to specific tasks or domains through additional training on targeted data.

Why this answer

Fine-tuning takes a pre-trained large language model (LLM) and continues the training process on a smaller, domain-specific dataset. This adjusts the model's weights to specialize its outputs for particular tasks (e.g., legal document summarization or medical Q&A) without retraining from scratch. It is distinct from prompt engineering or retrieval-augmented generation because it permanently modifies the model parameters.

Exam trap

The trap here is that candidates confuse fine-tuning with inference optimization or model compression, because all three can improve performance in production, but only fine-tuning actually modifies model weights through additional training on domain-specific data.

How to eliminate wrong answers

Option A is wrong because adjusting response speed for production deployment is an inference optimization technique (e.g., model quantization, batching, or using Azure OpenAI's throughput settings), not a training process like fine-tuning. Option C is wrong because manually reviewing and correcting outputs is a post-processing or human-in-the-loop validation step, not a model training method. Option D is wrong because compressing a large model into a smaller, faster version describes model distillation or pruning, which reduces model size and latency but does not involve training on domain-specific data to improve task performance.

154
MCQmedium

A museum wants to automatically generate descriptive tags for its digital art collection. They need to identify objects, themes, and artistic styles in the images without any custom training. Which Azure Computer Vision feature should they use?

A.Azure AI Custom Vision
B.Azure AI Computer Vision Image Analysis
C.Azure AI Face service
D.Azure AI Form Recognizer
AnswerB

The prebuilt Image Analysis service can detect objects, themes, and generate tags and descriptions from images without any custom training.

Why this answer

Azure AI Computer Vision Image Analysis provides pre-built models that can automatically generate descriptive tags for images, identifying objects, themes, and artistic styles without any custom training. This feature uses a set of thousands of recognizable objects, living beings, scenery, and actions, making it ideal for the museum's requirement to tag digital art without custom model development.

Exam trap

The trap here is that candidates may confuse Custom Vision (which requires training) with the pre-built Image Analysis feature, mistakenly thinking custom training is needed for domain-specific tasks like art tagging, when in fact the pre-built model already covers common objects and themes.

How to eliminate wrong answers

Option A is wrong because Azure AI Custom Vision requires users to upload and label their own images to train a custom model, which contradicts the 'without any custom training' requirement. Option C is wrong because Azure AI Face service is specialized for detecting and analyzing human faces (e.g., age, emotion, facial landmarks) and cannot identify general objects, themes, or artistic styles. Option D is wrong because Azure AI Form Recognizer is designed to extract text and structure from documents (e.g., invoices, receipts) and is not intended for image content analysis or tagging.

155
MCQmedium

What capability does Azure AI Vision's 'celebrity recognition' feature provide?

A.Automatically scheduling meetings with celebrities based on their availability
B.Identifying well-known public figures in images and returning their names with confidence scores
C.Generating fictional celebrity lookalikes for entertainment applications
D.Verifying celebrity identities for event access control
AnswerB

Celebrity recognition uses a specialized domain model to identify famous public figures in images for media and content applications.

Why this answer

Azure AI Vision's celebrity recognition feature is a specialized domain-specific model that identifies well-known public figures (e.g., actors, politicians, athletes) within images. It returns the recognized celebrity's name along with a confidence score, enabling applications like media indexing or social media analysis. This capability is built on top of the general object detection and facial recognition models, but is pre-trained on a curated dataset of celebrity faces.

Exam trap

The trap here is that candidates confuse celebrity recognition (a pre-built, domain-specific model for identifying famous people) with general facial recognition or verification, which are separate capabilities in Azure AI Vision with different use cases and APIs.

How to eliminate wrong answers

Option A is wrong because Azure AI Vision does not have any scheduling or calendar integration capabilities; it is an image analysis service, not a productivity or meeting management tool. Option C is wrong because the feature does not generate or synthesize fictional lookalikes; it only identifies real, known individuals from a pre-defined database. Option D is wrong because celebrity recognition is not designed for identity verification or access control; it lacks the liveness detection and high-accuracy matching required for security scenarios, and Azure Face API (with person groups) would be used for that purpose.

156
MCQmedium

What is 'Azure AI Services multi-service resource' and what is its advantage?

A.A resource that automatically selects the best AI model for each request based on the task
B.A single resource providing one API key for Vision, Language, Speech, and Translator with unified billing
C.A resource type that runs multiple AI workloads simultaneously on shared compute
D.An enterprise licence for unlimited usage of all Azure AI services
AnswerB

Multi-service resource simplifies management — one key, one endpoint, consolidated bill for multiple Azure AI services.

Why this answer

Option B is correct because an Azure AI Services multi-service resource provides a single endpoint and API key to access multiple Azure AI services (Vision, Language, Speech, Translator) under one resource, enabling unified billing and simplified management. This is distinct from single-service resources, which require separate keys and endpoints for each service, increasing administrative overhead.

Exam trap

The trap here is that candidates confuse 'multi-service resource' with a load balancer or auto-scaling feature, when in reality it is purely a billing and key-management convenience with no impact on how AI models are selected or executed.

How to eliminate wrong answers

Option A is wrong because it describes a hypothetical auto-selection mechanism that does not exist in Azure AI Services; multi-service resources do not automatically choose models—they expose individual APIs that must be called explicitly. Option C is wrong because multi-service resources do not run workloads on shared compute; they are logical containers for API access, and each service runs on its own dedicated backend infrastructure. Option D is wrong because there is no 'enterprise licence for unlimited usage'—Azure AI Services are billed per-call or per-transaction, and multi-service resources simply consolidate billing under one meter, not provide unlimited usage.

157
MCQmedium

A logistics company needs to automatically read shipping labels on packages. The labels contain printed text in various fonts and sizes, as well as handwritten addresses. Which Azure Computer Vision capability should they use to extract the text from the labels?

A.Image Analysis
B.Face API
C.Optical Character Recognition (OCR) - Read API
D.Custom Vision
AnswerC

The Read API is purpose-built for extracting printed and handwritten text from images and documents, supporting various fonts and sizes.

Why this answer

The Read API (part of Azure Computer Vision's OCR capabilities) is specifically designed to extract printed and handwritten text from images, handling varied fonts, sizes, and styles. This makes it the correct choice for reading shipping labels that contain both printed text and handwritten addresses.

Exam trap

The trap here is that candidates often confuse Image Analysis (which can detect text in images but not extract it reliably from mixed formats) with the dedicated OCR Read API, or they mistakenly think Custom Vision can be trained for text extraction when it is designed for custom visual patterns.

How to eliminate wrong answers

Option A is wrong because Image Analysis provides general image descriptions, object detection, and tags, but does not include text extraction from mixed printed and handwritten content. Option B is wrong because Face API is dedicated to detecting, recognizing, and analyzing human faces, not text. Option D is wrong because Custom Vision is used to train custom image classification or object detection models, not for out-of-the-box text extraction from labels.

158
MCQhard

A medical research team needs to analyze CT scans to identify and outline the exact boundaries of lung nodules. Which Azure Computer Vision capability should they use?

A.Image Classification
B.Object Detection
C.Semantic Segmentation
D.Optical Character Recognition (OCR)
AnswerC

Semantic Segmentation classifies every pixel, providing exact boundaries of each object, which is ideal for outlining lung nodules.

Why this answer

Semantic segmentation is the correct capability because it classifies each pixel in an image, enabling precise delineation of object boundaries. For CT scans, this allows the model to outline the exact shape and contour of lung nodules, which is essential for medical analysis. Image classification and object detection only provide labels or bounding boxes, not pixel-level boundaries.

Exam trap

The trap here is that candidates confuse object detection with semantic segmentation, assuming bounding boxes are sufficient for boundary outlining, but the exam tests the distinction between rectangular region identification and pixel-level precision.

How to eliminate wrong answers

Option A is wrong because image classification assigns a single label to the entire image, not identifying or outlining individual objects like nodules. Option B is wrong because object detection provides bounding boxes around objects, which are rectangular and cannot capture the irregular, precise boundaries of lung nodules. Option D is wrong because OCR extracts text from images, which is irrelevant to analyzing CT scans for nodule boundaries.

159
MCQeasy

What is 'natural language processing' (NLP) as a category of AI workload?

A.Using AI to process and understand text and speech in human languages
B.Programming computers using natural spoken language instead of code
C.A network protocol for low-latency language model inference
D.Automatically converting speech to a natural-sounding language
AnswerA

NLP enables computers to work with human language — covering translation, sentiment, summarisation, chatbots, and more.

Why this answer

Natural language processing (NLP) is an AI workload that focuses on enabling computers to interpret, understand, and generate human language in both text and speech forms. It combines computational linguistics with statistical machine learning models to perform tasks like sentiment analysis, language translation, and speech recognition. This makes option A the correct definition.

Exam trap

The trap here is that candidates often confuse a specific NLP application (like speech synthesis or translation) with the entire NLP workload category, leading them to select option D instead of the broader, correct definition in option A.

How to eliminate wrong answers

Option B is wrong because it describes a hypothetical scenario of programming using natural language, which is not a current AI workload category; NLP processes language but does not replace programming languages. Option C is wrong because it incorrectly defines NLP as a network protocol for low-latency inference, which is unrelated to language processing and more akin to infrastructure concepts like gRPC or HTTP/2. Option D is wrong because it describes text-to-speech (TTS) synthesis, which is a specific application of NLP, not the broad category of NLP itself.

160
MCQmedium

A developer is using Azure OpenAI Service to classify customer support tickets into categories such as 'Billing', 'Technical Issue', and 'Account Management'. The developer provides three labeled examples for each category in the prompt to improve the model's accuracy. What technique is the developer applying?

A.Fine-tuning
B.Few-shot learning
C.Prompt engineering
D.Retrieval-augmented generation
AnswerB

By providing a few labeled examples in the prompt, the developer is using few-shot learning to guide the model's classification behavior without retraining.

Why this answer

Few-shot learning is the correct technique because the developer is providing a small number of labeled examples (three per category) directly in the prompt to guide the model's output without updating the model's weights. This approach leverages the model's in-context learning ability, where the examples act as a pattern for the model to follow when classifying new tickets.

Exam trap

The trap here is that candidates often confuse few-shot learning with fine-tuning, assuming that any use of examples to improve accuracy must involve retraining the model, but few-shot learning does not modify model weights—it only uses examples in the prompt.

How to eliminate wrong answers

Option A is wrong because fine-tuning involves retraining the model on a custom dataset to update its weights, which is not what is happening here—the developer is only adding examples to the prompt, not training the model. Option C is wrong because prompt engineering is a broader practice that includes designing prompts for clarity and structure, but the specific technique of including labeled examples in the prompt to improve accuracy is called few-shot learning, not just prompt engineering. Option D is wrong because retrieval-augmented generation (RAG) involves fetching external data from a knowledge base at inference time to augment the prompt, whereas here the examples are static and pre-defined in the prompt itself.

161
MCQmedium

What is 'document processing' as an AI workload and what pipeline does it typically involve?

A.Using Azure Blob Storage to store and manage document files efficiently
B.Automating extraction, understanding, and routing of business documents through OCR, extraction, and NLP
C.Digitising physical documents by scanning them and converting to PDF format
D.Managing document access permissions and version control in SharePoint
AnswerB

Document processing pipelines combine OCR + Document Intelligence + NLP — replacing manual data entry with automated understanding.

Why this answer

Document processing as an AI workload involves automating the extraction, understanding, and routing of information from documents. This pipeline typically uses Optical Character Recognition (OCR) to digitize text, followed by AI models (e.g., Azure Form Recognizer) for data extraction, and Natural Language Processing (NLP) for semantic understanding and classification. Option B correctly captures this end-to-end automation, which is a core AI workload in Azure.

Exam trap

The trap here is that candidates confuse basic document digitization (Option C) or storage/management (Options A and D) with the full AI pipeline of extraction, understanding, and routing, which requires OCR, NLP, and automated workflows.

How to eliminate wrong answers

Option A is wrong because Azure Blob Storage is a general-purpose object storage service for unstructured data, not an AI workload for document processing; it lacks the OCR, extraction, and NLP pipeline required for intelligent document handling. Option C is wrong because digitizing documents by scanning and converting to PDF is a basic digitization step, not an AI workload—it omits the automated extraction, understanding, and routing that define AI-driven document processing. Option D is wrong because managing document access permissions and version control in SharePoint is a content management and governance task, not an AI workload; it does not involve OCR, data extraction, or NLP.

162
MCQmedium

What is 'Microsoft Copilot Studio' and what is it used for?

A.A professional audio/video editing suite powered by AI for content creators
B.A low-code platform for building custom AI bots and agents integrated with Microsoft 365
C.An IDE for enterprise developers building high-performance LLM applications in C#
D.A tool for generating Copilot-branded marketing content for Microsoft partners
AnswerB

Copilot Studio enables citizen developers to build domain-specific bots — extending Microsoft Copilot with custom knowledge and workflows.

Why this answer

Microsoft Copilot Studio is a low-code platform that allows users to build custom AI-powered bots and agents that integrate seamlessly with Microsoft 365 services. It extends the capabilities of Microsoft Copilot by enabling tailored conversational experiences, such as automating workflows, answering queries, and handling tasks within the Microsoft ecosystem without requiring extensive coding.

Exam trap

The trap here is that candidates may confuse 'Copilot Studio' with a general-purpose development tool or creative suite, rather than recognizing it as a low-code platform specifically for building custom AI bots integrated with Microsoft 365.

How to eliminate wrong answers

Option A is wrong because Microsoft Copilot Studio is not a professional audio/video editing suite; that describes tools like Adobe Premiere Pro or DaVinci Resolve, not a low-code AI bot builder. Option C is wrong because Copilot Studio is not an IDE for building LLM applications in C#; it is a low-code platform, whereas an IDE like Visual Studio or JetBrains Rider would be used for such development. Option D is wrong because Copilot Studio is not a marketing content generation tool for partners; it is a platform for creating custom AI agents, not for producing Copilot-branded marketing materials.

163
MCQhard

A company develops an AI system to screen job candidates based on their resumes. The system is trained on historical data. Analysis reveals that the model has an adverse impact against female candidates due to a proxy feature (e.g., 'years of continuous employment') that correlates with gender. The team removes the protected attribute 'gender' from the training data but the biased outcome persists. According to Microsoft's responsible AI principles, which additional step should the team take to address this unfairness?

A.Remove the offending proxy feature 'years of continuous employment' from the training data.
B.Use a tool like Fairlearn to detect and mitigate the bias while maintaining model performance.
C.Train a separate model for each gender group to ensure equal outcomes.
D.Collect more training data from underrepresented groups.
AnswerB

Fairlearn provides algorithms and metrics to detect and mitigate unfairness, directly addressing the persistent bias even after removing protected attributes.

Why this answer

Option B is correct because Microsoft's responsible AI principle of fairness requires not just removing protected attributes but also detecting and mitigating proxy features that cause bias. Fairlearn is a Microsoft open-source toolkit specifically designed to assess and mitigate unfairness in AI systems, offering algorithms like 'Exponentiated Gradient Reduction' or 'Grid Search' to reduce disparity while preserving model performance. Simply removing the proxy feature (A) may not always be feasible if it carries predictive value, and Fairlearn provides a systematic way to balance fairness and accuracy.

Exam trap

The trap here is that candidates assume removing the protected attribute (gender) alone solves fairness, but Microsoft's responsible AI principles emphasize that proxy features can perpetuate bias, requiring tools like Fairlearn for detection and mitigation rather than simplistic feature removal or data collection.

How to eliminate wrong answers

Option A is wrong because removing the proxy feature 'years of continuous employment' may eliminate valuable predictive information and does not guarantee that other correlated features or interactions won't reintroduce bias; Fairlearn's mitigation techniques address bias without necessarily discarding features. Option C is wrong because training separate models for each gender group can lead to different treatment and may violate fairness principles by reinforcing segregation, and it does not align with Microsoft's approach of mitigating bias within a unified model. Option D is wrong because collecting more data from underrepresented groups can help but does not directly address the existing proxy bias; it may reduce imbalance but does not mitigate the specific correlation between 'years of continuous employment' and gender that causes adverse impact.

164
MCQmedium

A data scientist trains a binary classification model to detect fraudulent transactions. The dataset contains 99% legitimate transactions (negative class) and 1% fraudulent transactions (positive class). The model predicts 'legitimate' for every transaction in the test set and achieves 99% accuracy. Which metric would best reveal that the model is failing to identify any fraudulent transactions?

A.Accuracy
B.Precision
C.Recall
D.F1-score
AnswerC

Recall for the fraud class is 0 since no fraudulent transactions are identified; this directly shows the model's failure to catch any positive cases.

Why this answer

Recall (also known as sensitivity or true positive rate) measures the proportion of actual positive cases (fraudulent transactions) that the model correctly identifies. With 99% accuracy but zero true positives, the recall is 0%, which immediately reveals the model's complete failure to detect fraud. In Azure Machine Learning, the classification metrics pane would show recall = 0.0 for the positive class, highlighting this issue despite high accuracy.

Exam trap

Microsoft often tests the trap that high accuracy implies a good model, especially with imbalanced data, leading candidates to overlook that recall (or sensitivity) is the critical metric for detecting minority class failures.

How to eliminate wrong answers

Option A is wrong because accuracy only measures overall correctness (99% here) and is misleading when classes are imbalanced; it does not reveal the model's inability to detect the minority class. Option B is wrong because precision measures the proportion of predicted positives that are actually positive, but since the model never predicts any positive, precision is undefined (0/0) or reported as 0, which does not directly expose the failure to find any actual fraud. Option D is wrong because F1-score is the harmonic mean of precision and recall; with recall = 0, F1-score is 0, but recall itself is the more direct and interpretable metric for identifying that no fraudulent transactions were caught.

165
MCQhard

A manufacturing company wants to use Azure Computer Vision to inspect products on an assembly line for defects. They have a labeled dataset with images of defective and non-defective products. They need to not only classify products as defective or not, but also identify the exact location of the defect (e.g., a crack) in the image. Which Azure Computer Vision capability should they use?

A.Custom Vision object detection
B.Custom Vision image classification
C.Azure Face API
D.Optical Character Recognition (OCR)
AnswerA

Correct. Object detection can be trained to identify and locate defects (e.g., cracks) within an image, providing both classification and location.

Why this answer

Custom Vision object detection is the correct choice because it not only classifies images (defective vs. non-defective) but also localizes defects by drawing bounding boxes around them. The labeled dataset with defect locations directly supports training a model to output both class labels and spatial coordinates, which is exactly what object detection provides.

Exam trap

The trap here is that candidates confuse image classification (which only labels the whole image) with object detection (which provides both classification and localization), leading them to choose Custom Vision image classification despite the explicit need for defect location.

How to eliminate wrong answers

Option B is wrong because Custom Vision image classification only assigns a single label to the entire image (e.g., 'defective' or 'non-defective') and cannot identify the exact location of a defect. Option C is wrong because Azure Face API is specialized for detecting, analyzing, and recognizing human faces, not for industrial defect localization. Option D is wrong because Optical Character Recognition (OCR) extracts text from images and has no capability to detect or localize physical defects like cracks.

166
Drag & Dropmedium

Drag and drop the steps to train a custom vision model in Azure Custom Vision into the correct order.

Drag steps to the numbered slots on the right, or tap a step then tap a slot.

Steps
Order

Why this order

Training a custom vision model requires uploading tagged images, training, evaluating, and publishing.

167
MCQmedium

A data scientist trains a linear regression model to predict house prices. The model's training error is very high, and its test error is nearly as high. Which term best describes this situation?

A.Underfitting
B.Overfitting
C.High bias
D.High variance
AnswerA

Underfitting is characterized by a model that does not learn the training data well, leading to high error on both training and test sets.

Why this answer

Underfitting occurs when a model is too simple to capture the underlying patterns in the data, resulting in high training error and similarly high test error. In this linear regression scenario, the model fails to learn the relationship between features and house prices, leading to poor performance on both training and test sets.

Exam trap

The trap here is that candidates often confuse 'high bias' with 'underfitting' as the best descriptor, but the question asks for the term that best describes the situation, and 'underfitting' is the direct behavioral term while 'high bias' is a contributing cause.

How to eliminate wrong answers

Option B (Overfitting) is wrong because overfitting would show very low training error but high test error, not high training error. Option C (High bias) is incorrect because high bias is a cause of underfitting, not a separate term describing the situation itself. Option D (High variance) is wrong because high variance is associated with overfitting, where the model is too sensitive to training data, not with high training error.

168
MCQmedium

A company uses Azure OpenAI Service to automatically generate customer support email responses. They want to ensure that the model does not produce responses containing offensive language, hate speech, or biased content. Which Microsoft responsible AI principle is most directly addressed by implementing content filters that screen the model's output before it is sent?

A.A. Transparency
B.B. Reliability and Safety
C.C. Inclusiveness
D.D. Fairness
AnswerD

Fairness is the principle that AI systems should treat all people fairly and avoid bias. Implementing content filters to block hate speech and offensive language is a direct application of Fairness.

Why this answer

Implementing content filters to screen model outputs for offensive language, hate speech, or biased content directly addresses the Fairness principle, which requires AI systems to treat all people equitably and avoid reinforcing societal biases. By filtering out harmful or biased content, the organization ensures that the generated responses do not discriminate against or marginalize any group, aligning with Microsoft's commitment to fairness in AI.

Exam trap

The trap here is that candidates often confuse Reliability and Safety (which deals with system uptime and operational failures) with the specific need to prevent biased or offensive outputs, which falls under Fairness in Microsoft's responsible AI framework.

How to eliminate wrong answers

Option A is wrong because Transparency refers to the principle of making AI systems understandable and providing clear information about their capabilities and limitations, not about filtering outputs for harmful content. Option B is wrong because Reliability and Safety focuses on ensuring the AI system operates dependably and safely under normal conditions, which includes preventing failures but does not specifically target bias or offensive language filtering. Option C is wrong because Inclusiveness aims to design AI systems that empower and include all people, often through accessible interfaces and diverse data representation, but it does not directly address the screening of outputs for offensive or biased content.

169
MCQmedium

What is 'batch document translation' in Azure AI Translator and what file formats does it support?

A.Translating database records in batches by sending SQL queries to the translation API
B.Asynchronous translation of Word, PDF, Excel, and HTML documents preserving their layout
C.Translating phone call transcripts in batches after calls are completed
D.A real-time API that translates one document page at a time as users scroll
AnswerB

Batch document translation handles formatted files from Blob Storage — preserving structure while translating content.

Why this answer

Option B is correct because Azure AI Translator's batch document translation is an asynchronous operation that translates entire documents (such as Word, PDF, Excel, and HTML) while preserving their original layout and structure. This is distinct from real-time or synchronous translation, as batch translation processes files in bulk via a job-based API, making it ideal for large-scale or non-interactive scenarios.

Exam trap

The trap here is that candidates confuse batch document translation with real-time translation or assume it applies to non-document data like databases or transcripts, but the key differentiator is that it's an asynchronous, file-based service that preserves document layout.

How to eliminate wrong answers

Option A is wrong because batch document translation does not involve SQL queries or database records; it translates document files, not database content. Option C is wrong because batch document translation is designed for document files, not for phone call transcripts, which would require a different service like Azure Speech-to-Text or Conversation Transcription. Option D is wrong because batch document translation is asynchronous, not real-time; the real-time API for translating individual pages or text is the synchronous Translator API, not batch translation.

170
MCQeasy

What is the purpose of Azure AI Speech's 'batch transcription' capability?

A.Real-time transcription of live audio streams for immediate use
B.Asynchronous processing of large volumes of audio files for cost-efficient transcription at scale
C.Synchronizing speech transcription across multiple languages simultaneously
D.Training a custom speech recognition model on audio samples
AnswerB

Batch transcription handles large audio file collections asynchronously — submit files, retrieve transcripts later, ideal for call center archives.

Why this answer

Azure AI Speech's batch transcription is designed for asynchronous processing of large volumes of pre-recorded audio files. It allows you to submit multiple audio files for transcription without requiring real-time interaction, making it cost-efficient for scenarios like call center analytics or media captioning where immediate results are not needed.

Exam trap

The trap here is confusing batch transcription with real-time transcription, as candidates often assume 'batch' implies faster processing rather than asynchronous, cost-efficient bulk processing.

How to eliminate wrong answers

Option A is wrong because real-time transcription of live audio streams is handled by Azure AI Speech's real-time transcription API, not batch transcription. Option C is wrong because synchronizing speech transcription across multiple languages simultaneously is a feature of real-time translation or multi-language transcription, not batch transcription. Option D is wrong because training a custom speech recognition model is done through Azure AI Speech's Custom Speech service, which uses audio samples and transcription data, not batch transcription.

171
MCQeasy

A developer uses Azure OpenAI Service to generate marketing copy. They want the model to produce more focused and deterministic responses, reducing the variety of outputs for the same prompt. Which parameter should the developer decrease?

A.Temperature
B.Max tokens
C.Top P
D.Frequency penalty
AnswerA

Lowering temperature reduces randomness, making outputs more deterministic and focused.

Why this answer

Temperature controls the randomness of the model's output. Lowering temperature (e.g., from 1.0 to 0.2) makes the model more deterministic and focused, reducing output variety for the same prompt. This is the correct parameter to adjust for more consistent marketing copy.

Exam trap

The trap here is that candidates often confuse temperature with Top P, thinking both control randomness identically, but temperature directly scales logits while Top P sets a cumulative probability cutoff for token selection.

How to eliminate wrong answers

Option B (Max tokens) is wrong because it controls the maximum length of the output, not the randomness or determinism. Option C (Top P) is wrong because it controls nucleus sampling, which also affects output diversity but through cumulative probability threshold, not directly reducing variety in a deterministic way. Option D (Frequency penalty) is wrong because it reduces repetition of tokens based on their frequency in the output, not the overall randomness or determinism of the response.

172
MCQmedium

A customer service team wants to build an Azure AI-powered bot that can understand the intent behind customer messages. For example, the bot should recognize that 'I want to return my shoes' maps to a 'ReturnItem' intent, and 'Where is my order?' maps to 'TrackOrder'. Which Azure service provides pre-built models specifically for intent recognition?

A.Language Understanding (LUIS)
B.Text Analytics
C.Translator Text
D.Speech-to-text
AnswerA

LUIS (part of Azure Language service) is designed for intent recognition and entity extraction from conversational utterances. It provides pre-built models for common intents.

Why this answer

Language Understanding (LUIS) is the correct Azure service because it provides pre-built models and custom capabilities specifically designed for intent recognition and entity extraction from natural language utterances. The scenario requires mapping customer messages like 'I want to return my shoes' to a 'ReturnItem' intent, which is exactly the core function of LUIS—it analyzes user input to identify the user's goal (intent) and any relevant details (entities).

Exam trap

The trap here is that candidates often confuse Text Analytics (which can extract entities and sentiment) with LUIS, but Text Analytics lacks the pre-built intent recognition models and the ability to map utterances to custom intents like 'ReturnItem' or 'TrackOrder'.

How to eliminate wrong answers

Option B (Text Analytics) is wrong because it focuses on extracting insights like sentiment, key phrases, and named entities from text, but it does not provide pre-built models for intent recognition or mapping utterances to specific intents. Option C (Translator Text) is wrong because it is a machine translation service that converts text between languages, with no capability for understanding or classifying user intents. Option D (Speech-to-text) is wrong because it transcribes spoken audio into text, but it does not perform any semantic analysis or intent classification on the transcribed text.

173
MCQeasy

What is 'Azure AI Vision's image analysis v4.0' and what new capability does it add?

A.A version supporting 4K resolution images for the first time
B.Florence-powered advanced capabilities including dense captioning, embeddings, and improved background removal
C.A version requiring 4x more compute than the previous version
D.The fourth iteration of Microsoft's Kinect 3D depth sensor SDK
AnswerB

v4.0 brings Florence's language-vision understanding — enabling dense regional captions, vector embeddings, and richer scene understanding.

Why this answer

Azure AI Vision's image analysis v4.0 is a major update that leverages the Florence foundation model to deliver advanced capabilities such as dense captioning (generating detailed descriptions for multiple regions in an image), image embeddings (vector representations for similarity search), and improved background removal. This version significantly enhances the depth and accuracy of image understanding compared to previous versions.

Exam trap

The trap here is that candidates confuse 'version 4.0' with a simple incremental update (like resolution or performance tweaks) rather than recognizing it as a paradigm shift powered by the Florence foundation model, which is the core new capability tested.

How to eliminate wrong answers

Option A is wrong because Azure AI Vision v4.0 does not specifically introduce 4K resolution support; resolution handling was already available in prior versions, and the key new capability is the Florence-powered AI features, not a resolution threshold. Option C is wrong because the update does not require 4x more compute; the Florence model is optimized for efficiency, and the exam focuses on functional improvements, not resource requirements. Option D is wrong because Azure AI Vision is a cloud-based image analysis service, not related to the Kinect 3D depth sensor SDK, which is a separate hardware product for motion sensing.

174
MCQhard

What is 'differential privacy' and how is it relevant to AI model training?

A.The difference in model accuracy between a private deployment and a public API
B.A mathematical guarantee that model training reveals negligible information about any individual's data
C.Encrypting model weights so they remain private from users accessing the model API
D.Using different models for different privacy tiers of customers
AnswerB

Differential privacy adds noise during training — providing formal guarantees that models don't memorise or expose individual records.

Why this answer

Differential privacy is a mathematical framework that ensures the output of a model training process does not reveal whether any specific individual's data was included in the training dataset. It achieves this by adding calibrated noise to the training process or query results, providing a formal privacy guarantee quantified by the epsilon parameter. This is directly relevant to AI model training because it allows organizations to train models on sensitive data while protecting individual privacy, which is a core requirement for compliance with regulations like GDPR and HIPAA.

Exam trap

The trap here is that candidates confuse data privacy techniques (like encryption or access control) with the formal mathematical guarantee of differential privacy, which specifically addresses information leakage from the model's outputs rather than protecting the data at rest or in transit.

How to eliminate wrong answers

Option A is wrong because it describes a comparison of model accuracy between deployment environments, which has nothing to do with the mathematical privacy guarantee of differential privacy. Option C is wrong because encrypting model weights protects the model's intellectual property from API users, but does not prevent the model from memorizing and leaking individual training data points. Option D is wrong because using different models for different privacy tiers is a policy or access control mechanism, not a mathematical technique for limiting information leakage about individuals.

175
MCQhard

A data scientist is training a binary classification model to detect fraudulent transactions. The dataset contains only 1% fraudulent transactions. The model achieves 99% accuracy on the test set, but when deployed, it fails to detect most actual fraud cases. Which metric would best reveal this issue?

A.Accuracy
B.Precision
C.Recall
D.F1 score
AnswerC

Recall measures the fraction of actual fraud cases that the model correctly identifies. A low recall reveals the model's failure to detect fraud.

Why this answer

Recall (sensitivity) measures the proportion of actual positive cases correctly identified. In this highly imbalanced dataset (1% fraud), a model can achieve 99% accuracy by simply predicting 'non-fraud' for every transaction, which yields zero true positives. Recall reveals this failure because it focuses solely on how many fraudulent transactions were caught, ignoring the vast majority of non-fraud cases.

Exam trap

The trap here is that candidates see '99% accuracy' and assume the model is performing well, failing to recognize that accuracy is a poor metric for imbalanced datasets, and that recall specifically measures the ability to detect the minority class.

How to eliminate wrong answers

Option A is wrong because accuracy is misleading in imbalanced datasets; a 99% accuracy can be achieved by a trivial classifier that never predicts fraud, hiding the model's inability to detect any positive cases. Option B is wrong because precision measures the proportion of predicted fraud cases that are actually fraud, but if the model rarely predicts fraud, precision may be undefined or artificially high, and it does not capture the failure to identify actual fraud. Option D is wrong because the F1 score is the harmonic mean of precision and recall; while it balances both, it can still be low if recall is poor, but the question asks for the metric that best reveals the issue, and recall directly exposes the lack of true positive detections.

176
MCQmedium

What is 'time series forecasting' and what Azure ML tools support it?

A.Forecasting how long model training will take based on dataset size
B.Predicting future values in time-ordered data (sales, demand, energy) using Azure ML AutoML
C.Scheduling ML jobs to run at specific times using Azure ML pipelines
D.Analysing historical model performance over time to detect degradation
AnswerB

Time series forecasting handles temporal patterns — Azure ML AutoML tries many algorithms and handles seasonality for business prediction.

Why this answer

Time series forecasting is a machine learning technique that predicts future values based on historical, time-ordered data, such as sales, demand, or energy consumption. Azure ML AutoML supports this by automatically selecting the best model (e.g., ARIMA, Prophet, or gradient boosting) and tuning hyperparameters for time-dependent features like seasonality and trends.

Exam trap

The trap here is confusing time series forecasting with unrelated Azure ML features like job scheduling or model monitoring, leading candidates to pick options that describe operational tasks rather than predictive modeling.

How to eliminate wrong answers

Option A is wrong because it describes estimating training duration, which is a performance optimization concern, not a forecasting task on time-ordered data. Option C is wrong because scheduling ML jobs is a pipeline orchestration feature, not a predictive modeling technique. Option D is wrong because analyzing historical model performance for degradation is model monitoring (data drift/concept drift), not forecasting future values.

177
Multi-Selectmedium

A manufacturing company wants to use Azure Computer Vision to automatically inspect products on an assembly line for defects. They need to identify and locate specific types of defects (e.g., scratch, dent, crack) in product images. Which Azure Computer Vision capabilities could be used together to achieve this? (Select two options.)

Select 2 answers
A.Object Detection
B.Semantic Segmentation
C.Optical Character Recognition (OCR)
D.Image Classification
AnswersA, B

Object Detection identifies and locates multiple objects (defects) in an image with bounding boxes.

Why this answer

Option A is correct because Object Detection in Azure Computer Vision can identify and locate multiple specific defect types (e.g., scratch, dent, crack) within product images by drawing bounding boxes around each defect. This capability directly meets the requirement to both identify and locate defects on the assembly line.

Exam trap

The trap here is that candidates often confuse Image Classification with Object Detection, not realizing that classification cannot locate multiple defects or distinguish between defect types in a single image.

178
MCQhard

A data scientist evaluates a regression model that predicts house prices. On the test set, the Mean Absolute Error (MAE) is $8,000 and the Root Mean Squared Error (RMSE) is $25,000. What does the large difference between MAE and RMSE indicate about the model's errors?

A.The model is overfitting the training data
B.The model predictions are consistently biased high
C.The model has some predictions with very large errors
D.The model has high variance due to outliers in training data
AnswerC

RMSE penalizes large errors more heavily than MAE. A significantly higher RMSE relative to MAE implies that while most errors are moderate, there are a few predictions with extremely large errors (outliers).

Why this answer

The large difference between MAE ($8,000) and RMSE ($25,000) indicates that the model has some predictions with very large errors. RMSE squares the errors before averaging, which heavily penalizes large deviations, so a significantly higher RMSE relative to MAE suggests the presence of outliers or extreme prediction errors in the test set.

Exam trap

The trap here is that candidates confuse the mathematical behavior of RMSE (which amplifies large errors) with concepts like overfitting or bias, rather than recognizing it as a direct indicator of outlier errors in the predictions.

How to eliminate wrong answers

Option A is wrong because overfitting is characterized by low training error and high test error, not by a specific relationship between MAE and RMSE; the given metrics are both on the test set, so overfitting cannot be inferred from this difference alone. Option B is wrong because consistently biased high predictions would affect both MAE and RMSE similarly (e.g., both would be elevated), not cause a large disparity; bias shifts the mean error but does not disproportionately inflate RMSE over MAE. Option D is wrong because high variance due to outliers in training data is a cause of overfitting or poor generalization, but the question focuses on the test set errors; the large RMSE relative to MAE on the test set directly indicates the presence of large errors in predictions, not the source of variance in training.

179
MCQmedium

A museum wants to create an app that allows visitors to take a photo of a painting and receive information about the artist, year, and style. The app needs to identify the painting from a database of thousands of artworks. Which Azure Computer Vision capability is most suitable?

A.Optical Character Recognition (OCR)
B.Image classification
C.Object detection
D.Face detection
AnswerB

Image classification analyzes the entire image content and returns a label. This can be trained to recognize each specific painting and provide its details.

Why this answer

Image classification is the correct choice because the app needs to assign a single label (the specific painting) to the entire photo. Azure Computer Vision's image classification models are trained to recognize and categorize entire images into predefined classes, which matches the requirement of identifying a painting from a database of thousands of artworks based on the visual content of the photo.

Exam trap

The trap here is that candidates confuse image classification (labeling the whole image) with object detection (locating objects within the image), but the requirement to identify the painting from a photo of the entire artwork makes classification the precise fit.

How to eliminate wrong answers

Option A is wrong because Optical Character Recognition (OCR) extracts text from images, not visual features of paintings; it would only work if the painting had a visible label or plaque. Option C is wrong because object detection identifies and locates multiple objects within an image (e.g., people, furniture) and returns bounding boxes, but the app needs to classify the entire painting as a single entity, not detect sub-objects. Option D is wrong because face detection specifically identifies human faces in images, which is irrelevant to recognizing a painting's artistic attributes.

180
MCQhard

A developer uses Azure OpenAI Service to generate product descriptions. They want to ensure that the model only considers the most likely tokens that together have a cumulative probability of 0.95, ignoring very low-probability tokens that could lead to nonsensical outputs. Which parameter should they configure?

A.Temperature
B.Top_p
C.Frequency penalty
D.Presence penalty
AnswerB

Correct. Top_p (nucleus sampling) sets a cumulative probability threshold so that only the most probable tokens that together reach that threshold are considered, eliminating very unlikely tokens.

Why this answer

Option B (Top_p) is correct because the developer wants to limit token selection to those with a cumulative probability of 0.95, which is exactly what the Top_p (nucleus sampling) parameter controls. By setting Top_p to 0.95, the model will only consider the smallest set of tokens whose combined probability mass reaches 0.95, effectively ignoring low-probability tokens that could produce nonsensical outputs.

Exam trap

The trap here is that candidates often confuse Top_p with Temperature, assuming both control randomness, but Temperature scales logits without filtering low-probability tokens, whereas Top_p directly removes them based on cumulative probability mass.

How to eliminate wrong answers

Option A (Temperature) is wrong because it controls the randomness of token selection by scaling the logits before applying softmax, not by filtering based on cumulative probability. Option C (Frequency penalty) is wrong because it reduces the likelihood of tokens that have already appeared in the generated text, aiming to avoid repetition, not to filter low-probability tokens. Option D (Presence penalty) is wrong because it penalizes tokens that have appeared at least once in the text, encouraging the model to introduce new topics, but does not perform cumulative probability filtering.

181
MCQmedium

A law firm receives hundreds of legal documents daily. They need to automatically extract key entities like names of parties, dates, jurisdictions, and also classify each document as 'contract', 'pleading', or 'memo'. Which combination of Azure AI Language features should they use?

A.Entity recognition and key phrase extraction
B.Entity recognition and custom text classification
C.Sentiment analysis and language detection
D.Summarization and conversation analysis
AnswerB

Entity recognition extracts specific entities like party names and dates; custom text classifies documents into the required types.

Why this answer

Option B is correct because the law firm needs both entity extraction (to identify parties, dates, jurisdictions) and document classification (contract, pleading, memo). Azure AI Language's prebuilt entity recognition handles the entity extraction, while custom text classification allows the firm to train a model to classify documents into their specific categories. This combination directly addresses both requirements without unnecessary features.

Exam trap

The trap here is that candidates often confuse key phrase extraction with entity recognition, assuming key phrases can replace entities, but key phrases are unstructured and not mapped to predefined categories like dates or jurisdictions.

How to eliminate wrong answers

Option A is wrong because key phrase extraction returns general important phrases (e.g., 'breach of contract'), not structured entities like dates or jurisdictions, and it does not classify documents into custom categories. Option C is wrong because sentiment analysis detects positive/negative/neutral tone, not entities or document types, and language detection only identifies the language of the text, neither of which meets the classification or entity extraction needs. Option D is wrong because summarization condenses text into a shorter version and conversation analysis is designed for dialogue between speakers (e.g., chat logs), not for extracting entities or classifying single-document legal texts.

182
MCQmedium

What is the difference between face detection and face identification?

A.Face detection identifies who the person is; face identification counts how many faces are present
B.Face detection finds face locations; face identification determines who the person is from an enrolled database
C.Face detection works on videos; face identification works on static images only
D.They are the same operation with different names
AnswerB

Detection = locating faces in an image; identification = matching detected faces to known individuals in an enrolled group.

Why this answer

Face detection is a computer vision task that locates human faces in an image or video, returning bounding box coordinates. Face identification (or recognition) goes a step further by matching a detected face against a database of enrolled individuals to determine a specific identity. Option B correctly distinguishes these two operations: detection finds where faces are, while identification determines who the person is.

Exam trap

The trap here is confusing the terms 'detection' and 'identification' as interchangeable, when in fact detection is a prerequisite for identification and they serve fundamentally different roles in a computer vision pipeline.

How to eliminate wrong answers

Option A is wrong because it reverses the definitions: face detection does not identify who the person is, and face identification does not count faces—that is a separate task called face counting. Option C is wrong because both face detection and face identification can work on both videos and static images; Azure Face API supports both modalities. Option D is wrong because they are distinct operations with different purposes and outputs—detection returns bounding boxes, identification returns identity matches from a person group.

183
MCQmedium

What is 'speech synthesis markup language' (SSML) used for in Azure AI Speech?

A.A programming language for writing speech recognition algorithms
B.An XML markup language for controlling TTS voice characteristics like pitch, rate, pauses, and pronunciation
C.A system for transcribing speech in real time to a database
D.A security protocol for encrypting speech API calls
AnswerB

SSML provides fine-grained control over how text is spoken — inserting pauses, changing pitch/rate, emphasizing words, and switching voices.

Why this answer

SSML is an XML-based markup language that allows you to fine-tune text-to-speech (TTS) output by controlling prosodic elements such as pitch, speaking rate, volume, and pronunciation. It also supports inserting pauses, specifying phonetic pronunciations, and adjusting emphasis, making it essential for generating natural-sounding speech in Azure AI Speech.

Exam trap

The trap here is that candidates confuse SSML with a general-purpose programming language or a transcription tool, when in fact it is a specialized XML markup for fine-tuning TTS output, not for speech recognition or real-time transcription.

How to eliminate wrong answers

Option A is wrong because SSML is not a programming language for writing speech recognition algorithms; it is a markup language for controlling TTS output, and speech recognition algorithms are built using models and APIs like Azure Speech-to-Text, not SSML. Option C is wrong because SSML does not perform real-time transcription; real-time transcription is handled by the Speech-to-Text API, while SSML is used exclusively for synthesizing speech from text. Option D is wrong because SSML is not a security protocol; encryption of API calls is managed by TLS/SSL and Azure security features, not by SSML.

184
MCQeasy

What is 'AI accountability' in Microsoft's Responsible AI principles?

A.Billing accountability — ensuring costs are tracked and charged to the correct Azure subscription
B.Humans remaining responsible for AI systems with oversight mechanisms and clear lines of accountability
C.AI systems reporting their own mistakes and triggering automatic self-correction
D.Holding AI vendors legally accountable for damages caused by their models
AnswerB

Accountability ensures AI doesn't operate without human responsibility — requiring oversight, audit trails, and clear ownership of AI outcomes.

Why this answer

Microsoft's Responsible AI principle of accountability means that humans are ultimately responsible for AI systems. This includes establishing oversight mechanisms, clear lines of accountability, and ensuring that AI systems are designed and operated under human control. It does not refer to billing, automatic self-correction, or vendor liability.

Exam trap

The trap here is that candidates confuse 'accountability' with technical automation (like self-correction) or legal liability, rather than understanding it as the human responsibility and oversight required by Microsoft's Responsible AI framework.

How to eliminate wrong answers

Option A is wrong because it confuses 'accountability' with Azure billing and subscription cost tracking, which is a financial operations (FinOps) concept, not a Responsible AI principle. Option C is wrong because it describes an autonomous self-healing system, which contradicts the principle that humans must remain responsible and in control; AI systems should not independently correct mistakes without human oversight. Option D is wrong because while legal liability may be a related topic, Microsoft's Responsible AI principle of accountability focuses on organizational and human responsibility, not on holding vendors legally accountable for damages.

185
MCQmedium

What is 'retrieval augmented generation' (RAG) and which Azure services typically implement it?

A.Using Azure Storage to retrieve training data for model fine-tuning
B.Combining Azure AI Search (retrieval) with Azure OpenAI (generation) to ground LLM responses in a knowledge base
C.Using Azure CDN to deliver AI-generated content faster globally
D.A method of compressing large datasets before training language models
AnswerB

RAG: AI Search retrieves relevant documents → provided as context to Azure OpenAI → LLM generates answers grounded in retrieved content.

Why this answer

Retrieval Augmented Generation (RAG) is a pattern that combines a retrieval step with a generative step. In Azure, this is typically implemented by using Azure AI Search to retrieve relevant documents or chunks from a knowledge base, then passing those results as context to an Azure OpenAI model (e.g., GPT-4) to generate a grounded, fact-based response. This approach reduces hallucinations and ensures the output is based on authoritative data rather than the model's training data alone.

Exam trap

The trap here is that candidates confuse RAG with fine-tuning, mistakenly thinking retrieval modifies the model's training data, whereas RAG is a prompt-time augmentation that leaves the model unchanged.

How to eliminate wrong answers

Option A is wrong because RAG does not involve fine-tuning the model; it retrieves external data at inference time to augment the prompt, not to update the model's weights. Option C is wrong because Azure CDN is a content delivery network for caching and accelerating static assets, not a component of the RAG pipeline which focuses on retrieval and generation. Option D is wrong because RAG is not a compression technique; it is an architecture that retrieves relevant information from a vector or keyword index to provide context for generation, leaving the dataset and model unchanged.

186
MCQmedium

A legal firm needs to automatically sort incoming legal documents into predefined categories such as 'Contract', 'Brief', 'Motion', and 'Discovery'. They have a set of 500 manually labeled documents to use as examples. Which Azure AI Language feature should they use to build this classification system?

A.Custom text classification
B.Key phrase extraction
C.Named entity recognition
D.Sentiment analysis
AnswerA

Custom text classification is designed to classify documents into user-defined categories using labeled examples, perfect for this scenario.

Why this answer

Custom text classification is the correct choice because it allows the legal firm to train a model using their 500 labeled documents to classify text into predefined categories like 'Contract', 'Brief', 'Motion', and 'Discovery'. This feature enables supervised learning where the model learns from labeled examples to automatically sort incoming documents, which is exactly the requirement described.

Exam trap

The trap here is that candidates may confuse custom text classification with built-in features like key phrase extraction or named entity recognition, mistakenly thinking those can perform document-level categorization when they are designed for different NLP tasks.

How to eliminate wrong answers

Option B (Key phrase extraction) is wrong because it extracts significant terms or phrases from text without assigning documents to predefined categories, so it cannot sort documents into 'Contract', 'Brief', etc. Option C (Named entity recognition) is wrong because it identifies and classifies entities like people, organizations, or dates in text, but does not categorize entire documents into user-defined classes. Option D (Sentiment analysis) is wrong because it determines the emotional tone (positive, negative, neutral) of text, which is irrelevant for sorting legal documents by document type.

187
MCQmedium

A large company deploys an AI system to screen job applications and recommend candidates for interviews. After six months, an audit reveals that the system recommends candidates from certain ethnic groups at a much lower rate than others, even when those candidates have similar qualifications. Which Microsoft responsible AI principle is most directly violated?

A.Inclusiveness
B.Fairness
C.Reliability and safety
D.Privacy and security
AnswerB

Fairness requires that AI systems do not discriminate against individuals or groups. The system's biased recommendations based on ethnicity directly violate this principle.

Why this answer

The scenario describes an AI system that produces biased outcomes against certain ethnic groups despite similar qualifications, which directly violates the Fairness principle. Fairness in responsible AI requires that systems treat all people equitably and do not discriminate based on sensitive attributes like ethnicity, race, or gender. The audit finding shows the system is not fair, as it systematically disadvantages specific groups.

Exam trap

The trap here is that candidates may confuse Fairness with Inclusiveness, but Inclusiveness is about accessibility and broad user engagement, not about preventing discriminatory bias in model outcomes.

How to eliminate wrong answers

Option A is wrong because Inclusiveness focuses on designing AI to empower and engage everyone, including people with disabilities, but does not directly address the discriminatory bias in candidate selection. Option C is wrong because Reliability and safety concerns whether the AI system performs consistently and safely under expected conditions, not the fairness of its recommendations across demographic groups. Option D is wrong because Privacy and security deals with protecting personal data and preventing unauthorized access, not with biased outcomes in decision-making.

188
MCQmedium

Which responsible AI principle ensures that AI systems work reliably across different conditions and for all users, including those from different demographics?

A.Privacy
B.Reliability and safety
C.Transparency
D.Accountability
AnswerB

Reliability and safety ensures AI systems work dependably for all users under varied conditions and fail safely when errors occur.

Why this answer

The Reliability and safety principle ensures that AI systems perform consistently and correctly under a wide range of conditions, including edge cases and diverse demographic groups. This principle requires rigorous testing, validation, and monitoring to prevent failures or biased outcomes that could harm users. In the context of AI-900, this principle directly addresses the need for systems to work reliably for all users, regardless of age, gender, ethnicity, or other demographic factors.

Exam trap

Microsoft often tests the trap where candidates confuse 'Reliability and safety' with 'Transparency' because both involve user trust, but reliability is about consistent performance across conditions, while transparency is about explainability of decisions.

How to eliminate wrong answers

Option A (Privacy) is wrong because privacy focuses on protecting user data and controlling how personal information is collected, stored, and used, not on ensuring consistent performance across conditions or demographics. Option C (Transparency) is wrong because transparency is about making AI systems understandable and explainable to users, such as disclosing how decisions are made, not about operational reliability across different user groups. Option D (Accountability) is wrong because accountability deals with assigning responsibility for AI system outcomes and ensuring human oversight, not with the technical robustness of the system under varying conditions.

189
MCQmedium

A marketing team wants to automatically analyze thousands of customer reviews to identify the most commonly discussed aspects, such as 'price', 'durability', or 'customer service'. They do not have any labeled data for custom training. Which prebuilt Azure AI Language feature should they use?

A.Key phrase extraction
B.Sentiment analysis
C.Entity recognition
D.Language detection
AnswerA

Correct because key phrase extraction automatically identifies the most important words and phrases that summarize the main topics discussed in a document. It directly answers the need to find commonly discussed aspects like 'price' and 'durability'.

Why this answer

Key phrase extraction is the correct choice because it automatically identifies the most important points or topics (like 'price', 'durability', 'customer service') from unstructured text without requiring any labeled training data. This prebuilt Azure AI Language feature is designed specifically to surface commonly discussed aspects from large volumes of text, making it ideal for analyzing thousands of customer reviews.

Exam trap

The trap here is that candidates often confuse 'key phrase extraction' with 'entity recognition', mistakenly thinking that named entities like 'price' or 'customer service' are entities, when in fact they are general concepts extracted as key phrases, not predefined entity categories.

How to eliminate wrong answers

Option B is wrong because sentiment analysis determines the emotional tone (positive, negative, neutral) of text, not the specific aspects or topics being discussed. Option C is wrong because entity recognition identifies named entities such as people, organizations, and locations, not general product aspects like 'price' or 'durability'. Option D is wrong because language detection identifies the language in which the text is written, which is irrelevant to extracting discussed aspects from reviews.

190
MCQmedium

A data scientist is building a classification model to predict customer churn. The dataset has only 5% churn cases. The model achieves 95% accuracy on the test set, but upon investigation, the data scientist finds the model predicts 'not churn' for nearly every customer. Which metric should the data scientist primarily use to evaluate the model's performance on this imbalanced dataset?

A.Accuracy
B.F1 score
C.Mean Absolute Error (MAE)
D.R-squared
AnswerB

The F1 score is the harmonic mean of precision and recall, making it a robust metric for imbalanced datasets as it accounts for false positives and false negatives.

Why this answer

In an imbalanced dataset with only 5% churn, a model that predicts 'not churn' for every case achieves 95% accuracy by always guessing the majority class. This accuracy is misleading because it fails to identify any churn cases. The F1 score (option B) is the harmonic mean of precision and recall, making it the primary metric for evaluating classification performance on imbalanced data, as it penalizes both false positives and false negatives and is not skewed by class imbalance.

Exam trap

The trap here is that candidates often default to accuracy as the primary metric for classification, failing to recognize that on imbalanced datasets, accuracy can be artificially high and misleading, while the F1 score provides a more truthful evaluation of minority class prediction.

How to eliminate wrong answers

Option A is wrong because accuracy is misleading on imbalanced datasets; a model that always predicts the majority class can achieve high accuracy (95%) while failing to detect any minority class (churn) cases. Option C is wrong because Mean Absolute Error (MAE) is a regression metric that measures average absolute differences between predicted and actual continuous values, not classification performance. Option D is wrong because R-squared is a regression metric that indicates the proportion of variance in the dependent variable explained by the model, and it is not applicable to classification tasks or imbalanced class evaluation.

191
MCQmedium

A multinational corporation receives customer support emails in multiple languages. They need to automatically identify the language of each email so it can be routed to the appropriate support team. Which Azure AI Language feature should they use?

A.Sentiment analysis
B.Key phrase extraction
C.Language detection
D.Entity recognition
AnswerC

Language detection automatically identifies the language of the input text, making it the correct feature for this routing scenario.

Why this answer

Language detection is the correct Azure AI Language feature because it is specifically designed to identify the written language of text input. The multinational corporation's requirement to automatically determine the language of each email for routing directly matches the core functionality of this prebuilt capability, which returns a language name and ISO 639-1 code for each document.

Exam trap

The trap here is that candidates may confuse language detection with sentiment analysis or key phrase extraction because all three are Natural Language Processing features, but only language detection answers the 'which language?' question directly.

How to eliminate wrong answers

Option A is wrong because sentiment analysis evaluates the emotional tone (positive, negative, neutral) of text, not the language it is written in. Option B is wrong because key phrase extraction identifies important terms and concepts within text but does not determine the language of the text. Option D is wrong because entity recognition identifies and categorizes named entities (e.g., people, places, organizations) in text, not the language of the text.

192
MCQeasy

A museum wants to automatically transcribe handwritten labels on historical artifacts. The handwriting varies in style and may include numbers and special characters. Which Azure Computer Vision capability should they use?

A.Image captioning
B.Optical Character Recognition (OCR)
C.Facial recognition
D.Object detection
AnswerB

OCR extracts text from images, including handwritten text, numbers, and special characters, which matches the requirement.

Why this answer

Optical Character Recognition (OCR) is the correct choice because it is specifically designed to extract printed or handwritten text from images, including numbers and special characters. Azure Computer Vision's OCR API can handle varied handwriting styles and convert them into machine-readable text, making it ideal for transcribing historical artifact labels.

Exam trap

The trap here is that candidates may confuse OCR with image captioning, thinking both can 'read' text, but captioning describes the image contextually rather than extracting exact characters.

How to eliminate wrong answers

Option A is wrong because image captioning generates a natural language description of the overall scene or objects in an image, not the extraction of specific text characters. Option C is wrong because facial recognition identifies or verifies individuals based on facial features, which is unrelated to text transcription. Option D is wrong because object detection identifies and locates objects (e.g., vases, tools) within an image, but it does not read or transcribe any text present on those objects.

193
Matchingmedium

Match each Azure AI service to its primary capability.

Drag a concept onto its matching description — or click a concept then click the description.

Concepts
Matches

AI-powered cloud search service

Build conversational AI bots

Extract information from documents

Extract insights from videos

Monitor and detect anomalies in metrics

Why these pairings

These Azure AI services each focus on a specific AI capability.

194
MCQhard

A city deploys an AI system that automatically issues parking fines based on camera images. A citizen disputes a fine, claiming the system misidentified their car. The city cannot provide an explanation of how the system reached its decision because the model is too complex to interpret. Which Microsoft responsible AI principle is most directly violated?

A.Transparency
B.Privacy and security
C.Inclusiveness
D.Reliability and safety
AnswerA

Correct. Transparency requires that AI systems be understandable and that their decisions can be explained.

Why this answer

The city cannot explain how the AI system reached its decision, which directly violates the transparency principle. Transparency requires that AI systems be understandable and that organizations provide meaningful explanations of their behavior, especially when decisions have legal or financial consequences. The inability to interpret the model's reasoning prevents the citizen from understanding or challenging the fine, undermining trust and accountability.

Exam trap

The trap here is that candidates may confuse 'transparency' with 'reliability and safety', assuming that if the system works accurately, no principle is violated, but the core issue is the inability to explain the decision, not the system's correctness.

How to eliminate wrong answers

Option B (Privacy and security) is wrong because the scenario does not involve unauthorized data access, data breaches, or failure to protect personal information; the issue is about explainability, not data protection. Option C (Inclusiveness) is wrong because the problem is not about bias or accessibility for diverse user groups, but about the lack of interpretability of the model's decision. Option D (Reliability and safety) is wrong because the system may be functioning correctly and safely from a technical standpoint; the violation is the inability to provide an explanation, not a failure in accuracy or safety.

195
MCQmedium

A data scientist trains a multiclass classification model to identify different species of flowers (Iris setosa, Iris virginica, Iris versicolor). The overall accuracy is 94%, but the accuracy for the Iris virginica class is only 60%. Which additional metric should the data scientist examine to better understand the model's performance on the minority class?

A.Precision
B.Recall
C.F1-score
D.Mean Absolute Error (MAE)
AnswerC

F1-score is the harmonic mean of precision and recall. It provides a single metric that balances both for a specific class, making it ideal for evaluating performance on the underperforming Iris virginica class.

Why this answer

The F1-score is the harmonic mean of precision and recall, providing a single metric that balances both false positives and false negatives. Since the model has high overall accuracy but poor performance on the minority class (Iris virginica), the F1-score is ideal for evaluating the model's effectiveness on that class, as it accounts for class imbalance better than accuracy alone.

Exam trap

The trap here is that candidates often choose precision or recall individually, not realizing that the F1-score is specifically designed to combine both metrics and is the standard choice for evaluating performance on imbalanced classes in classification tasks.

How to eliminate wrong answers

Option A is wrong because precision alone measures the proportion of true positive predictions among all positive predictions, but it does not consider false negatives, so it cannot fully capture the model's weakness on the minority class. Option B is wrong because recall alone measures the proportion of actual positives correctly identified, but it ignores false positives, providing an incomplete picture of performance on the minority class. Option D is wrong because Mean Absolute Error (MAE) is a regression metric that measures average absolute differences between predicted and actual values, and it is not applicable to classification tasks like multiclass flower species identification.

196
Matchingmedium

Match each Azure AI service to its pricing model.

Drag a concept onto its matching description — or click a concept then click the description.

Concepts
Matches

Pay per transaction or per API call

Pay per message or channel

Pay per training hour and prediction

Pay per token (input and output)

Pay per storage and queries

Why these pairings

Pricing varies by service and usage.

197
MCQeasy

Which Azure AI service can detect the language of a text input and return the identified language name and confidence score?

A.Azure AI Vision
B.Azure AI Language (language detection)
C.Azure AI Translator
D.Azure AI Document Intelligence
AnswerB

Azure AI Language's language detection feature identifies the language of text input with a confidence score.

Why this answer

Azure AI Language's language detection feature is specifically designed to identify the language of a text input, returning both the language name and a confidence score between 0 and 1. This capability is part of the Natural Language Processing (NLP) workloads within Azure AI Language, making option B the correct choice for this task.

Exam trap

The trap here is that candidates often confuse Azure AI Translator's ability to detect language as a side effect with the dedicated language detection feature in Azure AI Language, which explicitly returns a confidence score and is the correct service for this specific requirement.

How to eliminate wrong answers

Option A is wrong because Azure AI Vision is focused on analyzing images and video content (e.g., object detection, OCR), not text language detection. Option C is wrong because Azure AI Translator translates text between languages but does not natively return a confidence score for language identification; it relies on language detection as a sub-step, not as a primary output. Option D is wrong because Azure AI Document Intelligence (formerly Form Recognizer) extracts structured data from documents (e.g., forms, invoices) and does not include a dedicated language detection feature with confidence scores.

198
MCQeasy

What is the Azure AI Vision Image Analysis 4.0's 'Florence' foundation model capable of?

A.Only detecting faces in images
B.Advanced image understanding including detailed captions, dense captioning, and multimodal embeddings
C.Only processing medical imaging for diagnostic purposes
D.Converting images into 3D models
AnswerB

Florence foundation model enables detailed image captioning, multi-region dense captions, background removal, and vision-language embeddings.

Why this answer

Option B is correct because the Florence foundation model in Azure AI Vision Image Analysis 4.0 is a multimodal model designed for advanced image understanding. It can generate detailed image captions, produce dense captions (describing multiple regions within an image), and create multimodal embeddings that align visual and textual representations for tasks like image search and similarity.

Exam trap

The trap here is that candidates may assume 'foundation model' only applies to language tasks (like GPT) and overlook that Florence is a multimodal vision-language model, leading them to choose a narrow option like face detection or medical imaging.

How to eliminate wrong answers

Option A is wrong because the Florence model goes far beyond face detection; it is a general-purpose vision model capable of scene understanding, object recognition, and captioning, not limited to facial analysis. Option C is wrong because Florence is not specialized for medical imaging; Azure AI Vision offers separate healthcare-specific APIs (e.g., Medical Imaging) for diagnostic purposes, but Florence is a general foundation model. Option D is wrong because Florence does not convert images into 3D models; 3D model generation is not a capability of Image Analysis 4.0, which focuses on 2D image understanding and metadata extraction.

199
MCQmedium

What is 'healthcare AI' and what capabilities does Azure provide for it?

A.AI that gives patients direct medical advice as a substitute for doctors
B.AI for extracting medical entities, radiology insights, clinical trial matching, and patient analysis
C.A hospital management system for scheduling, billing, and patient record management
D.AI that monitors patients' vitals in real time using IoT medical devices
AnswerB

Azure healthcare AI covers clinical NLP, radiology AI, trial matching, and patient insights — augmenting clinical workflows.

Why this answer

Option B is correct because healthcare AI refers to AI solutions tailored for the healthcare industry, and Azure provides specific capabilities such as extracting medical entities (e.g., symptoms, medications) via Azure Health Bot and Text Analytics for Health, analyzing radiology images with Azure AI Vision, matching patients to clinical trials using Azure Cognitive Services, and performing patient analysis with Azure Machine Learning. These capabilities support clinical decision-making and operational efficiency without replacing doctors.

Exam trap

The trap here is that candidates confuse general healthcare IT systems (like scheduling or IoT monitoring) with AI-specific workloads, or assume AI replaces doctors, when Azure's healthcare AI is strictly an assistive technology for extracting insights and supporting clinical workflows.

How to eliminate wrong answers

Option A is wrong because healthcare AI is designed to assist healthcare professionals, not to give direct medical advice as a substitute for doctors; Azure's AI tools are decision-support systems, not autonomous diagnosticians. Option C is wrong because a hospital management system for scheduling, billing, and patient record management is a traditional IT system, not an AI workload; Azure provides such systems via Azure Health Data Services, but the question specifically asks about AI capabilities. Option D is wrong because while Azure IoT Hub can monitor patients' vitals in real time, that is an IoT workload, not a core healthcare AI capability; healthcare AI focuses on data analysis and insights, not raw device monitoring.

200
MCQmedium

What is 'Azure AI Custom Vision' and how does it differ from Azure AI Vision?

A.Azure AI Vision is for video; Custom Vision is for still images only
B.Azure AI Vision offers pre-built general models; Custom Vision lets you train models for your specific categories
C.Custom Vision is more expensive because it uses more advanced AI algorithms
D.Azure AI Vision requires GPU compute; Custom Vision runs on CPU only
AnswerB

Custom Vision trains on your labelled images for domain-specific classification or detection — while Azure AI Vision's models are general-purpose.

Why this answer

Azure AI Vision provides pre-trained models for common computer vision tasks like object detection, OCR, and image analysis without requiring custom training data. Azure AI Custom Vision, on the other hand, allows you to upload your own labeled images and train a model to recognize specific categories or objects that are unique to your business scenario. This distinction makes B correct because it highlights the key difference: pre-built general models versus custom-trained models.

Exam trap

The trap here is that candidates often confuse 'Custom Vision' with being a more advanced or expensive version of Azure AI Vision, when in fact the core distinction is about customization versus pre-built functionality, not cost or hardware requirements.

How to eliminate wrong answers

Option A is wrong because Azure AI Vision is not limited to video; it supports both images and video analysis, while Custom Vision also works with still images and can be used for image classification and object detection. Option C is wrong because Custom Vision is not inherently more expensive due to 'more advanced AI algorithms'; pricing is based on compute time, training hours, and prediction API calls, not on algorithm complexity, and both services use similar underlying deep learning techniques. Option D is wrong because neither service strictly requires GPU compute; both can run on CPU-based infrastructure, though GPU acceleration may be used for training in Custom Vision to improve speed, but it is not a mandatory requirement.

201
MCQmedium

A customer support team wants to create a chatbot that can answer common questions about employee benefits. They have a PDF document containing a list of frequently asked questions with their answers. Which Azure AI Language feature should they use to build a solution that extracts answers directly from this content?

A.Sentiment Analysis
B.Key Phrase Extraction
C.Custom Question Answering
D.Text Analytics for Health
AnswerC

Custom Question Answering creates a knowledge base from provided content, such as FAQ documents, and provides answers to user queries based on that knowledge base.

Why this answer

Custom Question Answering (C) is the correct choice because it is specifically designed to ingest documents like PDFs and extract question-answer pairs from them, enabling a chatbot to respond directly with answers from the content. This feature uses a pre-built or custom knowledge base to match user queries to the most relevant answer, making it ideal for the described scenario.

Exam trap

The trap here is that candidates often confuse Key Phrase Extraction (B) with question answering, not realizing that Key Phrase Extraction only identifies terms without providing direct answers, while Custom Question Answering is the only feature that returns extracted answers from a document.

How to eliminate wrong answers

Option A is wrong because Sentiment Analysis detects positive, negative, or neutral sentiment in text, not extracting answers from documents. Option B is wrong because Key Phrase Extraction identifies important words or phrases but does not map questions to answers or provide direct responses. Option D is wrong because Text Analytics for Health is specialized for extracting medical entities and relationships from clinical documents, not general FAQ content.

202
MCQeasy

A research organization is developing an AI system to assist with medical diagnosis. They want to ensure that if the system makes an error, there is a clear process for auditing and determining responsibility. Which Microsoft responsible AI principle is most relevant?

A.Privacy and Security
B.Accountability
C.Inclusiveness
D.Transparency
AnswerB

Accountability means that the organization takes responsibility for the AI system's outcomes and has mechanisms for auditing and governance, which directly addresses the need for a clear process when errors occur.

Why this answer

Accountability is the Microsoft responsible AI principle that requires organizations to define and maintain clear processes for auditing, reviewing, and taking responsibility for AI system outcomes. In this scenario, the need for a clear process to audit errors and determine responsibility directly aligns with accountability, which mandates that AI systems have governance structures, human oversight, and audit trails to assign ownership for decisions and mistakes.

Exam trap

The trap here is that candidates often confuse transparency (making AI explainable) with accountability (having a process to assign responsibility), but transparency alone does not ensure that someone is held responsible for errors or that an audit trail exists.

How to eliminate wrong answers

Option A (Privacy and Security) is wrong because it focuses on protecting data confidentiality and system integrity, not on establishing processes for error auditing and responsibility assignment. Option C (Inclusiveness) is wrong because it addresses designing AI to empower and include diverse user groups, not the governance and audit mechanisms needed when errors occur. Option D (Transparency) is wrong because while it involves making AI decisions understandable, it does not specifically require a defined process for auditing errors and determining who is responsible; transparency is about communication, not accountability workflows.

203
MCQmedium

What is the difference between 'supervised' and 'unsupervised' learning?

A.Supervised learning requires a human to watch the training process; unsupervised runs automatically
B.Supervised learning trains on labelled data; unsupervised discovers patterns in unlabelled data
C.Supervised learning uses neural networks; unsupervised uses decision trees only
D.Unsupervised learning always produces better results than supervised learning
AnswerB

Supervised = learn from labeled input-output pairs. Unsupervised = find structure in unlabelled data (clusters, anomalies, components).

Why this answer

Option B is correct because supervised learning requires a labeled dataset where each training example has an input-output pair, allowing the model to learn a mapping from inputs to known outputs. In contrast, unsupervised learning works with unlabeled data, and the model must find inherent patterns, groupings, or structures (e.g., clustering or dimensionality reduction) without any predefined labels. This distinction is fundamental to choosing the right machine learning approach in Azure, such as using Azure Machine Learning for supervised tasks like regression/classification or Azure Cognitive Services for unsupervised clustering.

Exam trap

The trap here is that candidates confuse the need for human oversight with the technical definition of supervision, mistakenly thinking 'supervised' means a human must monitor the process, when it actually refers to the presence of labeled training data.

How to eliminate wrong answers

Option A is wrong because supervised learning does not require a human to watch the training process; both supervised and unsupervised learning can run automatically once the data and algorithm are configured. Option C is wrong because both supervised and unsupervised learning can use neural networks (e.g., supervised CNNs for image classification, unsupervised autoencoders for anomaly detection), and neither is restricted to decision trees. Option D is wrong because unsupervised learning does not always produce better results; the quality depends on the problem, data, and evaluation metrics—supervised learning often outperforms when high-quality labeled data is available.

204
MCQmedium

A logistics company needs to automatically read shipping labels on packages, which include text printed in various fonts and sizes, as well as handwritten addresses. Which Azure Computer Vision capability should they use?

A.Optical Character Recognition (OCR) via the Read API
B.Dense Captioning
C.Image Analysis - Object Detection
D.Image Analysis - Tagging
AnswerA

The Read API extracts text from images, including handwritten and printed text, making it ideal for reading shipping labels.

Why this answer

The Read API is the correct choice because it is specifically designed for extracting printed and handwritten text from images, handling various fonts, sizes, and styles. This makes it ideal for reading shipping labels that contain both machine-printed text and handwritten addresses.

Exam trap

The trap here is that candidates may confuse general image analysis capabilities (like tagging or object detection) with text extraction, not realizing that OCR via the Read API is the dedicated service for reading text from images.

How to eliminate wrong answers

Option B is wrong because Dense Captioning generates descriptive captions for regions of an image, not text extraction. Option C is wrong because Object Detection identifies and locates objects (e.g., boxes, pallets) but does not read text. Option D is wrong because Image Analysis - Tagging assigns descriptive tags to the entire image (e.g., 'package', 'label') but does not extract the textual content.

205
Multi-Selectmedium

A customer support team wants to automatically analyze incoming emails to (1) determine the overall emotional tone (e.g., frustrated, satisfied) and (2) identify specific key phrases that indicate the reason for contact (e.g., 'return item', 'refund policy'). Which two Azure AI Language features should they use? (Choose two.)

Select 2 answers
A.Sentiment analysis
B.Key phrase extraction
C.Entity recognition
D.Language detection
AnswersA, B

Correct: Determines the emotional tone of the text.

Why this answer

Sentiment analysis is the correct choice because it evaluates text to determine the overall emotional tone, such as frustration or satisfaction, by assigning a sentiment score (positive, negative, neutral, or mixed) at the document and sentence level. This directly addresses the requirement to analyze the emotional tone of incoming emails.

Exam trap

The trap here is that candidates often confuse entity recognition with key phrase extraction, but entity recognition focuses on predefined categories (e.g., person, location) while key phrase extraction identifies context-specific important terms not limited to named entities.

206
MCQeasy

A company wants to build a chatbot that answers customer questions using only their internal knowledge base, which consists of several PDFs and Word documents. They do not want the chatbot to use any information from the model's pre-trained knowledge. Which Azure OpenAI feature should they use to achieve this?

A.Content filtering
B.Prompt flow
C.Azure OpenAI on your data
D.Temperature parameter
AnswerC

This feature integrates the model with your own data sources (e.g., PDFs, databases) to generate responses based exclusively on your data, overriding the model's general knowledge.

Why this answer

Azure OpenAI on your data allows you to connect Azure OpenAI models to your own data sources (such as PDFs and Word documents) and restrict the model to generate responses solely from that data, without using the model's pre-trained knowledge. This is achieved by indexing the documents into an Azure Cognitive Search index and using retrieval-augmented generation (RAG) to ground the model's responses in your specific content.

Exam trap

The trap here is that candidates often confuse prompt engineering techniques (like setting temperature or using Prompt flow) with the data grounding mechanism provided by Azure OpenAI on your data, mistakenly thinking they can control knowledge sources through parameters or workflow tools.

How to eliminate wrong answers

Option A is wrong because content filtering is a safety feature that blocks harmful or policy-violating content in inputs and outputs, but it does not restrict the model's knowledge source to your own data. Option B is wrong because Prompt flow is a development tool for building and orchestrating AI workflows, not a feature that confines the model's knowledge to your documents. Option D is wrong because the temperature parameter controls the randomness of the model's responses, not the source of information the model uses.

207
MCQmedium

What is 'AI-assisted labelling' in Azure Machine Learning data labelling?

A.Automatically generating descriptive captions for images using a pre-trained model
B.Using a partially trained model to pre-populate labels that human annotators verify and correct
C.Deploying a model to production without any human review of its outputs
D.Using AI to detect and remove incorrectly labelled examples from a completed dataset
AnswerB

AI-assisted labelling speeds annotation by having the model guess labels — humans only review and fix, reducing effort dramatically.

Why this answer

AI-assisted labelling in Azure Machine Learning uses a partially trained model to automatically suggest labels for unlabelled data. Human annotators then review and correct these suggestions, which speeds up the labelling process while maintaining quality. This is a form of active learning where the model iteratively improves as more labelled data is verified.

Exam trap

The trap here is confusing AI-assisted labelling with fully automated AI tasks (like image captioning or model deployment) and overlooking the critical human-in-the-loop verification step that distinguishes this feature from pure automation.

How to eliminate wrong answers

Option A is wrong because automatically generating descriptive captions for images using a pre-trained model is a computer vision task (image captioning), not a data labelling technique in Azure ML. Option C is wrong because deploying a model without human review contradicts the core purpose of AI-assisted labelling, which requires human verification to ensure label accuracy. Option D is wrong because detecting and removing incorrectly labelled examples is a data cleaning or quality assurance step, not the AI-assisted labelling workflow that pre-populates labels for human review.

208
MCQmedium

What is 'retrieval-augmented generation' (RAG) and what problem does it solve?

A.Storing model responses in a cache to retrieve them faster for repeated questions
B.Retrieving relevant documents from a knowledge base to provide accurate context for LLM responses
C.Generating random responses and selecting the most relevant using a ranker model
D.A technique for making LLM responses shorter by removing irrelevant sections
AnswerB

RAG grounds LLM answers in retrieved documents — solving hallucination, knowledge cutoff, and private data limitations.

Why this answer

Retrieval-augmented generation (RAG) combines a retrieval step with a generative language model. It first retrieves relevant documents or passages from an external knowledge base (e.g., Azure Cognitive Search) and then feeds that context into the LLM to ground its response. This solves the problem of LLMs producing outdated, hallucinated, or factually incorrect answers by ensuring the model has access to current, authoritative information.

Exam trap

The trap here is that candidates confuse RAG with simple caching or response shortening, overlooking that the core innovation is grounding generation in externally retrieved, up-to-date knowledge rather than relying solely on the model's parametric memory.

How to eliminate wrong answers

Option A is wrong because caching model responses improves latency for repeated queries but does not address factual accuracy or grounding; it is a performance optimization, not a solution for hallucination or outdated knowledge. Option C is wrong because generating random responses and then ranking them is not how RAG works; RAG retrieves relevant documents first, then generates a single response grounded in that context, not a random selection. Option D is wrong because RAG is about augmenting the input with retrieved context, not about shortening responses; truncation or summarization techniques are separate concerns.

209
MCQmedium

A developer is using Azure OpenAI Service to generate structured data in JSON format. They want to ensure that every response is valid JSON without adding instructions in every prompt. Which Azure OpenAI feature should they configure?

A.Set the temperature parameter to a low value (e.g., 0).
B.Set the top_p parameter to a high value (e.g., 1).
C.Set the response_format parameter to 'json_object'.
D.Set the max_tokens parameter to a high value (e.g., 2000).
AnswerC

Setting response_format to 'json_object' instructs the model to output valid JSON, which is exactly what the developer needs for structured data generation.

Why this answer

Option C is correct because Azure OpenAI Service provides a `response_format` parameter that can be set to `json_object`, which instructs the model to always return valid JSON output. This ensures structured data without requiring the developer to include formatting instructions in every prompt, as the service enforces JSON schema compliance at the API level.

Exam trap

The trap here is that candidates often confuse parameters that control randomness (temperature, top_p) or output length (max_tokens) with those that enforce output structure, leading them to incorrectly assume that low temperature alone can produce consistent JSON formatting.

How to eliminate wrong answers

Option A is wrong because setting the temperature parameter to a low value (e.g., 0) reduces randomness and makes output more deterministic, but it does not enforce any specific output format like JSON; it only controls creativity. Option B is wrong because setting top_p to a high value (e.g., 1) allows the model to consider a wider range of token probabilities, increasing diversity, but it does not guarantee structured JSON output. Option D is wrong because setting max_tokens to a high value (e.g., 2000) only controls the maximum length of the response, not its format; it cannot ensure the output is valid JSON.

210
MCQmedium

An autonomous driving company is developing a system that needs to understand the road scene at a granular level. For each pixel in a camera image, the system must classify whether it belongs to the road, a pedestrian, a vehicle, a traffic sign, or the sky. Which Azure Computer Vision capability should they use?

A.Image classification
B.Object detection
C.Semantic segmentation
D.Optical character recognition (OCR)
AnswerC

Correct. Semantic segmentation classifies every pixel, providing a dense understanding of the scene.

Why this answer

Semantic segmentation is the correct choice because it classifies every pixel in an image into a predefined category, such as road, pedestrian, vehicle, traffic sign, or sky. This pixel-level classification is essential for autonomous driving to understand the road scene at a granular level, enabling precise boundary detection and scene understanding.

Exam trap

The trap here is that candidates confuse object detection with pixel-level classification, assuming bounding boxes provide enough detail, but semantic segmentation is required for granular scene understanding where every pixel matters.

How to eliminate wrong answers

Option A is wrong because image classification assigns a single label to the entire image, not individual pixels, so it cannot distinguish between road, pedestrian, and sky in the same scene. Option B is wrong because object detection identifies and locates objects with bounding boxes, but it does not classify every pixel, missing fine-grained boundaries like the edge of a road or the shape of a traffic sign. Option D is wrong because optical character recognition (OCR) extracts text from images, such as reading a speed limit sign, but it does not classify pixels into scene categories like road or sky.

211
MCQmedium

What is 'few-shot prompting' and how does it improve model outputs?

A.Training a model with very few labelled examples using transfer learning
B.Including a small number of input-output examples in the prompt to demonstrate the desired task format
C.Generating a short (few-shot) response rather than a detailed answer
D.Running the model for only a few seconds to save compute costs
AnswerB

Few-shot prompting provides task demonstrations in the prompt — no training required, just examples that show the model what's expected.

Why this answer

Few-shot prompting improves model outputs by providing a small number of input-output examples directly in the prompt, which helps the model understand the desired task format, style, or reasoning pattern without requiring any fine-tuning or retraining. This technique leverages the model's in-context learning ability to generalize from the given examples and produce more accurate, consistent responses.

Exam trap

The trap here is that candidates confuse 'few-shot' with 'fewer training data' or 'shorter responses,' when the term specifically refers to the number of examples included in the prompt to guide the model's output.

How to eliminate wrong answers

Option A is wrong because few-shot prompting does not involve training or updating model weights; it relies on in-context learning within a single prompt, not transfer learning or additional training with labelled examples. Option C is wrong because 'few-shot' refers to the number of examples in the prompt, not the length of the response; the model can still generate detailed answers. Option D is wrong because few-shot prompting has nothing to do with compute time or cost savings; it is a prompt engineering technique that may actually increase token usage and latency.

212
MCQhard

A hospital deploys an AI diagnostic system that achieves 95% accuracy overall. However, for patients from a specific minority ethnic group, the accuracy drops to 60%. The hospital decides to continue using the system because the overall accuracy is acceptable. Which Microsoft responsible AI principle is most directly violated by this decision?

A.Fairness
B.Inclusiveness
C.Transparency
D.Accountability
AnswerA

Fairness requires that AI systems perform consistently across different demographic groups. Here, the minority group receives significantly worse diagnostic accuracy, violating this principle.

Why this answer

The decision to continue using the system despite a 60% accuracy for a minority ethnic group directly violates the Fairness principle. Fairness requires that AI systems treat all groups equitably and avoid discrimination, even if overall metrics are high. A 35% accuracy gap between groups indicates systemic bias, which the hospital is ignoring by prioritizing aggregate performance over equitable outcomes.

Exam trap

The trap here is that candidates confuse 'overall accuracy' with 'system quality' and fail to recognize that Fairness requires equal performance across all subgroups, not just a high average.

How to eliminate wrong answers

Option B (Inclusiveness) is wrong because inclusiveness focuses on designing systems that benefit all people, including those with disabilities or diverse needs, not specifically on equal accuracy across demographic groups. Option C (Transparency) is wrong because transparency concerns openness about how and when AI is used, not the ethical obligation to correct performance disparities. Option D (Accountability) is wrong because accountability refers to who is responsible for the system's outcomes, not the direct principle that prohibits discriminatory performance gaps.

213
MCQmedium

What is 'customer churn prediction' as an AI workload and what ML type does it use?

A.Analysing customer complaints to identify the root cause of service dissatisfaction
B.Using supervised classification to predict which customers are likely to cancel or become inactive
C.Detecting when a customer has already churned based on their last login date
D.Using NLP to understand why customers write negative reviews before leaving
AnswerB

Churn prediction trains on labelled historical data (churned/retained) — enabling proactive retention targeting of high-risk customers.

Why this answer

Customer churn prediction is a supervised machine learning workload where historical customer data (e.g., usage patterns, support interactions, billing history) is used to train a classification model. The model learns to assign a binary label (churn or not churn) to new customers, making it a supervised classification task. This directly matches option B, which correctly identifies the use of supervised classification to predict likely churners.

Exam trap

The trap here is that candidates confuse descriptive analytics (analyzing why churn happened) with predictive analytics (forecasting who will churn), leading them to pick option A or D, which describe post-hoc analysis rather than supervised classification.

How to eliminate wrong answers

Option A is wrong because analyzing customer complaints to identify root causes is a descriptive analytics or root cause analysis task, not a predictive churn model; it does not involve supervised classification to forecast future behavior. Option C is wrong because detecting that a customer has already churned based on last login date is a rule-based or anomaly detection task (often unsupervised or simple thresholding), not a predictive model that forecasts future churn. Option D is wrong because using NLP to understand why customers write negative reviews is a sentiment analysis or topic modeling workload, which is typically unsupervised or uses text classification, but it does not predict which customers will churn—it explains past sentiment, not future behavior.

214
MCQeasy

What is the Whisper model available in Azure OpenAI used for?

A.Generating images from text descriptions
B.Transcribing spoken audio to text with high accuracy across languages
C.Generating very quiet (whispering) text-to-speech audio
D.Summarizing long documents into concise bullet points
AnswerB

Whisper is OpenAI's speech recognition model — it transcribes audio to text across many languages and audio conditions.

Why this answer

The Whisper model in Azure OpenAI is a large-scale speech recognition system designed to transcribe spoken audio into text. It supports multiple languages and is optimized for high accuracy, making it the correct choice for audio-to-text tasks.

Exam trap

The trap here is that the name 'Whisper' misleads candidates into thinking it relates to quiet speech or text-to-speech, when it is actually a speech-to-text model.

How to eliminate wrong answers

Option A is wrong because generating images from text descriptions is the function of DALL-E models, not Whisper. Option C is wrong because Whisper is for speech-to-text transcription, not text-to-speech generation; 'whispering' refers to the model's name, not the volume of output. Option D is wrong because summarizing long documents is a text-based task handled by GPT models, not by Whisper, which focuses on audio processing.

215
MCQhard

What is the 'bias-variance tradeoff' in machine learning?

A.The tradeoff between model accuracy and inference speed
B.The tradeoff between underfitting (high bias) and overfitting (high variance) when choosing model complexity
C.The tradeoff between training data quantity and model quality
D.The difference in fairness metrics between biased and unbiased model versions
AnswerB

Bias-variance: simple models underfit (high bias), complex models overfit (high variance) — finding the optimal complexity is the core ML challenge.

Why this answer

Option B is correct because the bias-variance tradeoff describes the inverse relationship between underfitting (high bias, where the model is too simple to capture patterns) and overfitting (high variance, where the model is too complex and captures noise). In Azure Machine Learning, this tradeoff is managed by tuning hyperparameters like regularization strength or tree depth to balance model complexity and generalization.

Exam trap

The trap here is that candidates often confuse the term 'bias' in bias-variance tradeoff with ethical or fairness bias, leading them to incorrectly select Option D, which is a separate AI-900 concept about model fairness and responsible AI.

How to eliminate wrong answers

Option A is wrong because it confuses the bias-variance tradeoff with a performance optimization concern (accuracy vs. inference speed), which is unrelated to model complexity and generalization. Option C is wrong because it misrepresents the tradeoff as a data quantity issue; while more data can help reduce variance, the core tradeoff is about model complexity, not data volume. Option D is wrong because it conflates the bias-variance tradeoff with fairness metrics; bias in this context refers to statistical bias in model predictions, not ethical or demographic bias.

216
MCQeasy

A retail company wants to automatically group customers into segments based on their purchasing history, age, and location without using any predefined labels. The goal is to identify distinct customer profiles for targeted marketing campaigns. Which type of machine learning approach should they use?

A.Supervised learning
B.Unsupervised learning
C.Reinforcement learning
D.Regression
AnswerB

Correct. Unsupervised learning is used when the goal is to find patterns or groupings in data without pre-existing labels. Clustering algorithms like K-means are common for customer segmentation.

Why this answer

Unsupervised learning is the correct approach because the company wants to group customers into segments without predefined labels. The algorithm will discover natural patterns and clusters in the data (purchasing history, age, location) on its own, which is the core characteristic of unsupervised learning.

Exam trap

The trap here is that candidates often confuse clustering (unsupervised) with classification (supervised), mistakenly thinking that grouping customers always requires predefined labels like 'high value' or 'low value'.

How to eliminate wrong answers

Option A is wrong because supervised learning requires labeled training data with predefined output categories, but the question explicitly states 'without using any predefined labels'. Option C is wrong because reinforcement learning involves an agent learning through trial-and-error interactions with an environment to maximize cumulative reward, which is not relevant to grouping static customer data. Option D is wrong because regression is a supervised learning technique used to predict continuous numerical values, not to group data into discrete segments.

217
MCQmedium

A data scientist trains a regression model to predict house prices. The model achieves very low error on the training data but significantly higher error on a held-out test set. Which problem does this scenario best describe?

A.Underfitting
B.Overfitting
C.High bias
D.High variance
AnswerB

Correct. Overfitting is characterized by excellent performance on training data but poor performance on new data due to memorization of noise.

Why this answer

The scenario describes overfitting, where the model learns the training data too well, including noise and outliers, resulting in very low training error but poor generalization to new data. In Azure Machine Learning, this is often detected by comparing training and validation metrics; a large gap indicates overfitting. The correct answer is B.

Exam trap

The trap here is that candidates confuse 'high variance' (a statistical property) with the specific problem name 'overfitting', but the question explicitly asks for the problem description, not the underlying cause.

How to eliminate wrong answers

Option A is wrong because underfitting occurs when the model fails to capture patterns in the training data, resulting in high error on both training and test sets, not low training error. Option C is wrong because high bias typically leads to underfitting, where the model is too simple and performs poorly on both training and test data. Option D is wrong because high variance is a characteristic of overfitting, but the question asks for the problem described, not the statistical property; overfitting is the direct term for the scenario.

218
MCQmedium

What is 'hyperparameter tuning' in Azure Machine Learning?

A.Adjusting the physical voltage supplied to GPU hardware during training
B.Searching for the optimal algorithm settings (learning rate, batch size) that maximise model performance
C.Training the model to predict hyper-specific rare events in the data
D.Compressing model weights to reduce inference latency
AnswerB

Hyperparameter tuning explores the configuration space to find the settings that produce the best model — HyperDrive automates this in Azure ML.

Why this answer

Hyperparameter tuning in Azure Machine Learning is the process of searching for the optimal set of algorithm settings, such as learning rate, batch size, or number of epochs, to maximize model performance. Azure ML provides automated hyperparameter tuning via HyperDrive, which uses techniques like Bayesian sampling, random sampling, or grid search to efficiently explore the hyperparameter space. This is a core step in training a model to achieve the best accuracy or other metrics, not a hardware or compression task.

Exam trap

The trap here is that candidates confuse hyperparameter tuning with hardware tuning (Option A) or model compression (Option D), because both involve 'tuning' or 'adjusting' something, but hyperparameter tuning is strictly about algorithm configuration, not hardware or post-training optimization.

How to eliminate wrong answers

Option A is wrong because adjusting the physical voltage supplied to GPU hardware is a hardware-level operation (e.g., undervolting or overclocking) unrelated to Azure Machine Learning's software-based hyperparameter tuning, which operates on algorithm parameters. Option C is wrong because training a model to predict hyper-specific rare events describes imbalanced classification or anomaly detection, not the systematic search over hyperparameters to optimize model settings. Option D is wrong because compressing model weights to reduce inference latency refers to model quantization or pruning techniques (e.g., ONNX Runtime optimization), which are post-training steps, not part of hyperparameter tuning during training.

219
MCQhard

What is 'word sense disambiguation' (WSD) and why is it challenging for NLP?

A.Correcting spelling mistakes caused by homophones (words that sound the same)
B.Determining which meaning of an ambiguous word is intended based on surrounding context
C.Translating words from one language to their exact equivalent in another language
D.Measuring how many distinct meanings a word has across a dictionary
AnswerB

WSD resolves lexical ambiguity — understanding 'bank' means financial institution vs. riverbank based on sentence context.

Why this answer

Option B is correct because word sense disambiguation (WSD) is the NLP task of identifying which specific meaning of a polysemous word (a word with multiple meanings) is intended in a given context, using surrounding words, syntax, and semantic cues. This is challenging because many words have multiple, often unrelated meanings (e.g., 'bank' as a financial institution vs. river bank), and the correct sense depends on subtle contextual signals that are difficult for models to capture without deep understanding of the domain or world knowledge.

Exam trap

The trap here is that candidates confuse WSD with related but distinct NLP tasks like homophone correction (A) or machine translation (C), because all involve handling ambiguous words, but WSD specifically targets meaning selection within a single language based on context.

How to eliminate wrong answers

Option A is wrong because correcting spelling mistakes caused by homophones is a task related to homophone disambiguation or spell-checking, not word sense disambiguation; WSD deals with meaning selection, not orthographic correction. Option C is wrong because translating words to exact equivalents in another language is a machine translation task, which may involve WSD as a sub-step but is not the definition of WSD itself; WSD focuses on sense selection within a single language. Option D is wrong because measuring how many distinct meanings a word has across a dictionary is a lexical resource or lexicography task (e.g., counting senses in WordNet), not the process of disambiguating which sense is used in a specific context.

220
MCQmedium

A writer uses Azure OpenAI Service to generate story ideas. The current configuration uses a temperature setting of 0, causing the model to produce identical outputs for the same prompt. The writer wants more creative and diverse outputs. Which parameter should be increased?

A.max_tokens
B.temperature
C.top_p
D.frequency_penalty
AnswerB

Increasing temperature increases randomness, leading to more creative and diverse outputs.

Why this answer

Temperature controls the randomness of the model's output. A temperature of 0 makes the model deterministic, always choosing the most likely next token, which leads to identical outputs for the same prompt. Increasing the temperature (e.g., to 0.7 or higher) introduces more randomness, allowing the model to sample from less likely tokens and produce more creative, diverse story ideas.

Exam trap

The trap here is that candidates may confuse temperature with top_p, thinking both are equally responsible for randomness, but temperature is the direct control for randomness while top_p is an alternative sampling method that can also affect diversity but is not the parameter to increase for more creative outputs.

How to eliminate wrong answers

Option A is wrong because max_tokens controls the maximum length of the generated output, not the diversity or creativity of the content. Option C is wrong because top_p (nucleus sampling) also influences randomness, but the question specifically asks for the parameter to increase for more creative outputs; while top_p can be adjusted, temperature is the primary parameter for controlling randomness and is the most direct answer. Option D is wrong because frequency_penalty reduces the repetition of tokens by penalizing tokens that have already appeared, which can increase diversity but is not the primary parameter for controlling randomness; it is more about penalizing repetition rather than introducing creative randomness.

221
MCQmedium

What is 'prompt flow' in Azure AI Foundry?

A.A tool for managing the queue of prompt requests sent to Azure OpenAI during peak usage
B.A visual development tool for building, testing, and deploying LLM application pipelines
C.An automated system that suggests improvements to prompts based on output quality metrics
D.A monitoring dashboard showing the flow of prompts through an AI application in production
AnswerB

Prompt flow chains LLM calls, tools, and functions visually — enabling RAG pipelines and agents to be built, evaluated, and deployed.

Why this answer

Prompt flow in Azure AI Foundry is a visual development tool that enables developers to design, test, and deploy end-to-end pipelines for large language model (LLM) applications. It provides a graph-based interface to orchestrate LLM calls, data processing, and custom logic, making it easier to build complex generative AI workflows without writing extensive code.

Exam trap

The trap here is that candidates confuse 'prompt flow' with a monitoring or optimization tool, when in fact it is a visual pipeline builder for developing and testing LLM application workflows.

How to eliminate wrong answers

Option A is wrong because prompt flow is not a queue management tool for handling request spikes; Azure OpenAI provides built-in rate limiting and quota management for that purpose. Option C is wrong because prompt flow does not automatically suggest prompt improvements based on output metrics; that functionality is more aligned with features like prompt engineering guidance or evaluation tools within Azure AI Foundry. Option D is wrong because prompt flow is primarily a development and testing tool, not a production monitoring dashboard; monitoring is handled by separate services like Azure Monitor or Application Insights.

222
MCQmedium

What is 'AI democratisation' and how do Azure AI services support it?

A.Making AI governance decisions through a democratic voting process within organisations
B.Making AI capabilities accessible to all organisations and developers through pre-built APIs and low-code tools
C.Ensuring AI companies are publicly listed so retail investors can participate in AI growth
D.Open-sourcing all AI models so any developer can use them without licensing fees
AnswerB

Democratisation removes barriers — pre-built APIs, no-code portals, and pay-per-use pricing enable any organisation to use AI.

Why this answer

AI democratisation refers to making AI capabilities accessible to a broad range of users, not just experts. Azure AI services support this by offering pre-built APIs (e.g., Computer Vision, Language Understanding) and low-code tools like Azure Machine Learning designer and Power Platform AI Builder, enabling developers and organisations with limited AI expertise to integrate AI into their applications without building models from scratch.

Exam trap

The trap here is that candidates may confuse 'democratisation' with open-source licensing or corporate governance, but the exam specifically tests the concept of lowering technical barriers through pre-built, API-accessible AI services.

How to eliminate wrong answers

Option A is wrong because it misinterprets 'democratisation' as a governance voting process, which is unrelated to the technical goal of broadening AI access. Option C is wrong because it confuses financial market participation (public listing) with technical accessibility, which has no bearing on enabling developers to use AI services. Option D is wrong because it incorrectly assumes that open-sourcing all models is the only or primary method; Azure AI services support democratisation through managed APIs and low-code tools, not by requiring full model open-sourcing or waiving licensing fees.

223
MCQmedium

A research team wants to automatically extract the most important phrases from a large collection of scientific articles to identify emerging trends. The articles are all in English. They do not want to train a custom model. Which built-in Azure AI Language feature should they use?

A.Key phrase extraction
B.Sentiment analysis
C.Named entity recognition (NER)
D.Language detection
AnswerA

Correct. Key phrase extraction is designed to automatically extract the most salient phrases from text, which is exactly what the research team needs.

Why this answer

Key phrase extraction is the correct Azure AI Language feature because it automatically identifies and returns the most important phrases in a document, which directly supports the goal of extracting key terms from scientific articles to spot emerging trends. This built-in capability requires no custom model training and works out-of-the-box for English text, making it ideal for the research team's use case.

Exam trap

The trap here is that candidates often confuse named entity recognition (NER) with key phrase extraction, assuming NER can extract any important term, but NER is limited to predefined entity types (e.g., person, location) and cannot capture domain-specific scientific phrases like 'quantum entanglement' or 'CRISPR-Cas9'.

How to eliminate wrong answers

Option B is wrong because sentiment analysis detects positive, negative, or neutral sentiment in text, not important phrases or keywords, so it cannot identify emerging trends. Option C is wrong because named entity recognition (NER) identifies specific entities like people, organizations, or locations, not general key phrases that represent trends in scientific literature. Option D is wrong because language detection determines the language of the text, which is unnecessary since the articles are already known to be in English.

224
MCQmedium

What ethical consideration is MOST important when deploying AI systems for hiring decisions?

A.Ensuring the AI processes applications as quickly as possible
B.Auditing for and mitigating bias that could disadvantage protected demographic groups
C.Making the AI the final decision-maker for all candidates
D.Ensuring the AI is only deployed in large companies
AnswerB

Hiring AI must be audited for bias against protected characteristics — discriminatory AI can violate employment laws and cause real harm.

Why this answer

Option B is correct because the most critical ethical consideration in AI-driven hiring is fairness and non-discrimination. AI systems can inadvertently learn and amplify historical biases present in training data, leading to unfair outcomes for protected groups under laws like Title VII of the Civil Rights Act. Auditing for and mitigating bias ensures the AI model's decisions are equitable and legally compliant, which is a core principle of responsible AI.

Exam trap

The trap here is that candidates may confuse operational efficiency (speed) with ethical responsibility, or assume that automation alone is sufficient, when Microsoft and other vendors emphasize that human-in-the-loop and bias auditing are mandatory for responsible AI deployment.

How to eliminate wrong answers

Option A is wrong because processing speed is a performance metric, not an ethical consideration; prioritizing speed over fairness could lead to biased decisions being made faster. Option C is wrong because making the AI the final decision-maker removes human oversight, which is ethically problematic as AI lacks accountability and cannot interpret nuanced, context-dependent factors like a human recruiter can. Option D is wrong because ethical deployment of AI in hiring is equally important for companies of all sizes; restricting it to large companies does not address the underlying bias or fairness issues.

225
MCQmedium

A company uses a generative AI model to answer customer questions about their products. They observe that the model sometimes produces factually incorrect or fabricated information. To reduce these inaccuracies, they want to provide the model with relevant, up-to-date product documentation as context before generating a response. Which technique is being applied?

A.Prompt Engineering
B.Grounding
C.Fine-tuning
D.Reinforcement Learning from Human Feedback (RLHF)
AnswerB

Grounding connects the model to external data sources (like product documentation) to provide factual context, significantly reducing hallucinations and improving accuracy.

Why this answer

B is correct because grounding is the technique of providing a generative AI model with specific, authoritative source data (such as product documentation) as context before generating a response. This anchors the model's output to verified facts, directly reducing hallucinations and fabricated information by constraining the generation to the provided context.

Exam trap

Microsoft often tests the distinction between grounding (providing external context at inference time) and fine-tuning (updating model weights), so candidates mistakenly choose fine-tuning when the scenario describes adding new information without retraining.

How to eliminate wrong answers

Option A is wrong because prompt engineering involves crafting input instructions to guide model behavior, but it does not inherently supply new, up-to-date factual context; it only refines how the model uses its existing training data. Option C is wrong because fine-tuning retrains the model on a specific dataset to adapt its weights, which is a more resource-intensive process and does not dynamically inject current documentation at inference time. Option D is wrong because Reinforcement Learning from Human Feedback (RLHF) uses human preferences to align model outputs with desired qualities (e.g., helpfulness, safety), but it does not provide real-time factual context to reduce inaccuracies.

Page 2

Page 3 of 14

Page 4