Microsoft Azure AI Fundamentals AI-900 (AI-900) — Questions 751825

1020 questions total · 14pages · All types, answers revealed

Page 10

Page 11 of 14

Page 12
751
MCQeasy

A bank is developing an AI system to automatically approve or reject small business loan applications. The bank wants to ensure that the system does not unfairly discriminate against applicants based on their age, gender, or ethnicity. Which Microsoft responsible AI principle should most directly guide the design and evaluation of this system?

A.Fairness
B.Reliability and safety
C.Privacy and security
D.Inclusiveness
AnswerA

Fairness ensures the AI system does not discriminate based on demographic characteristics, which is the core concern in loan approval scenarios.

Why this answer

The bank's goal is to prevent discrimination based on age, gender, or ethnicity in loan approvals. The Fairness principle directly addresses this by requiring AI systems to treat all groups equitably and to mitigate biases in training data and model predictions. This principle guides the design and evaluation of the system to ensure that outcomes are not skewed by protected attributes.

Exam trap

The trap here is that candidates often confuse 'Inclusiveness' (designing for diverse user needs) with 'Fairness' (preventing algorithmic bias in outcomes), leading them to select D instead of A.

How to eliminate wrong answers

Option B (Reliability and safety) is wrong because it focuses on the system's ability to function correctly and safely under all conditions, not on preventing discriminatory outcomes. Option C (Privacy and security) is wrong because it concerns protecting personal data from unauthorized access or misuse, not ensuring equitable treatment across demographic groups. Option D (Inclusiveness) is wrong because while it promotes designing for all users, it does not specifically address the detection or mitigation of algorithmic bias in decision-making outcomes.

752
MCQhard

What is 'neural architecture search' (NAS) and how does it relate to AutoML?

A.Searching the web for neural network architectures published in research papers
B.Automating the discovery of optimal neural network architectures using computational search
C.Querying a database of pre-built neural networks to find the closest match for a task
D.A legal search process for patenting new AI model architectures
AnswerB

NAS searches the space of possible architectures computationally — finding better network designs than human experts alone.

Why this answer

Neural Architecture Search (NAS) is an automated process that uses computational search methods—such as reinforcement learning, evolutionary algorithms, or gradient-based optimization—to discover optimal neural network architectures for a given task. It is a key component of AutoML because AutoML aims to automate the entire machine learning pipeline, including model selection and hyperparameter tuning, and NAS specifically automates the design of the neural network topology itself.

Exam trap

The trap here is that candidates confuse NAS with simply searching for existing models online or in a database, rather than understanding it as an automated, generative search process that creates new architectures.

How to eliminate wrong answers

Option A is wrong because NAS does not involve searching the web for published research papers; it is a computational search over a defined architecture space, not a web crawl. Option C is wrong because NAS does not query a static database of pre-built networks; it dynamically generates and evaluates candidate architectures during the search process. Option D is wrong because NAS is a technical optimization method, not a legal or patent-related search process.

753
MCQmedium

What is prompt injection and why is it a security concern for AI systems?

A.A technique for making AI prompts shorter and more efficient
B.A security attack where malicious inputs try to override AI system instructions or bypass guardrails
C.The process of adding examples to prompts to improve model performance
D.A method of injecting training data into a model after deployment
AnswerB

Prompt injection embeds malicious instructions in inputs to manipulate AI behavior — a critical security concern for AI applications.

Why this answer

Prompt injection is a security attack where a malicious user crafts input that overrides the AI system's original instructions or bypasses its guardrails, causing the model to behave in unintended ways. This is a critical security concern because it can lead to data exfiltration, unauthorized actions, or the generation of harmful content, undermining the trust and safety of AI deployments.

Exam trap

The trap here is that candidates confuse prompt injection with legitimate prompt engineering techniques like few-shot learning or prompt optimization, failing to recognize it as a distinct security vulnerability that targets the instruction hierarchy of AI systems.

How to eliminate wrong answers

Option A is wrong because prompt injection is not about making prompts shorter or more efficient; it is a security exploit, not a performance optimization technique. Option C is wrong because adding examples to prompts is a legitimate method called few-shot learning, which improves model performance without compromising security. Option D is wrong because injecting training data after deployment describes a data poisoning or model update scenario, not a runtime attack on the prompt itself.

754
MCQmedium

What is a vector database and why is it important for generative AI applications?

A.A database that stores traditional relational tables for AI training data
B.A database optimized for storing and searching high-dimensional embeddings for semantic similarity search
C.A database that stores the weights of trained neural networks
D.A database using vector graphics for visualizing AI models
AnswerB

Vector databases enable fast semantic search by finding embeddings closest to a query vector — powering RAG and recommendation systems.

Why this answer

Option B is correct because a vector database is specifically designed to store and index high-dimensional embeddings—numerical representations of data such as text, images, or audio—and to perform efficient similarity searches using distance metrics like cosine similarity or Euclidean distance. In generative AI applications, vector databases enable retrieval-augmented generation (RAG), where relevant context is retrieved from a knowledge base to ground the model's output, reducing hallucinations and improving accuracy.

Exam trap

The trap here is that candidates confuse a vector database with a traditional database or with model storage, because the term 'vector' is overloaded—it can refer to mathematical vectors (embeddings) in AI, but also to vector graphics or data structures in other contexts.

How to eliminate wrong answers

Option A is wrong because it describes a traditional relational database (RDBMS) that stores structured data in tables with rows and columns, not high-dimensional vectors; relational databases lack the specialized indexing (e.g., HNSW, IVF) needed for efficient similarity search. Option C is wrong because it describes a model weight repository, not a database; neural network weights are stored in serialized formats (e.g., .h5, .pt) and are not queried for semantic similarity. Option D is wrong because it confuses vector databases with vector graphics (e.g., SVG files) used for rendering images; vector databases have no role in visualizing AI models.

755
MCQmedium

A hospital receives patient notes in free text. They need to automatically identify entities like disease names, medications, and dosages from these notes without requiring any custom training. Which Azure AI Language feature is specifically designed for this medical entity extraction task?

A.Text Analytics for Health
B.Custom Named Entity Recognition
C.Key Phrase Extraction
D.Sentiment Analysis
AnswerA

This is correct. It is a prebuilt NLP model specialized for extracting medical entities from unstructured clinical text.

Why this answer

Text Analytics for Health is a pre-built Azure AI Language feature specifically designed to extract medical entities such as diseases, medications, dosages, symptoms, and procedures from unstructured clinical text without requiring any custom training. It uses specialized medical ontologies (e.g., UMLS, SNOMED CT) and natural language processing models trained on healthcare data, making it the correct choice for this task.

Exam trap

The trap here is that candidates may confuse Custom Named Entity Recognition (which requires training) with the pre-built medical entity extraction capability of Text Analytics for Health, especially since both involve 'entity recognition' in their names.

How to eliminate wrong answers

Option B is wrong because Custom Named Entity Recognition requires you to provide labeled training data to train a custom model, whereas the question explicitly states 'without requiring any custom training'. Option C is wrong because Key Phrase Extraction identifies general key terms and phrases from text but does not have the medical domain knowledge to specifically extract entities like disease names, medications, and dosages. Option D is wrong because Sentiment Analysis determines the emotional tone (positive, negative, neutral) of text, not medical entity extraction.

756
MCQeasy

A healthcare company deploys an AI system to assist doctors in diagnosing skin conditions from images. The system is a deep neural network that does not provide explanations for its predictions. The company implements a process where every AI recommendation is logged, and a medical team reviews any adverse outcomes to determine if the system or a human made an error. The company also clearly assigns responsibility for the system's outputs to a specific clinical oversight committee. Which Microsoft responsible AI principle is most directly being implemented by these actions?

A.Accountability
B.Fairness
C.Reliability and safety
D.Transparency
AnswerA

Accountability means that the organization takes ownership of the AI system's outcomes, establishes clear oversight, and has processes to audit and learn from mistakes. This matches the described logging, review, and committee assignment.

Why this answer

The actions described—logging AI recommendations, reviewing adverse outcomes to determine error source, and assigning a clinical oversight committee—directly implement the Accountability principle. Accountability requires that organizations clearly assign responsibility for AI system outputs and have processes to review and address impacts, which is exactly what the company does by creating a human review loop and a designated committee.

Exam trap

Microsoft often tests the distinction between Accountability (assigning responsibility and oversight) and Transparency (explainability), so candidates mistakenly choose Transparency because they conflate logging/review with making the model's reasoning visible, even though the model itself is a black box.

How to eliminate wrong answers

Option B (Fairness) is wrong because the described actions focus on oversight and error attribution, not on mitigating bias or ensuring equitable treatment across demographic groups. Option C (Reliability and safety) is wrong because while logging and review contribute to safety, the core of the question is about assigning responsibility and governance, not about system robustness or fail-safe mechanisms. Option D (Transparency) is wrong because transparency involves making AI systems understandable and explainable to users, whereas the company's deep neural network does not provide explanations, and the actions are about post-hoc accountability rather than interpretability.

757
MCQeasy

What are 'embeddings' in Azure OpenAI and what are they used for?

A.Embedded systems software that runs AI models on IoT devices
B.Numerical vector representations of text that capture semantic meaning for search and similarity tasks
C.HTML embed tags for displaying AI model outputs in web applications
D.Compressed versions of large language models that use fewer parameters
AnswerB

Embeddings encode semantic meaning as vectors — enabling similarity search, clustering, recommendations, and RAG retrieval.

Why this answer

Embeddings in Azure OpenAI are numerical vector representations of text that capture semantic meaning, enabling tasks like semantic search, clustering, and similarity comparisons. They convert words, sentences, or documents into high-dimensional vectors so that similar meanings are represented by vectors close to each other in the vector space. This is correct because embeddings are fundamental to modern AI search and recommendation systems, not related to hardware or web embedding tags.

Exam trap

The trap here is that the term 'embeddings' sounds like 'embedded systems' or 'embed tags,' leading candidates to confuse a core AI concept with unrelated hardware or web development terms.

How to eliminate wrong answers

Option A is wrong because it confuses 'embeddings' with 'embedded systems' — IoT device software is unrelated to Azure OpenAI's vector representations of text. Option C is wrong because it misinterprets 'embeddings' as HTML embed tags, which are used for embedding external content in web pages, not for semantic text representation. Option D is wrong because it describes model compression techniques like quantization or pruning, not embeddings; embeddings are full-precision vector outputs, not compressed versions of models.

758
MCQmedium

A global e-commerce company develops a chatbot to assist customers in multiple languages. The chatbot uses text-based responses. To ensure it serves diverse populations fairly, which Microsoft responsible AI principle should they prioritize?

A.Accountability
B.Inclusiveness
C.Privacy and security
D.Transparency
AnswerB

Inclusiveness ensures the AI system works for all people, including different languages and cultural contexts, which is the primary concern for a multilingual chatbot.

Why this answer

Inclusiveness is the correct principle because the chatbot must serve customers in multiple languages without bias or exclusion. Microsoft's responsible AI principle of inclusiveness ensures that AI systems are designed to empower everyone, including people of diverse backgrounds, languages, and abilities. By prioritizing inclusiveness, the company ensures the chatbot's text-based responses are accessible and fair across all supported languages.

Exam trap

The trap here is that candidates often confuse Transparency (explainability) with fairness, but inclusiveness specifically addresses equitable access and representation across diverse user groups, which is the core requirement for a multilingual chatbot.

How to eliminate wrong answers

Option A is wrong because Accountability focuses on establishing governance and ownership for AI systems, not directly on ensuring fair service across diverse languages. Option C is wrong because Privacy and security concerns data protection and confidentiality, not the fairness or accessibility of multilingual responses. Option D is wrong because Transparency involves explaining how and why AI systems make decisions, but it does not directly address the need to serve diverse populations equitably.

759
MCQeasy

Which Azure AI service detects and identifies human faces in images, including attributes like age estimate and emotion?

A.Azure AI Vision
B.Azure AI Face
C.Azure AI Custom Vision
D.Azure AI Video Indexer
AnswerB

Azure AI Face detects faces in images and provides attributes like age estimate, emotion, and supports face verification.

Why this answer

Azure AI Face is the correct service because it is specifically designed to detect and identify human faces in images, and it can extract attributes such as age estimates, emotions (e.g., happiness, sadness), and facial landmarks. Unlike general-purpose image analysis, Azure AI Face uses specialized face detection models and returns face rectangles along with optional attribute data.

Exam trap

The trap here is that candidates confuse Azure AI Vision's basic face detection (which only returns bounding boxes) with Azure AI Face's specialized attribute extraction, leading them to select Azure AI Vision when the question explicitly asks for age estimate and emotion attributes.

How to eliminate wrong answers

Option A is wrong because Azure AI Vision provides general image analysis (e.g., object detection, OCR, scene description) but does not offer dedicated face attribute extraction like age or emotion; it only returns a basic face bounding box without detailed attributes. Option C is wrong because Azure AI Custom Vision is used to train custom image classification or object detection models on user-provided datasets, not for pre-built face detection with age and emotion attributes. Option D is wrong because Azure AI Video Indexer is focused on extracting insights from video content (e.g., speech transcription, scene segmentation, and face detection in video), but it is not the primary service for still-image face attribute analysis and does not provide the same granular attribute extraction as Azure AI Face.

760
MCQmedium

What is 'semantic kernel' in Microsoft's AI development ecosystem?

A.The core algorithm that powers all Azure AI services internally
B.An open-source SDK for orchestrating LLMs with plugins, memory, and planning for AI applications
C.A database for storing semantic embeddings in Azure
D.A Linux kernel modification for optimized AI workloads
AnswerB

Semantic Kernel lets developers combine LLMs with custom functions, data sources, and planning to build sophisticated AI apps.

Why this answer

Semantic Kernel is an open-source SDK that enables developers to integrate large language models (LLMs) with their applications by providing abstractions for plugins, memory (vector storage), and planning (automatic orchestration of AI tasks). It is not a core algorithm, a database, or a kernel modification, but rather a lightweight orchestrator that works with Azure OpenAI and other LLM providers.

Exam trap

The trap here is that candidates confuse 'Semantic Kernel' with a low-level system component (like a kernel or database) due to the word 'kernel', when it is actually a high-level SDK for orchestrating LLM workflows.

How to eliminate wrong answers

Option A is wrong because Semantic Kernel is not the core algorithm powering Azure AI services; Azure AI services use their own specialized models and APIs (e.g., Azure OpenAI Service, Cognitive Services) and Semantic Kernel is a higher-level orchestration SDK. Option C is wrong because Semantic Kernel is not a database; Azure offers Azure Cognitive Search and Azure Cosmos DB for storing semantic embeddings, but Semantic Kernel itself provides memory abstractions that can use those databases. Option D is wrong because Semantic Kernel is not a Linux kernel modification; it is a cross-platform SDK (C#, Python, Java) that runs on standard operating systems without requiring kernel-level changes.

761
MCQmedium

A software company uses Azure OpenAI to generate code snippets. They want to evaluate how confident the model is in each token it generates. Which Azure OpenAI feature provides a numerical measure of confidence for each generated token?

A.Logprobs
B.Temperature
C.Top-p
D.Presence penalty
AnswerA

Logprobs provides log probabilities for each token, allowing calculation of confidence levels.

Why this answer

Logprobs (log probabilities) is the Azure OpenAI feature that provides a numerical measure of confidence for each generated token. It outputs the log probability of each token being selected by the model, allowing developers to assess how certain the model is about its predictions at the token level.

Exam trap

The trap here is that candidates confuse hyperparameters that control generation behavior (temperature, top-p, presence penalty) with output features that provide model confidence metrics, leading them to pick a parameter that influences randomness rather than the one that reports token-level probabilities.

How to eliminate wrong answers

Option B (Temperature) is wrong because it controls the randomness of token sampling, not the confidence measure of individual tokens. Option C (Top-p) is wrong because it sets a cumulative probability threshold for nucleus sampling, limiting the pool of candidate tokens but not providing per-token confidence scores. Option D (Presence penalty) is wrong because it penalizes tokens that have already appeared in the text to encourage topic diversity, and has no relation to outputting confidence values.

762
MCQeasy

Which AI principle ensures that AI systems are developed and used in ways that are transparent and understandable to affected stakeholders?

A.Reliability
B.Fairness
C.Transparency
D.Privacy
AnswerC

Transparency means AI systems and their decisions should be understandable, and people should know when AI is affecting them.

Why this answer

Transparency is the correct answer because it directly addresses the requirement that AI systems be open, explainable, and understandable to stakeholders. This principle ensures that decisions made by AI models can be audited, interpreted, and communicated clearly, which is essential for building trust and enabling informed consent.

Exam trap

The trap here is that candidates often confuse Transparency with Fairness or Privacy, mistakenly thinking that making a system fair or private automatically makes it understandable, whereas transparency specifically requires the ability to inspect and explain the model's logic.

How to eliminate wrong answers

Option A is wrong because Reliability focuses on the system's ability to perform consistently and correctly under defined conditions, not on making its operations understandable. Option B is wrong because Fairness is concerned with avoiding bias and ensuring equitable treatment across different groups, not with the clarity or openness of the AI's decision-making process. Option D is wrong because Privacy deals with protecting personal data and controlling access, not with the transparency or explainability of the AI system's behavior.

763
MCQeasy

A manufacturing company uses an AI system to predict when machines will need maintenance. The system must work correctly under varying factory floor conditions such as temperature changes and noise levels. Which Microsoft responsible AI principle is most directly focused on ensuring the system performs reliably in these different conditions?

A.Fairness
B.Reliability & Safety
C.Privacy & Security
D.Inclusiveness
AnswerB

This principle directly ensures that AI systems perform consistently and safely across a range of conditions, which matches the requirement for reliable operation in different factory environments.

Why this answer

B is correct because the Reliability & Safety principle ensures that AI systems operate consistently and predictably under varying conditions, such as temperature changes and noise levels on a factory floor. This principle mandates rigorous testing, monitoring, and fail-safe mechanisms to maintain performance and prevent harm when environmental factors deviate from expected ranges.

Exam trap

The trap here is that candidates often confuse 'Reliability & Safety' with 'Privacy & Security' because both involve 'protection,' but the former protects against system failure under environmental stress, while the latter protects data from breaches.

How to eliminate wrong answers

Option A is wrong because Fairness focuses on eliminating bias and ensuring equitable outcomes across demographic groups, not on system robustness to environmental variations. Option C is wrong because Privacy & Security deals with protecting data confidentiality and preventing unauthorized access, not with operational reliability under changing physical conditions. Option D is wrong because Inclusiveness aims to design AI that benefits all users regardless of ability or background, not to guarantee consistent performance under environmental stressors.

764
MCQmedium

A law firm wants to automatically categorize incoming legal documents into custom categories such as 'Motion', 'Contract', 'Discovery', and 'Memorandum'. The firm has a set of manually labeled documents that can be used to train the system. Which Azure AI Language feature should they use?

A.Prebuilt Text Analytics for sentiment
B.Custom text classification
C.Conversational Language Understanding
D.Key phrase extraction
AnswerB

Custom text classification enables training a model on your labeled data to classify documents into custom categories.

Why this answer

The law firm needs to categorize documents into custom categories using their own labeled data. Custom text classification in Azure AI Language is specifically designed for this purpose, allowing you to train a model on your own labeled documents to classify text into user-defined categories. Prebuilt Text Analytics for sentiment only detects sentiment (positive/negative/neutral), not custom categories.

Exam trap

The trap here is that candidates often confuse pre-built features (like sentiment analysis or key phrase extraction) with custom trainable features, assuming any NLP task can be solved with a pre-built model, but Azure requires custom text classification for user-defined categories.

How to eliminate wrong answers

Option A is wrong because Prebuilt Text Analytics for sentiment is a pre-trained model that only analyzes sentiment (positive, negative, neutral, mixed) and cannot be trained on custom categories like 'Motion' or 'Contract'. Option C is wrong because Conversational Language Understanding (CLU) is designed for intent and entity extraction from conversational utterances (e.g., chatbots), not for categorizing entire documents into custom categories. Option D is wrong because key phrase extraction is an unsupervised pre-built feature that extracts important phrases from text, but it does not classify documents into custom categories and cannot be trained on labeled data.

765
MCQeasy

What does Azure AI Language's 'language detection' feature return?

A.Only the language name in English
B.The language name, ISO code, and a confidence score
C.A translated version of the text in English
D.Only a binary yes/no for whether the text is English
AnswerB

Azure AI Language's language detection returns the detected language name, ISO 639-1 code, and a 0-1 confidence score.

Why this answer

Azure AI Language's language detection feature returns the language name, its ISO 639-1 code, and a confidence score between 0 and 1. This allows applications to identify the language of input text with a quantifiable level of certainty, enabling downstream processing like routing to the correct language-specific model.

Exam trap

The trap here is that candidates confuse language detection with translation or assume it only returns a language name, overlooking the ISO code and confidence score that are always part of the response.

How to eliminate wrong answers

Option A is wrong because language detection returns both the language name and its ISO code, not just the name in English. Option C is wrong because language detection identifies the language, it does not translate the text; translation is a separate Azure AI service (Translator). Option D is wrong because language detection supports over 100 languages and returns a confidence score, not a binary yes/no for English only.

766
MCQmedium

A bike-sharing company wants to predict the number of rentals per hour. Their model's predictions are usually close but occasionally have large errors due to unexpected events like sudden rain. They want a metric that heavily penalizes these large errors to ensure the model is not overly confident. Which evaluation metric should they primarily use?

A.Mean Absolute Error (MAE)
B.Mean Squared Error (MSE)
C.Classification Accuracy
D.R-squared
AnswerB

MSE squares each error, so large errors contribute disproportionately to the total. This aligns with the requirement to penalize large errors heavily.

Why this answer

Mean Squared Error (MSE) is the correct choice because it squares the residuals, which heavily penalizes large errors. Since the bike-sharing company wants to discourage occasional large prediction errors (e.g., due to sudden rain), MSE’s quadratic penalty ensures that models with even a few large outliers receive a much worse score, forcing the model to avoid overconfidence.

Exam trap

The trap here is that candidates often choose MAE because it is simpler and more interpretable, but they miss the explicit requirement to 'heavily penalize large errors,' which only MSE (or RMSE) accomplishes through squaring.

How to eliminate wrong answers

Option A is wrong because Mean Absolute Error (MAE) uses absolute differences and does not disproportionately penalize large errors; it treats all errors linearly, so occasional large errors would not be heavily weighted. Option C is wrong because Classification Accuracy is a metric for classification tasks (e.g., predicting categories), not for regression tasks like predicting a continuous number of rentals per hour. Option D is wrong because R-squared measures the proportion of variance explained by the model and does not inherently penalize large errors more than small ones; it can be misleadingly high even with large outliers if the overall variance is large.

767
MCQeasy

A developer is using Azure OpenAI Service to generate Python code snippets. They notice that the generated code often contains repetitive function definitions and loops. Which parameter should be increased to reduce this repetition?

A.Temperature
B.Max tokens
C.Frequency penalty
D.Top P
AnswerC

A higher frequency penalty discourages the model from repeating the same tokens, leading to less repetition.

Why this answer

The frequency penalty parameter reduces repetition by penalizing tokens that have already appeared in the generated text, making the model less likely to reuse the same functions or loops. Increasing this value directly discourages the model from generating repetitive patterns, which is exactly the issue described.

Exam trap

Microsoft often tests the distinction between parameters that control randomness (temperature, Top P) versus those that control repetition (frequency penalty, presence penalty), leading candidates to mistakenly choose temperature when the issue is repetitive content.

How to eliminate wrong answers

Option A is wrong because temperature controls randomness in token selection, not repetition; increasing it would make output more random, not less repetitive. Option B is wrong because max tokens limits the total length of the output, not the likelihood of repeating content. Option D is wrong because Top P (nucleus sampling) controls the cumulative probability threshold for token selection, affecting diversity but not specifically penalizing repetition.

768
MCQmedium

What is the purpose of Azure AI Video Indexer's transcript feature?

A.To translate video subtitles into multiple languages
B.To automatically convert speech in videos to searchable text with timestamps
C.To generate written scripts for producing new videos
D.To extract text visible in video frames (on-screen text)
AnswerB

Video Indexer transcribes spoken content to text with timestamps, enabling text-based search across video content.

Why this answer

Azure AI Video Indexer's transcript feature uses automatic speech recognition (ASR) to convert spoken audio in videos into a text transcript, which is then indexed with precise timestamps for each word or phrase. This enables users to search, navigate, and analyze video content by keyword or phrase, making the video's audio content fully searchable and accessible.

Exam trap

The trap here is that candidates often confuse the transcript feature (speech-to-text) with the OCR feature (on-screen text extraction) or with translation, because all three involve 'text' but serve fundamentally different purposes in Video Indexer's pipeline.

How to eliminate wrong answers

Option A is wrong because translation of subtitles is a separate feature in Video Indexer (the 'Translate' capability), not the core purpose of the transcript feature, which focuses on generating the original-language text from speech. Option C is wrong because the transcript feature extracts existing speech from a video; it does not generate new written scripts for producing videos, which would be a scriptwriting or content creation tool. Option D is wrong because extracting text visible in video frames (on-screen text) is handled by the OCR (optical character recognition) feature in Video Indexer, not the transcript feature, which deals exclusively with audio-derived speech.

769
MCQmedium

A marketing team needs to analyze thousands of product reviews to identify the most frequently mentioned topics, such as 'battery life', 'customer support', and 'price'. They want an automated method to extract these main concepts from each review. Which Azure AI Language feature should they use?

A.Language detection
B.Sentiment analysis
C.Key phrase extraction
D.Entity recognition
AnswerC

Key phrase extraction identifies and returns a list of important phrases from the text that represent the main concepts or topics discussed, such as 'battery life' or 'customer support'.

Why this answer

Key phrase extraction is the correct choice because it automatically identifies the main concepts, such as 'battery life', 'customer support', and 'price', from unstructured text like product reviews. This feature is specifically designed to extract the most salient topics or points from a document, making it ideal for analyzing thousands of reviews to find frequently mentioned themes.

Exam trap

The trap here is confusing key phrase extraction with entity recognition, as both extract information from text, but entity recognition is limited to predefined categories (e.g., person, location) while key phrase extraction captures any salient concept.

How to eliminate wrong answers

Option A is wrong because language detection identifies the language of the text (e.g., English, Spanish), not the topics or concepts within it. Option B is wrong because sentiment analysis determines the overall positive, negative, or neutral tone of the text, not the specific topics mentioned. Option D is wrong because entity recognition identifies named entities like people, places, or organizations (e.g., 'Microsoft', 'Seattle'), not general concepts or topics like 'battery life'.

770
MCQeasy

What is 'Azure ML's experiment tracking' and why do data scientists use it?

A.Monitoring the progress of Azure ML service new feature deployments
B.Recording hyperparameters, metrics, and configurations for each training run for comparison and reproduction
C.Tracking which Azure ML resources are used by which team members for billing allocation
D.A compliance audit log of all model predictions made in production
AnswerB

Experiment tracking is the data scientist's lab notebook — capturing all run details to enable systematic model improvement.

Why this answer

Azure ML's experiment tracking is a feature that automatically records hyperparameters, metrics, and configuration details for each training run. Data scientists use it to compare multiple runs, identify the best-performing model, and reproduce results by revisiting the exact settings and data used. This is essential for iterative experimentation and ensuring reproducibility in machine learning workflows.

Exam trap

The trap here is that candidates confuse experiment tracking (recording training run metadata) with monitoring or auditing of deployed models, leading them to choose options about deployment progress or production compliance logs.

How to eliminate wrong answers

Option A is wrong because it describes monitoring deployment progress of new features, which is a DevOps or MLOps concern, not the purpose of experiment tracking for training runs. Option C is wrong because it refers to resource usage tracking for billing allocation, which is handled by Azure Cost Management and resource tagging, not by experiment tracking. Option D is wrong because it describes a compliance audit log for model predictions in production, which is related to model monitoring and governance, not the recording of training run metadata.

771
MCQeasy

What is machine learning?

A.A process of manually programming computers with rules for every possible scenario
B.A subset of AI where algorithms learn from data to make predictions without explicit programming
C.A method of creating robots that can perform physical tasks
D.A type of computer network for processing large datasets
AnswerB

Machine learning algorithms identify patterns in training data and apply them to make predictions on new, unseen data.

Why this answer

Machine learning is a subset of artificial intelligence (AI) that enables systems to automatically learn and improve from experience without being explicitly programmed for every scenario. Instead of following static rules, ML algorithms use training data to identify patterns and make predictions or decisions. This is the core definition tested in AI-900, distinguishing ML from traditional rule-based programming.

Exam trap

The trap here is that candidates confuse machine learning with traditional programming (Option A) because both involve computers making decisions, but ML eliminates the need for explicit rule-writing by learning from data.

How to eliminate wrong answers

Option A is wrong because it describes traditional rule-based programming, not machine learning; ML does not require manual coding of rules for every possible scenario but instead learns patterns from data. Option C is wrong because machine learning is not limited to robotics or physical tasks; it is a data-driven approach used in software applications like recommendation systems and fraud detection. Option D is wrong because while machine learning may use computer networks for processing large datasets, this describes distributed computing or big data infrastructure, not the fundamental concept of learning from data to make predictions.

772
MCQhard

A healthcare organization needs to extract specific data elements (such as patient names, medication dosages, and dates) from unstructured doctors' notes. Which Azure Cognitive Service is best suited for this task?

A.Language Understanding (LUIS)
B.Text Analytics
C.Translator Text
D.Speech
AnswerB

Text Analytics includes Named Entity Recognition, which can extract predefined categories of entities (e.g., Person, Date, Quantity) from unstructured text, making it ideal for this task.

Why this answer

Text Analytics (now part of Azure AI Language) is the correct service because it provides pre-built entity extraction capabilities specifically designed to identify and extract named entities like people (patient names), quantities (medication dosages), and dates from unstructured text. This aligns directly with the requirement to extract specific data elements from doctors' notes without needing custom model training.

Exam trap

The trap here is that candidates confuse LUIS's conversational entity extraction with Text Analytics' document-level NER, assuming any 'language understanding' task requires LUIS, when in fact structured data extraction from static text is the domain of Text Analytics.

How to eliminate wrong answers

Option A is wrong because Language Understanding (LUIS) is designed for intent recognition and entity extraction from conversational utterances in chatbots, not for batch extraction of structured data elements from unstructured clinical notes. Option C is wrong because Translator Text is a machine translation service that converts text between languages, not an entity extraction or information retrieval tool. Option D is wrong because Speech is a speech-to-text and text-to-speech service that transcribes audio into text, but it does not perform any entity extraction or data element identification from the transcribed text.

773
MCQmedium

What is 'responsible AI impact assessment' for generative AI applications?

A.Measuring the compute cost impact of adding generative AI to an application
B.Identifying potential harms, affected groups, and mitigation measures before deploying AI applications
C.Measuring user satisfaction scores after a generative AI feature launches
D.Calculating the environmental impact of AI model training in terms of CO2 emissions
AnswerB

Impact assessment evaluates risks before deployment — mapping harms, affected stakeholders, and appropriate safeguards.

Why this answer

Responsible AI impact assessment is a structured process to identify potential harms (e.g., bias, fairness, privacy violations), affected groups (e.g., demographic segments), and mitigation measures before deploying generative AI applications. It aligns with Microsoft's Responsible AI principles and is a key governance step in Azure AI services to ensure ethical deployment.

Exam trap

The trap here is that candidates confuse 'impact assessment' with any measurable outcome (cost, satisfaction, or environment) instead of recognizing it as a specific governance process focused on identifying and mitigating potential harms before deployment.

How to eliminate wrong answers

Option A is wrong because it focuses on compute cost impact, which is a financial metric, not an assessment of ethical harms or societal impact. Option C is wrong because measuring user satisfaction scores is a post-launch performance metric, not a pre-deployment assessment of potential harms. Option D is wrong because calculating CO2 emissions relates to environmental sustainability, not the identification of harms, affected groups, or mitigations required for responsible AI governance.

774
MCQeasy

What is the primary difference between supervised and unsupervised machine learning?

A.Supervised learning requires human oversight; unsupervised runs automatically
B.Supervised uses labeled training data; unsupervised finds patterns in unlabeled data
C.Supervised is for images; unsupervised is for text
D.Supervised is older and less accurate than unsupervised
AnswerB

Supervised: learn from labeled examples; Unsupervised: discover patterns in data without predefined labels.

Why this answer

The primary difference between supervised and unsupervised machine learning is that supervised learning uses labeled training data to learn a mapping from inputs to outputs, while unsupervised learning finds hidden patterns or structures in unlabeled data without predefined labels. This distinction is fundamental to choosing the right approach for a given problem, such as classification (supervised) versus clustering (unsupervised).

Exam trap

The trap here is that candidates often confuse the need for human oversight with the use of labeled data, mistakenly thinking supervised learning requires constant human intervention, whereas the core distinction is the presence or absence of labels in the training data.

How to eliminate wrong answers

Option A is wrong because both supervised and unsupervised learning can run automatically once configured; human oversight is not a defining difference, as both may require human intervention for data preparation and model evaluation. Option C is wrong because neither technique is inherently restricted to a specific data type—supervised learning can be applied to images, text, or numeric data, and the same is true for unsupervised learning. Option D is wrong because supervised learning is not inherently older or less accurate; both have been developed over decades, and accuracy depends on the problem, data quality, and algorithm choice, not on the paradigm itself.

775
MCQeasy

A research team wants to automatically analyze thousands of online news articles to identify the main topics discussed in each article (e.g., 'technology', 'politics', 'sports'). They need a prebuilt Azure AI Language feature that returns a list of key concepts or topics without any custom training. Which feature should they use?

A.Key phrase extraction
B.Named Entity Recognition (NER)
C.Sentiment analysis
D.Language detection
AnswerA

Key phrase extraction identifies the most important phrases (e.g., 'machine learning', 'climate change') that summarize the main topics of the article.

Why this answer

Key phrase extraction is the correct Azure AI Language feature because it automatically identifies and returns a list of the main topics, concepts, or themes discussed in a document without requiring any custom training or labeled data. This prebuilt capability is designed specifically for extracting high-level topics from unstructured text, making it ideal for analyzing thousands of news articles to determine subjects like 'technology', 'politics', or 'sports'.

Exam trap

The trap here is that candidates often confuse Named Entity Recognition (NER) with key phrase extraction, mistakenly thinking NER identifies topics when it actually extracts specific named entities like 'Microsoft' or 'New York', not general themes.

How to eliminate wrong answers

Option B is wrong because Named Entity Recognition (NER) identifies and categorizes specific entities such as people, organizations, locations, and dates, not the overarching topics or themes of an article. Option C is wrong because Sentiment analysis evaluates the emotional tone (positive, negative, neutral) of text, not the subject matter or topics discussed. Option D is wrong because Language detection identifies the language in which the text is written (e.g., English, Spanish), not the topics or concepts within the content.

776
MCQeasy

A hospital uses an AI system to prioritize patient appointments based on urgency. The system is trained on historical data. The team wants to ensure that the system does not discriminate against patients based on age or disability. Which Microsoft responsible AI principle should most directly guide the design of this system?

A.Inclusiveness
B.Privacy and security
C.Reliability and safety
D.Transparency
AnswerA

Inclusiveness requires AI systems to avoid bias and ensure fair treatment for all groups, including age and disability.

Why this answer

The Inclusiveness principle directly guides the design of this system because it requires AI systems to empower everyone and avoid discrimination, including on the basis of age or disability. By proactively identifying and mitigating bias in the training data and model predictions, the hospital can ensure that appointment prioritization does not unfairly disadvantage any patient group. This principle is specifically about fairness and accessibility, making it the most relevant for preventing discriminatory outcomes.

Exam trap

Microsoft often tests the distinction between Inclusiveness (fairness and bias mitigation) and Transparency (explainability), so candidates mistakenly choose Transparency because they think 'understanding the decision' prevents discrimination, but the core design principle to actively avoid bias is Inclusiveness.

How to eliminate wrong answers

Option B is wrong because Privacy and security focuses on protecting patient data through encryption, access controls, and compliance with regulations like HIPAA, not on preventing discriminatory prioritization based on age or disability. Option C is wrong because Reliability and safety ensures the system operates consistently and safely under expected conditions, but does not directly address fairness or bias against protected attributes. Option D is wrong because Transparency is about making the AI system's decisions understandable and auditable, which supports accountability but does not by itself prevent discrimination; it is a means to an end, not the guiding principle for fairness.

777
MCQmedium

A fashion retailer wants to automatically generate new, unique images of clothing items based on textual descriptions (e.g., 'a blue silk dress with floral patterns'). Which Azure service would be most appropriate to accomplish this?

A.A) Azure Machine Learning
B.B) Azure OpenAI Service
C.C) Azure Cognitive Search
D.D) Custom Vision
AnswerB

Correct. Azure OpenAI Service includes models like DALL-E that can generate images from textual descriptions.

Why this answer

Azure OpenAI Service provides access to powerful generative AI models like GPT-4 and DALL-E, which can create new images from textual descriptions. This service is specifically designed for generative tasks, such as producing unique clothing images based on prompts like 'a blue silk dress with floral patterns', making it the most appropriate choice.

Exam trap

The trap here is that candidates may confuse Azure OpenAI Service (for generative AI) with Azure Machine Learning (for traditional ML) or Custom Vision (for classification), not realizing that only Azure OpenAI Service provides pre-built generative capabilities for text-to-image creation.

How to eliminate wrong answers

Option A is wrong because Azure Machine Learning is a platform for building, training, and deploying custom machine learning models, but it does not natively include pre-built generative image models; you would need to integrate a separate generative model, which is not the most direct solution. Option C is wrong because Azure Cognitive Search is a search and indexing service for retrieving existing documents or data, not for generating new images from text. Option D is wrong because Custom Vision is designed for image classification and object detection using labeled training data, not for generating novel images from textual descriptions.

778
MCQmedium

What is 'multi-label text classification' vs 'single-label' in Azure AI Language?

A.Single-label classifies one word; multi-label classifies entire sentences
B.Single-label assigns exactly one category; multi-label allows multiple categories per document
C.Multi-label is more accurate because it considers more information per document
D.Single-label requires more training data than multi-label classification
AnswerB

Multi-label handles documents that genuinely belong to multiple categories — topic tags, product attributes, or combined themes.

Why this answer

In Azure AI Language, single-label text classification assigns exactly one category to each document, while multi-label classification allows a document to be assigned multiple categories simultaneously. This distinction is fundamental to how the classification models are trained and how predictions are structured, with multi-label using a separate binary classifier per label rather than a single softmax output.

Exam trap

The trap here is that candidates confuse 'multi-label' with 'multi-class' (which still assigns only one label per document) or assume multi-label is always better, ignoring that it requires different model architecture and training data.

How to eliminate wrong answers

Option A is wrong because single-label classification operates on entire documents or text blocks, not individual words, and multi-label also works on documents or sentences, not just sentences. Option C is wrong because multi-label classification is not inherently more accurate; accuracy depends on the data and task, and multi-label may introduce complexity like label imbalance that can reduce performance. Option D is wrong because single-label classification does not inherently require more training data than multi-label; in fact, multi-label often needs more data to cover all label combinations effectively.

779
MCQmedium

A manufacturer trains a model to detect defective parts on an assembly line. Only 2% of parts are defective. The model predicts 'non-defective' for all parts and achieves 98% accuracy. Which metric best reveals the model's inability to identify defective parts?

A.Accuracy
B.Precision
C.Recall
D.F1 Score
AnswerC

Recall (sensitivity) is the proportion of actual defective parts that the model correctly identifies. A recall of 0% clearly shows the model fails to detect any defects.

Why this answer

Recall (sensitivity) measures the proportion of actual defective parts correctly identified by the model. With 98% accuracy but zero true positives (since the model labels everything as non-defective), recall is 0%, which directly exposes the model's failure to detect any defective parts despite high accuracy.

Exam trap

The trap here is that candidates see 98% accuracy and assume the model is performing well, overlooking that accuracy is inflated by class imbalance and does not measure the model's ability to detect the rare defective class.

How to eliminate wrong answers

Option A is wrong because accuracy is misleading here—it only reflects the overall correct predictions (98% non-defective) and hides the model's complete failure on the minority class (defective parts). Option B is wrong because precision measures the proportion of predicted defective parts that are actually defective; since the model never predicts defective, precision is undefined (division by zero) and does not reveal the inability to identify defects. Option D is wrong because the F1 score is the harmonic mean of precision and recall; with recall at 0%, F1 is also 0%, but recall alone more directly and intuitively shows the model's inability to detect defects.

780
MCQhard

What is 'curriculum learning' and how does it relate to training stability?

A.Designing a course curriculum using AI to personalise learning for students
B.Training models on progressively harder examples to improve stability and convergence
C.A structured plan for the sequence of ML courses a data scientist should take
D.Using a pre-defined curriculum of hyperparameter values to systematically explore the search space
AnswerB

Curriculum learning starts easy and increases difficulty — improving training stability and final performance vs. random example ordering.

Why this answer

Curriculum learning is a training strategy where a model is first exposed to simpler examples and then gradually introduced to more complex ones. This approach improves training stability by preventing the model from being overwhelmed by difficult patterns early on, which can cause large gradient updates and divergence. By structuring the learning process, the model converges more reliably and often achieves better generalization.

Exam trap

The trap here is that candidates confuse 'curriculum learning' with educational curricula or hyperparameter tuning, because the term 'curriculum' sounds like a course plan or a search schedule rather than a data ordering strategy.

How to eliminate wrong answers

Option A is wrong because it describes adaptive educational technology for human learners, not a machine learning training technique. Option C is wrong because it refers to a sequence of courses for a data scientist's professional development, not a model training methodology. Option D is wrong because it describes a hyperparameter search strategy (like grid or random search), not a curriculum-based ordering of training examples.

781
MCQeasy

What is facial recognition and what are the key responsible AI considerations for its use?

A.Facial recognition has no ethical concerns and should be deployed universally
B.Facial recognition requires ethical consideration regarding accuracy disparities, privacy, and potential for misuse
C.Facial recognition is only used for unlocking smartphones
D.Facial recognition is 100% accurate across all demographics
AnswerB

Responsible facial recognition deployment requires addressing demographic accuracy disparities, obtaining consent, protecting privacy, and preventing misuse.

Why this answer

Facial recognition is a computer vision technology that identifies or verifies individuals by analyzing facial features from images or video. The key responsible AI considerations include addressing accuracy disparities across demographic groups (e.g., higher false positive rates for certain ethnicities), ensuring privacy through data minimization and consent, and preventing misuse such as mass surveillance without oversight. Option B correctly captures these ethical imperatives, which are critical for trustworthy deployment.

Exam trap

The trap here is that candidates may assume facial recognition is either harmless or perfectly accurate, ignoring the documented bias and privacy risks that responsible AI frameworks like Microsoft's Responsible AI Standard explicitly address.

How to eliminate wrong answers

Option A is wrong because facial recognition has significant ethical concerns, including bias, privacy violations, and potential for misuse, making universal deployment irresponsible. Option C is wrong because facial recognition is used in many applications beyond smartphone unlocking, such as security systems, identity verification, and law enforcement. Option D is wrong because facial recognition is not 100% accurate across all demographics; studies show accuracy disparities, particularly for women and people with darker skin tones, due to training data imbalances.

782
MCQeasy

What is 'receipt analysis' in Azure AI Document Intelligence and what data does it extract?

A.Analysing customer satisfaction scores from post-purchase surveys
B.Extracting merchant name, items, prices, tax, and totals from retail receipt images
C.Verifying that a receipt matches the purchase record in a financial database
D.Detecting fraudulent receipts by comparing them to a known-good receipt database
AnswerB

Receipt analysis extracts structured financial fields from receipts — enabling automated expense management and bookkeeping.

Why this answer

Receipt analysis in Azure AI Document Intelligence is a prebuilt model designed to extract key-value pairs and line items from sales receipts. Option B correctly identifies that it extracts merchant name, items, prices, tax, and totals from retail receipt images, which is the primary function of this model.

Exam trap

The trap here is confusing the extraction of receipt data with downstream tasks like validation, fraud detection, or sentiment analysis, leading candidates to select options that describe post-processing steps rather than the core capability of the receipt analysis model.

How to eliminate wrong answers

Option A is wrong because analyzing customer satisfaction scores from post-purchase surveys is a text analytics or sentiment analysis task, not a document intelligence feature for structured data extraction from receipts. Option C is wrong because verifying a receipt against a financial database is a reconciliation or validation process, not a core extraction capability of the receipt analysis model. Option D is wrong because detecting fraudulent receipts by comparison to a known-good database is a fraud detection scenario, not a feature of the receipt analysis model, which focuses on extracting data rather than verifying authenticity.

783
MCQmedium

What is 'AI fairness' in Microsoft's Responsible AI principles?

A.Ensuring all Azure AI services are priced fairly for organisations of all sizes
B.Ensuring AI systems treat all demographic groups equitably without producing biased outcomes
C.Distributing AI compute resources equally across all team members in a project
D.Ensuring competition in the AI market by preventing monopolistic AI practices
AnswerB

Fairness requires equal performance and treatment across groups — Azure ML's Fairlearn integration detects and helps mitigate demographic disparities.

Why this answer

AI fairness in Microsoft's Responsible AI principles is about ensuring that AI systems treat all demographic groups equitably and do not produce biased outcomes. This involves designing and testing models to detect and mitigate unfairness, such as disparities in accuracy or impact across groups defined by race, gender, age, or other protected attributes.

Exam trap

The trap here is that candidates often associate 'fairness' with general ethical or economic concepts like pricing or competition, rather than recognizing it as a specific technical principle about demographic equity and bias mitigation in AI model outcomes.

How to eliminate wrong answers

Option A is wrong because it confuses 'fairness' with 'pricing fairness'—AI fairness is about equitable treatment of people, not cost structures for Azure services. Option C is wrong because it misinterprets fairness as resource allocation fairness within a team, which is unrelated to demographic equity in AI outcomes. Option D is wrong because it conflates fairness with market competition and antitrust concerns, which are not part of Microsoft's Responsible AI principles for model behavior.

784
MCQmedium

A customer service department wants to automatically extract the names of products mentioned in customer emails and the sentiment expressed about each product. For example, from the sentence 'The battery life of the X100 is excellent, but the screen is too dark,' they need to identify 'X100' and associate 'positive' sentiment with 'battery life' and 'negative' sentiment with 'screen'. Which Azure AI Language feature should they use?

A.Custom text classification
B.Key phrase extraction
C.Entity linking
D.Sentiment analysis with opinion mining
AnswerD

Correct. Opinion mining (a component of sentiment analysis) extracts aspects (such as product names or features) and the expressed sentiment toward each aspect.

Why this answer

Option D is correct because sentiment analysis with opinion mining is specifically designed to extract both the sentiment (positive, negative, neutral) and the associated target (e.g., 'battery life', 'screen') from text. This feature goes beyond simple sentiment scoring by identifying the opinion target and the sentiment expressed toward it, which directly matches the requirement to extract product names and their associated sentiments from customer emails.

Exam trap

The trap here is that candidates often confuse key phrase extraction (Option B) with sentiment analysis with opinion mining, because key phrases can include product names, but key phrase extraction does not provide any sentiment association, which is the core requirement of the question.

How to eliminate wrong answers

Option A is wrong because custom text classification assigns predefined labels to entire documents or sentences, not extracting specific entities or associating sentiment with them. Option B is wrong because key phrase extraction only identifies important words or phrases (e.g., 'battery life', 'X100', 'screen') but does not provide any sentiment analysis or link sentiment to specific targets. Option C is wrong because entity linking identifies and disambiguates named entities by linking them to a knowledge base (e.g., Wikipedia), but it does not perform sentiment analysis or associate sentiment with the linked entities.

785
MCQmedium

What is 'data augmentation' and how does it help with limited training data?

A.Collecting more labelled data from external sources to supplement training
B.Creating synthetic training variants (flips, rotations, synonyms) to expand small datasets
C.Increasing the number of compute nodes to process large training datasets faster
D.Adding more evaluation metrics to get a richer view of model performance
AnswerB

Augmentation multiplies effective training data by transforming existing examples — teaching invariances and reducing overfitting.

Why this answer

Data augmentation is a technique that artificially expands a training dataset by applying transformations (e.g., image flips, rotations, cropping, or text synonym replacement) to existing samples. This helps models generalize better when real-world data is scarce, reducing overfitting without requiring new labeled data collection.

Exam trap

The trap here is that candidates confuse 'data augmentation' with simply 'collecting more data' (Option A), failing to recognize that augmentation creates synthetic variants from existing data rather than acquiring new external samples.

How to eliminate wrong answers

Option A is wrong because collecting more labeled data from external sources is a separate process (data acquisition), not data augmentation—augmentation creates synthetic variants from existing data, not new external samples. Option C is wrong because increasing compute nodes relates to distributed training or scaling infrastructure, not to generating synthetic training variants to address limited data. Option D is wrong because adding evaluation metrics (e.g., precision, recall) improves model assessment but does not expand the training dataset or solve data scarcity.

786
MCQeasy

A retail company wants to build a solution that automatically reads the printed text on product labels to update inventory records. The labels contain alphanumeric characters and are in various fonts and sizes. Which Azure Cognitive Service should they use?

A.Azure Face Service
B.Azure Form Recognizer
C.Azure Computer Vision - OCR
D.Azure Video Indexer
AnswerC

Computer Vision's OCR (Optical Character Recognition) reads printed and handwritten text from images, making it ideal for product label text extraction.

Why this answer

Azure Computer Vision's OCR (Optical Character Recognition) API is specifically designed to extract printed text from images, handling various fonts, sizes, and alphanumeric characters. This makes it the ideal choice for reading product labels to update inventory records, as it can process the diverse label formats commonly found in retail environments.

Exam trap

The trap here is that candidates may confuse Azure Form Recognizer (which includes OCR capabilities) with the simpler Computer Vision OCR service, but Form Recognizer is designed for structured document extraction, not general-purpose text reading from labels.

How to eliminate wrong answers

Option A is wrong because Azure Face Service is designed for detecting and analyzing human faces (e.g., facial attributes, emotions, identification), not for reading printed text on labels. Option B is wrong because Azure Form Recognizer is optimized for extracting structured data from forms and documents (e.g., invoices, receipts) using prebuilt or custom models, but it is overkill and less efficient for simple printed text extraction from labels; it relies on OCR as a subcomponent but adds unnecessary complexity for this use case. Option D is wrong because Azure Video Indexer is used for analyzing video content (e.g., speech transcription, scene detection, face recognition), not for extracting printed text from static images of labels.

787
MCQeasy

What is 'credit scoring' as an AI workload and what responsible AI concerns does it raise?

A.A system for automatically assigning credit scores to software bugs in a development backlog
B.ML for predicting loan repayment risk — with fairness, bias, and explainability concerns
C.Monitoring whether a customer has used all their credit within an approved limit
D.An internal system for scoring the quality of AI projects within an organisation
AnswerB

Credit scoring has life-altering consequences — historical bias, demographic proxies, and GDPR explanation rights require careful responsible AI practices.

Why this answer

Credit scoring in AI refers to machine learning models that predict the likelihood of a borrower repaying a loan. This raises responsible AI concerns around fairness (e.g., models may discriminate against protected groups), bias (e.g., training data may reflect historical inequalities), and explainability (e.g., complex models like gradient-boosted trees are often black boxes, making it hard to justify decisions to regulators or customers).

Exam trap

The trap here is that candidates confuse 'credit scoring' with simple monitoring or non-AI scoring systems, but the exam specifically tests the understanding that it is a predictive ML workload with ethical implications around fairness, bias, and explainability.

How to eliminate wrong answers

Option A is wrong because it describes a system for prioritizing software bugs, not a financial AI workload; 'credit scoring' in AI-900 context refers to credit risk assessment, not bug triage. Option C is wrong because it describes a simple monitoring check of credit utilization, not a predictive ML model that scores repayment risk; this is a rule-based threshold check, not an AI workload. Option D is wrong because it describes an internal project evaluation metric, not a customer-facing AI system that assigns credit scores; the term 'scoring' here is generic and unrelated to financial credit.

788
MCQeasy

What is 'clustering' in unsupervised machine learning?

A.Grouping similar data points together without predefined labels based on natural patterns
B.Classifying data points into predefined categories using labelled training examples
C.Grouping Azure compute resources together for distributed training jobs
D.Organising model training runs into logical groups for experiment tracking
AnswerA

Clustering is unsupervised — it discovers natural groupings in data (customer segments, document topics) without requiring labels.

Why this answer

Clustering is an unsupervised learning technique that automatically groups data points based on inherent similarities or patterns in the data, without requiring any pre-existing labels. The algorithm identifies natural structures, such as distance or density relationships, to form clusters. In Azure Machine Learning, clustering is commonly implemented using algorithms like K-Means or DBSCAN for tasks such as customer segmentation or anomaly detection.

Exam trap

The trap here is that candidates confuse clustering (unsupervised) with classification (supervised), especially when the question mentions 'grouping' data, leading them to choose Option B which describes classification with predefined labels.

How to eliminate wrong answers

Option B is wrong because it describes supervised learning (classification), where models are trained on labelled examples to assign predefined categories, not unsupervised clustering. Option C is wrong because it refers to Azure compute cluster provisioning for distributed training, which is an infrastructure concept unrelated to machine learning algorithms. Option D is wrong because it describes organizing experiment tracking runs in Azure Machine Learning, which is a DevOps/MLOps practice, not a machine learning technique.

789
MCQhard

A company deploys an AI system to screen job resumes. The system consistently rejects candidates from a certain university, but the company cannot determine which features led to the decision or how the model arrived at that outcome. Which Microsoft responsible AI principle is most directly violated?

A.Fairness
B.Accountability
C.Transparency
D.Privacy and security
AnswerC

Transparency means AI systems should be understandable, and operators should be able to explain decisions. The company's inability to explain the rejection directly violates this principle.

Why this answer

The scenario describes a system that makes decisions without providing any insight into how or why those decisions were made. Transparency, as a Microsoft responsible AI principle, requires that AI systems be understandable and that their decisions can be explained. Since the company cannot determine which features led to the rejection or how the model arrived at the outcome, the lack of explainability directly violates the Transparency principle.

Exam trap

Microsoft often tests the distinction between Fairness (bias) and Transparency (explainability), so the trap here is that candidates see a potentially biased outcome and immediately choose Fairness, missing that the core violation is the lack of explainability, not the bias itself.

How to eliminate wrong answers

Option A is wrong because Fairness focuses on ensuring AI systems do not discriminate against groups or individuals, but the core issue here is the inability to understand the model's decision-making process, not the presence of bias itself. Option B is wrong because Accountability refers to the assignment of responsibility for AI system outcomes and governance, not the technical explainability of a model's decisions. Option D is wrong because Privacy and security concern the protection of personal data and system integrity, whereas the problem is about the lack of interpretability and explanation of model outputs.

790
MCQhard

What is the 'alignment problem' in AI safety and why is it significant?

A.Making AI models work consistently across different hardware platforms and cloud providers
B.The challenge of building AI systems that reliably pursue what humans actually intend rather than gaming the specification
C.Aligning AI model training data with current regulations and compliance requirements
D.Ensuring all team members agree on the objectives before beginning an AI project
AnswerB

Alignment means AI goals match human values — misaligned AI might satisfy a reward specification in harmful or unexpected ways.

Why this answer

Option B is correct because the alignment problem refers to the fundamental challenge in AI safety where a system may optimize for a literal or mis-specified objective, leading to unintended or harmful behavior. For example, a reinforcement learning agent tasked with 'maximizing score' might find a way to exploit a bug in the simulation rather than learning the intended skill. This is significant because misaligned AI can cause real-world harm, especially as systems become more capable and autonomous.

Exam trap

The trap here is that candidates confuse the term 'alignment' with general coordination or compliance tasks, such as aligning teams or regulations, rather than recognizing it as a specific AI safety concept about goal specification and reward design.

How to eliminate wrong answers

Option A is wrong because it describes hardware/cloud interoperability, which is a deployment or portability concern, not an AI safety or alignment issue. Option C is wrong because aligning training data with regulations is a compliance and data governance task, not the technical challenge of ensuring an AI's goals match human intent. Option D is wrong because it refers to team alignment on project objectives, which is a project management concept, not a technical AI safety problem.

791
MCQeasy

What is 'image classification' in Azure AI Custom Vision?

A.Organising image files into folders on Azure Blob Storage by date
B.Assigning a category label to an entire image based on its dominant visual content
C.Converting colour images to black and white for accessibility purposes
D.Sorting images by their file size and resolution metadata
AnswerB

Image classification labels the whole image (cat/dog/car) — simpler than object detection, which locates specific instances within the image.

Why this answer

Image classification in Azure AI Custom Vision involves training a model to assign a single category label (e.g., 'dog', 'cat') to an entire image based on its dominant visual content. This is a supervised learning task where the model learns from labeled images to predict the most likely class for new, unseen images. Option B correctly describes this core functionality.

Exam trap

The trap here is that candidates may confuse image classification with object detection (which identifies multiple objects and their locations) or with simple image processing tasks like filtering or sorting, leading them to pick options that describe non-AI operations.

How to eliminate wrong answers

Option A is wrong because organizing image files into folders on Azure Blob Storage by date is a storage management task, not a computer vision AI workload; it does not involve any model training or inference. Option C is wrong because converting color images to black and white is a simple image processing operation (e.g., using OpenCV or Azure Computer Vision's grayscale conversion), not a classification task that assigns semantic labels. Option D is wrong because sorting images by file size and resolution metadata is a file system or data preprocessing step, not a machine learning classification process that identifies visual content.

792
MCQeasy

What is 'regression' in machine learning and when is it used?

A.A model that predicts which category an item belongs to from a set of options
B.Predicting a continuous numerical value such as price, temperature, or demand
C.Going back to a previous model version when the current version performs poorly
D.A technique for reducing the dimensionality of training data before model fitting
AnswerB

Regression outputs a number — house price prediction, energy demand forecasting, and revenue estimation are classic regression tasks.

Why this answer

Regression is a supervised machine learning technique used to predict a continuous numerical value, such as price, temperature, or demand, based on input features. It models the relationship between independent variables and a dependent variable that has a real-valued output, making option B correct.

Exam trap

The trap here is that candidates confuse regression with classification, as both are supervised learning, but regression outputs a continuous number while classification outputs a discrete label.

How to eliminate wrong answers

Option A is wrong because it describes classification, not regression; classification predicts discrete categorical labels (e.g., 'cat' or 'dog'), not continuous values. Option C is wrong because it describes a version control or model rollback practice, not a machine learning algorithm or task. Option D is wrong because it describes dimensionality reduction (e.g., PCA), which is a preprocessing technique, not a predictive modeling task like regression.

793
MCQeasy

A global e-commerce company is designing an AI-powered chatbot to assist customers. They want to ensure the chatbot can be used by people with diverse abilities, including those who use screen readers or speak different languages. Which Microsoft responsible AI principle is most directly related to this requirement?

A.Fairness
B.Inclusiveness
C.Reliability and Safety
D.Transparency
AnswerB

Inclusiveness requires AI systems to empower everyone, including people with disabilities and diverse linguistic backgrounds, by providing accessible interfaces and multilingual capabilities.

Why this answer

The requirement to support screen readers and multiple languages directly aligns with Microsoft's responsible AI principle of inclusiveness, which aims to design AI systems that empower everyone, including people with disabilities and diverse linguistic backgrounds. In the context of a chatbot, inclusiveness ensures features like screen reader compatibility (via ARIA labels and semantic HTML) and multilingual natural language processing (NLP) models that can handle different languages and dialects, making the technology accessible to a broader audience.

Exam trap

The trap here is that candidates often confuse inclusiveness with fairness, thinking that ensuring equal access for all users is the same as preventing bias, but inclusiveness specifically targets accessibility and language support, while fairness targets equitable outcomes across protected attributes.

How to eliminate wrong answers

Option A (Fairness) is wrong because fairness focuses on preventing bias and ensuring equitable treatment across demographic groups, not on enabling accessibility for diverse abilities or multilingual support. Option C (Reliability and Safety) is wrong because it addresses system consistency, error handling, and safety under normal and adversarial conditions, not the specific design for screen readers or language diversity. Option D (Transparency) is wrong because it concerns explainability and openness about how AI systems make decisions, not the direct enablement of accessibility features like screen reader compatibility or multilingual interfaces.

794
MCQmedium

A manufacturing team wants to predict product defects based on sensor readings from the production line. They have 10,000 historical samples, each labeled as 'defective' or 'non-defective'. Which type of machine learning should they use in Azure Machine Learning?

A.Supervised learning
B.Unsupervised learning
C.Reinforcement learning
D.Semi-supervised learning
AnswerA

Supervised learning uses labeled data to train a model for prediction. The labeled outcomes (defective/non-defective) make this the correct approach.

Why this answer

This is a supervised learning problem because the dataset contains labeled historical samples (defective or non-defective), and the goal is to predict a categorical outcome based on sensor readings. In Azure Machine Learning, supervised learning algorithms such as two-class logistic regression or boosted decision trees are used to train a model that maps input features to known labels.

Exam trap

The trap here is that candidates may confuse 'predicting defects' with unsupervised anomaly detection, but the presence of explicit labels (defective/non-defective) makes this a supervised classification task, not an unsupervised one.

How to eliminate wrong answers

Option B (Unsupervised learning) is wrong because it is used when data has no labels and the goal is to find hidden patterns or groupings, not to predict a known outcome like defect status. Option C (Reinforcement learning) is wrong because it involves an agent learning through trial-and-error interactions with an environment to maximize a reward signal, which does not apply to static historical data with predefined labels. Option D (Semi-supervised learning) is wrong because it is designed for scenarios where only a small portion of data is labeled and the rest is unlabeled, but here all 10,000 samples are labeled.

795
MCQmedium

An autonomous drone needs to navigate a forest by identifying individual trees, including their exact shape and boundaries, to avoid colliding with branches. The drone also needs to distinguish between trees and other objects like rocks. Which Azure Computer Vision capability is best suited for this requirement?

A.Image classification
B.Object detection
C.Semantic segmentation
D.Optical character recognition (OCR)
AnswerC

Semantic segmentation labels each pixel with a class (e.g., 'tree', 'rock', 'sky'). This provides the precise shape and boundaries needed for collision avoidance.

Why this answer

Semantic segmentation is the correct choice because it classifies every pixel in an image, assigning each pixel to a specific class (e.g., 'tree', 'rock', 'branch'). This pixel-level precision allows the drone to identify the exact shape and boundaries of individual trees, which is essential for collision avoidance in a forest environment.

Exam trap

The trap here is that candidates confuse object detection (bounding boxes) with semantic segmentation (pixel-level masks), assuming bounding boxes provide enough detail for precise boundary avoidance, but the question explicitly requires 'exact shape and boundaries,' which only pixel-level segmentation can deliver.

How to eliminate wrong answers

Option A is wrong because image classification assigns a single label to the entire image, not individual objects or their boundaries, so it cannot provide the per-pixel detail needed to navigate around branches. Option B is wrong because object detection draws bounding boxes around objects, which gives approximate locations but not the precise shape or boundary of each tree, making it insufficient for avoiding fine branches. Option D is wrong because optical character recognition (OCR) extracts text from images, which is irrelevant to identifying trees, rocks, or other natural objects.

796
MCQmedium

A museum wants to create an interactive exhibit where visitors can type a description of a fictional creature, such as 'a fire-breathing dragon with emerald scales and golden wings,' and the system generates an image of that creature in real time. The museum must ensure that the generated images are safe and appropriate for all ages, including children. Which Azure service should they use, and which safety feature should they configure?

A.Azure OpenAI Service with the DALL-E 2 model and content filtering enabled
B.Azure Cognitive Services Computer Vision with custom vision image generation
C.Azure OpenAI Service with the GPT-4 model and content filtering enabled
D.Azure OpenAI Service with the DALL-E 2 model without content filtering
AnswerA

Correct. DALL-E 2 generates images from text, and Azure OpenAI Service provides built-in content filtering to block harmful outputs, ensuring age-appropriate images.

Why this answer

Option A is correct because Azure OpenAI Service with DALL-E 2 is specifically designed for generating images from text descriptions, and enabling content filtering ensures the output is safe for all ages, including children. This combination directly meets the museum's requirement for real-time, safe image generation from textual prompts.

Exam trap

The trap here is confusing Azure OpenAI Service's DALL-E 2 (image generation) with GPT-4 (text generation), or assuming that any AI service with content filtering can generate images, when only DALL-E 2 is designed for that task.

How to eliminate wrong answers

Option B is wrong because Azure Cognitive Services Computer Vision does not include image generation capabilities; it is used for analyzing and extracting information from images, not creating new ones. Option C is wrong because GPT-4 is a language model for text generation, not image generation; it cannot produce images from descriptions. Option D is wrong because disabling content filtering would allow potentially unsafe or inappropriate images, violating the museum's requirement for age-appropriate content.

797
MCQmedium

What is 'quality control' computer vision and how is it used in manufacturing?

A.Monitoring the quality of AI model outputs to ensure they meet accuracy standards
B.Detecting manufacturing defects at production line speeds with consistent accuracy
C.Verifying that factory video surveillance cameras meet quality standards
D.Controlling the quality of training images used to build computer vision models
AnswerB

QC vision identifies cracks, scratches, and assembly errors at speed — replacing inconsistent manual inspection with consistent AI.

Why this answer

Quality control in computer vision refers to using AI models to inspect products on a manufacturing line, detecting defects such as scratches, dents, or misalignments at high speed. Azure Custom Vision or Azure Computer Vision can be trained on labeled images of good and defective items to perform real-time inference, ensuring consistent accuracy far beyond human visual inspection. This directly addresses the need for automated, scalable defect detection in production environments.

Exam trap

The trap here is that candidates confuse 'quality control' of the AI model itself (Option A) with using computer vision to perform quality control on physical products, which is the core manufacturing use case.

How to eliminate wrong answers

Option A is wrong because it describes monitoring AI model output accuracy, which is a model governance or MLOps task, not the application of computer vision for physical product inspection in manufacturing. Option C is wrong because it confuses the quality of surveillance camera hardware with the computer vision workload used to inspect manufactured items; the question is about using vision AI for defect detection, not verifying camera specs. Option D is wrong because it refers to curating training data quality, which is a prerequisite for building models, not the operational use of computer vision for quality control on the factory floor.

798
MCQeasy

What is 'data preprocessing' and why is it important for machine learning?

A.Encrypting sensitive data before storing it in Azure for security compliance
B.Transforming raw data (handling nulls, scaling, encoding) to make it suitable for ML training
C.The process of splitting raw data into training and test sets
D.Compressing data files to reduce the cost of Azure Blob Storage
AnswerB

Preprocessing is foundational — cleaning, scaling, and encoding data significantly impacts model accuracy and training stability.

Why this answer

Data preprocessing is the transformation of raw data into a clean, structured format that machine learning algorithms can effectively learn from. Option B correctly identifies this as handling nulls, scaling numerical features, and encoding categorical variables, which are essential because ML models require numeric input and are sensitive to missing values and feature magnitudes.

Exam trap

The trap here is that candidates confuse data preprocessing with data splitting or security measures, but the core purpose is to clean and transform raw data so that ML models can interpret it correctly.

How to eliminate wrong answers

Option A is wrong because encrypting sensitive data is a security measure, not a preprocessing step that prepares data for ML training. Option C is wrong because splitting data into training and test sets is a separate step that occurs after preprocessing, not the preprocessing itself. Option D is wrong because compressing files reduces storage costs but does not transform data into a format suitable for ML algorithms.

799
MCQmedium

A news agency needs to automatically process thousands of news articles and extract specific structured information: the publication date, the author's name, and the headline. None of these fields are standard predefined entities in Azure AI Language. Which Azure AI Language feature should they use to train a model to extract these custom entities?

A.Key phrase extraction
B.Sentiment analysis
C.Custom Named Entity Recognition (Custom NER)
D.Language detection
AnswerC

Correct. Custom NER allows you to train a model to recognize specific entities like dates, author names, and headlines that are not in the prebuilt entity set.

Why this answer

Custom Named Entity Recognition (Custom NER) is the correct choice because it allows you to train a model to extract domain-specific entities—such as publication date, author name, and headline—that are not part of Azure's predefined entity catalog. By providing labeled examples, Custom NER learns to identify these custom fields from unstructured text, enabling automated extraction from thousands of news articles.

Exam trap

The trap here is that candidates confuse custom NER with key phrase extraction, assuming key phrases can extract structured fields like dates or names, but key phrases are untyped and not designed for entity-level extraction.

How to eliminate wrong answers

Option A is wrong because key phrase extraction returns general multi-word phrases (e.g., 'breaking news today') rather than specific structured fields like dates or names. Option B is wrong because sentiment analysis evaluates the emotional tone of text (positive/negative/neutral) and cannot extract entities or structured information. Option D is wrong because language detection only identifies the language of the text (e.g., English, Spanish) and provides no entity extraction capabilities.

800
MCQhard

A global e-commerce company receives customer support emails in over 30 languages. They want to automatically route each email to the correct department (Billing, Technical Support, or Returns) and also extract the order number and the product name mentioned in the email. The solution must be able to identify the language of each email before further processing. Which combination of Azure AI Language features should they use?

A.Language Detection, Custom Text Classification, and Entity Recognition
B.Language Detection, Key Phrase Extraction, and Sentiment Analysis
C.Entity Recognition, Sentiment Analysis, and Key Phrase Extraction
D.Custom Text Classification, Key Phrase Extraction, and Sentiment Analysis
AnswerA

Correct. Language Detection identifies the email's language. Custom Text Classification enables routing to departments. Entity Recognition extracts the order number and product name as entities.

Why this answer

Option A is correct because the scenario requires three distinct capabilities: Language Detection to identify the email's language (a prerequisite for further processing), Custom Text Classification to route emails to the correct department (Billing, Technical Support, or Returns), and Entity Recognition to extract the order number and product name. Azure AI Language provides these as separate, combinable features that directly map to the stated requirements.

Exam trap

The trap here is that candidates often confuse Key Phrase Extraction with Entity Recognition, or assume Sentiment Analysis is needed for routing, when in fact the scenario's explicit requirements (language identification, department routing, and specific entity extraction) map directly to Language Detection, Custom Text Classification, and Entity Recognition.

How to eliminate wrong answers

Option B is wrong because Key Phrase Extraction and Sentiment Analysis do not route emails to departments or extract structured entities like order numbers and product names; they only identify key terms and sentiment polarity. Option C is wrong because it lacks Language Detection, which is explicitly required to identify the language before further processing, and Sentiment Analysis is irrelevant to routing or entity extraction. Option D is wrong because it omits Language Detection and Entity Recognition, and Sentiment Analysis is unnecessary for the routing and extraction tasks.

801
MCQmedium

What is 'Azure OpenAI deployment' and how does it differ from a 'model'?

A.A model is the purchased licence; a deployment is the technical installation
B.A model is the underlying AI; a deployment is a named, quota-allocated instance your application calls
C.A deployment is always faster than a model because it uses optimised serving infrastructure
D.Models are available globally; deployments are restricted to specific Azure regions
AnswerB

Deployments are model instances with names and quotas — you create multiple deployments (dev, prod) of the same underlying model.

Why this answer

In Azure OpenAI, a 'model' refers to the underlying AI algorithm (e.g., GPT-4, GPT-3.5-Turbo) that defines the capabilities and behavior of the generative AI. A 'deployment' is a specific, named instance of that model provisioned within an Azure OpenAI resource, with its own endpoint, quota (tokens per minute), and configuration (e.g., content filter settings). This separation allows you to manage capacity and access for different applications or use cases independently, even when using the same base model.

Exam trap

The trap here is that candidates confuse the conceptual 'model' (the AI algorithm) with the operational 'deployment' (the provisioned instance), often assuming they are interchangeable or that a deployment is merely a 'copy' of the model, missing the critical quota and endpoint management aspects.

How to eliminate wrong answers

Option A is wrong because a model is not a purchased license; it is a specific AI algorithm (e.g., GPT-4) that you access via Azure, and a deployment is not a technical installation but a provisioned instance with its own endpoint and quota. Option C is wrong because a deployment does not inherently make the model faster; performance depends on the model's architecture, the deployment's region, and the allocated quota (tokens per minute), not on an optimized serving infrastructure specific to deployments. Option D is wrong because both models and deployments are available in specific Azure regions where the Azure OpenAI service is provisioned; models are not globally available without regional deployment, and deployments are also region-bound to the Azure OpenAI resource.

802
MCQeasy

An autonomous delivery robot uses AI to navigate sidewalks. The robot occasionally fails to detect pedestrians in low-light conditions, leading to near-collisions. The company wants to ensure the system is robust and safe before wider deployment. Which Microsoft responsible AI principle is most directly relevant?

A.Fairness
B.Privacy and security
C.Reliability and safety
D.Transparency
AnswerC

Reliability and safety ensure AI systems perform as intended and do not cause harm. The robot's detection failures in low light present a safety risk that must be mitigated.

Why this answer

The robot's failure to detect pedestrians in low-light conditions directly impacts the system's ability to operate reliably and safely. The Reliability and safety principle in Microsoft's responsible AI framework requires that AI systems perform consistently under expected conditions and fail gracefully when they cannot. Ensuring the robot can handle edge cases like low light is a core safety requirement before deployment.

Exam trap

The trap here is that candidates may confuse Transparency (which involves disclosing limitations) with the actual requirement to engineer the system to be safe and reliable, but the question asks for the principle most directly relevant to preventing near-collisions, which is Reliability and safety.

How to eliminate wrong answers

Option A is wrong because Fairness addresses bias and equitable treatment across demographic groups, not operational robustness in low-light environments. Option B is wrong because Privacy and security concerns data protection and unauthorized access, not the physical safety of navigation failures. Option D is wrong because Transparency involves explainability and disclosure of AI capabilities and limitations, but it does not directly mandate the system's operational reliability or safety performance.

803
MCQmedium

A news agency wants to automatically extract the names of people and organizations mentioned in thousands of articles. They plan to use a prebuilt Azure AI Language feature that requires no custom training. Which feature should they use?

A.Sentiment analysis
B.Named entity recognition
C.Key phrase extraction
D.Language detection
AnswerB

Named entity recognition identifies and classifies named entities like person names, organizations, and locations from text.

Why this answer

Named entity recognition (NER) is the correct feature because it is specifically designed to identify and categorize entities such as people, organizations, locations, and dates from unstructured text. The news agency's requirement to extract names of people and organizations from articles matches NER's prebuilt capability, which requires no custom training and is available in Azure AI Language.

Exam trap

The trap here is that candidates often confuse key phrase extraction with named entity recognition, but key phrase extraction returns multi-word phrases without entity type classification, while NER specifically labels entities like 'Person' or 'Organization'.

How to eliminate wrong answers

Option A is wrong because sentiment analysis evaluates the emotional tone (positive, negative, neutral) of text, not the extraction of specific entity names. Option C is wrong because key phrase extraction identifies important terms and phrases (e.g., 'climate change summit') but does not classify them into categories like person or organization. Option D is wrong because language detection identifies the language of the text (e.g., English, Spanish), not the entities mentioned within it.

804
MCQmedium

What is recall (sensitivity) in the context of binary classification model evaluation?

A.The proportion of positive predictions that are actually correct
B.The proportion of actual positives that the model correctly identified
C.The overall proportion of predictions that match the actual labels
D.How quickly the model can be updated with new training data
AnswerB

Recall = TP / (TP + FN). It measures how well the model finds all actual positives — minimizing missed detections.

Why this answer

Recall (sensitivity) measures the proportion of actual positive cases that the model correctly identifies. In binary classification, it answers: 'Of all the truly positive instances, how many did the model catch?' This is critical in scenarios where missing a positive (false negative) is costly, such as disease screening or fraud detection.

Exam trap

The trap here is that candidates confuse recall with precision (Option A) because both involve true positives, but recall focuses on actual positives while precision focuses on predicted positives.

How to eliminate wrong answers

Option A is wrong because it describes precision, not recall — precision is the proportion of positive predictions that are actually correct (true positives divided by all predicted positives). Option C is wrong because it describes accuracy, which is the overall proportion of correct predictions (both true positives and true negatives) out of all predictions. Option D is wrong because it describes model retraining or update speed, which is unrelated to evaluation metrics like recall; recall is a static performance measure, not a measure of training agility.

805
MCQmedium

A news agency receives thousands of articles daily from wire services. They want to automatically identify and tag mentions of people, organizations, and locations within each article to improve search and categorization. The agency has no labeled training data. Which built-in Azure AI Language feature should they use?

A.Sentiment Analysis
B.Key Phrase Extraction
C.Named Entity Recognition (NER)
D.Language Detection
AnswerC

NER automatically identifies and categorizes named entities such as persons, organizations, and locations. It requires no training data and is directly suitable for this task.

Why this answer

Named Entity Recognition (NER) is the correct choice because it is specifically designed to identify and categorize mentions of people, organizations, locations, and other entity types in unstructured text. Since the agency has no labeled training data, NER's pre-trained model can be used out-of-the-box without any custom training, making it ideal for automatically tagging articles with these entity types to improve search and categorization.

Exam trap

The trap here is that candidates often confuse Key Phrase Extraction with Named Entity Recognition, thinking that extracting important phrases is the same as identifying specific entity types, but Key Phrase Extraction does not categorize phrases into predefined classes like person or organization.

How to eliminate wrong answers

Option A is wrong because Sentiment Analysis evaluates the emotional tone (positive, negative, neutral) of text, not the identification of specific entities like people or organizations. Option B is wrong because Key Phrase Extraction identifies important phrases and keywords but does not classify them into predefined categories such as person, organization, or location. Option D is wrong because Language Detection determines the language of the text (e.g., English, Spanish) and has no capability to recognize named entities within the content.

806
MCQmedium

What is a foundation model in the context of AI?

A.A small specialized model optimized for a single specific task
B.A large general-purpose AI model trained at scale that can be adapted to many downstream tasks
C.The underlying hardware infrastructure for running AI workloads
D.A model that has been certified as ethically sound by regulators
AnswerB

Foundation models (GPT-4, DALL-E, etc.) are trained broadly and serve as the basis for many applications through fine-tuning or prompting.

Why this answer

A foundation model is a large-scale, general-purpose AI model trained on vast and diverse datasets, enabling it to be adapted or fine-tuned for a wide range of downstream tasks such as text generation, translation, and image recognition. This definition aligns with option B, as foundation models like GPT-4 or BERT are designed for broad applicability rather than a single task.

Exam trap

The trap here is that candidates often confuse foundation models with narrow AI models or hardware, mistakenly thinking a foundation model is either a small specialized tool or the underlying compute infrastructure, rather than recognizing its defining characteristic of being a large, adaptable, general-purpose model.

How to eliminate wrong answers

Option A is wrong because a foundation model is not small or specialized for a single task; it is large and general-purpose, unlike narrow models like a spam classifier. Option C is wrong because a foundation model refers to the AI model itself, not the hardware infrastructure (e.g., GPUs or TPUs) used to run AI workloads. Option D is wrong because ethical certification is not a defining characteristic of foundation models; they are defined by their scale and adaptability, not regulatory approval.

807
MCQmedium

What is 'model explainability' using SHAP values in Azure Machine Learning?

A.Explaining the model's predictions using a simplified version of the model that is easier to interpret
B.Calculating each feature's contribution to a specific prediction to explain why the model made that decision
C.Displaying the model's source code so users can verify what computations are performed
D.Testing the model on a separate evaluation dataset to report overall accuracy
AnswerB

SHAP values quantify each feature's impact on each prediction — providing mathematically rigorous local and global explanations.

Why this answer

SHAP (SHapley Additive exPlanations) values are a game-theoretic approach that assigns each feature an importance value for a particular prediction. Option B is correct because SHAP values quantify the contribution of each input feature to the model's output, providing a local explanation for why a specific decision was made. This is distinct from global feature importance or model simplification.

Exam trap

The trap here is that candidates confuse model explainability with model evaluation or model simplification, leading them to select Option A (surrogate model) or Option D (accuracy reporting) instead of recognizing that SHAP specifically provides per-feature contribution explanations for individual predictions.

How to eliminate wrong answers

Option A is wrong because it describes a surrogate model or model distillation (e.g., using a decision tree to approximate a black-box model), not SHAP values, which directly compute per-feature contributions without creating a separate simplified model. Option C is wrong because model explainability does not involve displaying source code; Azure Machine Learning does not expose model source code for verification, and SHAP is a post-hoc explanation method, not a code review tool. Option D is wrong because it describes model evaluation (testing accuracy on a holdout set), which is a performance metric, not an explainability technique that explains individual predictions.

808
MCQhard

A company develops an AI system to screen job resumes and rank candidates for interviews. The system is trained on historical hiring data that favored candidates from certain well-known universities. The company decides to deploy the system without any adjustments to address this bias. Which Microsoft responsible AI principle is most directly being violated?

A.Fairness
B.Inclusiveness
C.Reliability and Safety
D.Privacy and Security
AnswerA

Fairness ensures AI systems do not discriminate against individuals or groups. Deploying a biased system without correction violates this principle.

Why this answer

The correct answer is A (Fairness) because the AI system was trained on biased historical data that favored candidates from certain universities, and deploying it without adjustments directly violates the fairness principle. Fairness in responsible AI requires that systems treat all people equitably and do not discriminate based on protected attributes such as educational background. By not mitigating the bias, the system perpetuates historical inequities in the hiring process.

Exam trap

The trap here is that candidates may confuse 'fairness' with 'inclusiveness' because both relate to ethical AI, but inclusiveness is about designing for diverse user groups (e.g., accessibility), while fairness specifically addresses bias and discrimination in model outcomes.

How to eliminate wrong answers

Option B (Inclusiveness) is wrong because inclusiveness focuses on designing AI systems that empower and engage everyone, including people with disabilities, but the core issue here is not about accessibility or representation—it is about biased decision-making. Option C (Reliability and Safety) is wrong because reliability and safety concern the system's ability to operate consistently and without causing harm through errors or failures, not the fairness of its outcomes. Option D (Privacy and Security) is wrong because the scenario does not involve unauthorized data access, data breaches, or improper handling of personal information; the violation is about biased treatment of candidates, not data protection.

809
MCQmedium

What is computer vision and give three real-world application examples.

A.Computer vision is limited to text recognition only; it cannot detect objects
B.Computer vision enables machines to understand visual data — used in autonomous driving, medical imaging, and retail automation
C.Computer vision only works on satellite imagery for geographic analysis
D.Computer vision requires extremely expensive hardware unavailable in the cloud
AnswerB

Computer vision interprets images/video for tasks like autonomous vehicle navigation, medical diagnosis support, and automated checkout.

Why this answer

Option B is correct because computer vision is a field of AI that enables machines to interpret and make decisions based on visual data from the world, such as images and videos. The three examples given—autonomous driving (e.g., detecting pedestrians and lane markings), medical imaging (e.g., analyzing X-rays for tumors), and retail automation (e.g., self-checkout systems recognizing products)—are classic real-world applications that demonstrate the breadth of computer vision beyond simple text recognition.

Exam trap

The trap here is that candidates may assume computer vision is narrowly defined (e.g., only for text or satellite imagery) or that it requires prohibitively expensive hardware, when in fact it is a broad, cloud-accessible technology with many practical applications.

How to eliminate wrong answers

Option A is wrong because computer vision is not limited to text recognition; it encompasses a wide range of tasks including object detection, image classification, facial recognition, and scene understanding. Option C is wrong because computer vision is not restricted to satellite imagery; it is applied across diverse domains such as healthcare, manufacturing, security, and consumer technology. Option D is wrong because computer vision does not require extremely expensive hardware; cloud-based services like Azure Computer Vision provide scalable, cost-effective APIs that run on standard hardware, making the technology accessible without specialized on-premises equipment.

810
MCQmedium

A company wants to use Azure OpenAI to generate personalized marketing emails. They have a large dataset of customer purchase histories. They want the model to generate emails that recommend products based on individual customer preferences without retraining the entire model. Which technique should they use?

A.Fine-tuning
B.Prompt engineering with few-shot learning
C.Reinforcement learning from human feedback
D.Creating a custom neural network
AnswerB

This technique provides examples in the prompt to guide the model's output for a specific task without retraining, making it ideal for generating personalized emails based on customer data.

Why this answer

Prompt engineering with few-shot learning is correct because it allows the model to generate personalized marketing emails by providing a few examples of customer-product pairs in the prompt, without modifying the underlying model weights. This technique leverages the pre-trained knowledge of Azure OpenAI to recommend products based on individual customer purchase histories, avoiding the need for costly retraining.

Exam trap

The trap here is that candidates often confuse fine-tuning with prompt engineering, assuming that any customization requires retraining, when in fact few-shot learning can achieve personalization without modifying model weights.

How to eliminate wrong answers

Option A is wrong because fine-tuning requires retraining the model on a labeled dataset, which contradicts the requirement to avoid retraining the entire model. Option C is wrong because reinforcement learning from human feedback (RLHF) is used to align model behavior with human preferences through iterative feedback, not for generating personalized recommendations from static customer data without retraining. Option D is wrong because creating a custom neural network involves building and training a new model from scratch, which is unnecessary and contradicts the requirement to use Azure OpenAI without retraining.

811
MCQmedium

What is 'model compression' and what techniques does it include?

A.Compressing training data files to reduce storage costs
B.Reducing model size through pruning, quantisation, distillation, and factorisation for efficient deployment
C.Summarising model documentation into a shorter model card format
D.Packaging model code and dependencies into a container image for deployment
AnswerB

Model compression enables edge deployment and lower inference costs — multiple techniques trade small accuracy loss for large efficiency gains.

Why this answer

Model compression is a set of techniques used to reduce the size of a trained machine learning model while preserving its accuracy as much as possible. This is critical for deploying models on resource-constrained devices like edge devices or mobile phones. The key techniques include pruning (removing unnecessary weights), quantization (reducing the precision of weights, e.g., from 32-bit floats to 8-bit integers), distillation (training a smaller 'student' model to mimic a larger 'teacher' model), and factorization (decomposing large weight matrices into smaller ones).

Option B correctly lists these four core techniques.

Exam trap

The trap here is that candidates confuse model compression with general deployment or data optimization tasks, such as containerization (Option D) or data compression (Option A), because the word 'compression' is used broadly in Azure contexts.

How to eliminate wrong answers

Option A is wrong because compressing training data files is a data storage optimization technique, not a model compression technique; model compression specifically targets the model's architecture and parameters, not the input data. Option C is wrong because summarizing model documentation into a shorter model card is a documentation or governance practice, not a technical method for reducing model size or computational footprint. Option D is wrong because packaging model code and dependencies into a container image is a deployment and containerization step (e.g., using Docker), which does not reduce the model's size or complexity; it simply bundles the existing model for portability.

812
MCQmedium

A brand monitoring company wants to automatically detect the presence of specific logos (e.g., Apple, Coca-Cola) in social media images. The logos can appear in various orientations and sizes within the image. Which Azure Computer Vision capability is specifically designed to identify popular brands from their logos?

A.Image Classification
B.Object Detection
C.Brand Detection
D.Optical Character Recognition
AnswerC

Brand detection is a built-in feature of Azure Computer Vision that identifies thousands of global brands from their logos, handling variations in orientation and size.

Why this answer

Brand Detection is a specialized Azure Computer Vision capability that uses a pre-trained model to identify thousands of global brands from their logos in images. It is specifically designed to handle variations in logo orientation, size, and placement, making it the correct choice for this scenario.

Exam trap

The trap here is that candidates often confuse Object Detection (which finds generic objects) with Brand Detection (which is a specialized, pre-trained subset for logos), leading them to select Object Detection because it also uses bounding boxes.

How to eliminate wrong answers

Option A is wrong because Image Classification assigns a single label to the entire image (e.g., 'soda can') but does not locate or identify specific brand logos within the image. Option B is wrong because Object Detection identifies and locates generic objects (e.g., 'bottle', 'car') using bounding boxes, but it is not pre-trained to recognize specific brand logos like Apple or Coca-Cola. Option D is wrong because Optical Character Recognition (OCR) extracts printed or handwritten text from images, not visual logos or brand symbols.

813
MCQeasy

A manufacturing company uses overhead cameras on an assembly line to check that each part is present in the correct location on a circuit board. The system must not only confirm the part is there but also draw a box around each part to show its exact position. Which Azure Computer Vision capability should they use?

A.Optical Character Recognition (OCR)
B.Image Classification
C.Object Detection
D.Face Detection
AnswerC

Object Detection identifies objects and returns their bounding box coordinates, making it suitable for locating each part and drawing boxes around them.

Why this answer

Object Detection is the correct capability because it not only identifies whether a specific object (like a circuit board part) is present in an image but also returns bounding box coordinates that indicate the exact location of each detected object. This meets the requirement to both confirm the part's presence and draw a box around it.

Exam trap

The trap here is that candidates confuse Image Classification (which only labels the whole image) with Object Detection (which provides per-object localization), especially when the question emphasizes both 'confirm the part is there' and 'draw a box around each part'.

How to eliminate wrong answers

Option A is wrong because Optical Character Recognition (OCR) is designed to extract printed or handwritten text from images, not to detect or locate physical objects like circuit board parts. Option B is wrong because Image Classification assigns a single label to the entire image (e.g., 'circuit board with all parts') but does not provide bounding boxes or per-object localization. Option D is wrong because Face Detection is specialized for locating human faces and cannot be used to detect generic industrial parts on a circuit board.

814
MCQeasy

What is 'named entity recognition' (NER) in Azure AI Language?

A.Renaming database fields to follow a consistent naming convention
B.Identifying and classifying real-world entities (people, organisations, locations) mentioned in text
C.Recognising the named author of a document for copyright purposes
D.Detecting when a user provides their name in a chatbot conversation
AnswerB

NER extracts and categorises named entities from text — enabling structured information extraction from unstructured content.

Why this answer

Named entity recognition (NER) is a feature of Azure AI Language that identifies and categorizes real-world entities such as people, organizations, locations, dates, and quantities within unstructured text. It uses pre-trained machine learning models to extract these entities, enabling downstream tasks like information retrieval and content summarization. Option B correctly describes this core functionality.

Exam trap

The trap here is that candidates may confuse NER with other NLP tasks like sentiment analysis or key phrase extraction, or assume it only handles names, when in fact it classifies a wide range of entity types including dates, quantities, and URLs.

How to eliminate wrong answers

Option A is wrong because renaming database fields is a data governance or schema management task, not a natural language processing capability offered by Azure AI Language. Option C is wrong because while NER can identify a person's name, it does not specifically recognize the 'named author' for copyright purposes; copyright analysis is not a built-in feature of NER. Option D is wrong because detecting a user providing their name in a chatbot is a specific application of entity recognition, but NER is a broader technology that identifies many entity types, not just names in chatbot contexts.

815
MCQeasy

What is Azure Machine Learning?

A.A pre-built AI service for specific tasks like vision or language
B.A cloud platform for building, training, deploying, and monitoring ML models
C.A database service optimized for storing ML training data
D.A GPU-only service for deep learning training
AnswerB

Azure Machine Learning provides end-to-end ML lifecycle tools — experimentation, training, deployment, and monitoring.

Why this answer

Azure Machine Learning is a comprehensive cloud-based platform that provides end-to-end capabilities for the machine learning lifecycle, including building, training, deploying, and monitoring models. It supports various frameworks (e.g., TensorFlow, PyTorch, scikit-learn) and offers features like automated ML, pipelines, and MLOps integration. This distinguishes it from pre-built AI services or specialized infrastructure offerings.

Exam trap

The trap here is that candidates confuse Azure Machine Learning (a full ML platform) with Azure Cognitive Services (pre-built AI services), especially since both fall under the 'AI on Azure' umbrella, but the question specifically asks for the platform that enables custom model development.

How to eliminate wrong answers

Option A is wrong because it describes Azure Cognitive Services (now Azure AI Services), which are pre-built APIs for specific tasks like vision, language, or speech, not a platform for custom model development. Option C is wrong because Azure Machine Learning is not a database service; it can integrate with data stores like Azure Blob Storage or Azure SQL Database for training data, but it is not a database itself. Option D is wrong because Azure Machine Learning supports both CPU and GPU compute targets (e.g., Azure ML compute clusters, attached VMs), and is not limited to GPU-only workloads; it can run training on CPUs for many algorithms.

816
MCQeasy

A retail company wants to automatically determine whether customer reviews are positive, negative, or neutral. Which prebuilt Azure AI Language feature should they use?

A.Key phrase extraction
B.Language detection
C.Sentiment analysis
D.Entity recognition
AnswerC

Sentiment analysis is specifically designed to determine whether text is positive, negative, or neutral, making it the correct choice.

Why this answer

Sentiment analysis is the correct Azure AI Language feature because it is specifically designed to classify text into positive, negative, or neutral sentiment categories. This prebuilt capability analyzes customer reviews at the document and sentence level, returning a sentiment label and confidence scores, which directly meets the requirement of automatically determining review polarity.

Exam trap

The trap here is that candidates often confuse key phrase extraction with sentiment analysis because both deal with text content, but key phrase extraction only identifies terms without any polarity assessment, while sentiment analysis explicitly classifies emotional tone.

How to eliminate wrong answers

Option A is wrong because key phrase extraction identifies important words or phrases (e.g., 'great service', 'long wait') but does not assign a sentiment label like positive, negative, or neutral. Option B is wrong because language detection identifies the language of the text (e.g., English, Spanish) and has no capability to evaluate sentiment or polarity. Option D is wrong because entity recognition extracts named entities such as people, organizations, or locations, and does not perform sentiment classification.

817
MCQhard

A healthcare organization needs to automatically redact personally identifiable information (PII) such as patient names and addresses from unstructured medical notes, while keeping clinical terms like disease names and medications. They want to use prebuilt Azure AI Language features without any custom training. Which combination of features should they use?

A.PII detection and Text Analytics for Health
B.Key phrase extraction and sentiment analysis
C.Named Entity Recognition (NER) and language detection
D.Entity linking and conversational language understanding
AnswerA

PII detection finds personal data to redact; Text Analytics for Health extracts medical concepts, preserving clinical information. Both are prebuilt and require no custom training.

Why this answer

Option A is correct because PII detection identifies and redacts personally identifiable information like patient names and addresses, while Text Analytics for Health extracts clinical entities such as diseases and medications from unstructured medical notes. Both are prebuilt Azure AI Language features that require no custom training, making them ideal for this healthcare redaction scenario.

Exam trap

The trap here is that candidates may confuse general-purpose Named Entity Recognition (NER) with the specialized Text Analytics for Health feature, or assume that PII detection alone can handle clinical terms, when in fact two separate prebuilt features are needed for this specific healthcare redaction task.

How to eliminate wrong answers

Option B is wrong because key phrase extraction and sentiment analysis do not identify or redact PII or clinical terms; key phrase extraction returns general phrases and sentiment analysis evaluates emotional tone, neither of which meets the redaction requirement. Option C is wrong because Named Entity Recognition (NER) in Azure AI Language is a general-purpose feature that does not specifically handle clinical entities or PII redaction, and language detection only identifies the language of the text, not entities. Option D is wrong because entity linking maps entities to a knowledge base (e.g., Wikipedia) and conversational language understanding is designed for intent recognition in chatbots, neither of which provides PII redaction or clinical entity extraction for medical notes.

818
MCQmedium

What is the Azure AI Language feature 'custom named entity recognition' used for?

A.Automatically renaming Azure resources with appropriate names
B.Training models to recognize domain-specific entity types unique to your business
C.Replacing personally identifiable information with pseudonyms
D.Detecting when text contains company-specific named brands
AnswerB

Custom NER lets you define and train custom entity categories (e.g., product codes, medical terms, legal references) for your specific domain.

Why this answer

Custom named entity recognition (NER) in Azure AI Language allows you to train a model to identify and extract domain-specific entities that are not covered by the pre-built entity catalog. This is achieved by providing labeled example data, which the service uses to learn the unique entity types relevant to your business, such as product codes, internal document IDs, or specialized medical terms.

Exam trap

The trap here is that candidates confuse custom NER with the pre-built NER or PII detection features, assuming that 'custom' simply means recognizing any named entity, when in fact it specifically requires training a new model with your own labeled data to handle entities absent from the default catalog.

How to eliminate wrong answers

Option A is wrong because custom NER does not rename Azure resources; resource renaming is a management operation handled through the Azure portal, CLI, or PowerShell, not a language model feature. Option C is wrong because replacing PII with pseudonyms is the function of Azure AI Language's 'Personally Identifiable Information (PII) detection' feature with redaction, not custom NER. Option D is wrong because detecting company-specific brand names is a subset of what custom NER can do, but the option is too narrow and misleading—custom NER is designed for any domain-specific entity type, not just brands, and the pre-built NER already handles common brand names; the key value is recognizing entities that are unique and not in the default model.

819
MCQeasy

A marketing team uses Azure OpenAI to generate social media posts. They want to ensure the generated text maintains a consistent, predictable brand voice without being overly creative or random. Which parameter should they primarily adjust to control the randomness of the output?

A.Temperature
B.Max tokens
C.Frequency penalty
D.Top P
AnswerA

Correct. Lower temperature values make the output more deterministic and focused, which helps maintain a consistent brand voice.

Why this answer

Temperature controls the randomness of token selection by scaling the logits before applying the softmax function. A lower temperature (e.g., 0.2) makes the model more deterministic and conservative, producing outputs that stick closely to the most likely tokens—ideal for maintaining a consistent, predictable brand voice. Higher temperatures increase randomness, which the team wants to avoid.

Exam trap

The trap here is that candidates often confuse Top P (nucleus sampling) with temperature, thinking both control randomness equally, but temperature directly scales the logits for a more fine-grained control over determinism, whereas Top P dynamically selects a subset of tokens based on cumulative probability.

How to eliminate wrong answers

Option B (Max tokens) is wrong because it limits the length of the generated output, not the randomness or creativity of the text. Option C (Frequency penalty) is wrong because it reduces repetition by penalizing tokens that have already appeared, which affects diversity but does not directly control the overall randomness or predictability of the output. Option D (Top P) is wrong because it uses nucleus sampling to cut off the least likely tokens, which can influence creativity but is a different mechanism than temperature; adjusting Top P alone does not provide the same direct control over the deterministic vs. random trade-off that temperature offers.

820
MCQeasy

A hotel chain receives thousands of online guest reviews each month. The management wants to automatically determine whether the overall feedback for each review is positive, negative, or neutral to identify areas for improvement. Which Azure AI Language feature should they use?

A.Key phrase extraction
B.Named entity recognition
C.Sentiment analysis
D.Language detection
AnswerC

Sentiment analysis is designed to determine the emotional tone of text, such as positive, negative, or neutral.

Why this answer

Sentiment analysis is the correct Azure AI Language feature because it evaluates text to determine the overall sentiment—positive, negative, or neutral—at the document or sentence level. This directly matches the hotel chain's requirement to classify each review's feedback automatically, enabling them to identify areas for improvement based on sentiment trends.

Exam trap

The trap here is that candidates often confuse sentiment analysis with key phrase extraction, mistakenly thinking that extracting positive or negative phrases is equivalent to determining overall sentiment, but key phrase extraction does not assign a polarity score or classify the text as positive, negative, or neutral.

How to eliminate wrong answers

Option A is wrong because key phrase extraction identifies important words or phrases in text (e.g., 'comfortable beds', 'slow check-in') but does not classify the overall sentiment as positive, negative, or neutral. Option B is wrong because named entity recognition identifies and categorizes entities like people, places, or organizations (e.g., 'Hilton', 'New York') but does not evaluate the emotional tone or polarity of the text. Option D is wrong because language detection identifies the language of the text (e.g., English, Spanish) but provides no insight into the sentiment or opinion expressed in the review.

821
MCQmedium

What is the Azure AI Evaluation SDK used for in generative AI development?

A.Evaluating the environmental impact of AI model training
B.Systematically measuring quality (groundedness, relevance, coherence) and safety of generative AI responses
C.Evaluating Azure subscription costs for AI workloads
D.A peer review system for human evaluation of AI responses
AnswerB

The Evaluation SDK measures whether AI responses are grounded in context, relevant, coherent, and free from harmful content.

Why this answer

The Azure AI Evaluation SDK is specifically designed to systematically measure the quality and safety of generative AI responses. It evaluates key metrics such as groundedness (how well the response aligns with source data), relevance, and coherence, as well as safety aspects like content filtering and harm detection. This makes it essential for validating and improving generative AI applications before deployment.

Exam trap

The trap here is that candidates confuse the Evaluation SDK with general monitoring or cost tools, but the exam specifically tests that this SDK is for measuring response quality and safety in generative AI, not for environmental, cost, or human review purposes.

How to eliminate wrong answers

Option A is wrong because the Azure AI Evaluation SDK does not measure environmental impact; that is handled by tools like the Microsoft Sustainability Calculator or Azure Carbon Optimization. Option C is wrong because subscription cost evaluation is managed by Azure Cost Management + Billing, not the Evaluation SDK. Option D is wrong because the SDK provides automated, programmatic evaluation using built-in metrics and AI-assisted scoring, not a peer review system for human evaluators.

822
MCQeasy

A customer insights team at a retail company has collected thousands of product reviews. They want to automatically extract the most frequently mentioned topics or aspects from these reviews, such as 'price', 'durability', and 'customer service', without any custom training. Which prebuilt Azure AI Language feature should they use?

A.Text Analytics for Health
B.Key Phrase Extraction
C.Conversational Language Understanding
D.Entity Linking
AnswerB

Key Phrase Extraction identifies the main points or topics in text, making it suitable for extracting frequently mentioned aspects from customer reviews without custom training.

Why this answer

Key Phrase Extraction is the correct choice because it is a prebuilt Azure AI Language feature designed to automatically identify and extract the main points or topics from unstructured text, such as product reviews, without requiring any custom training or labeled data. It surfaces frequently mentioned aspects like 'price', 'durability', and 'customer service' by analyzing linguistic patterns and term frequency, making it ideal for this summarization task.

Exam trap

The trap here is that candidates often confuse Key Phrase Extraction with Entity Linking or Conversational Language Understanding, mistakenly thinking that extracting topics requires custom training or entity disambiguation, when in fact Key Phrase Extraction is a zero-shot, prebuilt feature specifically designed for this exact use case.

How to eliminate wrong answers

Option A is wrong because Text Analytics for Health is a specialized prebuilt model for extracting medical entities, relations, and concepts from clinical documents, not general product reviews. Option C is wrong because Conversational Language Understanding (CLU) is a custom, trainable feature for building intent and entity extraction models for conversational apps, requiring labeled training data, not a prebuilt zero-shot extraction tool. Option D is wrong because Entity Linking disambiguates named entities by linking them to a knowledge base (e.g., Wikipedia), but it does not extract general topics or aspects like 'price' or 'durability' from unstructured text.

823
MCQhard

A data scientist is training a logistic regression model to predict customer churn using a small dataset with 500 records and 200 features. The model achieves 97% accuracy on the training set but only 65% on a held-out test set, indicating severe overfitting. The data scientist wants to reduce overfitting by automatically eliminating irrelevant features. Which technique should the data scientist apply?

A.Apply L1 regularization (Lasso) to the model
B.Apply L2 regularization (Ridge) to the model
C.Use k-fold cross-validation to select the best model
D.Increase the number of training samples by data augmentation
AnswerA

L1 regularization adds a penalty term that can zero out coefficients of less important features, performing feature selection and reducing model complexity to combat overfitting.

Why this answer

L1 regularization (Lasso) adds a penalty equal to the absolute value of the magnitude of coefficients, which can shrink some coefficients exactly to zero. This performs automatic feature selection by eliminating irrelevant features, directly addressing the overfitting caused by having 200 features on only 500 records. The high training accuracy (97%) versus low test accuracy (65%) is a classic sign of overfitting that L1 regularization mitigates by reducing model complexity.

Exam trap

Microsoft often tests the distinction between L1 and L2 regularization: the trap here is that candidates confuse 'reducing overfitting' (which both can do) with 'eliminating features' (which only L1 does), leading them to pick L2 regularization or cross-validation instead.

How to eliminate wrong answers

Option B is wrong because L2 regularization (Ridge) penalizes the square of coefficients, shrinking them toward zero but never exactly to zero, so it does not eliminate features—it only reduces their impact, which is less effective for automatic feature selection. Option C is wrong because k-fold cross-validation is a model evaluation technique that helps estimate generalization error and tune hyperparameters, but it does not itself eliminate features or reduce overfitting; it would need to be combined with a regularization method. Option D is wrong because data augmentation increases the number of training samples, which can help reduce overfitting, but the question specifically asks for a technique that eliminates irrelevant features, and data augmentation does not perform feature selection—it only adds more data.

824
MCQmedium

A security company needs to identify individuals in a crowd by matching their faces against a database of known persons of interest. The system must detect faces, verify the identities, and provide a confidence score. Which Azure Computer Vision capability should they use?

A.Facial recognition
B.Optical character recognition (OCR)
C.Image classification
D.Object detection
AnswerA

Facial recognition uses face detection and matching against a known database to identify individuals, exactly as required for this security scenario.

Why this answer

Azure Computer Vision's facial recognition capability is specifically designed to detect human faces in images, match them against a known database of persons, and return a confidence score for each match. This directly aligns with the security company's requirement to identify individuals in a crowd by verifying their identities against a watchlist.

Exam trap

The trap here is that candidates confuse 'facial recognition' (identity verification against a database) with 'object detection' (locating faces as objects), but only facial recognition provides the identity matching and confidence score required for this scenario.

How to eliminate wrong answers

Option B is wrong because Optical Character Recognition (OCR) extracts text from images, not faces or identities. Option C is wrong because Image Classification assigns a single label to an entire image (e.g., 'crowd'), not individual face detection and matching. Option D is wrong because Object Detection locates and labels generic objects (e.g., 'person') with bounding boxes, but does not perform identity verification or confidence-based matching against a known database.

825
MCQeasy

A retail company wants to automatically group its customers into distinct segments based on their purchasing patterns, without having pre-defined categories. The goal is to discover natural groupings in the customer data to tailor marketing campaigns. Which type of machine learning task should the company use?

A.Supervised learning - Classification
B.Unsupervised learning - Clustering
C.Reinforcement learning
D.Supervised learning - Regression
AnswerB

Clustering is an unsupervised learning technique that groups similar data points together based on features, without needing labels. This fits the scenario of discovering natural customer segments from purchasing patterns.

Why this answer

The company wants to discover natural groupings in customer data without pre-defined categories, which is the definition of unsupervised learning. Clustering algorithms (e.g., K-Means, DBSCAN) automatically partition data into segments based on similarity in purchasing patterns, making it the correct choice for this scenario.

Exam trap

The trap here is that candidates confuse 'grouping without labels' with classification (which requires labels) or regression (which predicts numbers), but the key differentiator is the absence of pre-defined categories, pointing directly to unsupervised clustering.

How to eliminate wrong answers

Option A is wrong because supervised learning - Classification requires labeled data with predefined categories, but the problem explicitly states 'without having pre-defined categories.' Option C is wrong because reinforcement learning involves an agent learning from rewards/punishments through interaction with an environment, which is not applicable to static customer segmentation. Option D is wrong because supervised learning - Regression predicts continuous numerical values (e.g., sales amount), not discrete customer segments.

Page 10

Page 11 of 14

Page 12