Back to Microsoft Azure AI Fundamentals AI-900 questions

Scenario-based practice

Hard Difficulty Questions

Practise Microsoft Azure AI Fundamentals AI-900 practice questions — original exam-style scenarios covering every exam domain, with detailed explanations, wrong-answer analysis, and common exam traps.

20
scenario questions
AI-900
exam code
Microsoft
vendor

Scenario guide

How to approach hard difficulty questions

These are the questions most candidates get wrong. They require connecting multiple concepts, reading tricky output, or knowing edge-case behaviour that isn't on most study cards. Practising them trains you to operate under uncertainty — a necessary skill on the real exam.

Quick answer

Hard Difficulty Questions questions test whether you can apply the concept in context, not just recognise a definition.

How the topic appears in realistic exam-style scenarios.

Which detail in the question changes the correct answer.

How to eliminate plausible but wrong options.

How to connect the question back to the wider exam objective.

Related practice questions

Related AI-900 topic practice pages

Scenario questions usually connect to one or more exam topics. Use these links to review the underlying concepts behind the scenario.

Practice set

Practice scenarios

Question 1hardmultiple choice
Full question →

A medical research team wants to analyze MRI scans to identify and measure the precise boundaries of tumors. They need to assign each pixel in the image to a class (e.g., tumor, healthy tissue, background). Which Azure Computer Vision capability should they use?

Question 2hardmultiple choice
Full question →

A quality assurance team at a software company uses Azure OpenAI Service to generate compliance reports. They need the model to produce the exact same output for a given prompt every time the API is called, to ensure reproducibility during testing. Which parameter should they set to achieve this deterministic behavior?

Question 3hardmultiple choice
Full question →

A data scientist trains a regression model to predict house prices using features like bedrooms, square footage, and location. The model achieves an R-squared of 0.95 on the test set. However, when deployed to predict prices in a new city with different property characteristics, the predictions are very inaccurate. Which concept best explains this poor performance?

Question 4hardmultiple choice
Full question →

A manufacturing company wants to use Azure Computer Vision to inspect products on an assembly line for defects. They have a labeled dataset with images of defective and non-defective products. They need to not only classify products as defective or not, but also identify the exact location of the defect (e.g., a crack) in the image. Which Azure Computer Vision capability should they use?

Question 5hardmultiple choice
Full question →

A medical research team needs to analyze CT scans to identify and outline the exact boundaries of lung nodules. Which Azure Computer Vision capability should they use?

Question 6hardmultiple choice
Full question →

A medical research organization needs to process thousands of clinical trial documents to automatically extract specific medical entities such as diseases, symptoms, medications, and dosages. They want to use a prebuilt Azure AI Language capability that is already trained on medical domain data. Which Azure AI Language feature should they use?

Question 7hardmultiple choice
Full question →

An autonomous vehicle team needs a system that not only identifies objects like cars and pedestrians but also creates a precise pixel-level mask for each individual object instance, even when objects overlap. Which Azure Computer Vision capability should they use?

Question 8hardmultiple choice
Full question →

What is the 'bias-variance tradeoff' in machine learning?

Question 9hardmultiple choice
Read the full NAT/PAT explanation →

A bank deploys an AI system that uses a deep neural network to approve personal loan applications. A customer whose loan was rejected requests a detailed explanation of why the decision was made. The bank's AI team realizes that the model's internal workings are too complex to provide a simple, understandable reason. According to Microsoft's responsible AI principles, which principle is most directly violated by this situation?

Question 10hardmultiple choice
Read the full NAT/PAT explanation →

A customer support team receives thousands of emails daily. They want to automatically route each email to the appropriate department (Billing, Technical Support, or General Inquiry). They also want to extract the customer's account number and order ID from each email. Which combination of Azure AI Language features should they use?

Question 11hardmultiple choice
Full question →

A data science team trains a regression model to predict house prices. They evaluate the model using Mean Absolute Error (MAE). After deployment, they notice that the model occasionally produces large errors (e.g., underpredicting a luxury home by $500,000) while most predictions are within $20,000. The business is more concerned about the impact of these large errors than the average small error. Which additional metric should the team use to better capture the penalty for large errors?

Question 12hardmultiple choice
Full question →

A city deploys an AI-powered kiosk to help residents access government services. The kiosk uses a voice interface only, without any text or screen reader support. Which Microsoft responsible AI principle is most directly being ignored?

Question 13hardmultiple choice
Full question →

A company deploys an AI-powered voice assistant that only supports English. The assistant is used in a country where the official languages are English, French, and Dutch. Many users who speak French or Dutch cannot use the assistant effectively. Which Microsoft responsible AI principle is most directly relevant to this situation?

Question 14hardmultiple choice
Full question →

A company develops an AI system to screen job candidates based on their resumes. The system is trained on historical data. Analysis reveals that the model has an adverse impact against female candidates due to a proxy feature (e.g., 'years of continuous employment') that correlates with gender. The team removes the protected attribute 'gender' from the training data but the biased outcome persists. According to Microsoft's responsible AI principles, which additional step should the team take to address this unfairness?

Question 15hardmultiple choice
Full question →

A data scientist has a small dataset with only 200 labeled samples. They want to get a reliable estimate of model performance without using a separate validation set that would reduce the training data. Which technique should the data scientist use in Azure Machine Learning to obtain this reliable estimate?

Question 16hardmultiple choice
Full question →

A company develops an AI system to screen job resumes and rank candidates for interviews. The system is trained on historical hiring data that favored candidates from certain well-known universities. The company decides to deploy the system without any adjustments to address this bias. Which Microsoft responsible AI principle is most directly being violated?

Question 17hardmultiple choice
Full question →

A company uses a GPT-based model to generate marketing copy. They notice the model occasionally produces text that includes harmful stereotypes. They want to reduce these harmful outputs without retraining the model. Which approach is most appropriate?

Question 18hardmultiple choice
Full question →

A company uses a generative AI model to create blog posts. They want to ensure that the model's output never contains offensive or harmful language before the content is published. They implement a system that checks the generated text against a list of prohibited terms and blocks or edits the content if necessary. Which type of safety measure is this?

Question 19hardmultiple choice
Full question →

A data scientist is training a binary classification model to detect fraudulent transactions. The dataset contains only 1% fraudulent transactions. The model achieves 99% accuracy on the test set, but when deployed, it fails to detect most actual fraud cases. Which metric would best reveal this issue?

Question 20hardmultiple choice
Full question →

A data scientist is building a classification model to detect fraudulent transactions. The dataset has 1,000,000 legitimate transactions and only 1,000 fraudulent ones. The model achieves 99.9% accuracy on the test set, but it fails to catch most fraudulent cases. Which metric should the data scientist prioritize to better evaluate the model's performance on this imbalanced dataset?

These AI-900 practice questions are part of Courseiva's free Microsoft certification practice question bank. Courseiva provides original exam-style AI-900 questions with detailed explanations, topic-based practice, mock exams, readiness tracking, and study analytics.