AI-900Chapter 2 of 100Objective 1.1

Types of AI Workloads

This chapter covers the three main types of AI workloads you must understand for the AI-900 exam: supervised learning, unsupervised learning, and reinforcement learning. Additionally, we explore conversational AI, computer vision, natural language processing, and anomaly detection workloads. Approximately 30-35% of AI-900 exam questions touch on identifying these workloads and their appropriate use cases. Mastering this topic is critical because the exam tests your ability to match a business problem to the correct AI technique.

25 min read
Intermediate
Updated May 31, 2026

The Restaurant Kitchen Brigade

Imagine a restaurant kitchen preparing a complex meal. The head chef (supervised learning) trains the junior chefs by showing them labeled recipe cards – each card has a dish name and the exact steps to make it. After training, when a new order comes in, the junior chefs recognize the dish and follow the learned steps. A second team uses unsupervised learning: they are given a pile of ingredients and asked to group them by flavor profiles without any labels. They discover that tomatoes, basil, and mozzarella often appear together, forming a 'Caprese' cluster. A third team uses reinforcement learning: a new chef is given a stove and ingredients, and for every perfectly seared steak, he earns a reward; for a burnt one, he receives a penalty. Over many trials, he learns the optimal temperature and timing. The restaurant also uses a pre-trained model: a master chef's signature sauce recipe is used as a starting point for all new sauces, saving time. Each learning paradigm has its own kitchen, tools, and quality metrics, and the head chef chooses which to deploy based on the order type and available data.

How It Actually Works

What Are AI Workloads?

AI workloads are specific tasks or problem categories that artificial intelligence systems are designed to solve. On the AI-900 exam, you must differentiate between the three primary machine learning paradigms—supervised, unsupervised, and reinforcement learning—and recognize common AI workload types like computer vision, NLP, conversational AI, and anomaly detection. Each workload uses distinct algorithms, data requirements, and evaluation metrics.

Supervised Learning: Learning with Labels

Supervised learning is the most common workload. The algorithm is trained on a dataset where each example has an input (features) and a correct output (label). The goal is to learn a mapping from inputs to outputs so that it can predict labels for new, unseen data. - How it works: The model processes input features, makes a prediction, compares it to the true label using a loss function (e.g., mean squared error for regression, cross-entropy for classification), and adjusts its internal parameters via optimization (e.g., gradient descent) to minimize the loss. This process repeats over many epochs. - Key components: Features (independent variables), labels (target variable), training set, validation set, test set, loss function, optimizer. - Common algorithms: Linear regression, logistic regression, decision trees, random forests, support vector machines, neural networks. - Evaluation metrics: Accuracy, precision, recall, F1-score (classification); R-squared, mean absolute error (regression). - Azure services: Azure Machine Learning, Automated ML, Azure Cognitive Services (e.g., Custom Vision for classification).

Unsupervised Learning: Finding Hidden Patterns

Unsupervised learning works with unlabeled data. The algorithm must discover inherent structures, patterns, or groupings without any guidance. - How it works: The model identifies similarities or differences between data points based on their features. For clustering, it groups points so that intra-cluster similarity is high and inter-cluster similarity is low. For dimensionality reduction, it projects high-dimensional data into fewer dimensions while preserving variance. - Key components: No labels; only features. Distance metrics (e.g., Euclidean, cosine) for clustering. Number of clusters (k) must be specified or determined via heuristics. - Common algorithms: K-means clustering, hierarchical clustering, DBSCAN, principal component analysis (PCA), t-SNE. - Evaluation metrics: Silhouette score, inertia, Davies-Bouldin index (clustering). Explained variance (PCA). - Azure services: Azure Machine Learning (K-means module), Azure Cognitive Services (Anomaly Detector – uses unsupervised approach).

Reinforcement Learning: Learning from Rewards

Reinforcement learning involves an agent that interacts with an environment. The agent takes actions, receives rewards or penalties, and learns a policy that maximizes cumulative reward over time. - How it works: The agent observes the current state, selects an action using an exploration-exploitation strategy (e.g., epsilon-greedy), receives a reward and the next state, and updates its policy (e.g., via Q-learning or policy gradients). The process continues until the agent converges to an optimal policy. - Key components: Agent, environment, state, action, reward, policy, value function, Q-function, discount factor (gamma, typically 0.9–0.99), learning rate (alpha, e.g., 0.1). - Common algorithms: Q-learning, Deep Q-Networks (DQN), Policy Gradients, Proximal Policy Optimization (PPO). - Evaluation metrics: Cumulative reward, average reward per episode, convergence time. - Azure services: Azure Machine Learning (RL via Ray), Azure Cognitive Services (Personalizer – uses contextual bandits, a simplified RL).

Conversational AI: Understanding and Generating Dialogue

Conversational AI workloads power chatbots and virtual assistants. They combine natural language understanding (NLU) to interpret user intent and natural language generation (NLG) to produce responses. - How it works: The system receives text or speech, uses a language model to extract intent and entities (e.g., 'BookFlight' intent with 'destination' entity 'Paris'), retrieves or generates a response, and optionally sends it to a text-to-speech engine. - Key components: Intent recognition, entity extraction, dialogue management, response generation, language model. - Azure services: Azure Bot Service, Language Understanding (LUIS) – now part of Azure Cognitive Service for Language, QnA Maker (now part of Azure Cognitive Service for Language – custom question answering), Azure Speech Services. - Evaluation metrics: Intent accuracy, entity F1-score, user satisfaction, average conversation length.

Computer Vision: Interpreting Visual Data

Computer vision workloads process images and videos to extract information. Tasks include image classification, object detection, segmentation, optical character recognition (OCR), and facial recognition. - How it works: Convolutional neural networks (CNNs) extract hierarchical features from pixels. For classification, a CNN outputs probabilities for each class. For object detection, it also predicts bounding boxes. For segmentation, it assigns a class to every pixel. - Key components: Input image dimensions (e.g., 224x224x3 for ResNet), convolutional layers, pooling layers, fully connected layers, loss function (e.g., cross-entropy, IoU loss). - Azure services: Azure Computer Vision, Custom Vision, Face API, Form Recognizer (now Azure AI Document Intelligence). - Evaluation metrics: Accuracy, mean average precision (mAP) for detection, intersection over union (IoU) for segmentation.

Natural Language Processing (NLP): Understanding Text

NLP workloads analyze and generate human language. Key tasks include sentiment analysis, key phrase extraction, language detection, text classification, summarization, and translation. - How it works: Text is tokenized into words or subwords, converted to embeddings (e.g., word2vec, BERT embeddings), and processed by a model (e.g., transformer) to perform the task. For translation, an encoder-decoder architecture is common. - Key components: Tokenizer, embedding dimension (e.g., 768 for BERT), attention mechanism, transformer layers (e.g., 12 for BERT-base). - Azure services: Azure Cognitive Service for Language (includes sentiment analysis, key phrase extraction, language detection, custom text classification), Azure Translator, Azure Speech Services. - Evaluation metrics: Accuracy, F1-score, BLEU score (translation), ROUGE score (summarization).

Anomaly Detection: Identifying Outliers

Anomaly detection identifies data points that deviate significantly from the norm. It is used for fraud detection, equipment monitoring, and security. - How it works: The model learns the normal pattern of data (e.g., using statistical methods or autoencoders). New data points are scored based on their deviation; points with scores above a threshold are flagged as anomalies. - Key components: Training data (mostly normal), anomaly score, threshold (e.g., 95th percentile). - Common algorithms: One-class SVM, isolation forest, autoencoders, ARIMA (time series). - Azure services: Azure Anomaly Detector (part of Cognitive Services), Azure Machine Learning (built-in anomaly detection modules). - Evaluation metrics: Precision, recall, F1-score at various thresholds, area under ROC curve.

How These Workloads Relate to Azure

Azure offers a unified platform for these workloads through Azure Machine Learning (for custom model training) and Azure Cognitive Services (pre-built AI models). The exam expects you to know which Azure service maps to which workload. For example, if a scenario describes identifying objects in images, you should select Azure Computer Vision. If it describes training a model to classify customer feedback into positive/negative, you should select Azure Cognitive Service for Language (sentiment analysis) or Custom Text Classification.

Exam Relevance

AI-900 tests your ability to identify the correct workload type for a given business problem. Common questions present a scenario and ask which Azure service or ML paradigm to use. You must also distinguish between supervised, unsupervised, and reinforcement learning based on data labeling and goal. For example, if data is labeled and you want to predict a category, it's supervised classification. If data is unlabeled and you want to group customers, it's unsupervised clustering.

Walk-Through

1

Identify the Business Problem

Start by understanding the customer's goal. Ask: Is the goal to predict a known outcome (e.g., fraud yes/no)? Then it's supervised learning. Is the goal to discover hidden groups (e.g., customer segments)? Then it's unsupervised learning. Is the goal to learn a sequence of actions through trial and error (e.g., game playing, robot control)? Then it's reinforcement learning. Also, determine the data type: text, image, or numeric. This initial classification narrows down the workload type.

2

Determine Data Availability and Labeling

Check if labeled data exists. Supervised learning requires a large set of labeled examples. If labels are unavailable or too expensive to create, unsupervised learning or reinforcement learning (which generates its own rewards) may be more suitable. For reinforcement learning, you need a simulator or environment where the agent can interact and receive feedback. The exam often tests this: a scenario with no labeled data points to clustering or anomaly detection.

3

Match to Azure Cognitive Service

Once the workload type is clear, map it to the appropriate Azure service. For computer vision, use Azure Computer Vision or Custom Vision. For NLP, use Azure Cognitive Service for Language. For conversational AI, use Azure Bot Service with Language Understanding. For anomaly detection, use Azure Anomaly Detector. For custom machine learning, use Azure Machine Learning. The exam expects you to know these mappings without hesitation.

4

Consider Pre-built vs. Custom Models

Azure Cognitive Services offer pre-trained models that require no training data. If the problem fits a common use case (e.g., general object detection, sentiment analysis), use the pre-built service. If the problem requires domain-specific customization (e.g., detecting specific defects in manufacturing), use Custom Vision or Custom Text Classification. The exam emphasizes this trade-off: pre-built for speed, custom for accuracy on niche data.

5

Evaluate and Iterate

After deploying the model, monitor its performance using appropriate metrics. For classification, track accuracy and precision. For regression, track RMSE. For clustering, use silhouette score. For reinforcement learning, track cumulative reward. If performance is poor, consider more data, feature engineering, or switching to a different workload type. The exam may ask about metrics for specific workloads.

What This Looks Like on the Job

Enterprise Scenario 1: Retail Customer Segmentation

A large retail chain wants to group its customers based on purchasing behavior to tailor marketing campaigns. They have transaction data with features like total spend, frequency, and product categories purchased, but no customer labels. This is an unsupervised learning problem. The data science team uses Azure Machine Learning with K-means clustering. They set k=5 after elbow analysis. The model identifies five segments: budget shoppers, luxury buyers, frequent small purchasers, seasonal buyers, and inactive customers. The marketing team then designs targeted promotions. Common pitfall: choosing too many clusters leads to overfitting; too few misses important distinctions. The team uses silhouette score (target >0.5) to validate. Performance consideration: clustering on millions of customers requires scaling to distributed compute (e.g., Azure Spark ML). Misconfiguration: using Euclidean distance on unscaled features (e.g., income vs. number of visits) skews results.

Enterprise Scenario 2: Manufacturing Defect Detection

A factory produces circuit boards and needs to automatically detect soldering defects from camera images. They have thousands of labeled images: 'defective' and 'non-defective'. This is a supervised image classification problem. They use Azure Custom Vision to train a model with transfer learning (ResNet backbone). After training with 500 images per class, the model achieves 98% accuracy. It is deployed as a container on the factory edge device via Azure IoT Edge. Common issue: imbalanced classes (only 5% defective) – they use oversampling. Performance: inference must complete within 100ms per image; they optimize by reducing image resolution from 1024x1024 to 320x320. Misconfiguration: not retraining periodically as new defect types appear.

Enterprise Scenario 3: Chatbot for Customer Support

A telecom company wants a chatbot to handle common inquiries like bill payments and plan changes. They use Azure Bot Service with Language Understanding (LUIS) for intent recognition. They define intents: 'CheckBalance', 'PayBill', 'ChangePlan', and entities: 'amount', 'date'. They train with 50 example utterances per intent. The chatbot uses QnA Maker for FAQs. When a user says 'I want to pay my bill', LUIS identifies intent 'PayBill' with entity 'bill'. The bot then prompts for payment details. Common pitfalls: overlapping intents (e.g., 'PayBill' and 'CheckBalance' both triggered by 'bill') – they use phrase lists and add negative examples. Performance: latency must be under 2 seconds; they cache common responses. Misconfiguration: not handling out-of-scope queries – they add a fallback intent.

How AI-900 Actually Tests This

What AI-900 Tests on AI Workloads (Objective 1.1)

The exam focuses on identifying the correct workload type for a given scenario. You must be able to distinguish between: - Supervised learning: labeled data, prediction (classification or regression). - Unsupervised learning: unlabeled data, pattern discovery (clustering, anomaly detection). - Reinforcement learning: agent learning from rewards, sequential decision-making. - Computer vision: image/video analysis (object detection, OCR, facial recognition). - NLP: text analysis (sentiment, key phrases, translation). - Conversational AI: chatbots with intent and entity recognition. - Anomaly detection: outlier identification in numeric/time-series data.

Common Wrong Answers and Why

1.

Choosing 'Reinforcement learning' for recommendation systems: Many candidates think recommendation systems use RL because they learn from user feedback. However, most recommendation systems use collaborative filtering (unsupervised) or supervised learning on user-item interactions. RL is only used in advanced cases like news recommendation with exploration. For AI-900, stick to supervised for explicit ratings, unsupervised for implicit patterns.

2.

Selecting 'Computer Vision' for text analysis: A scenario describing 'extracting customer sentiment from product reviews' is NLP, not computer vision. Candidates confuse because text can be in images, but the task is about text content, not image processing.

3.

Confusing 'Anomaly Detection' with 'Supervised Classification': Anomaly detection often uses unsupervised learning (no labels for anomalies). If the scenario says 'no historical fraud data', it's unsupervised anomaly detection, not supervised.

4.

Picking 'Custom Vision' when 'Computer Vision' suffices: If the scenario needs a pre-built model (e.g., general object detection), the answer is Azure Computer Vision, not Custom Vision. Custom Vision is only when you need to train on custom images.

Specific Numbers, Values, and Terms

Azure Cognitive Service for Language includes sentiment analysis, key phrase extraction, language detection, and custom text classification.

Azure Computer Vision includes OCR, image analysis, and spatial analysis.

Azure Anomaly Detector works best on time-series data.

Azure Personalizer uses reinforcement learning (contextual bandits).

Azure Bot Service integrates with Language Understanding (LUIS) and QnA Maker.

Evaluation metrics: accuracy for classification, RMSE for regression, silhouette for clustering, BLEU for translation.

Edge Cases and Exceptions

Regression vs. classification: If the output is a continuous number (e.g., price), it's regression (supervised). If the output is a category, it's classification.

Time-series anomaly detection: Uses unsupervised approaches like ARIMA or autoencoders; Azure Anomaly Detector is designed for this.

Semi-supervised learning: Not explicitly tested, but know that some scenarios use a mix (e.g., a small labeled set with large unlabeled set).

How to Eliminate Wrong Answers

If the scenario mentions 'no labels', eliminate supervised learning.

If the scenario mentions 'trial and error' or 'reward', choose reinforcement learning.

If the scenario involves 'images' or 'video', choose computer vision.

If the scenario involves 'text' or 'language', choose NLP.

If the scenario involves 'chatbot' or 'conversation', choose conversational AI.

If the scenario involves 'unusual patterns' in numeric data, choose anomaly detection.

Key Takeaways

Supervised learning uses labeled data to predict outcomes; unsupervised learning finds patterns in unlabeled data; reinforcement learning learns from rewards.

Azure Cognitive Services are pre-built AI models; Azure Machine Learning is for custom model training.

For image tasks, use Computer Vision or Custom Vision; for text tasks, use Language Service.

For conversational AI, use Bot Service with Language Understanding (LUIS).

For anomaly detection on time-series data, use Azure Anomaly Detector.

Reinforcement learning is used for decision-making in environments with rewards (e.g., games, robotics).

Always match the workload to the data type and business goal: labeled data → supervised; unlabeled → unsupervised; reward-based → RL.

Easy to Mix Up

These come up on the exam all the time. Here's how to tell them apart.

Supervised Learning

Requires labeled data (input-output pairs)

Goal: predict labels for new data

Common algorithms: linear regression, decision trees, neural networks

Evaluation: accuracy, precision, recall, RMSE

Use cases: spam detection, price prediction, image classification

Unsupervised Learning

Works with unlabeled data

Goal: find hidden patterns or groupings

Common algorithms: K-means, PCA, DBSCAN

Evaluation: silhouette score, inertia, explained variance

Use cases: customer segmentation, anomaly detection, dimensionality reduction

Computer Vision

Processes images and videos

Tasks: classification, object detection, OCR, face detection

Azure services: Computer Vision, Custom Vision, Face API

Input: pixel data (arrays of numbers)

Output: labels, bounding boxes, recognized text

Natural Language Processing

Processes text or speech

Tasks: sentiment analysis, translation, summarization, entity extraction

Azure services: Language Service, Translator, Speech

Input: sequences of words or phonemes

Output: sentiment scores, translated text, key phrases

Watch Out for These

Mistake

Reinforcement learning is the same as supervised learning because both use labeled data.

Correct

Reinforcement learning does not use labeled data. The agent learns from rewards (scalar feedback) after each action, not from correct input-output pairs. The reward is not a label; it's a signal that guides the policy.

Mistake

Unsupervised learning cannot be used for anomaly detection because anomalies are rare and unknown.

Correct

Unsupervised anomaly detection is common. The model learns the 'normal' pattern from unlabeled data and flags deviations. Azure Anomaly Detector uses this approach.

Mistake

Computer vision and NLP are the same because both process text in images.

Correct

Computer vision processes images as pixel arrays; NLP processes text as sequences of words. Even if text appears in an image, extracting it requires OCR (computer vision), but understanding the meaning of that text is NLP.

Mistake

All Azure Cognitive Services require training custom models.

Correct

Many Cognitive Services are pre-trained and require no training data. For example, Azure Computer Vision can detect objects out-of-the-box. Only services like Custom Vision or Custom Text Classification require custom training.

Mistake

Conversational AI is just a chatbot that answers FAQs.

Correct

Conversational AI involves intent recognition, entity extraction, and dialogue management. Simple FAQ bots use QnA Maker, but true conversational AI uses LUIS or language models to understand and respond contextually.

Do You Actually Know This?

Reveal each answer, then mark whether you got it right. Score 60%+ to unlock the next chapter.

Frequently Asked Questions

What is the difference between supervised and unsupervised learning on the AI-900 exam?

Supervised learning requires labeled data (input-output pairs) and is used for prediction tasks like classification and regression. Unsupervised learning uses unlabeled data to discover patterns, such as clustering customers into segments. On the exam, if the scenario mentions 'historical data with known outcomes', it's supervised. If it says 'group similar items without labels', it's unsupervised.

Which Azure service should I use for object detection in images?

For pre-built object detection, use Azure Computer Vision (detects common objects). For custom object detection (e.g., detecting specific machine parts), use Azure Custom Vision. The exam distinguishes: if the scenario says 'pre-trained' or 'out-of-the-box', choose Computer Vision; if it says 'train on your own images', choose Custom Vision.

Is anomaly detection always unsupervised?

Not always, but on the AI-900 exam, anomaly detection is typically presented as unsupervised because labeled anomalies are rare. Azure Anomaly Detector uses unsupervised learning to model normal behavior. However, if labeled anomalies are available, you could use supervised classification. The exam focuses on unsupervised anomaly detection.

What is an example of reinforcement learning on Azure?

Azure Personalizer uses reinforcement learning (contextual bandits) to select the best action for a user based on feedback. For example, it can choose which article to show on a website to maximize clicks. It learns from reward signals (clicks) without requiring labeled data.

How do I know if a scenario requires conversational AI vs. NLP?

If the scenario involves a two-way dialogue with a user (e.g., a chatbot that asks clarifying questions), it's conversational AI. If it's a one-off analysis of text (e.g., extracting sentiment from a review), it's NLP. Conversational AI uses intent recognition and dialogue management, while NLP focuses on understanding the text content.

What is the role of QnA Maker in conversational AI?

QnA Maker (now part of Azure Cognitive Service for Language as custom question answering) provides a knowledge base of question-answer pairs. It is used for FAQ-style chatbots where the bot retrieves answers based on user questions. It does not handle complex dialogues or multi-turn conversations.

Can I use computer vision for text analysis?

Computer vision can extract text from images using OCR (Optical Character Recognition), but analyzing the meaning of that text (e.g., sentiment) is an NLP task. On the exam, if the scenario involves extracting text from a scanned document, it's computer vision. If it involves understanding the text, it's NLP.

Terms Worth Knowing

Ready to put this to the test?

You've just covered Types of AI Workloads — now see how well it sticks with free AI-900 practice questions. Full explanations included, no account needed.

Done with this chapter?