AI-900Chapter 89 of 100Objective 1.1

Azure AI Services Overview

This chapter provides a comprehensive overview of Azure AI Services, a portfolio of pre-built, customizable AI models and APIs that enable developers to integrate cognitive capabilities into applications without machine learning expertise. For the AI-900 exam, this topic is foundational, appearing in roughly 25–30% of questions across domains 1, 2, and 3. You will need to understand the purpose, capabilities, and appropriate use cases for each service, as well as how to provision and consume them. This chapter covers exactly what the exam tests, including service categories, key features, pricing tiers, and security considerations.

25 min read

Intermediate

Updated May 31, 2026

Reviewed by Johnson Ajibi· Senior Network & Security Engineer · MSc IT Security

Jump to a section

Explain it to me simply Where people get tripped up Test what I know Look up key terms

Azure AI Services as a Specialized Toolbox

Imagine you are a builder with a large toolbox. Inside, you have many specialized power tools: a hammer drill for concrete, a circular saw for lumber, a jigsaw for curves, a sander for finishing, and a nail gun for fastening. Each tool is purpose-built for a specific task, has its own power requirements (battery or cord), and comes with a manual. You don't need to know how to forge steel or wind copper wire; you just pick the right tool, read the instructions, and use it safely. Azure AI Services work the same way. Instead of building machine learning models from scratch, you access pre-built APIs and services for vision, speech, language, decision, and search. Each service is a specialized 'tool' with its own endpoint, authentication key, and pricing tier. You send it input data (like an image or text), and it returns a result (like detected objects or sentiment). The underlying AI is already trained and optimized by Microsoft; you just need to configure the service correctly and call it via REST or SDK. For example, Computer Vision is like the camera tool – you point it at an image and it tells you what it sees. Language Understanding (LUIS) is like a translator that extracts intent from sentences. You don't build the neural network; you use the tool. This analogy captures the essence: Azure AI Services provide ready-to-use AI capabilities without requiring deep data science expertise, mirroring how a builder leverages specialized tools rather than inventing new ones.

How It Actually Works

What Are Azure AI Services?

Azure AI Services are a collection of cloud-based APIs and SDKs that provide intelligent capabilities such as vision, speech, language, decision, and search. They are part of Microsoft's Azure AI platform, which also includes Azure Machine Learning (for custom model building) and Azure Cognitive Search (for AI-powered search). The key differentiator of AI Services is that they are pre-trained – Microsoft has trained large-scale models on vast datasets, so you don't need to provide training data or manage model training infrastructure. You simply call an endpoint with input data and receive a prediction.

Why They Exist

Building AI from scratch requires significant expertise in data science, deep learning, and infrastructure management. Many applications need common AI capabilities – like detecting objects in images, understanding sentiment in text, or transcribing speech – but building custom models for these tasks is expensive and time-consuming. Azure AI Services solve this by offering generalized models that work well out-of-the-box for many scenarios, with options to customize for specific domains. The exam expects you to recognize when to use a pre-built service versus when to build a custom model with Azure Machine Learning.

How They Work Internally

Each Azure AI Service is deployed as a RESTful API endpoint, accessible via HTTP/HTTPS. You authenticate using either a subscription key (passed in the Ocp-Apim-Subscription-Key header) or Azure Active Directory (AAD) token. The service receives input (e.g., image bytes, text string, audio stream) and processes it through a deep neural network that was trained on massive datasets. For example, Computer Vision uses convolutional neural networks (CNNs) trained on ImageNet and other datasets; Text Analytics uses transformer models like BERT. The output is a JSON response containing predictions, confidence scores, and metadata. Services can be provisioned in two ways: multi-service resource (one endpoint for multiple services) or single-service resource (dedicated endpoint per service). The exam tests that you know the difference: multi-service uses a combined key and endpoint, while single-service provides isolation and separate billing.

Key Components and Defaults

Endpoint: The URL for API calls. Format: https://<region>.api.cognitive.microsoft.com/ for some services, or https://<name>.cognitiveservices.azure.com/ for others.

Subscription Key: Two keys are provided per resource (Key1 and Key2) to allow key rotation without downtime.

Region: Services are deployed in specific Azure regions (e.g., West US, West Europe). The endpoint includes the region.

Pricing Tier: Choose from Free (F0) – limited calls per month (e.g., 20 calls per minute, 5K calls per month) – and Standard (S0) – pay-as-you-go with higher limits. Some services have additional tiers like S1, S2.

Rate Limits: Defined per tier. For example, Computer Vision Free tier: 20 transactions per minute. Standard: 10–30 per second depending on feature.

Data Residency: Some services offer regional processing; for example, Text Analytics in Europe processes data within Europe.

Configuration and Verification

To provision a service: 1. In Azure Portal, search for "Cognitive Services" and click "Create". 2. Choose the service (e.g., Computer Vision). 3. Select subscription, resource group, region, and pricing tier. 4. After deployment, go to "Keys and Endpoint" to get the endpoint URL and keys.

To verify connectivity, use curl:

curl -H "Ocp-Apim-Subscription-Key: <key>" "<endpoint>/vision/v3.2/analyze?visualFeatures=Categories,Description&language=en" -H "Content-Type: application/json" --data-binary "{\"url\":\"https://example.com/image.jpg\"}"

Alternatively, use the Azure AI Services SDK for .NET, Python, Java, etc.

Interaction with Related Technologies

Azure Bot Service: Uses Language Understanding (LUIS) and QnA Maker to create conversational bots.

Azure Cognitive Search: Can integrate with Computer Vision and Text Analytics to enrich index content.

Azure Machine Learning: For custom models, you may use AI Services as a baseline or to preprocess data.

Azure Functions / Logic Apps: Trigger AI Services calls in serverless workflows.

Azure Security: Use Azure Private Link to access services over a private endpoint, and Azure Key Vault to store keys.

Categories of AI Services

The exam expects you to know the main service categories and their flagship services:

Vision:

Computer Vision: Analyze images for objects, faces, text (OCR), landmarks.

Custom Vision: Train custom image classifiers/object detectors with your own images.

Face: Detect, identify, and analyze human faces.

Form Recognizer: Extract text, key-value pairs, and tables from documents.

Speech:

Speech-to-Text: Transcribe audio to text.

Text-to-Speech: Convert text to natural-sounding speech.

Speech Translation: Real-time translation of speech.

Speaker Recognition: Identify who is speaking.

Language:

Text Analytics: Sentiment, key phrases, named entities, language detection.

Language Understanding (LUIS): Extract intent and entities from natural language.

QnA Maker: Create a question-and-answer bot from FAQ or documents.

Translator: Text translation between languages.

Decision:

Anomaly Detector: Detect anomalies in time-series data.

Content Moderator: Moderate text, images, and videos for offensive content.

Personalizer: Provide personalized recommendations (reinforcement learning).

Search:

Azure Cognitive Search: AI-powered search with indexing and querying over your own data.

Exam-Relevant Details

Pricing: Free tier (F0) is limited but good for testing. Standard (S0) is production. Some services have additional tiers (e.g., Computer Vision S1, S2).

Rate Limits: Free tier typically 20 calls per minute. Standard varies; e.g., Computer Vision S0: 10–30 calls per second.

Authentication: Key-based (Ocp-Apim-Subscription-Key header) or AAD token. Multi-service resource uses a single key for all included services.

Customization: Custom Vision and LUIS allow training custom models. Form Recognizer has custom models for specific document types.

Containers: Some services can be deployed as Docker containers for on-premises or edge scenarios (e.g., Computer Vision, Text Analytics). The exam may test that containers enable offline use.

Common Exam Traps

Confusing Azure Machine Learning with AI Services: Azure ML is for custom model training; AI Services are pre-built APIs.

Assuming all services are available in all regions: Some services are region-restricted; check documentation.

Mixing up LUIS and QnA Maker: LUIS extracts intent from free-form text; QnA Maker provides direct answers from a knowledge base.

Overlooking the Free tier limits: Many questions test that Free tier has strict rate and volume limits.

Security and Compliance

Data is encrypted in transit (HTTPS) and at rest (Azure Storage encryption).

Some services offer data residency – e.g., Text Analytics can process data within a specific geography.

Use Azure Private Link to keep traffic within Microsoft's network.

Customer-managed keys (CMK) are available for some services to control encryption keys.

Responsible AI: Microsoft provides guidelines for fairness, privacy, and transparency. The exam may ask about the Responsible AI principles.

Walk-Through

Identify the AI Requirement

First, determine what cognitive capability your application needs. For example, do you need to detect objects in images (Computer Vision), understand spoken commands (Speech-to-Text and LUIS), or moderate user-generated content (Content Moderator)? The exam expects you to map business problems to the correct service. Consider whether the task is general (pre-built works) or domain-specific (custom training needed). This step is crucial because choosing the wrong service wastes time and money.

Provision the Resource

In the Azure Portal, create a Cognitive Services resource. You can create a multi-service resource (one endpoint for multiple services) or a single-service resource (dedicated). Select the region closest to your users to minimize latency. Choose a pricing tier: Free (F0) for testing with limited calls, Standard (S0) for production. After creation, note the endpoint URL and one of the two subscription keys. The exam may ask about the difference between multi-service and single-service resources.

Authenticate and Call the API

Authentication is done via HTTP headers. For key-based auth, include `Ocp-Apim-Subscription-Key: <your-key>` in every request. For AAD, obtain a token and pass it in the `Authorization: Bearer <token>` header. The API endpoint URL includes the service name and version (e.g., `/vision/v3.2/analyze`). The request body depends on the service: for Computer Vision, you send an image URL or binary data; for Text Analytics, you send a JSON array of documents. The response is always JSON.

Process the Response

The API returns a JSON object containing predictions, confidence scores, and metadata. For example, Computer Vision returns `categories`, `description`, `objects`, and `tags`. Text Analytics returns `sentiment`, `keyPhrases`, and `entities`. Your application must parse this JSON and extract the relevant information. Handle errors like rate limiting (HTTP 429) or invalid input (HTTP 400). The exam may test that you understand the structure of common responses.

Scale and Secure the Solution

For production, monitor usage and scale by upgrading to a higher tier or using multiple resources. Implement security best practices: rotate keys regularly, use Azure Key Vault to store secrets, restrict network access with firewalls or Private Link, and enable logging and monitoring. Consider using containers for offline or edge scenarios. The exam may ask about scaling options and security features like private endpoints.

What This Looks Like on the Job

Enterprise Scenario 1: Customer Service Chatbot

A large retail company wants to build a multilingual customer service chatbot that can handle order inquiries, returns, and product recommendations. They use a combination of Azure AI Services: - Language Understanding (LUIS) to extract user intents (e.g., 'check order status', 'return item') and entities (e.g., order number, product name). - QnA Maker to answer frequently asked questions from a knowledge base built from their FAQ page. - Translator to support 60+ languages. - Text-to-Speech to provide voice responses for phone integration.

In production, they deploy a multi-service Cognitive Services resource in West Europe to be close to their customers. They use Azure Bot Service to orchestrate the conversation. The Free tier is used for development; Standard tier handles millions of requests per month. They implement rate limiting and use Azure Monitor to track errors and latency. A common mistake is assuming LUIS can answer questions directly (it only extracts intent), leading to poor user experience. They also use Content Moderator to filter inappropriate user messages.

Enterprise Scenario 2: Document Processing Pipeline

A financial services firm needs to automate extraction of data from invoices and contracts. They use: - Form Recognizer with a custom model trained on their invoice templates. - Computer Vision for OCR on scanned documents. - Text Analytics to extract key entities like dates, amounts, and vendor names.

They deploy Form Recognizer as a container in their on-premises data center to comply with data residency regulations. The container runs on Azure Stack Edge. They process thousands of documents daily. A challenge is handling varied layouts; they train multiple custom models for different document types. They use Azure Logic Apps to trigger the pipeline when a new document arrives in Blob Storage. The exam may test that containers allow offline processing and data sovereignty.

Enterprise Scenario 3: Real-time Content Moderation

A social media platform uses Content Moderator to automatically flag offensive images, text, and videos. They integrate the API into their upload pipeline. For images, they use the Image Moderation API to detect adult or racy content. For text, they use Text Moderation to detect profanity, personally identifiable information (PII), and unwanted language. They also use Custom Vision to detect brand logos or copyrighted material. In production, they use the Standard tier with a high rate limit. They set up a human review loop using the Review API for borderline cases. A common pitfall is relying solely on automated moderation; false positives/negatives require human oversight. They also use Anomaly Detector to monitor for unusual spikes in flagged content.

How AI-900 Actually Tests This

What AI-900 Tests on This Topic (Objective 1.1 – Identify Features of AI Services)

The AI-900 exam focuses on your ability to match business problems to the correct Azure AI Service. You will not be asked to write code or interpret complex JSON. Instead, you must know:

The purpose of each service (e.g., Computer Vision for image analysis, Text Analytics for sentiment).

The category (Vision, Speech, Language, Decision, Search).

When to use a pre-built API vs. Azure Machine Learning.

Pricing tiers (Free vs. Standard) and their limitations.

Authentication methods (key vs. AAD).

Multi-service vs. single-service resources.

Container support for offline/edge scenarios.

Responsible AI principles (fairness, privacy, transparency).

Common Wrong Answers and Why Candidates Choose Them

Confusing LUIS with QnA Maker: Candidates see "understand natural language" and pick LUIS, but the question asks for answering FAQs – that's QnA Maker. LUIS extracts intent, not answers.

Choosing Azure Machine Learning for pre-built tasks: When the question says "no training data needed", the answer is AI Services, not Azure ML. Candidates often overcomplicate.

Assuming all services are available globally: Some services have regional restrictions (e.g., Text Analytics for Healthcare in certain regions). The exam may test that you check regional availability.

Picking the wrong tier: A question about a prototype with low usage – the answer is Free tier (F0). Candidates pick Standard because they think Free is too limited, but the exam tests that Free is for testing.

Forgetting authentication: Questions about security often have answers like "use Azure AD" or "use keys stored in Key Vault". Candidates might choose "use a single key in code" which is insecure.

Numbers and Terms That Appear Verbatim

Free tier: 20 calls per minute, 5,000 calls per month (typical).

Standard tier: pay-as-you-go, higher limits.

Multi-service resource: one key for multiple services (e.g., Vision, Speech, Language).

Single-service resource: dedicated key per service.

Container support: Computer Vision, Text Analytics, Form Recognizer, LUIS, etc.

Authentication: Ocp-Apim-Subscription-Key header or Authorization: Bearer <token>.

Endpoint format: https://<region>.api.cognitive.microsoft.com/ or https://<name>.cognitiveservices.azure.com/.

Responsible AI: fairness, reliability, privacy, transparency, accountability.

Edge Cases and Exceptions

Custom Vision: You can train with your own images, but the base model is pre-trained. This blurs the line between AI Services and Azure ML – the exam expects you to know Custom Vision is an AI Service.

Form Recognizer: Has pre-built models (e.g., for invoices) and custom models. The exam may ask which to use for a specific document type.

Personalizer: Uses reinforcement learning, not supervised learning. It learns from feedback in real-time.

Anomaly Detector: Works best with time-series data; batch vs. streaming modes.

Speech Services: Custom Speech allows training on domain-specific vocabulary.

How to Eliminate Wrong Answers

If the scenario says "no custom training", eliminate Azure ML and Custom Vision.

If the scenario involves "conversational understanding", consider LUIS; for "FAQ", consider QnA Maker.

If the scenario requires "offline processing", look for container support.

If the scenario mentions "cost", Free tier is for development, Standard for production.

If the scenario discusses "security", think about key rotation, Key Vault, Private Link.

Key Takeaways

Azure AI Services provide pre-built AI capabilities via REST APIs; no ML expertise needed.

Main categories: Vision, Speech, Language, Decision, Search.

Free tier (F0) is for testing; Standard (S0) is for production.

Multi-service resource uses one key for multiple services; single-service uses dedicated keys.

Authentication uses subscription key (Ocp-Apim-Subscription-Key) or Azure AD token.

Some services support containers for offline/edge scenarios.

Responsible AI principles: fairness, reliability, privacy, transparency, accountability.

Custom Vision and LUIS allow custom training within the AI Service framework.

Easy to Mix Up

These come up on the exam all the time. Here's how to tell them apart.

Azure AI Services

Pre-built models, no training required.

Accessed via REST API or SDK.

Use for common AI tasks like vision, speech, language.

Pricing based on API calls (per transaction).

Limited customization (except Custom Vision, LUIS).

Azure Machine Learning

Build and train custom models using your own data.

Requires ML expertise and data preparation.

Use for unique or domain-specific AI tasks.

Pricing based on compute resources (training and inference).

Full control over model architecture and training.

Watch Out for These

Mistake

Azure AI Services require machine learning expertise to use.

Correct

No, they are designed to be used by developers with no ML background. You only need to know how to call REST APIs or use SDKs. The models are pre-trained by Microsoft.

Mistake

All Azure AI Services are available in every Azure region.

Correct

Not all services are available in all regions. For example, some preview services may be limited to specific regions. Always check regional availability documentation.

Mistake

The Free tier (F0) can be used in production.

Correct

The Free tier has strict rate limits (e.g., 20 calls per minute) and volume caps (e.g., 5,000 calls per month). It is intended for testing and development only. Use Standard tier for production.

Mistake

You can train custom models using Azure Machine Learning and then use them via AI Services endpoints.

Correct

Custom models trained in Azure ML are deployed as web services, not as AI Services endpoints. AI Services are pre-built; Custom Vision is an exception that allows training but within the service itself.

Mistake

Multi-service resource provides separate keys for each service.

Correct

A multi-service resource provides a single key and endpoint that works for all included services. Single-service resources provide dedicated keys per service.

Do You Actually Know This?

Reveal each answer, then mark whether you got it right. Score 60%+ to unlock the next chapter.

Frequently Asked Questions

What is the difference between Azure AI Services and Azure Machine Learning?

Azure AI Services are pre-built APIs that you can call directly without training models. Azure Machine Learning is a platform for building, training, and deploying custom machine learning models. Use AI Services for common tasks like image recognition or sentiment analysis; use Azure ML when you need a model trained on your own data for a specific problem.

Which Azure AI Service should I use to extract text from images?

Use Computer Vision's OCR capability (Read API) or Form Recognizer for structured documents. Computer Vision works well for general images; Form Recognizer is optimized for forms and invoices.

Can I use Azure AI Services offline?

Yes, several services offer Docker containers that can run on-premises or at the edge. Examples include Computer Vision, Text Analytics, Form Recognizer, and LUIS. Containers allow processing without internet connectivity.

How do I secure my Azure AI Services keys?

Store keys in Azure Key Vault and retrieve them at runtime. Rotate keys regularly. Use Azure AD authentication instead of keys when possible. Restrict network access using firewalls or Azure Private Link.

What is the pricing model for Azure AI Services?

Most services have a Free tier (F0) with limited calls per month (e.g., 5,000) and rate limits (e.g., 20 per minute). Standard tier (S0) is pay-as-you-go with higher limits. Some services have additional tiers. You are billed per transaction (API call).

What is the difference between LUIS and QnA Maker?

LUIS (Language Understanding) extracts intents and entities from natural language. QnA Maker provides direct answers from a knowledge base. Use LUIS when you need to understand what the user wants to do; use QnA Maker when you need to answer factual questions.

How do I choose between multi-service and single-service Cognitive Services resource?

Use a multi-service resource to manage multiple services with one key and endpoint, simplifying management. Use single-service resources for isolation, separate billing, or when you need different regions or tiers per service.

Terms Worth Knowing

Artificial intelligence Machine learning Responsible AI

Ready to put this to the test?

You've just covered Azure AI Services Overview — now see how well it sticks with free AI-900 practice questions. Full explanations included, no account needed.

Try AI-900 practice questions Back to all chapters

Done with this chapter?

Azure AI Search for RAG Applications

Cognitive Services Pricing and Tiers

See the full AI-900 study guide