What does the Azure AI Vision 'Image Analysis' capability return when analyzing an image?
Image Analysis returns natural language descriptions, detected objects, tags, categories, and other semantic information about what's in the image.
Why this answer
Azure AI Vision's Image Analysis capability uses pre-trained deep learning models to extract rich semantic information from images, including human-readable descriptions, a list of detected objects with bounding boxes, and a set of relevant tags. This goes far beyond basic metadata, making option B correct because it accurately captures the breadth of semantic outputs the service provides.
Exam trap
The trap here is that candidates confuse basic image metadata (file size, dimensions) with the semantic analysis outputs of Azure AI Vision, leading them to choose option A, or they assume the service only returns a single label (option C) because they think of simpler classification models rather than the multi-output analysis capability.
How to eliminate wrong answers
Option A is wrong because Image Analysis does not return file size or dimensions; those are basic metadata properties handled by storage services, not the computer vision API. Option C is wrong because the service returns multiple category labels, tags, and descriptions, not just a single category label for the entire image. Option D is wrong because Azure AI Vision does not generate 3D point clouds; that capability is associated with depth-sensing cameras or specialized 3D reconstruction services, not the 2D image analysis API.