A retail company stores three types of customer data: (1) a table with columns for CustomerID, Name, and Email; (2) product reviews as JSON documents with varying fields such as rating and comment; (3) product demonstration videos stored in MP4 format. Which of the following correctly classifies these data types in order from first to third?
Correct. The table is structured (fixed schema), JSON documents are semi-structured (flexible schema), and videos are unstructured (no schema).
Why this answer
Option C is correct because the customer table with fixed columns (CustomerID, Name, Email) is structured data, product reviews as JSON documents with varying fields are semi-structured data (schema-on-read, flexible fields), and MP4 video files are unstructured data (no schema, binary format). This ordering matches the standard classification in Azure Data Fundamentals: structured → semi-structured → unstructured.
Exam trap
The trap here is that candidates often confuse semi-structured data (like JSON) with unstructured data, or assume all non-tabular data is unstructured, when in fact JSON's key-value pairs with varying fields make it semi-structured.
How to eliminate wrong answers
Option A is wrong because it incorrectly places unstructured before semi-structured; product reviews as JSON are semi-structured, not unstructured, and MP4 videos are unstructured, not semi-structured. Option B is wrong because it starts with semi-structured for the customer table, which is clearly structured with a fixed schema; it also misorders the remaining types. Option D is wrong because it begins with unstructured for the customer table, which is structured, and then places semi-structured before structured, reversing the correct order.