Courseiva
Knowledge + Practice
CertificationsVendorsCareer RoadmapsLabs & ToolsStudy GuidesGlossaryPractice Questions
C
Courseiva

Free IT certification practice questions with explained answers for CCNA, CompTIA, AWS, Azure, Google Cloud, and more.

Certification Practice Questions

CCNA practice questionsSecurity+ SY0-701 practice questionsAWS SAA-C03 practice questionsAZ-104 practice questionsAZ-900 practice questionsCLF-C02 practice questionsA+ Core 1 practice questionsGoogle Cloud ACE practice questionsCySA+ CS0-003 practice questionsNetwork+ N10-009 practice questions
View all certifications →

Product

CertificationsCertification PathsExam TopicsPractice TestsExam Dumps vs Practice TestsStudy HubComparisons

Company

AboutContactEditorial PolicyQuestion Writing PolicyTrust Center

Legal

Privacy PolicyTerms of Service

Courseiva is a free IT certification practice platform offering original exam-style practice questions, detailed explanations, topic-based practice, mock exams, readiness tracking, and study analytics for Cisco, CompTIA, Microsoft, AWS, and other technology certifications.

© 2026 Courseiva. Courseiva is operated by JTNetSolutions Ltd. All rights reserved.

Courseiva is an independent certification practice platform and is not affiliated with, endorsed by, or sponsored by Cisco, Microsoft, AWS, CompTIA, Google, ISC2, ISACA, or any other certification vendor. Vendor names and certification marks are used only to identify the exams learners are preparing for.

← Comparing and Contrasting Data Concepts practice sets

DA0-001 Comparing and Contrasting Data Concepts • Complete Question Bank

DA0-001 Comparing and Contrasting Data Concepts — All Questions With Answers

Complete DA0-001 Comparing and Contrasting Data Concepts question bank — all 0 questions with answers and detailed explanations.

115
Questions
Free
No signup
Certifications/DA0-001/Practice Test/Comparing and Contrasting Data Concepts/All Questions
Question 1mediummultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A retail company stores customer purchase history in a relational database. The database contains a table 'transactions' with columns: transaction_id, customer_id, product_id, quantity, price, and transaction_date. A data analyst needs to create a report that shows total revenue per customer for the last quarter. Which data concept describes the relationship between customer_id and total revenue?

Question 2easymultiple choice
Read the full NAT/PAT explanation →

A healthcare database stores patient records. Each patient has a unique patient_id, and the database includes a table 'visits' with visit_id, patient_id, visit_date, and diagnosis_code. To ensure data integrity, which constraint should be applied to the patient_id column in the 'visits' table?

Question 3hardmultiple choice
Read the full NAT/PAT explanation →

A data engineer is designing a data warehouse for a multinational corporation. The company has sales data from different regions with varying currencies and date formats. To ensure consistency, which data concept should be applied to standardize the data before loading into the warehouse?

Question 4mediummultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

An e-commerce company uses a star schema for its data warehouse. The fact table 'sales_fact' contains foreign keys to dimension tables: customer_dim, product_dim, time_dim, and store_dim. A business user wants to know the total sales for each product category in the last month. Which join operation is required to retrieve this data?

Question 5easymultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A data analyst is working with a dataset containing customer information. The dataset includes a column 'full_name' which stores first and last names together. To perform analysis on first names separately, which data concept describes the process of splitting 'full_name' into 'first_name' and 'last_name'?

Question 6hardmultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A data scientist is building a machine learning model to predict customer churn. The dataset includes both numerical features (age, income) and categorical features (gender, marital status). Which data concept describes the process of converting categorical features into numerical values that can be used by the algorithm?

Question 7mediummultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A company's database has a table 'orders' with columns: order_id, customer_id, order_date, and total_amount. A data analyst needs to identify customers who have placed more than 5 orders in the past year. Which data concept should be used to group orders by customer and count them?

Question 8easymultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A data analyst receives a dataset with a column 'salary' that contains values like '45,000', '55,000', and '65,000'. The analyst notices that the values are stored as text. Which data concept should be applied to convert the salary column from text to numeric format for analysis?

Question 9mediummulti select
Read the full Comparing and Contrasting Data Concepts explanation →

Which TWO of the following are characteristics of structured data? (Choose TWO.)

Question 10hardmulti select
Read the full Comparing and Contrasting Data Concepts explanation →

Which THREE of the following are valid data quality dimensions? (Choose THREE.)

Question 11easymulti select
Read the full Comparing and Contrasting Data Concepts explanation →

Which TWO of the following are examples of data transformation? (Choose TWO.)

Question 12hardmultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A financial services company is migrating its customer data from a legacy on-premises relational database to a cloud-based data warehouse. The legacy database uses a denormalized schema with a single table 'customer_master' that contains all customer attributes, including repeated groups for multiple accounts per customer (account1_type, account1_balance, account2_type, account2_balance, etc.). The data warehouse team wants to implement a normalized star schema with separate dimension and fact tables. During the ETL process, the team encounters an error: 'Data truncation: string data right truncation' when loading account_type values into the dim_account table. The account_type column in dim_account is defined as VARCHAR(10), but the source data contains account types like 'SavingsPlus' (11 characters) and 'CheckingPremium' (15 characters). The team must resolve this issue without losing data. Which course of action should the team take?

Question 13mediummultiple choice
Read the full NAT/PAT explanation →

A healthcare organization maintains a database of patient records. The database has a table 'patients' with columns: patient_id (primary key), first_name, last_name, date_of_birth, gender, and last_visit_date. A data analyst is tasked with creating a report that lists all patients who have not visited in the last two years. The analyst writes a query: SELECT * FROM patients WHERE last_visit_date < DATEADD(year, -2, GETDATE()); However, the query returns zero rows, even though the analyst knows there are patients who have not visited for over two years. Upon inspection, the analyst discovers that the last_visit_date column contains NULL values for patients who have never visited. Which modification to the query should the analyst make to include patients with NULL last_visit_date?

Question 14easymultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A data analyst needs to ensure that a customer's address is stored in a consistent format across multiple databases. Which data quality dimension is the analyst primarily concerned with?

Question 15mediummultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A data engineer is designing a data warehouse for a retail company. The fact table must record each sale transaction, including product ID, store ID, date, and quantity sold. The product details (name, category, price) are stored in a separate table. This design is an example of which data modeling concept?

Question 16hardmultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A data analyst is troubleshooting a report that shows unusually high sales for a specific product. Upon investigation, the analyst finds that the product was returned by several customers, but the returns were recorded in a separate system and not reflected in the sales data. Which data integration concept was likely missing?

Question 17mediummultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

When the analyst runs the query, it fails. What is the most likely reason?

Network Topology
+Refer to the exhibit.FROM ProductsWHERE TotalValue > 2000
Question 18hardmultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A data analyst is reviewing the error log from a nightly batch load. What is the most likely cause of the error?

Exhibit

Refer to the exhibit.

Error log from a data pipeline:

[2025-03-15 10:32:14] ERROR: Duplicate key value violates unique constraint 'order_pkey'
[2025-03-15 10:32:14] Detail: Key (order_id)=(12345) already exists.
[2025-03-15 10:32:15] WARNING: Batch load incomplete. 4999 of 5000 rows inserted.
Question 19easymultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A data analyst is creating a report for a marketing campaign. The campaign data includes customer names, email addresses, and purchase history. Which of the following best describes the 'customer name' data type?

Question 20mediummulti select
Read the full Comparing and Contrasting Data Concepts explanation →

Which TWO of the following are considered structured data?

Question 21hardmulti select
Read the full Comparing and Contrasting Data Concepts explanation →

Which TWO of the following are examples of data governance best practices?

Question 22mediummulti select
Read the full Comparing and Contrasting Data Concepts explanation →

Which THREE of the following are characteristics of a relational database?

Question 23hardmulti select
Read the full Comparing and Contrasting Data Concepts explanation →

Which THREE of the following are valid methods for handling missing data?

Question 24hardmultiple choice
Read the full NAT/PAT explanation →

A mid-sized e-commerce company stores customer data in a relational database. The database has a table named 'Customers' with columns: CustomerID (primary key), FirstName, LastName, Email, Phone, Address, City, State, ZipCode, and SignUpDate. The company is migrating to a new CRM system that requires a denormalized structure for performance reasons. The new system expects a single table 'CustomerDetails' with columns: CustomerID, FullName (concatenation of first and last name), ContactInfo (JSON object containing email, phone, and address), SignUpDate, and Region (derived from state). The data analyst must design an ETL process to transform the data. During a test run, the analyst notices that some records have missing Phone or Address values. Which of the following is the best approach to handle missing data in the ContactInfo JSON object?

Question 25mediumdrag order
Read the full Comparing and Contrasting Data Concepts explanation →

Drag and drop the steps for the ETL (Extract, Transform, Load) process in the correct order.

Drag steps to the numbered slots on the right, or tap a step then tap a slot.

Steps
Order
1Step 1
2Step 2
3Step 3
4Step 4
5Step 5
Question 26mediumdrag order
Read the full Comparing and Contrasting Data Concepts explanation →

Drag and drop the steps to create a data visualization dashboard in the correct order.

Drag steps to the numbered slots on the right, or tap a step then tap a slot.

Steps
Order
1Step 1
2Step 2
3Step 3
4Step 4
5Step 5
Question 27mediummatching
Read the full Comparing and Contrasting Data Concepts explanation →

Match each data quality dimension to its description.

Drag a concept onto its matching description — or click a concept then click the description.

Concepts
Matches

Degree to which data correctly reflects real-world values

Extent to which all required data is present

Absence of contradictions across data sources

Data is up-to-date and available when needed

No duplicate records exist within the dataset

Question 28mediummatching
Read the full Comparing and Contrasting Data Concepts explanation →

Match each data visualization type to its best use case.

Drag a concept onto its matching description — or click a concept then click the description.

Concepts
Matches

Compare quantities across categories

Show relationship between two numeric variables

Display distribution of a single continuous variable

Show magnitude of values across two dimensions

Summarize distribution and identify outliers

Question 29easymultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A company needs to store raw, unprocessed data from IoT sensors for future machine learning experiments. The data is in various formats and schemas are not yet defined. Which storage solution is most appropriate?

Question 30mediummultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A financial application requires fast query performance for aggregations on large historical datasets. The schema has many lookup tables. Which schema design is most efficient for this workload?

Question 31hardmultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A database table has columns: OrderID (primary key), ProductID, CustomerID, CustomerName, OrderDate, ProductName. All products are purchased only by the customer who placed the order. Which normal form violation exists if CustomerName depends on CustomerID?

Question 32mediummultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A data engineer needs to store logs from web servers that have varying fields. The logs are in JSON format. Which data type describes this JSON data?

Question 33easymultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A retail company processes daily transactions. The current system transforms data before loading it into the data warehouse. The volume is growing rapidly, and they want to load raw data first to reduce processing time. Which approach should they adopt?

Question 34hardmultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A table Orders has OrderID (primary key), CustomerID, and CustomerEmail. During analysis, it is found that CustomerID uniquely identifies CustomerEmail. Which normal form is violated if both CustomerID and CustomerEmail are stored in this table?

Question 35mediummultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A company requires real-time masking of credit card numbers for customer support agents while allowing full access for accountants. Which technique should be implemented?

Question 36easymultiple choice
Read the full NAT/PAT explanation →

A hospital's patient records system must process thousands of small transactions per second. Which type of database system is best suited for this workload?

Question 37mediummultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A data quality report shows that 95% of records have all required fields completed, but 20% of the completed fields contain values that are outside valid ranges. Which data quality dimension is most affected?

Question 38mediummulti select
Read the full Comparing and Contrasting Data Concepts explanation →

Which TWO roles are primarily responsible for defining and enforcing data governance policies within an organization?

Question 39hardmulti select
Read the full Comparing and Contrasting Data Concepts explanation →

Which THREE of the following are NoSQL database types?

Question 40mediummulti select
Read the full Comparing and Contrasting Data Concepts explanation →

A data team must implement a data retention policy to reduce storage costs while meeting legal requirements. Which TWO actions best achieve this?

Question 41mediummultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

Refer to the exhibit. A data analyst is trying to understand access permissions for the company-data bucket. Which statement accurately describes the effective permissions?

Exhibit

{"Version":"2012-10-17","Statement":[{"Effect":"Allow","Principal":{"AWS":"arn:aws:iam::123456789012:role/DataAnalyst"},"Action":"s3:GetObject","Resource":"arn:aws:s3:::company-data/prod/*"},{"Effect":"Deny","Principal":"*","Action":"s3:GetObject","Resource":"arn:aws:s3:::company-data/prod/sensitive/*"}]}
Question 42hardmultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

Refer to the exhibit. A database administrator notices that queries filtering on both CustomerID and OrderDate are slow. Which single change would most likely improve performance for such queries?

Exhibit

CREATE TABLE Orders (
  OrderID INT PRIMARY KEY,
  CustomerID INT,
  OrderDate DATE,
  TotalAmount DECIMAL(10,2),
  INDEX idx_cust (CustomerID),
  INDEX idx_date (OrderDate)
);
Question 43easymultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

Refer to the exhibit. A data pipeline is failing to parse this log entry. What is the most likely cause of the error?

Exhibit

2023-08-15 14:23:45, ERROR: Invalid JSON: {"user": "John", "age": 30 "country": "USA"}
Question 44easymultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A retail analyst needs to determine the most popular product category. The dataset includes columns: ProductID, Category, SalesDate, QuantitySold, UnitPrice. Which column contains qualitative data?

Question 45mediummultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A data scientist is building a model to predict customer churn. The company's internal CRM system provides customer demographics and transaction history. They also purchase demographic data from a third-party vendor. How should the purchased data be classified?

Question 46hardmultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A logistics company is analyzing truck delivery times. Which variable is discrete?

Question 47easymultiple choice
Read the full NAT/PAT explanation →

A hospital wants to analyze patient readmission rates. The data contains daily patient visits. What is the level of granularity?

Question 48mediummultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A data analyst finds that the "Age" column contains values like "N/A", "unknown", and negative numbers. Which data quality dimension is primarily affected?

Question 49hardmultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

An e-commerce company stores customer support emails in a text database, product images in a blob store, and sales transactions in a SQL table. Which data store holds only structured data?

Question 50easymultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A market researcher conducts a survey with questions like "What is your favorite brand?" and "How many units do you purchase per year?" Which data types correspond?

Question 51mediummultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

In a customer database, each row represents a customer with columns: CustomerID, Name, Address, Phone. What does the column "Name" represent?

Question 52hardmultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A data audit reveals that some numbers in the "Revenue" column were manually entered from PDF invoices. This introduces potential errors. Which data concept is being addressed?

Question 53mediummulti select
Read the full Comparing and Contrasting Data Concepts explanation →

Which TWO data types are considered quantitative? (Select two.)

Question 54hardmulti select
Read the full Comparing and Contrasting Data Concepts explanation →

Which THREE characteristics describe unstructured data? (Select three.)

Question 55easymulti select
Read the full Comparing and Contrasting Data Concepts explanation →

Which TWO are examples of primary data? (Select two.)

Question 56mediummultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

Refer to the exhibit. Which type of data is the field "region"?

Exhibit

{
  "report": "Analytics",
  "filters": [
    {"field": "transaction_date", "operator": ">=", "value": "2023-01-01"},
    {"field": "region", "operator": "=", "value": "West"}
  ],
  "metrics": ["revenue", "units_sold"]
}
Question 57hardmultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

Refer to the exhibit. Based on the data profiling results, what is a likely data quality issue?

Exhibit

Column: Age
Null Count: 50
Unique Values: 23
Min: 0
Max: 150
Mean: 45.2
Median: 42
Question 58easymultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

Refer to the exhibit. Which data quality dimension is compromised by the missing value for Charlie's salary?

Exhibit

ID, Name, Salary, Department
1, Alice, 60000, Sales
2, Bob, 70000, IT
3, Charlie, , IT
Question 59easymultiple choice
Read the full NAT/PAT explanation →

A retail company stores customer transaction data in a relational database. They want to analyze purchasing patterns over time. Which type of data structure best supports this analysis?

Question 60mediummultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A data analyst receives a dataset with inconsistent date formats (e.g., "01/02/2023", "2023-01-02", "Jan 2, 2023"). Which data quality dimension is most directly affected?

Question 61hardmultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A company is designing a data lake to store raw sensor data from IoT devices. The data arrives as JSON objects with varying schemas. Which storage approach is most appropriate?

Question 62easymultiple choice
Read the full NAT/PAT explanation →

A healthcare provider needs to integrate patient data from multiple clinics into a single data warehouse. Which process is used to extract, transform, and load the data?

Question 63mediummultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A data analyst is working with a dataset that contains customer names and addresses. Some records have missing state codes. Which data quality issue is this?

Question 64hardmultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A financial institution wants to analyze transaction networks to detect fraud rings. Which database type is best suited for this analysis?

Question 65easymultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A data architect is designing a schema for a product catalog where each product has a variable number of attributes. Which NoSQL database type is most appropriate?

Question 66mediummultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A data engineer is comparing data warehouses and data lakes. Which statement accurately describes a data warehouse?

Question 67hardmultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

During an ETL process, a data quality check fails due to duplicate customer IDs. Which data quality dimension is violated?

Question 68mediummulti select
Read the full Comparing and Contrasting Data Concepts explanation →

Which TWO of the following are examples of semi-structured data?

Question 69hardmulti select
Read the full Comparing and Contrasting Data Concepts explanation →

Which THREE data quality dimensions are commonly assessed in a data profiling task?

Question 70easymulti select
Read the full Comparing and Contrasting Data Concepts explanation →

Which TWO of the following are characteristics of a data lake?

Question 71mediummultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

Refer to the exhibit. A data analyst notices that direct S3 access to files outside the "incoming/" prefix is blocked. Which data governance principle does this policy enforce?

Exhibit

{
  "policy": {
    "effect": "Allow",
    "action": ["s3:GetObject"],
    "resource": "arn:aws:s3:::data-bucket/*",
    "condition": {
      "StringEquals": {"s3:prefix": "incoming/"}
    }
  }
}
Question 72hardmultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

Refer to the exhibit. A data analyst runs this query to identify high-value customers. However, the result does not include customers with exactly 5 orders. Which data concept does the HAVING clause illustrate?

Network Topology
+| cust_id | orders || 101 | 7 || 203 | 9 |
Question 73easymultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

Refer to the exhibit. An Avro schema is defined as shown. Which data design concept does this represent?

Exhibit

{
  "schema": {
    "type": "struct",
    "fields": [
      { "name": "id", "type": "int", "nullable": false },
      { "name": "name", "type": "string", "nullable": true },
      { "name": "email", "type": "string", "nullable": true }
    ]
  }
}
Question 74easymultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A marketing team needs to store customer feedback from social media posts, including text, images, and emojis. Which data concept is most appropriate for this storage?

Question 75easymultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

An organization wants to assign responsibility for data quality and metadata management. Which role is primarily accountable for defining data standards and ensuring data quality across a specific domain?

Question 76easymultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A data analyst notices that customer addresses in the database contain invalid ZIP codes. Which data quality dimension is being violated?

Question 77mediummultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A company is implementing a data lifecycle management policy. Which stage occurs immediately after data is created?

Question 78mediummultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

To consolidate data from multiple operational databases into a central repository for reporting, a company decides to transform data before loading it into the target system. Which data integration approach is being used?

Question 79mediummultiple choice
Read the full NAT/PAT explanation →

A business needs to store large volumes of raw data in its native format for future analytics. Which storage architecture is most appropriate?

Question 80hardmultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A data modeler is designing a dimensional model for a sales analytics system. The fact table contains sales transactions, and the dimension tables include product, customer, and time. To reduce data redundancy, the modeler normalizes the dimension tables into multiple related tables. Which schema is being implemented?

Question 81hardmultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

An organization has multiple systems that store customer information inconsistently. To create a single authoritative view of customer data, they implement a process that identifies and merges duplicate records. This is an example of which data management discipline?

Question 82hardmultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A data governance team is drafting a policy for handling personally identifiable information (PII). According to data governance best practices, which document should define the classification levels and handling procedures?

Question 83easymulti select
Read the full Comparing and Contrasting Data Concepts explanation →

Which TWO of the following are considered internal data sources within an organization?

Question 84mediummulti select
Read the full Comparing and Contrasting Data Concepts explanation →

Which THREE of the following are common characteristics of unstructured data?

Question 85hardmulti select
Read the full Comparing and Contrasting Data Concepts explanation →

Which TWO of the following are primary benefits of implementing a data governance program?

Question 86easymultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

Refer to the exhibit. The data shown is an example of which data concept?

Exhibit

Refer to the exhibit.
Exhibit:
ID,Name,Age,Salary
1,John,32,50000
2,Jane,28,60000
3,Bob,45,55000
Question 87mediummultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

Refer to the exhibit. Which data concept does this exhibit best represent?

Exhibit

Refer to the exhibit.
Exhibit:
{
  "type": "object",
  "properties": {
    "customerId": { "type": "integer" },
    "name": { "type": "string" },
    "orders": {
      "type": "array",
      "items": {
        "type": "object",
        "properties": {
          "orderId": { "type": "integer" },
          "total": { "type": "number" }
        }
      }
    }
  }
}
Question 88hardmultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

Refer to the exhibit. Which conclusion can be drawn from this data quality report?

Exhibit

Refer to the exhibit.
Exhibit:
Table: Customer_Master
Column: Email_Address
Completeness: 92%
Validity: 85%
Uniqueness: 97%
Consistency: 100%
Question 89easymultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A market research firm collects survey responses where customers rate satisfaction on a scale of 'Very Unsatisfied', 'Unsatisfied', 'Neutral', 'Satisfied', 'Very Satisfied'. What type of data is being collected?

Question 90mediummultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A data analyst needs to compare sales data from the company's internal CRM with public demographic data from a government census. Which data concept best describes this scenario?

Question 91hardmultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A sensor records temperature readings in Celsius and a separate sensor records wind speed in meters per second. A data scientist wants to combine these datasets for analysis. Which statement accurately compares these data types?

Question 92easymultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A company stores customer data in a relational database with tables for orders, products, and customers. Which type of data best describes this?

Question 93mediummultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A researcher wants to study the effect of a new drug. She collects data directly from clinical trial participants. Later, she compares her findings with historical data from medical journals. Which contrast best describes her data sources?

Question 94hardmultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A data analyst notices that a column labeled 'Income' contains values like '$50,000' and '$75,000', but also 'High' and 'Low'. What data concept issue is occurring?

Question 95easymultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

Which of the following is an example of qualitative data?

Question 96mediummultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A data team is building a predictive model. They have data on 'Number of employees' (whole numbers) and 'Revenue' (currency). Which statement correctly compares these data types?

Question 97hardmultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A dataset contains a column 'Education Level' with values: 'High School', 'Bachelor', 'Master', 'PhD'. An analyst computes the average by assigning numbers 1-4. Which data concept is being violated?

Question 98easymulti select
Read the full Comparing and Contrasting Data Concepts explanation →

Which TWO of the following are characteristics of structured data? (Choose TWO.)

Question 99mediummulti select
Read the full Comparing and Contrasting Data Concepts explanation →

Which TWO of the following are examples of quantitative data? (Choose TWO.)

Question 100hardmulti select
Read the full Comparing and Contrasting Data Concepts explanation →

Which THREE of the following are properties of ratio data? (Choose THREE.)

Question 101mediummultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

The exhibit shows a JSON schema for a dataset. Which statement correctly describes the data types represented?

Exhibit

Refer to the exhibit.

{
  "fields": [
    {"name": "customer_id", "type": "integer"},
    {"name": "age", "type": "integer"},
    {"name": "income", "type": "float"},
    {"name": "education", "type": "string"}
  ]
}
Question 102hardmultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A retail company has merged with another firm and now needs to create a unified customer data warehouse. The existing systems use different data classification methods: System A stores customer income as a categorical range (e.g., '$0-$50k', '$50k-$100k', '$100k+') while System B stores exact income as a decimal number. A data analyst must combine these into a single table. The goal is to perform statistical analysis that includes calculating average income, but the categorical data from System A loses precision. The analyst proposes converting System B's exact values into the same ranges as System A to ensure consistency. However, the data governance team wants to preserve as much detail as possible. Which course of action should the analyst recommend?

Question 103mediummultiple choice
Read the full NAT/PAT explanation →

A healthcare analytics team is building a dashboard to monitor patient vitals. They receive data from two sources: Source 1 provides 'heart rate' as an integer (beats per minute), and Source 2 provides 'blood pressure' as a ratio (systolic/diastolic, e.g., 120/80). The team wants to create a combined metric called 'cardiac stress index' that uses both heart rate and systolic blood pressure. However, they notice that heart rate data occasionally contains negative values due to sensor errors. The data governance policy requires that all data be valid and meaningful. Which action best addresses the data quality issue while preserving the data types?

Question 104mediummultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A retail company analyzes customer purchase data to improve inventory management. They store daily transaction records in a relational database and monthly aggregate reports in a data warehouse. Which difference between these storage methods best explains why the warehouse is more suitable for trend analysis?

Question 105hardmulti select
Read the full Comparing and Contrasting Data Concepts explanation →

A data analyst is evaluating data quality issues in a customer database. Which TWO actions are best practices for ensuring data consistency?

Question 106easymulti select
Read the full Comparing and Contrasting Data Concepts explanation →

An organization is implementing a data lake to store raw data from various sources. Which THREE characteristics are typically associated with a data lake compared to a data warehouse?

Question 107mediummultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A manufacturing company has two primary data systems: an ERP system that stores production orders with fields like OrderID, ProductID, Quantity, and ProductionDate, and a CRM system that stores customer sales with fields like SaleID, CustomerID, ProductID, SaleDate, and Amount. The data analyst needs to create a unified view of product performance by joining these tables. However, the ProductID field in the ERP uses a 5-character alphanumeric code (e.g., 'P1234'), while the CRM uses a 6-character code (e.g., 'PR1234'). Additionally, some products have multiple entries due to slight variations in naming. The analyst wants to ensure accurate matching without losing data. Which action should the analyst take first to address the data inconsistency?

Question 108hardmultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A large financial institution is implementing a data governance framework to comply with new regulations requiring strict control over sensitive customer data. The data governance committee has identified several domains, including customer master data, transaction data, and risk assessment data. They need to decide on a master data management (MDM) approach that ensures a single, authoritative source of customer information across all systems. However, the current environment has multiple legacy systems with conflicting customer records. The committee is concerned about downtime and business disruption during the transition. Which MDM approach best balances data consistency with minimal operational impact?

Question 109easymultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A data analyst at a marketing agency is working with a dataset containing customer demographics, purchase history, and social media engagement metrics. The agency wants to perform sentiment analysis on unstructured social media comments to identify brand perception. The dataset also includes structured fields like age, income, and purchase amounts. The analyst needs to choose a storage and processing platform that can handle both structured and unstructured data efficiently without requiring extensive schema definition upfront. Which platform should the analyst recommend?

Question 110mediummultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A telecommunications company is experiencing issues with its customer satisfaction survey data. The data is collected from multiple channels: phone, email, and web forms. Each channel uses a different scale for ratings: phone uses 1-10, email uses 1-5, and web uses 1-7. Additionally, some survey responses contain missing values for demographic fields. The data analyst needs to calculate an overall satisfaction score that is comparable across all channels. The company's leadership wants a single metric that minimizes distortion from the different scales. Which approach should the analyst use to standardize the ratings?

Question 111hardmultiple choice
Read the full NAT/PAT explanation →

A healthcare organization is subject to strict data privacy regulations requiring the classification of all data assets. The data governance team has identified three data sensitivity levels: Public, Internal, and Restricted. They have a new data pipeline importing patient health records from multiple clinics. The records include patient names, diagnoses, treatment codes, and insurance information. The team must ensure that the classification is applied correctly and that restricted data (e.g., diagnoses) is not exposed to unauthorized personnel. However, the pipeline uses automated tagging based on metadata rules, and some fields are misclassified. What is the most effective immediate action to improve classification accuracy?

Question 112easymultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

An e-commerce company wants to provide real-time personalized product recommendations based on customer browsing behavior. Currently, they have a traditional data warehouse that processes batch updates every night. The marketing team complains that recommendations are outdated within hours because customers see yesterday's data. The data engineer needs to modify the architecture to support near-real-time analytics. The budget is limited, and the existing warehouse infrastructure must be reused as much as possible. Which architectural change would best meet the requirement?

Question 113mediummulti select
Read the full Comparing and Contrasting Data Concepts explanation →

A data analyst is comparing characteristics of structured and unstructured data. Which TWO of the following are characteristics of structured data? (Choose two.)

Question 114hardmultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

Refer to the exhibit. A data architect is designing a data dictionary for a relational database. Based on the exhibit, which data concept is being illustrated?

Exhibit

{
  "employee": {
    "emp_id": { "type": "integer", "nullable": false, "primary_key": true },
    "name": { "type": "string", "nullable": false },
    "email": { "type": "string", "nullable": true, "unique": true },
    "department": { "type": "string", "nullable": true }
  }
}
Question 115easymultiple choice
Read the full Comparing and Contrasting Data Concepts explanation →

A retail company is merging customer data from three separate systems: an e-commerce platform, a point-of-sale (POS) system, and a loyalty program. The e-commerce platform stores customer names in "FirstName LastName" format, the POS system stores names as "LastName, FirstName", and the loyalty program stores names in separate "first_name" and "last_name" fields. The data analyst needs to create a unified customer master table. After initial merging, there are 20% more records than expected, including duplicates with slight name variations (e.g., "John Smith" vs "John A. Smith"). To ensure accurate consolidation, which data concept should the analyst prioritize applying first?

Practice tests

Scored 10-question sessions with instant feedback and explanations.

DA0-001 Practice Test 1 — 10 Questions→DA0-001 Practice Test 2 — 10 Questions→DA0-001 Practice Test 3 — 10 Questions→DA0-001 Practice Test 4 — 10 Questions→DA0-001 Practice Test 5 — 10 Questions→DA0-001 Practice Exam 1 — 20 Questions→DA0-001 Practice Exam 2 — 20 Questions→DA0-001 Practice Exam 3 — 20 Questions→DA0-001 Practice Exam 4 — 20 Questions→Free DA0-001 Practice Test 1 — 30 Questions→Free DA0-001 Practice Test 2 — 30 Questions→Free DA0-001 Practice Test 3 — 30 Questions→DA0-001 Practice Questions 1 — 50 Questions→DA0-001 Practice Questions 2 — 50 Questions→DA0-001 Exam Simulation 1 — 100 Questions→

Practice by domain

Each domain maps to a weighted exam section. Focus on the domain where you are weakest.

Comparing and Contrasting Data ConceptsMining and Acquiring DataAnalyzing and Modeling DataVisualizing DataCommunicating Data Insights

Practice by scenario

Filter questions by type — troubleshooting, exhibit, drag-and-drop, PBQ, ACLs, OSPF, and more.

Browse scenarios→

Continue studying

All Comparing and Contrasting Data Concepts setsAll Comparing and Contrasting Data Concepts questionsDA0-001 Practice Hub