DA0-001 · topic practice

Mining Data practice questions

Practise CompTIA Data+ DA0-001 Mining Data practice questions — original exam-style scenarios with answer choices, explanations, and analysis of common mistakes.

Courseiva uses original exam-style practice questions designed for learning and revision. The goal is to understand the concepts, recognise exam patterns, and improve through explanations — not memorise copied exam dumps.

Reviewed byJohnson Ajibi· MSc IT Security
20 questionsDomain: Mining Data

What the exam tests

What to know about Mining Data

Mining Data questions test whether you can apply the concept in context, not just recognise a definition.

How the topic appears in realistic exam-style scenarios.

Which detail in the question changes the correct answer.

How to eliminate plausible but wrong options.

How to connect the question back to the wider exam objective.

Watch out for

Common Mining Data exam traps

  • Answering from memory before reading the full scenario.
  • Missing a constraint such as cost, availability, security, scope or command context.
  • Choosing a broad answer when the question asks for the most specific fix.
  • Ignoring why the wrong options are tempting.

Practice set

Mining Data questions

20 questions · select your answer, then reveal the explanation

Question 1easymultiple choice
Read the full Mining Data explanation →

A data analyst needs to retrieve all unique job titles from an employees table. Which SQL keyword should be used in the SELECT clause?

Question 2mediummultiple choice
Read the full Mining Data explanation →

A data analyst is profiling a new dataset containing customer information. When assessing data quality, which metric would be most appropriate to determine if the 'email' column contains valid email addresses?

Question 3hardmultiple choice
Read the full Mining Data explanation →

A data analyst is working with a sales table that contains columns: sale_id, product_id, sale_date, and amount. They need to calculate a 7-day moving average of sales amount for each product, ordered by sale_date. Which window function syntax should they use?

Question 4mediummultiple choice
Read the full Mining Data explanation →

A data analyst needs to extract the year from a column named 'order_date' in a SQL database. The database supports standard SQL functions. Which function should they use?

Question 5easymultiple choice
Read the full Mining Data explanation →

A data analyst wants to retrieve the top 5 highest-paid employees from a table named 'employees' that has columns 'employee_id', 'salary', and 'name'. Which SQL query should they use?

Question 6mediummultiple choice
Read the full Mining Data explanation →

A data analyst is performing exploratory data analysis on a dataset containing house prices. They want to identify outliers in the 'price' column using the IQR method. The first quartile (Q1) is $200,000, the third quartile (Q3) is $350,000, and the IQR is $150,000. What is the upper bound for identifying outliers?

Question 7hardmultiple choice
Read the full Mining Data explanation →

A data analyst needs to perform stratified sampling on a customer database to ensure proportional representation across three regions: North (40%), South (30%), and West (30%). The total sample size required is 1,000. How many customers should be sampled from the North region?

Question 8mediummultiple choice
Read the full Mining Data explanation →

A data analyst uses a CTE to find employees who earn more than the average salary in their department. Which SQL clause is used to define the CTE?

Question 9mediummultiple choice
Study the full Python automation breakdown →

A data analyst is using pandas in Python to merge two DataFrames: sales (columns: sale_id, product_id, amount) and products (columns: product_id, product_name). Which pandas function should they use to combine these DataFrames on the 'product_id' column?

Question 10easymultiple choice
Read the full Mining Data explanation →

In SQL, which string function would you use to remove leading and trailing spaces from a column named 'city'?

Question 11hardmultiple choice
Read the full Mining Data explanation →

A data analyst is writing a query to rank products by total sales amount within each category. They want ties to have the same rank and no gaps in the ranking sequence. Which window function should they use?

Question 12mediummultiple choice
Read the full Mining Data explanation →

A data analyst needs to count the number of orders placed by each customer, but only for customers who have placed more than 5 orders. Which SQL clause should be used to filter the aggregated results?

Question 13mediummulti select
Read the full Mining Data explanation →

A data analyst is performing data profiling on a customer table. Which TWO of the following are key metrics to assess data quality? (Select TWO.)

A data analyst is using a recursive CTE to traverse a hierarchical organizational chart. Which THREE components are required to define a recursive CTE? (Select THREE.)

Question 15mediummulti select
Read the full Mining Data explanation →

A data analyst wants to sample a large dataset of customer transactions. Which TWO sampling methods are probability-based and ensure every element has a known chance of being selected? (Select TWO.)

Question 16easymultiple choice
Read the full Mining Data explanation →

In a dataset of customer orders, you need to count the number of distinct customers who have placed orders. Which SQL aggregate function should you use?

You are using pandas in Python to clean a dataset. You notice several rows with missing values in the 'age' column. Which method would you use to remove those rows?

Question 18mediummultiple choice
Read the full Mining Data explanation →

A data analyst wants to retrieve the top 5 highest-paid employees from an 'employees' table, including ties. Which SQL clause should be used?

Question 19mediummultiple choice
Read the full Mining Data explanation →

You are analyzing sales data and need to calculate the moving average of monthly sales over the previous 3 months for each month. Which type of function is best suited for this task?

Question 20mediummultiple choice
Read the full Mining Data explanation →

A data quality assessment reveals that a column named 'email' contains values like 'user@example' (missing domain extension). Which data profiling technique would best identify such pattern violations?

Free account

Track your progress over time

Create a free account to save your results and see which topics improve across sessions.

Focused Mining Data sessions

Start a Mining Data only practice session

Every question in these sessions is drawn from the Mining Data domain — nothing else.

Related practice questions

Related DA0-001 topic practice pages

Move into related areas when this topic feels solid.

Frequently asked questions

What does the DA0-001 exam test about Mining Data?
Mining Data questions test whether you can apply the concept in context, not just recognise a definition.
How should I use these practice questions?
Select your answer before revealing the explanation. Then read why each option is right or wrong — this active recall approach builds retention far faster than re-reading notes.
Can I practise just Mining Data questions in a focused session?
Yes — the session launcher on this page draws every question from the Mining Data domain. Use a 10-question session first to gauge your baseline, then move to 20 or 30 once the weak spots are clear.
Where can I practise other DA0-001 topics?
Use the topic links above to move to related areas, or go back to the DA0-001 question bank to see all topics.
Are these real exam questions or dumps?
These are original practice questions written to test the same concepts the DA0-001 exam covers. They are not copied from any real exam or dump site.