Courseiva
Knowledge + Practice
CertificationsVendorsCareer RoadmapsLabs & ToolsStudy GuidesGlossaryPractice Questions
C
Courseiva

Free IT certification practice questions with explained answers for CCNA, CompTIA, AWS, Azure, Google Cloud, and more.

Certification Practice Questions

CCNA practice questionsSecurity+ SY0-701 practice questionsAWS SAA-C03 practice questionsAZ-104 practice questionsAZ-900 practice questionsCLF-C02 practice questionsA+ Core 1 practice questionsGoogle Cloud ACE practice questionsCySA+ CS0-003 practice questionsNetwork+ N10-009 practice questions
View all certifications →

Product

CertificationsCertification PathsExam TopicsPractice TestsExam Dumps vs Practice TestsStudy HubComparisons

Company

AboutContactEditorial PolicyQuestion Writing PolicyTrust Center

Legal

Privacy PolicyTerms of Service

Courseiva is a free IT certification practice platform offering original exam-style practice questions, detailed explanations, topic-based practice, mock exams, readiness tracking, and study analytics for Cisco, CompTIA, Microsoft, AWS, and other technology certifications.

© 2026 Courseiva. Courseiva is operated by JTNetSolutions Ltd. All rights reserved.

Courseiva is an independent certification practice platform and is not affiliated with, endorsed by, or sponsored by Cisco, Microsoft, AWS, CompTIA, Google, ISC2, ISACA, or any other certification vendor. Vendor names and certification marks are used only to identify the exams learners are preparing for.

← Mining and Acquiring Data practice sets

DA0-001 Mining and Acquiring Data • Complete Question Bank

DA0-001 Mining and Acquiring Data — All Questions With Answers

Complete DA0-001 Mining and Acquiring Data question bank — all 0 questions with answers and detailed explanations.

99
Questions
Free
No signup
Certifications/DA0-001/Practice Test/Mining and Acquiring Data/All Questions
Question 1mediummultiple choice
Read the full Mining and Acquiring Data explanation →

A data analyst is pulling data from a production database for a report. The database contains customer orders with a column 'order_date'. The analyst notices that some orders have dates in the future. Which data quality issue does this represent?

Question 2hardmultiple choice
Read the full Mining and Acquiring Data explanation →

A data engineer is designing a data pipeline to ingest streaming data from IoT sensors. The sensors send data every second, and the pipeline must handle bursts of up to 10,000 messages per second. Which approach is most appropriate for capturing this data before processing?

Question 3easymultiple choice
Read the full Mining and Acquiring Data explanation →

A data analyst needs to combine two datasets: one contains customer information (customer_id, name, address) and the other contains order information (order_id, customer_id, order_date). The analyst wants to include all customers, even those who have not placed orders. Which type of join should be used?

Question 4mediummultiple choice
Read the full Mining and Acquiring Data explanation →

A data analyst is tasked with extracting data from a legacy system that outputs fixed-width text files. The analyst needs to parse these files into a structured format. Which tool or method is most appropriate for this task?

Question 5hardmultiple choice
Read the full Mining and Acquiring Data explanation →

A company is merging two databases from different departments. In Database A, customer IDs are integers. In Database B, customer IDs are alphanumeric strings. To merge, the data analyst must reconcile these differences. Which step should be taken first?

Question 6easymultiple choice
Read the full Mining and Acquiring Data explanation →

A data analyst needs to extract data from an API that returns JSON. The analyst wants to convert the JSON output into a tabular format for analysis. Which function in a scripting language is commonly used for this purpose?

Question 7mediummultiple choice
Read the full Mining and Acquiring Data explanation →

A data analyst is building a dataset from multiple sources and needs to ensure data quality. During the data acquisition phase, which activity is most important to perform?

Question 8hardmultiple choice
Read the full Mining and Acquiring Data explanation →

An organization needs to acquire data from a third-party vendor. The data will be used for regulatory reporting. Which of the following should be the primary consideration before acquiring the data?

Question 9easymultiple choice
Read the full Mining and Acquiring Data explanation →

A data analyst is using SQL to extract data. The analyst wants to retrieve all records from a table named 'sales' where the 'amount' column is greater than 100. Which SQL clause should be used?

Question 10mediummulti select
Read the full Mining and Acquiring Data explanation →

Which TWO of the following are common methods for acquiring data from external sources?

Question 11hardmulti select
Read the full Mining and Acquiring Data explanation →

Which THREE of the following are best practices when performing data extraction for a data pipeline?

Question 12easymulti select
Read the full Mining and Acquiring Data explanation →

Which TWO of the following are valid SQL clauses used to filter and sort data?

Question 13mediummultiple choice
Read the full Mining and Acquiring Data explanation →

What is the primary purpose of the HAVING clause in the query shown?

Exhibit

Refer to the exhibit.

```
SELECT customer_id, COUNT(order_id) AS order_count
FROM orders
GROUP BY customer_id
HAVING COUNT(order_id) > 5
ORDER BY order_count DESC;
```
Question 14hardmultiple choice
Read the full Mining and Acquiring Data explanation →

A data analyst sees this error in the ETL logs. What is the most likely cause?

Exhibit

Refer to the exhibit.

Error log:

```
2025-03-15 10:23:45 ERROR: ORA-12034: materialized view log on "SCHEMA"."SALES" is newer than last refresh
2025-03-15 10:23:45 INFO: Refresh of materialized view "SALES_MV" failed
```
Question 15easymultiple choice
Read the full Mining and Acquiring Data explanation →

A data engineer is configuring access to a data lake in Amazon S3. What does the JSON policy shown allow?

Exhibit

Refer to the exhibit.

```
{
  "policy": {
    "Statement": [
      {
        "Effect": "Allow",
        "Action": ["s3:GetObject"],
        "Resource": "arn:aws:s3:::data-bucket/*"
      }
    ]
  }
}
```
Question 16hardmultiple choice
Read the full NAT/PAT explanation →

A healthcare organization is building a data warehouse to support population health analytics. The data sources include: (1) an electronic health record (EHR) system with a relational database containing patient demographics, diagnoses, and medications; (2) a claims system that generates CSV files daily; (3) patient-generated health data from mobile apps via a REST API returning JSON. The data engineer needs to design a data acquisition process that runs nightly. The EHR system has a change tracking mechanism that logs changes with timestamps. The claims CSV files are appended daily. The API supports filtering by date. The data warehouse uses a star schema with fact and dimension tables. The engineer must ensure data consistency and minimize load times. Which approach should the engineer take?

Question 17mediummultiple choice
Read the full Mining and Acquiring Data explanation →

A retail company is migrating its on-premises data warehouse to a cloud data warehouse. The current ETL process extracts data from a transactional database (SQL Server) and a web analytics system (JSON logs). The ETL runs nightly and takes 6 hours. The business requires that the new cloud warehouse support real-time reporting with data latency of less than 15 minutes. The data engineer proposes using change data capture (CDC) from the SQL Server database and streaming the JSON logs via a message queue. However, management is concerned about cost and complexity. The engineer must design a solution that meets the latency requirement while minimizing operational overhead. Which approach should the engineer recommend?

Question 18mediummulti select
Read the full Mining and Acquiring Data explanation →

A data analyst is merging two datasets from different departments. The analyst notices that the 'CustomerID' field in the first dataset is stored as an integer, while in the second dataset it is stored as a string with leading zeros. Which TWO steps should the analyst take to ensure successful data integration?

Question 19hardmultiple choice
Read the full Mining and Acquiring Data explanation →

Based on the exhibit, what is the most likely cause of the import failure?

Exhibit

Refer to the exhibit.

Data Import Log:
[2024-03-15 10:22:34] INFO: Starting import from source 'sales_raw.csv'
[2024-03-15 10:22:35] WARN: Row 1502: 'price' field contains non-numeric value '12.5A'. Skipping row.
[2024-03-15 10:22:36] ERROR: Row 3450: 'date' field value '2024-02-30' is invalid. Import halted.
[2024-03-15 10:22:36] INFO: Import process terminated with errors.
Question 20easymultiple choice
Read the full Mining and Acquiring Data explanation →

A marketing company is building a customer segmentation model. The data team has access to two sources: a CRM database with customer demographics and purchase history, and a third-party data provider that offers social media activity scores. The CRM data is updated daily, while the third-party data is refreshed weekly on Sundays. The analyst needs to create a unified dataset for the model training scheduled for Wednesday morning. The analyst runs a SQL query to join the two tables on CustomerID, but the resulting dataset has far fewer rows than expected. Upon investigation, the analyst finds that many customers in the CRM do not have matching records in the third-party data. Additionally, some customers in the third-party data have multiple entries due to unresolved duplicates. The analyst must produce the most complete dataset possible while maintaining data quality. Which course of action should the analyst take?

Question 21mediumdrag order
Read the full Mining and Acquiring Data explanation →

Drag and drop the steps to perform a data backup using the 3-2-1 rule in the correct order.

Drag steps to the numbered slots on the right, or tap a step then tap a slot.

Steps
Order
1Step 1
2Step 2
3Step 3
4Step 4
5Step 5
Question 22mediumdrag order
Read the full Mining and Acquiring Data explanation →

Drag and drop the steps to perform a data audit in the correct order.

Drag steps to the numbered slots on the right, or tap a step then tap a slot.

Steps
Order
1Step 1
2Step 2
3Step 3
4Step 4
5Step 5
Question 23mediummatching
Read the full Mining and Acquiring Data explanation →

Match each data analysis technique to its primary purpose.

Drag a concept onto its matching description — or click a concept then click the description.

Concepts
Matches

Model relationships between variables

Group similar data points without labels

Analyze data points collected over time

Compare means across multiple groups

Test association between categorical variables

Question 24mediummatching
Read the full Mining and Acquiring Data explanation →

Match each database concept to its definition.

Drag a concept onto its matching description — or click a concept then click the description.

Concepts
Matches

Unique identifier for each record in a table

Field that links to primary key in another table

Structure to speed up data retrieval

Virtual table based on a query result

Process to reduce data redundancy

Question 25easymultiple choice
Read the full Mining and Acquiring Data explanation →

A data analyst needs to collect customer sentiment data from social media platforms. Which data acquisition method is most appropriate?

Question 26mediummultiple choice
Read the full Mining and Acquiring Data explanation →

A company is merging two customer databases from different acquisitions. They need to identify duplicate records. Which data profiling technique is most effective?

Question 27hardmultiple choice
Read the full Mining and Acquiring Data explanation →

A data architect is designing an ETL pipeline to ingest streaming data from IoT sensors. The data must be available for real-time analytics. Which acquisition method is best?

Question 28easymultiple choice
Read the full Mining and Acquiring Data explanation →

A marketing team wants to collect data on competitor pricing for similar products. Which data source is most appropriate?

Question 29mediummultiple choice
Read the full Mining and Acquiring Data explanation →

During data acquisition, an analyst notices that the data from an external vendor has inconsistent date formats. What is the first step the analyst should take?

Question 30hardmultiple choice
Read the full Mining and Acquiring Data explanation →

A data engineer needs to acquire data from a legacy mainframe system that does not support modern APIs or direct database connectivity. Which approach is most feasible?

Question 31easymultiple choice
Read the full Mining and Acquiring Data explanation →

A small business wants to acquire customer feedback through a short questionnaire emailed after purchase. Which data acquisition method does this represent?

Question 32mediummultiple choice
Read the full Mining and Acquiring Data explanation →

An organization is integrating data from multiple sources into a data warehouse. They need to handle differences in data granularity (e.g., daily vs. hourly sales data). Which technique is most appropriate?

Question 33hardmultiple choice
Read the full Mining and Acquiring Data explanation →

A data analyst is using a public API to collect historical weather data. The API has a rate limit of 100 requests per minute, but the analyst needs to retrieve 10,000 records as quickly as possible. What strategy should be used?

Question 34easymulti select
Read the full Mining and Acquiring Data explanation →

Which TWO are common methods for acquiring internal data? (Choose two.)

Question 35mediummulti select
Read the full Mining and Acquiring Data explanation →

Which THREE are best practices for data profiling during acquisition? (Choose three.)

Question 36hardmulti select
Read the full Mining and Acquiring Data explanation →

Which THREE are common challenges when acquiring data from external APIs? (Choose three.)

Question 37easymultiple choice
Read the full Mining and Acquiring Data explanation →

Refer to the exhibit. An analyst runs this query before acquiring data from a PostgreSQL database. What is the primary purpose of this query?

Network Topology
+Refer to the exhibit.```SELECT table_name, num_rows, last_analyzedFROM information_schema.tablesWHERE table_schema = 'public'table_name | num_rows | last_analyzedcustomers | 15234 | 2024-06-01orders | 89732 | 2024-05-30products | 456 | 2024-06-02
Question 38mediummultiple choice
Read the full Mining and Acquiring Data explanation →

Refer to the exhibit. A data engineer is setting up data acquisition from an S3 bucket with this policy. What does the policy enforce?

Exhibit

Refer to the exhibit.
```json
{
  "Version": "2012-10-17",
  "Statement": [
    {
      "Effect": "Allow",
      "Action": "s3:GetObject",
      "Resource": "arn:aws:s3:::data-bucket/*",
      "Condition": {
        "IpAddress": {"aws:SourceIp": "10.0.0.0/8"}
      }
    }
  ]
}
```
Question 39hardmultiple choice
Read the full Mining and Acquiring Data explanation →

Refer to the exhibit. An analyst sees this log during data acquisition. What action should be taken first?

Exhibit

Refer to the exhibit.
```
INFO: Starting extraction from source 'SalesDB'
INFO: Connected successfully.
WARN: Data type mismatch in column 'Price': expected NUMERIC, found VARCHAR
ERROR: Row 1523: failed to parse date '2024-02-30'
INFO: Extraction completed with 1 warning and 1 error.
```
Question 40mediummultiple choice
Read the full Mining and Acquiring Data explanation →

A data analyst is tasked with combining customer data from a CRM system and a billing system. The CRM uses a GUID for customer ID, while billing uses an integer. Which approach should the analyst use to ensure a reliable merge?

Question 41easymultiple choice
Read the full Mining and Acquiring Data explanation →

A data team needs to extract data from a legacy system that only supports flat file exports. Which data acquisition method is most appropriate?

Question 42hardmultiple choice
Read the full Mining and Acquiring Data explanation →

During a data mining project, an analyst discovers that a significant number of records have a negative value for the age field. What is the most appropriate first step?

Question 43mediummultiple choice
Read the full Mining and Acquiring Data explanation →

Refer to the exhibit. What does the query return?

Exhibit

SELECT customer_id, COUNT(*) as order_count FROM orders GROUP BY customer_id HAVING COUNT(*) > 5;
Question 44easymultiple choice
Read the full Mining and Acquiring Data explanation →

Refer to the exhibit. What data quality issue is indicated?

Exhibit

2023-08-15 14:32:10 ERROR: Data conversion failed for column 'salary' in row 45: value 'N/A' cannot be converted to numeric.
Question 45mediummultiple choice
Read the full Mining and Acquiring Data explanation →

Refer to the exhibit. If the date column is stored as a string in 'MM/DD/YYYY' format, what will be the result?

Exhibit

SELECT * FROM sales WHERE date BETWEEN '2023-01-01' AND '2023-12-31';
Question 46mediummulti select
Read the full Mining and Acquiring Data explanation →

A data analyst needs to identify duplicate customer records. Which TWO methods are commonly used? (Select two.)

Question 47hardmulti select
Read the full Mining and Acquiring Data explanation →

After merging two datasets, an analyst finds that the resulting dataset has many null values in some columns. Which TWO steps should the analyst take to address this? (Select two.)

Question 48easymulti select
Read the full Mining and Acquiring Data explanation →

Which THREE data sources are suitable for web scraping? (Select three.)

Question 49easymultiple choice
Read the full NAT/PAT explanation →

A retail company wants to analyze customer purchase patterns to identify products frequently bought together. Which data mining technique is most appropriate?

Question 50easymultiple choice
Read the full Mining and Acquiring Data explanation →

A data analyst is importing a CSV file that contains a mixture of numeric and text fields. What is the most common issue when importing?

Question 51mediummultiple choice
Read the full Mining and Acquiring Data explanation →

During data acquisition, a data engineer uses a tool to extract data from a source system incrementally based on a timestamp column. Which method is being used?

Question 52hardmultiple choice
Read the full Mining and Acquiring Data explanation →

A data analyst discovers that a dataset contains multiple records for the same customer with different spellings (e.g., 'Jon' vs 'John'). Which data preparation step should be applied first?

Question 53hardmultiple choice
Read the full Mining and Acquiring Data explanation →

A financial institution is merging transaction data from two different systems. System A stores currency amounts as integers in cents, and System B stores as decimals in dollars. What is the best way to integrate the data?

Question 54hardmultiple choice
Read the full Mining and Acquiring Data explanation →

A data team is integrating customer data from three sources. After joining, they find that the count of unique customers is lower than expected. What is the most likely cause?

Question 55easymultiple choice
Read the full Mining and Acquiring Data explanation →

A data analyst needs to merge two customer tables from different sources. One table uses 'CUST_ID' as the primary key, the other uses 'CustomerID'. To ensure accurate merging, the analyst should first:

Question 56easymultiple choice
Read the full Mining and Acquiring Data explanation →

A company receives daily sales data in CSV format. The data includes a 'Date' column in MM/DD/YYYY format. To load this into a database that expects YYYY-MM-DD, the analyst should:

Question 57easymultiple choice
Read the full Mining and Acquiring Data explanation →

A data analyst is tasked with collecting data from a web API that returns JSON. The API requires an API key in the header. Which method should be used to authenticate?

Question 58mediummultiple choice
Read the full Mining and Acquiring Data explanation →

An analyst needs to combine two datasets from different sources that share a common key but have different levels of granularity. Dataset A has daily sales per store, Dataset B has hourly foot traffic per store. The analyst wants to analyze correlation. Which approach is appropriate?

Question 59mediummultiple choice
Read the full Mining and Acquiring Data explanation →

A data team is designing an ETL process to extract data from an operational database daily. The database experiences heavy write loads during business hours. What is the best practice to minimize impact on operations?

Question 60mediummultiple choice
Read the full NAT/PAT explanation →

A healthcare organization acquires data from multiple hospitals with different patient record systems. The data includes patient IDs but no common identifier across systems. Which technique should be used to link records?

Question 61hardmultiple choice
Read the full Mining and Acquiring Data explanation →

A financial analyst is integrating data from multiple stock exchanges. One exchange provides trade timestamps in UTC, another in Eastern Time. The analyst needs accurate time synchronization for time-series analysis. What is the best approach?

Question 62hardmultiple choice
Read the full Mining and Acquiring Data explanation →

An e-commerce company is merging customer data from three legacy systems. Two systems use email as unique identifier, but one system allows multiple customers per email. The third uses phone number. To create a unified customer view, the analyst should first:

Question 63hardmultiple choice
Read the full Mining and Acquiring Data explanation →

A data engineer is tasked with acquiring data from a third-party vendor that provides daily file drops via SFTP. The files are large (10 GB each). The pipeline must load data into a data warehouse. Which approach optimizes for speed and reliability?

Question 64easymulti select
Read the full Mining and Acquiring Data explanation →

A data analyst is validating a dataset acquired from an external source. Which TWO actions are appropriate for data quality assessment?

Question 65mediummulti select
Read the full Mining and Acquiring Data explanation →

A company is acquiring social media data via a public API. Which TWO considerations are important for ensuring ethical and legal compliance?

Question 66hardmulti select
Read the full Mining and Acquiring Data explanation →

A data scientist is merging retail transaction data from online and in-store sources. Which THREE steps are required to ensure data consistency?

Question 67easymultiple choice
Read the full NAT/PAT explanation →

A data analyst receives the above JSON snippet from a web API. The analyst needs to extract the email addresses for all customers. Which JSONPath expression should be used?

Exhibit

Refer to the exhibit.
{
  "customers": [
    {"id": 1, "name": "John Doe", "email": "john@example.com"},
    {"id": 2, "name": "Jane Smith", "email": "jane@example.com"}
  ],
  "status": "success"
}
Question 68mediummultiple choice
Read the full Mining and Acquiring Data explanation →

An analyst is reviewing the above SQL query used to acquire data. What does this query retrieve?

Exhibit

Refer to the exhibit.
SELECT customer_id, COUNT(order_id) AS order_count
FROM orders
WHERE order_date BETWEEN '2023-01-01' AND '2023-12-31'
GROUP BY customer_id
HAVING COUNT(order_id) > 5;
Question 69hardmultiple choice
Read the full Mining and Acquiring Data explanation →

A data pipeline log shows the above error. Which data transformation should be applied during acquisition?

Exhibit

Refer to the exhibit.
ERROR: Data type mismatch for column 'transaction_amount' at row 342. Expected DECIMAL(10,2), received VARCHAR.
The source system sends transaction_amount as a string with a currency symbol (e.g., '$123.45').
Question 70easymultiple choice
Read the full Mining and Acquiring Data explanation →

A marketing team wants to analyze customer sentiment from social media posts. Which data acquisition method is most appropriate?

Question 71mediummultiple choice
Read the full Mining and Acquiring Data explanation →

A data analyst needs to combine sales data from multiple regional databases with different schemas. Which process is best?

Question 72hardmultiple choice
Read the full Mining and Acquiring Data explanation →

An organization is acquiring data from an external vendor. The vendor provides a flat file with inconsistent delimiters and missing values. Which step should be performed first in data acquisition?

Question 73easymultiple choice
Read the full Mining and Acquiring Data explanation →

A data analyst is tasked with gathering data from a legacy system that only exports CSV files. The files contain headers but no data types. Which tool would best facilitate initial data exploration?

Question 74mediummultiple choice
Read the full Mining and Acquiring Data explanation →

A company wants to collect real-time clickstream data from its website. Which acquisition method is most suitable?

Question 75hardmultiple choice
Read the full Mining and Acquiring Data explanation →

A financial institution needs to acquire credit transaction data from multiple sources while ensuring compliance with data privacy regulations. What is the most critical step?

Question 76easymultiple choice
Read the full Mining and Acquiring Data explanation →

A data analyst is extracting data from a relational database using SQL. Which clause is essential for limiting the rows retrieved to only those needed?

Question 77mediummultiple choice
Read the full Mining and Acquiring Data explanation →

An e-commerce company is acquiring product data from multiple supplier APIs. The APIs return JSON with inconsistent field naming conventions. Which data acquisition technique should be applied?

Question 78hardmultiple choice
Read the full Mining and Acquiring Data explanation →

A data team is using web scraping to collect competitor pricing data. The target website has anti-scraping measures like CAPTCHAs and rate limiting. Which approach is most effective?

Question 79easymulti select
Read the full Mining and Acquiring Data explanation →

Which TWO are examples of internal data sources? (Select exactly 2)

Question 80mediummulti select
Read the full Mining and Acquiring Data explanation →

A data analyst is evaluating data quality issues during acquisition. Which TWO issues are most likely to arise from merging data from different sources? (Select exactly 2)

Question 81hardmulti select
Read the full Mining and Acquiring Data explanation →

Which THREE are best practices for acquiring data via web scraping? (Select exactly 3)

Question 82mediummultiple choice
Read the full Mining and Acquiring Data explanation →

Refer to the exhibit. What is the most likely issue causing the unexpectedly low count?

Exhibit

SELECT COUNT(*) FROM customers WHERE signup_date > '2023-01-01';
Result: 15000
The analyst notices that the result seems low because they expected more signups from that period.
Question 83hardmultiple choice
Read the full Mining and Acquiring Data explanation →

Refer to the exhibit. What is the most likely cause of the extraction failure?

Exhibit

2023-05-01 14:32:01 INFO: Starting extraction from source 'SalesDB'
2023-05-01 14:32:05 ERROR: Connection timeout after 30s
2023-05-01 14:32:06 WARN: Retry attempt 1
2023-05-01 14:32:36 ERROR: Connection timeout after 30s
2023-05-01 14:32:37 WARN: Retry attempt 2
2023-05-01 14:33:07 ERROR: Connection timeout after 30s
2023-05-01 14:33:08 WARN: Retry attempt 3
2023-05-01 14:33:38 ERROR: Maximum retries exceeded
Question 84hardmultiple choice
Read the full Mining and Acquiring Data explanation →

A retail company is acquiring sales data from 150 stores worldwide. Each store sends daily CSV files via email to a central email address. The data acquisition process is manual: an intern downloads each attachment and copies it into a shared folder. The shared folder is then accessed by an ETL tool that loads data into a data warehouse. Recently, the data warehouse has been missing records for several stores. The intern reports that some emails are not being received or are delayed. The company needs to improve the reliability and timeliness of data acquisition. Which course of action should be taken first?

Question 85easymultiple choice
Read the full Mining and Acquiring Data explanation →

A marketing analyst needs to combine customer data from a CRM database with social media engagement data from a third-party API. Which data acquisition method is most appropriate?

Question 86easymultiple choice
Read the full Mining and Acquiring Data explanation →

A data analyst is tasked with collecting data from multiple spreadsheets provided by different departments. Each spreadsheet has different column names and formats. What is the best first step?

Question 87mediummultiple choice
Read the full Mining and Acquiring Data explanation →

A data engineer is designing an ETL pipeline to extract sales data from a legacy on-premise database and load it into a cloud data warehouse. The database is slow and queries during business hours affect performance. Which extraction strategy minimizes impact?

Question 88hardmultiple choice
Read the full Mining and Acquiring Data explanation →

A research firm is acquiring data from public government databases via API. The API rate limits at 100 requests per minute. They need to download 10,000 records, but each request returns a maximum of 100 records. What is the most efficient approach to ensure complete acquisition without being blocked?

Question 89mediummulti select
Read the full Mining and Acquiring Data explanation →

Which TWO are valid data acquisition methods? (Select two.)

Question 90hardmulti select
Read the full Mining and Acquiring Data explanation →

Which THREE are challenges in acquiring data from external sources? (Select three.)

Question 91easymultiple choice
Read the full Mining and Acquiring Data explanation →

A retail company's data analytics team needs to acquire point-of-sale (POS) transaction data from 200 stores daily. Each store sends a CSV file via email at the end of the day. The files often arrive late, have inconsistent column names (e.g., "StoreID", "Store_ID", "store_id"), and occasionally contain corrupted rows. The team manually processes these files, leading to frequent errors and delays. The company wants to automate the acquisition process to ensure data is available by 9 AM the next business day with high quality. Which approach best addresses these issues?

Question 92easymultiple choice
Read the full NAT/PAT explanation →

A healthcare organization collects patient questionnaire data via paper forms at clinics. The forms are scanned and sent to a central office, where staff manually enter data into an electronic system. This process is slow and error-prone. The organization wants to reduce manual entry errors and speed up data availability. Which method should they adopt?

Question 93mediummultiple choice
Review the full routing breakdown →

A logistics company receives GPS tracking data from fleet vehicles at 1-second intervals via a cellular network. The data is used to optimize routes and monitor driver behavior. Recently, the data acquisition system has been missing updates for some vehicles when they pass through tunnels or remote areas. The data team notices gaps during these periods. The company needs a solution to ensure near-real-time data continuity. What should they do?

Question 94mediummultiple choice
Read the full Mining and Acquiring Data explanation →

An e-commerce company wants to integrate product pricing data from competitor websites to adjust its own prices dynamically. They plan to scrape pricing pages every hour. However, the competitors' websites have anti-scraping measures such as IP blocking and CAPTCHAs. The company's legal team also advises caution regarding terms of service. Which data acquisition strategy is both effective and compliant?

Question 95hardmultiple choice
Read the full Mining and Acquiring Data explanation →

A financial analytics firm needs to acquire historical stock market tick data (millions of records per day) from a data vendor. The vendor provides data via FTP in binary format. The firm's existing infrastructure uses on-premise servers with limited storage and processing power. They need to stream the data into a cloud data lake for analysis. However, the binary format is proprietary and requires a licensed decoder. The budget is constrained. Which approach best meets the data acquisition requirements?

Question 96hardmultiple choice
Read the full Mining and Acquiring Data explanation →

A social media monitoring company collects public tweets using the Twitter API. The API has a tiered access: free tier allows 500,000 tweets per month, and paid tier allows 2 million tweets per month. The company needs to collect 1.5 million tweets per month for analysis. They are on a free tier but have been exceeding the limit, causing account suspension. They need a sustainable solution without significantly increasing costs. What should they do?

Question 97easymulti select
Read the full Mining and Acquiring Data explanation →

A data analyst is performing data acquisition from multiple source files. Which TWO data profiling tasks should the analyst complete before loading the data into the target system?

Question 98mediummultiple choice
Read the full Mining and Acquiring Data explanation →

Refer to the exhibit. A data analyst is trying to extract data from a SQL Server database but receives the error. Which configuration change should the analyst recommend to the database administrator?

Exhibit

Error: 2024-03-15 10:32:45 ERROR [DataExtractor] Failed to connect to source database 'SalesDB' on server 10.0.0.50:1433. Error: Login failed for user 'dataminer'. Reason: An attempt to login using SQL authentication failed. Server is configured for Windows authentication only.
Question 99hardmultiple choice
Read the full Mining and Acquiring Data explanation →

A large retail company is integrating customer data from two separate CRM systems into a new data warehouse. System A stores customer IDs as integers (e.g., 12345), while System B stores them as alphanumeric strings (e.g., 'CUST-12345-X'). Additionally, some customers exist in both systems but with slight name variations (e.g., 'John Smith' vs 'Jon Smith'). The data warehouse requires a unified customer table with a single unique identifier for each customer. The analyst needs to design the data acquisition process. Which of the following is the most appropriate first step?

Practice tests

Scored 10-question sessions with instant feedback and explanations.

DA0-001 Practice Test 1 — 10 Questions→DA0-001 Practice Test 2 — 10 Questions→DA0-001 Practice Test 3 — 10 Questions→DA0-001 Practice Test 4 — 10 Questions→DA0-001 Practice Test 5 — 10 Questions→DA0-001 Practice Exam 1 — 20 Questions→DA0-001 Practice Exam 2 — 20 Questions→DA0-001 Practice Exam 3 — 20 Questions→DA0-001 Practice Exam 4 — 20 Questions→Free DA0-001 Practice Test 1 — 30 Questions→Free DA0-001 Practice Test 2 — 30 Questions→Free DA0-001 Practice Test 3 — 30 Questions→DA0-001 Practice Questions 1 — 50 Questions→DA0-001 Practice Questions 2 — 50 Questions→DA0-001 Exam Simulation 1 — 100 Questions→

Practice by domain

Each domain maps to a weighted exam section. Focus on the domain where you are weakest.

Comparing and Contrasting Data ConceptsMining and Acquiring DataAnalyzing and Modeling DataVisualizing DataCommunicating Data Insights

Practice by scenario

Filter questions by type — troubleshooting, exhibit, drag-and-drop, PBQ, ACLs, OSPF, and more.

Browse scenarios→

Continue studying

All Mining and Acquiring Data setsAll Mining and Acquiring Data questionsDA0-001 Practice Hub