CCNA Communicating Data Insights Questions

52 of 127 questions · Page 2/2 · Communicating Data Insights · Answers revealed

76
MCQeasy

A data analyst wants to compare the sales performance of four different stores over the same time period. Which chart type is most suitable?

A.Line chart with multiple lines
B.Grouped bar chart
C.Stacked bar chart
D.Pie chart with multiple pies
AnswerB

Grouped bars allow side-by-side comparison of stores.

Why this answer

A grouped bar chart is the most suitable choice because it allows direct comparison of discrete categories (four stores) across a common time period, with each group representing a time interval and individual bars representing each store's sales. This chart type excels at side-by-side comparisons of multiple entities over the same categorical axis, making it ideal for the analyst's goal.

Exam trap

The trap here is that candidates often choose a line chart (Option A) because they associate 'time period' with trend analysis, but the question explicitly asks to compare sales performance over the same time period, not to show changes over time, making the grouped bar chart the correct choice for direct comparison of discrete entities.

How to eliminate wrong answers

Option A is wrong because a line chart with multiple lines is designed to show trends over continuous time, but here the time period is fixed and the focus is on comparing discrete stores, not tracking changes over time. Option C is wrong because a stacked bar chart shows part-to-whole relationships within each time period, which obscures individual store performance and makes direct comparisons between stores difficult. Option D is wrong because a pie chart with multiple pies is meant to show proportions of a whole for each store separately, but comparing sales across stores requires a common baseline, which pie charts do not provide.

77
MCQhard

A data analyst is reviewing an S3 bucket policy that controls access to a data lake. The analyst wants to communicate that the current policy restricts data downloads to a specific IP range. Which of the following best describes the policy's effect?

A.The policy allows all users to download data.
B.The policy denies data downloads to all users.
C.The policy restricts data downloads to a specific IP range.
D.The policy allows data uploads from a specific IP range.
AnswerC

The Deny with condition effectively allows only that IP range.

Why this answer

Option C is correct because the S3 bucket policy explicitly restricts data downloads to a specific IP range by using a condition in the policy statement, such as `aws:SourceIp` with a CIDR block. This means only requests originating from IP addresses within that range are allowed to perform the `s3:GetObject` action, effectively limiting downloads to authorized network locations.

Exam trap

The trap here is that candidates may confuse the policy's effect with a blanket denial (Option B) or misinterpret the restriction as applying to uploads (Option D), when the policy specifically targets downloads and allows them only from a defined IP range.

How to eliminate wrong answers

Option A is wrong because it describes an allow-all effect, which contradicts the explicit IP-based restriction in the policy; a policy that restricts to a specific IP range does not allow all users. Option B is wrong because it states the policy denies downloads to all users, but the policy actually allows downloads from the specified IP range while implicitly or explicitly denying others—it does not universally deny. Option D is wrong because it refers to data uploads (e.g., `s3:PutObject`), but the question specifically addresses downloads (`s3:GetObject`), and the policy effect is about restricting downloads, not uploads.

78
MCQmedium

A report uses red and green colors to indicate performance; a stakeholder who is colorblind complains. What is the BEST remedy?

A.Remove color entirely
B.Change colors to blue and orange
C.Add patterns or labels in addition to color
D.Use a different chart type
AnswerC

Patterns and labels provide redundant encoding that works for all users.

Why this answer

Option C is correct because adding patterns or labels alongside color ensures that the data encoding is accessible to colorblind viewers without removing the informational value of color for those who can see it. This approach follows WCAG (Web Content Accessibility Guidelines) 2.1 Success Criterion 1.4.1, which requires that color is not the sole means of conveying information. In data visualization tools like Tableau or Power BI, this is implemented by adding shape markers or text annotations to series, preserving the original color scheme for non-colorblind users while providing redundant encoding.

Exam trap

The trap here is that candidates assume the best remedy is to simply change colors to a more colorblind-friendly palette (like blue and orange), but CompTIA often tests the deeper principle that color alone is never sufficient for accessibility—redundant encoding via patterns or labels is required to meet WCAG standards.

How to eliminate wrong answers

Option A is wrong because removing color entirely eliminates a key visual encoding channel, which can reduce the chart's effectiveness for the majority of viewers and may violate best practices for data visualization that leverage color for quick pattern recognition. Option B is wrong because while blue and orange are more colorblind-friendly than red and green, they still rely solely on color differentiation, which fails for users with complete color blindness (achromatopsia) or other deficiencies; the WCAG requirement is to not use color as the only means of distinction. Option D is wrong because changing the chart type does not address the core issue of color-dependent encoding; a different chart type (e.g., bar chart instead of pie chart) still uses color to differentiate categories, so the accessibility problem persists unless patterns or labels are added.

79
MCQhard

Refer to the exhibit. What is the best course of action for the data analyst?

A.Change the default value to 'NA'
B.Delete rows with missing 'Region'
C.Investigate source data to understand why 'Region' is missing
D.Ignore the warning as it is handled
AnswerC

Root cause analysis helps improve data quality at the source.

Why this answer

Option C is correct because missing data in a critical field like 'Region' indicates a potential data quality issue that requires root cause analysis. The data analyst should investigate the source system to determine whether the missing values result from a collection error, a transformation bug, or a legitimate null, rather than making assumptions by deleting rows or filling defaults. This aligns with the principle of data provenance and ensures that downstream insights are based on accurate, traceable data.

Exam trap

The trap here is that candidates assume missing data should be immediately cleaned by deletion or default values, but Cisco tests the understanding that a data analyst's first responsibility is to investigate the source of the anomaly before applying any transformation.

How to eliminate wrong answers

Option A is wrong because blindly changing the default value to 'NA' introduces artificial data that may skew aggregations or filter logic, and it does not address the underlying reason for the missing values. Option B is wrong because deleting rows with missing 'Region' could discard valid records if the missing data is due to a temporary system glitch or a mapping error, leading to biased analysis and loss of potentially important context. Option D is wrong because ignoring the warning assumes the missing data is harmless, but in practice, missing region information can break geographic segmentation, violate data integrity constraints, and cause incorrect reporting in dashboards or ETL pipelines.

80
Multi-Selecthard

When presenting data insights, which three practices help ensure clarity? (Choose THREE.)

Select 3 answers
A.Include all data points in the chart
B.Overlay multiple chart types on one axis
C.Use consistent color coding
D.Label axes clearly
E.Provide context and benchmarks
AnswersC, D, E

Consistency helps audiences quickly interpret visual elements.

Why this answer

Option C is correct because consistent color coding across charts and dashboards reduces cognitive load, allowing viewers to quickly associate colors with specific categories or metrics. This practice aligns with data visualization best practices for clarity, as it leverages pre-attentive processing to speed up interpretation without requiring the audience to re-learn the mapping for each chart.

Exam trap

The trap here is that candidates may think including all data points (Option A) ensures completeness, but the exam tests the understanding that clarity often requires simplification and focus, not exhaustive detail.

81
MCQeasy

A data analyst is asked to create a visualization showing the trend of monthly revenue over the last five years. Which chart type is most suitable?

A.Pie chart.
B.Histogram.
C.Line chart.
D.Bar chart.
AnswerC

Line charts effectively show changes over time with continuous data.

Why this answer

A line chart is the most suitable choice because it explicitly shows the trend of monthly revenue over a continuous time period (five years). The x-axis represents time (months), and the y-axis represents revenue, allowing the viewer to easily identify patterns, seasonality, and overall direction. This makes it ideal for time-series data visualization.

Exam trap

The trap here is that candidates often confuse a histogram with a line chart because both use bars or lines, but a histogram bins continuous data into ranges and loses the temporal sequence, while a line chart preserves the order of time-series data.

How to eliminate wrong answers

Option A is wrong because a pie chart is used to show parts of a whole (proportions) at a single point in time, not trends over a continuous period. Option B is wrong because a histogram displays the distribution of a continuous variable by binning data into intervals, but it does not inherently show a trend over time—it would obscure the sequential order of months. Option D is wrong because a bar chart compares discrete categories or values at specific points, but it does not emphasize the continuous flow and trend direction as effectively as a line chart for time-series data.

82
MCQeasy

When creating a data story, what should be the FIRST step?

A.Design the layout of the report
B.Gather all available data
C.Choose visualization tools
D.Identify the key message
AnswerD

The key message frames the story and directs the entire process.

Why this answer

The first step in creating a data story is to identify the key message because the entire narrative, data selection, and visualization choices must align with a central insight. Without a clear message, subsequent steps like gathering data or designing layouts risk being unfocused and failing to communicate actionable findings. This ensures the story is driven by the insight, not by the data or tools.

Exam trap

CompTIA often tests the misconception that data gathering or tool selection is the logical starting point, tempting candidates to skip the critical step of defining the core insight first.

How to eliminate wrong answers

Option A is wrong because designing the layout of the report should occur after the key message is defined; layout is a presentation concern, not the foundational step. Option B is wrong because gathering all available data without a guiding message leads to analysis paralysis and irrelevant information; data should be collected with the key message in mind. Option C is wrong because choosing visualization tools depends on the data and the story to be told; selecting tools prematurely can limit the narrative or force inappropriate chart types.

83
MCQeasy

A data analyst is preparing a presentation for executive leadership. The analyst wants to highlight the correlation between marketing spend and revenue over the past year. Which visualization type is most appropriate for showing this relationship?

A.Scatter plot
B.Pie chart
C.Bar chart
D.Histogram
AnswerA

Scatter plots effectively display correlations between two continuous variables.

Why this answer

A scatter plot is the most appropriate visualization for showing the relationship between two continuous variables—marketing spend and revenue—because it plots individual data points on an X-Y axis, allowing the analyst to visually assess correlation, trends, and outliers. This directly supports the goal of highlighting correlation, as the pattern of points (e.g., upward slope) indicates the strength and direction of the relationship.

Exam trap

The trap here is that candidates often confuse a bar chart or histogram with a scatter plot because they think any chart with axes can show relationships, but only a scatter plot directly plots paired continuous data to reveal correlation without aggregation.

How to eliminate wrong answers

Option B (Pie chart) is wrong because pie charts are designed to show parts of a whole (proportions) for categorical data, not the relationship between two continuous variables. Option C (Bar chart) is wrong because bar charts compare discrete categories or aggregated values, not the correlation between two continuous metrics; they would require binning or summarizing the data, losing the granularity needed for correlation analysis. Option D (Histogram) is wrong because histograms display the distribution of a single continuous variable (e.g., frequency of revenue values), not the relationship between two variables.

84
MCQhard

A data team is preparing a quarterly business review for the CEO. The report must include both high-level summaries and the ability for the CEO to drill down into specific departments. Which reporting technique best meets this requirement?

A.A slide deck with one slide per department.
B.An interactive dashboard with drill-down capabilities.
C.A static PDF with a summary page and appendices.
D.A data dump in Excel with filters.
AnswerB

Interactive dashboards allow users to start with a summary and click to see underlying details for specific departments.

Why this answer

An interactive dashboard with drill-down capabilities (Option B) is the correct choice because it directly addresses the requirement for both high-level summaries and the ability to explore specific departments. Dashboards allow the CEO to view aggregated KPIs at a glance and then click through to detailed views for each department, providing a seamless, user-driven exploration experience without switching between separate reports or slides.

Exam trap

The trap here is that candidates often choose a static PDF (Option C) or a slide deck (Option A) because they associate 'report' with printed or presentation materials, but the question explicitly requires 'drill-down' capability, which is a hallmark of interactive business intelligence tools, not static documents.

How to eliminate wrong answers

Option A is wrong because a slide deck with one slide per department forces a linear, static presentation; the CEO cannot dynamically drill down from a summary view into a specific department without manually navigating slides, which breaks the requirement for interactive drill-down. Option C is wrong because a static PDF with a summary page and appendices is non-interactive; the CEO would have to jump to appendix pages manually, which is not a true drill-down capability and lacks the real-time filtering or cross-filtering that an interactive dashboard provides. Option D is wrong because a data dump in Excel with filters is a raw data file that requires the CEO to understand the data structure and apply filters manually; it does not offer a curated high-level summary or a guided drill-down path, and it risks overwhelming the user with granular data without pre-built aggregations.

85
MCQmedium

A data analyst creates a report and wants to ensure it tells a compelling story. Which element is most important for data storytelling?

A.Using only one chart type for consistency.
B.Avoiding any visual elements to keep focus on text.
C.Including all data without filtering.
D.Using a narrative arc with context.
AnswerD

Engages audience and makes findings memorable.

Why this answer

Option D is correct because data storytelling relies on a narrative arc—introducing context, building tension through data insights, and resolving with actionable conclusions—to engage the audience and make the data memorable. Without a narrative, even the most accurate data fails to drive understanding or decision-making. This aligns with the DA0-001 objective of communicating data insights effectively.

Exam trap

The trap here is that candidates often confuse 'data storytelling' with 'data presentation' and assume that including all data (Option C) is thorough, when in fact the exam emphasizes that a compelling story requires filtering and context to avoid overwhelming the audience.

How to eliminate wrong answers

Option A is wrong because using only one chart type ignores the fact that different data relationships (e.g., trends vs. distributions) require different visual encodings; forcing consistency sacrifices clarity. Option B is wrong because avoiding visual elements contradicts the principle that humans process visual information faster than text; data storytelling relies on charts, graphs, and annotations to highlight key patterns. Option C is wrong because including all data without filtering leads to cognitive overload and obscures the main message; effective storytelling requires selective inclusion based on the narrative's focus.

86
MCQeasy

A data analytics team has created a report for stakeholders. The report includes complex statistical terms and raw data tables. Stakeholders are confused and ask for clarification. Which of the following should the team do to improve communication?

A.Schedule a follow-up meeting to explain each term individually.
B.Provide a glossary of statistical terms and keep the report as is.
C.Remove all data and only give conclusions.
D.Simplify the report by using clear visualizations and plain language summaries.
AnswerD

Correct. This directly addresses the confusion by making the report accessible.

Why this answer

Option D is correct because effective data communication requires tailoring the message to the audience. By replacing complex statistical terms and raw data tables with clear visualizations and plain language summaries, the team makes insights accessible to stakeholders who may lack technical expertise, directly addressing the confusion.

Exam trap

The trap here is that candidates may think providing more explanation (Option A) or more data (Option B) is always better, but the DA0-001 exam emphasizes that communication must be tailored to the audience's level of understanding, not just the completeness of the information.

How to eliminate wrong answers

Option A is wrong because scheduling a follow-up meeting to explain each term individually is inefficient and does not improve the report itself; stakeholders should be able to understand the report without needing a separate tutorial. Option B is wrong because providing a glossary while keeping the report as is forces stakeholders to constantly cross-reference terms, which does not simplify the communication and still leaves raw data tables that are hard to interpret. Option C is wrong because removing all data and only giving conclusions removes the evidence and context needed for stakeholders to trust and verify the insights, which undermines transparency and data-driven decision-making.

87
MCQmedium

A data team needs to communicate insights about customer churn to the sales team. The insights include confidence intervals and p-values. The sales team is not familiar with statistics. Which of the following should the data team do?

A.Explain the practical implications of the results without statistical jargon.
B.Assume the sales team will learn the terms over time.
C.Use technical terms but provide written definitions.
D.Include a detailed statistical appendix.
AnswerA

Correct. This makes the insights accessible and actionable for the sales team.

Why this answer

Option A is correct because the sales team lacks statistical background, so presenting confidence intervals and p-values directly would cause confusion. The data team should translate these results into practical business implications—such as 'customers with a 30-day inactivity are 40% more likely to churn'—without using terms like p-value or confidence interval. This aligns with the DA0-001 objective of tailoring communication to the audience's expertise level.

Exam trap

The trap here is that candidates often choose Option C (providing definitions) thinking it balances accuracy and clarity, but the DA0-001 exam emphasizes audience adaptation—definitions still require the audience to learn technical terms, which is less effective than plain-language explanations.

How to eliminate wrong answers

Option B is wrong because assuming the sales team will learn statistical terms over time is unrealistic and risks misinterpretation of critical insights, leading to poor business decisions. Option C is wrong because providing written definitions of technical terms still forces the sales team to process unfamiliar jargon, which can slow understanding and reduce engagement. Option D is wrong because a detailed statistical appendix is excessive for a non-technical audience and may overwhelm them, defeating the purpose of clear communication.

88
MCQhard

A data analyst at a retail company is building a dashboard for store managers to track sales performance. The data comes from three sources: point-of-sale (POS) systems, inventory, and customer loyalty. The POS table contains columns transaction_id, store_id, date, product_id, quantity, and price. The inventory table has product_id, store_id, stock_level, and reorder_point. The loyalty table has customer_id, transaction_id, and points_earned. The analyst creates a star schema with a sales_fact fact table containing all rows from POS, dimension tables for store, product, date, and customer. To calculate average transaction value, the analyst uses the formula SUM(quantity * price) / COUNT(*). Store managers report that the average transaction value appears too low, especially for stores with multiple registers. The analyst realizes that because each product sold in a transaction creates a separate row in sales_fact, a single transaction with multiple items contributes multiple rows. The current calculation divides by the number of rows rather than the number of distinct transactions. Which of the following is the best course of action to correct the average transaction value metric? (Choose one.)

A.Use the MEDIAN function instead of AVG
B.Aggregate the data at the transaction level before calculating the average
C.Use a different data model that denormalizes transaction totals into a new fact table
D.Create a calculated field that sums sales per transaction (quantity * price) and then averages across distinct transaction IDs
AnswerD

This correctly computes average transaction value by first summing per transaction.

Why this answer

Option A is correct because it calculates total sales per transaction (summing product-level rows) and then averages across distinct transactions, fixing the over-counting issue. Option B is too vague and does not specify how to aggregate. Option C is not required since the star schema is appropriate.

Option D uses median, which does not address the counting issue.

89
MCQmedium

A data analyst creates a scatter plot showing the relationship between advertising spend and revenue. The plot shows a strong positive correlation. Which of the following should the analyst include in the report to ensure accurate communication?

A.Include a note that correlation does not imply causation.
B.Replace the scatter plot with a bar chart.
C.Remove any outliers from the plot.
D.Add a trend line to the scatter plot.
AnswerA

This prevents misinterpretation of the relationship.

Why this answer

Option B is correct because correlation does not imply causation, and this caveat is essential. Option A is wrong because the scatter plot already shows the relationship. Option C is wrong because removing points could bias the analysis.

Option D is wrong because a regression line is not necessary for every scatter plot.

90
MCQhard

While reviewing a dashboard, an analyst notices that the data in a trend line chart does not match the underlying data due to a filter setting. The dashboard is used for weekly executive meetings. What should the analyst do?

A.Ignore the discrepancy if it is small.
B.Wait for someone to complain before acting.
C.Immediately remove the dashboard and send raw data.
D.Document the issue and fix the filter before the next meeting.
AnswerD

Proactive approach maintains data integrity and trust.

Why this answer

Option D is correct because the analyst has identified a data integrity issue caused by a filter setting that directly impacts the accuracy of the trend line chart. The dashboard is used for weekly executive meetings, so the analyst must document the discrepancy and correct the filter before the next meeting to ensure data-driven decisions are based on accurate visualizations. This aligns with best practices in data governance and the principle of maintaining trust in reporting tools.

Exam trap

The trap here is that candidates may assume small discrepancies are acceptable or that waiting for complaints is a valid approach, but the exam emphasizes proactive data integrity and the importance of maintaining accurate visualizations for scheduled stakeholder meetings.

How to eliminate wrong answers

Option A is wrong because ignoring even a small discrepancy in a dashboard used for executive decision-making can lead to compounded errors in trend analysis and erode trust in the data; any deviation from the underlying data must be investigated and corrected. Option B is wrong because waiting for someone to complain is reactive and unprofessional; the analyst should proactively ensure data accuracy, especially for a recurring weekly meeting where stakeholders rely on consistent, correct visualizations. Option C is wrong because immediately removing the dashboard and sending raw data disrupts the established reporting workflow and forces executives to interpret unaggregated data, which is inefficient and likely to introduce new errors; the proper action is to fix the filter and restore the correct trend line chart.

91
MCQmedium

A data analyst notices that a line chart showing monthly sales over the past two years has a steep drop in one month. Upon investigation, the analyst discovers that a new sales region was added mid-month and the data was not normalized. Which of the following best practices should the analyst apply to communicate this insight accurately?

A.Remove the month with the drop from the report.
B.Use a bar chart instead to show the data.
C.Normalize the sales data by region and explain the data anomaly in the report.
D.Present the data as-is and let stakeholders interpret the drop.
AnswerC

Normalization corrects the artifact, and explanation provides transparency.

Why this answer

Option C is correct because the core issue is that the sales data is not normalized by region, causing a misleading drop when a new region was added mid-month. By normalizing the data (e.g., calculating per-region averages or percentages) and explicitly noting the anomaly in the report, the analyst ensures accurate communication of insights. This aligns with the DA0-001 domain of Communicating Data Insights, where transparency and data integrity are paramount.

Exam trap

CompTIA often tests the misconception that simply changing the chart type (Option B) or omitting problematic data (Option A) is acceptable, when the correct approach is to normalize and explain the anomaly to maintain data integrity and clarity.

How to eliminate wrong answers

Option A is wrong because removing the month with the drop would hide the anomaly and distort the trend, violating the principle of data integrity and transparency. Option B is wrong because switching to a bar chart does not address the underlying issue of unnormalized data; the same misleading drop would still appear, just in a different visual format. Option D is wrong because presenting the data as-is without explanation shifts the burden of interpretation to stakeholders, who may draw incorrect conclusions due to the unnormalized data, which is a failure in clear communication.

92
MCQeasy

Refer to the exhibit. A data analyst is unable to access the dataset during weekends. Based on the policy, what is the most likely reason?

A.The analyst's role is not in the list
B.The policy is misconfigured
C.The access condition requires being on the internal network
D.The condition restricts access to business hours only
AnswerD

Weekends are outside business hours, so access is denied.

Why this answer

Option D is correct because the policy explicitly includes a condition that restricts access to business hours only (e.g., Monday–Friday, 9 AM–5 PM). Since the analyst is attempting to access the dataset during a weekend, the condition denies access regardless of the analyst's role or network location. This is a common attribute-based access control (ABAC) rule where time-based conditions are evaluated before granting permissions.

Exam trap

The trap here is that candidates may assume the issue is a missing role or network restriction, but the policy explicitly shows a time-based condition that overrides other permissions, and the weekend access attempt directly violates that condition.

How to eliminate wrong answers

Option A is wrong because the exhibit shows the analyst's role is listed in the policy (e.g., 'Data Analyst' is present in the role list), so the issue is not a missing role. Option B is wrong because the policy is correctly configured with a time-based condition; misconfiguration would imply a syntax error or invalid attribute, but the policy is syntactically valid and the denial is expected behavior. Option C is wrong because the policy does not include a network location condition (e.g., 'sourceIp' or 'internalNetwork' attribute); the only condition is time-based, so being on the internal network would not grant access during weekends.

93
MCQeasy

Which type of chart is most suitable for showing the distribution of a single continuous variable?

A.Histogram
B.Line chart
C.Scatter plot
D.Bar chart
AnswerA

Histograms display the frequency distribution of a continuous variable.

Why this answer

A histogram is the most suitable chart for showing the distribution of a single continuous variable because it groups the data into bins (intervals) along the x-axis and displays the frequency or count of observations within each bin on the y-axis. This allows you to see the shape, spread, and central tendency of the data, such as whether it is normally distributed, skewed, or has multiple modes. For a single continuous variable, a histogram directly visualizes the probability density or frequency distribution, which is its primary purpose in data analysis.

Exam trap

The trap here is that candidates often confuse a histogram with a bar chart, assuming both are interchangeable for showing frequencies, but a histogram is specifically for continuous data with no gaps between bars, while a bar chart is for discrete categories with gaps.

How to eliminate wrong answers

Option B is wrong because a line chart is designed to show trends over time or ordered categories by connecting data points with lines, not to display the distribution of a single continuous variable. Option C is wrong because a scatter plot is used to show the relationship between two continuous variables by plotting points on a Cartesian plane, not to visualize the distribution of a single variable. Option D is wrong because a bar chart is used for categorical data, where each bar represents a distinct category, and it does not handle continuous variables properly since it would require arbitrary binning without the statistical properties of a histogram.

94
MCQeasy

A data analyst is presenting findings to a non-technical audience. Which data visualization technique is most appropriate?

A.Box plot
B.Heat map
C.Scatter plot
D.Bar chart
AnswerD

Bar charts are straightforward and show comparisons clearly, making them suitable for non-technical audiences.

Why this answer

Bar charts are the most appropriate visualization for a non-technical audience because they present categorical comparisons in a simple, intuitive format that requires no statistical background to interpret. Unlike box plots, heat maps, or scatter plots, bar charts clearly show relative magnitudes with minimal cognitive load, making them ideal for communicating data insights to stakeholders who may not be familiar with data analysis conventions.

Exam trap

The trap here is that candidates often choose scatter plots or box plots because they seem more 'analytical,' but the question specifically targets audience comprehension, not analytical depth — Cisco tests whether you prioritize clarity over complexity when the audience lacks technical background.

How to eliminate wrong answers

Option A is wrong because box plots display data distribution through quartiles and outliers, which require understanding of statistical concepts like median, interquartile range, and whiskers — concepts that are not intuitive for a non-technical audience. Option B is wrong because heat maps use color gradients to represent data density or correlation, which can be ambiguous without a legend and are better suited for identifying patterns in large matrices rather than straightforward comparisons. Option C is wrong because scatter plots show relationships between two continuous variables and rely on the audience's ability to interpret correlation or clustering, which is not immediately accessible to non-technical viewers.

95
MCQhard

A data analyst discovers a data discrepancy in a report that has already been distributed to executives. The discrepancy is due to a known bug that will be fixed next month. What is the most appropriate action?

A.Send a corrected report immediately with an apology
B.Wait for the next scheduled update
C.Ignore it since it will be fixed soon
D.Inform the executives of the error and its impact
AnswerD

Transparency builds trust and allows informed decision-making.

Why this answer

Option B is correct because informing executives of the error and its impact maintains transparency and trust. Option A (ignore) is unethical. Option C (send corrected report immediately) may cause confusion without context.

Option D (wait) delays communication.

96
MCQhard

A data team is preparing a dashboard for executives. The team wants to highlight key performance indicators (KPIs) that are below target. Which of the following visualization techniques would most effectively draw attention to underperforming metrics without causing confusion?

A.Remove underperforming KPIs from the dashboard to avoid confusion.
B.Use a scatter plot to show the relationship between KPIs.
C.Apply conditional formatting to turn KPI values red when below target.
D.Use a pie chart showing the proportion of each KPI.
AnswerC

Red highlights call attention to issues immediately.

Why this answer

Conditional formatting that turns KPI values red when below target is the most effective technique because it leverages pre-attentive visual processing — the human eye naturally notices color changes (especially red) before other visual elements. This allows executives to instantly identify underperforming metrics without needing to interpret complex chart types, reducing cognitive load and confusion.

Exam trap

CompTIA often tests the misconception that removing or hiding negative data simplifies communication, when in fact effective dashboards must highlight both successes and failures to drive data-driven decisions.

How to eliminate wrong answers

Option A is wrong because removing underperforming KPIs from the dashboard hides critical negative information, violating the principle of transparency in data communication and preventing executives from taking corrective action. Option B is wrong because a scatter plot is designed to show relationships or correlations between two variables, not to highlight individual KPI performance against a target; it would introduce unnecessary complexity and fail to draw immediate attention to underperforming metrics. Option D is wrong because a pie chart shows proportions of a whole, which is irrelevant for comparing KPIs against targets; it cannot effectively indicate which metrics are below target and often leads to misinterpretation due to difficulty in comparing angles and areas.

97
MCQhard

A data analyst is building a dashboard that will be used by both executives and operational managers. The executives need high-level summaries, while managers need granular details. Which dashboard design principle should the analyst apply?

A.Use a single chart that shows both summary and detail simultaneously.
B.Display all available data on one page for transparency.
C.Design the dashboard with drill-down capabilities from summary to detail.
D.Create two separate dashboards for each audience.
AnswerC

Drill-down allows executives to see overview and managers to access details on demand.

Why this answer

Option C is correct because drill-down capabilities allow users to start with a high-level summary (e.g., total revenue by region) and then interactively navigate to granular details (e.g., individual transactions) without overwhelming either audience. This design principle supports both executive and operational manager needs within a single dashboard, maintaining clarity and performance by loading only the required level of detail on demand.

Exam trap

The trap here is that candidates often confuse 'drill-down' with 'separate dashboards' (Option D), thinking that different audiences require entirely different views, when in fact a single interactive dashboard with drill-down capabilities is the standard best practice for serving both summary and detail needs efficiently.

How to eliminate wrong answers

Option A is wrong because a single chart attempting to show both summary and detail simultaneously would likely become cluttered, violate data-ink ratio principles, and fail to provide clear high-level insights for executives or actionable granularity for managers. Option B is wrong because displaying all available data on one page creates information overload, reduces readability, and violates the principle of progressive disclosure, making it difficult for both audiences to quickly find relevant insights. Option D is wrong because creating two separate dashboards introduces maintenance overhead, potential data inconsistency, and forces users to switch contexts, whereas a single dashboard with drill-down capabilities is more efficient and user-friendly.

98
Matchingmedium

Match each data security concept to its description.

Drag a concept onto its matching description — or click a concept then click the description.

Concepts
Matches

Convert data into code to prevent unauthorized access

Replace sensitive data with non-sensitive placeholder

Hide parts of data (e.g., show only last 4 digits)

Restrict data access based on user permissions

Log of all data access and modifications

Why these pairings

Data security is critical for protecting sensitive information.

99
MCQhard

A large e-commerce company has a data warehouse that stores daily sales transactions. The data analytics team creates a weekly report for the executive team that includes total revenue, number of orders, and average order value. Recently, the report showed a sudden 20% drop in total revenue compared to the previous week. Upon investigation, the data analyst discovers that a major promotional event ended the previous week, and the current week's data includes a new product launch with lower prices. The analyst needs to present this data to the executives in a way that accurately reflects the business situation. The analyst must also provide recommendations for actions. Which of the following is the best course of action?

A.Remove the new product launch data from the report to show a clearer comparison.
B.Present only the data from the new product launch to show growth potential.
C.Present the data with annotations explaining the promotion end and new product launch, and recommend analyzing customer retention and cross-selling opportunities.
D.Present the drop as-is and recommend cutting costs to compensate for the revenue decline.
AnswerC

This provides full context and actionable insights.

Why this answer

Option C is correct because it adheres to the principle of transparent data communication: the analyst must present the full dataset with contextual annotations (promotion end, new product launch) to avoid misleading executives. This approach enables informed decision-making, and the recommendation to analyze customer retention and cross-selling opportunities directly addresses the underlying business dynamics—understanding whether the revenue drop is temporary or signals a shift in customer behavior. In a data warehouse context, this ensures that the weekly report reflects the true state of the business rather than a manipulated or incomplete view.

Exam trap

The trap here is that candidates may think removing or cherry-picking data (Options A and B) simplifies the story for executives, but the exam tests the ethical and professional standard of presenting complete, contextualized data—even when it shows a negative trend—while offering actionable recommendations based on the full picture.

How to eliminate wrong answers

Option A is wrong because removing the new product launch data constitutes data manipulation that violates the integrity of the report; it would hide a significant business event and prevent executives from understanding the full revenue picture. Option B is wrong because presenting only the new product launch data is a form of selection bias that ignores the overall revenue decline and the impact of the promotion end, leading to an overly optimistic and incomplete analysis. Option D is wrong because presenting the drop as-is without context or analysis fails to provide actionable insights; recommending cost cuts without understanding the root cause (promotion end and new product pricing) could lead to misguided strategic decisions.

100
MCQeasy

A data analyst needs to present the distribution of customer ages to a non-technical audience. Which type of chart would be most appropriate?

A.Scatter plot
B.Histogram
C.Pie chart
D.Line chart
AnswerB

Histograms show distribution of continuous data.

Why this answer

A histogram is the most appropriate chart for displaying the distribution of a single continuous variable, such as customer ages, to a non-technical audience. It groups ages into bins and shows the frequency of customers within each bin, making the shape, center, and spread of the distribution immediately visible. This aligns with the DA0-001 objective of selecting visualizations that clearly communicate data insights to stakeholders.

Exam trap

The trap here is that candidates often confuse a histogram with a bar chart, thinking a bar chart can show distribution, but a bar chart is for categorical data, not continuous numeric data grouped into bins.

How to eliminate wrong answers

Option A is wrong because a scatter plot is used to show the relationship between two continuous variables, not the distribution of a single variable like age. Option C is wrong because a pie chart is designed for parts-of-a-whole categorical data, not for displaying the frequency distribution of a continuous numeric variable. Option D is wrong because a line chart is typically used to show trends over time or ordered categories, not the distribution of a single continuous variable.

101
MCQeasy

A company must comply with GDPR when reporting customer data. What must be included in the report?

A.Customer names and emails
B.The source of the data
C.A notice about data processing purposes
D.Data retention period
AnswerC

GDPR mandates that data subjects be informed of the purposes of processing, especially when data is shared.

Why this answer

Under GDPR, when reporting customer data, the controller must provide a notice about the purposes of data processing to ensure transparency and lawful basis for processing. This is required by Article 13 of the GDPR, which mandates that data subjects be informed of the specific purposes for which their personal data is being processed. Including this notice in the report demonstrates compliance with the accountability principle.

Exam trap

The trap here is that candidates often confuse operational data elements (like names, source, or retention) with the mandatory transparency obligations under GDPR, assuming that including any data field satisfies compliance, whereas the core requirement is the notice about processing purposes.

How to eliminate wrong answers

Option A is wrong because customer names and emails are personal data that may be included in the report, but GDPR does not require their inclusion; in fact, pseudonymization or anonymization is often preferred to minimize data exposure. Option B is wrong because while the source of data may be relevant for transparency, GDPR does not mandate including the data source in every report; the focus is on processing purposes and lawful basis. Option D is wrong because the data retention period is a policy requirement under GDPR Article 5(1)(e), but it is not a mandatory element to be included in a customer data report; it is typically documented in the privacy notice or data retention policy, not the report itself.

102
Multi-Selectmedium

A data analyst is troubleshooting a dashboard that displays slowly. Which two of the following are common causes of dashboard performance issues? (Select TWO.)

Select 2 answers
A.Minimal use of filters.
B.Use of summary tables.
C.Large unaggregated datasets.
D.Complex calculated fields with many dependencies.
E.Excessive use of live data connections.
AnswersC, D

Correct. Large unaggregated datasets increase load on the dashboard.

Why this answer

Option C is correct because large unaggregated datasets force the dashboard to process every row of raw data at query time, which dramatically increases load times and memory consumption. Aggregated or pre-summarized tables reduce the volume of data scanned, improving performance.

Exam trap

The trap here is that candidates often confuse 'summary tables' (which improve performance) with 'large unaggregated datasets' (which degrade it), and they may overlook that complex calculated fields with dependencies are a hidden performance killer.

103
Multi-Selecthard

A data team is creating a report for the board of directors. The board wants to see both the big picture and the ability to explore details for specific regions. Which TWO approaches should the team implement?

Select 2 answers
A.Use a single summary metric for all regions.
B.Include an interactive dashboard with filters by region.
C.Embed a table of data for each region in an appendix.
D.Provide a static executive summary with no drill-down.
E.Create a hierarchical report with linked pages.
AnswersB, E

Interactive dashboards allow users to filter and drill into specific regions on demand.

Why this answer

Option B is correct because interactive dashboards with filters allow the board to see high-level summaries while drilling into region-specific details on demand. This approach aligns with modern BI tools like Tableau or Power BI, which support dynamic filtering and cross-filtering without requiring separate report pages.

Exam trap

CompTIA often tests the distinction between static data presentation (tables, summaries) and interactive exploration features (filters, drill-through), trapping candidates who choose static options like an appendix or single metric when the question explicitly asks for both big-picture and detail exploration.

104
MCQhard

A data analyst is presenting a weekly sales report to a diverse audience that includes sales managers, marketing team, and finance. The analyst wants to ensure the data is understood correctly. Which communication strategy is most effective?

A.Use a single complex chart to show all data at once.
B.Create a slide deck with key insights and contextual explanations.
C.Email the raw data table to all stakeholders.
D.Use technical terms such as 'variance' and 'standard deviation'.
AnswerB

Contextual explanations cater to different roles and improve understanding.

Why this answer

Option B is correct because a slide deck with key insights and contextual explanations tailors the presentation to a diverse audience, ensuring that sales managers, marketing, and finance can each grasp the relevant data without being overwhelmed. This approach aligns with best practices for data communication, where visual summaries and narrative context improve comprehension over raw data or overly technical jargon.

Exam trap

The trap here is that candidates may overvalue technical precision (Option D) or data completeness (Option C), forgetting that the primary goal is audience comprehension, not data exhaustiveness.

How to eliminate wrong answers

Option A is wrong because a single complex chart (e.g., a multi-series radar or stacked area chart) can obscure individual trends and overload viewers, leading to misinterpretation across roles with different analytical backgrounds. Option C is wrong because emailing raw data tables forces stakeholders to derive insights themselves, risking errors and inefficiency, especially for non-technical teams like marketing. Option D is wrong because using technical terms like 'variance' and 'standard deviation' without explanation alienates audience members who lack statistical training, violating the principle of audience-appropriate communication.

105
MCQmedium

A data analyst is troubleshooting a dashboard that displays incorrect totals for sales by region. The data source queries are correct. Which of the following is the most likely cause?

A.The visualizations are using a different aggregation level.
B.The data model includes duplicate records.
C.The dashboard is using a live connection instead of an extract.
D.The filter context is inadvertently excluding some regions.
AnswerD

Correct. Filters can exclude data without obvious indication, causing incorrect totals.

Why this answer

Option D is correct because filter context can inadvertently exclude certain regions, leading to incorrect totals. Options A, B, and C are less likely: A affects performance not accuracy; B would be caught by correct queries; C could cause aggregation differences but filters are a common issue.

106
MCQmedium

A data analyst encounters the above error log when trying to connect to a database. The analyst needs to explain the issue to the database administrator. Which of the following correctly describes the problem?

A.The database connection pool has reached its maximum limit.
B.The database table is corrupted.
C.The database server is out of disk space.
D.The database authentication credentials are invalid.
AnswerA

The log explicitly says 'Connection pool exhausted'.

Why this answer

The error log indicates a 'connection pool exhausted' or 'too many connections' message, which occurs when the database connection pool has reached its maximum limit. This means all available connections are in use, and no new connections can be established until existing ones are released. The analyst should explain to the DBA that the application is attempting to open more connections than the pool allows, often due to a connection leak or insufficient pool size.

Exam trap

The trap here is that candidates confuse connection pool exhaustion with authentication or disk space issues, but the error log's specific wording (e.g., 'cannot acquire connection from pool') directly points to a connection limit problem.

How to eliminate wrong answers

Option B is wrong because a corrupted table typically produces errors like 'table corruption' or 'index corruption', not connection pool exhaustion. Option C is wrong because out-of-disk-space errors manifest as 'disk full' or 'no space left on device', not connection limit errors. Option D is wrong because invalid authentication credentials result in 'access denied' or 'login failed' errors, not connection pool limit messages.

107
MCQhard

Refer to the exhibit. A data analyst is unable to run a query on the customers table after October 1, 2023. What is the reason?

A.The resource name is incorrect
B.The policy allows access only before October 1, 2023
C.The action should be READ not SELECT
D.The policy denies access after October 1, 2023
AnswerB

After October 1, the condition fails, and the Allow effect no longer applies, resulting in denial.

Why this answer

The policy explicitly allows access only before October 1, 2023, meaning any query attempt on or after that date is denied. This is a time-based access control condition, often implemented using AWS IAM or Azure RBAC policies with a `Condition` block that checks the `aws:CurrentTime` or equivalent attribute. Since the query fails after October 1, 2023, the policy's effective date restriction is the direct cause.

Exam trap

CompTIA often tests the distinction between an explicit deny and an implicit deny caused by a missing allow condition; the trap here is that candidates mistakenly think the policy contains an explicit 'deny after date' statement, when in reality it simply grants access only before the date, relying on the default implicit deny for all other times.

How to eliminate wrong answers

Option A is wrong because the resource name being incorrect would cause a different error (e.g., 'Table not found' or 'Invalid resource'), not a time-based denial. Option C is wrong because `SELECT` is the correct SQL action for reading data; the policy uses `SELECT` as the action identifier, not `READ`, and changing it would not resolve the time restriction. Option D is wrong because it misstates the policy logic: the policy does not explicitly deny access after October 1, 2023; instead, it grants access only before that date, which implicitly denies access after it.

The distinction matters because an explicit deny would override any allow, but here the absence of an allow after the date is the issue.

108
Multi-Selectmedium

Which TWO of the following are best practices for designing a data dashboard?

Select 2 answers
A.Include animated transitions between data views.
B.Use consistent color schemes to indicate performance levels.
C.Use 3D effects to make charts more visually appealing.
D.Place the most important KPIs at the top of the dashboard.
E.Include as many charts as possible to provide comprehensive data.
AnswersB, D

Consistent colors help users quickly interpret data.

Why this answer

Option B is correct because consistent color schemes (e.g., red for critical, yellow for warning, green for normal) allow users to instantly interpret performance levels without cognitive overload. This aligns with dashboard design principles that prioritize clarity and rapid pattern recognition over decorative elements.

Exam trap

The trap here is that candidates confuse 'visually appealing' with 'effective communication' — CompTIA often tests that decorative elements like 3D effects and animations reduce data accuracy and user comprehension, even though they may look impressive.

109
MCQmedium

A data analyst needs to create a dashboard that updates automatically every hour. The data source is a large database. Which approach minimizes performance impact?

A.Query the entire database each time
B.Use incremental refresh only for new or changed data
C.Export the data to Excel and import
D.Create a static report monthly
AnswerB

Incremental refresh minimizes database load.

Why this answer

Option B is correct because incremental refresh queries only new or changed records since the last refresh, drastically reducing data transfer and processing load on the large database. This approach uses change-tracking mechanisms (e.g., timestamps, CDC) to avoid full-table scans, minimizing performance impact while maintaining near-real-time updates.

Exam trap

CompTIA often tests the misconception that 'more data is better' or that full refreshes are simpler and equally acceptable, but the trap here is ignoring the performance cost of full database scans on large datasets in favor of the more efficient incremental approach.

How to eliminate wrong answers

Option A is wrong because querying the entire database each hour performs a full table scan on a large database, causing excessive I/O, CPU, and memory usage that degrades performance for all users. Option C is wrong because exporting the entire database to Excel and importing it adds unnecessary data transformation overhead, loses real-time capability, and still requires a full data pull. Option D is wrong because a static monthly report does not meet the requirement for automatic hourly updates and provides stale data, making it functionally incorrect for the use case.

110
MCQhard

Refer to the exhibit. A data analyst is reviewing a data quality report. Which of the following actions should the analyst take first?

A.Delete the 1200 records with null emails.
B.Fill null emails with a placeholder.
C.Investigate the source system to understand why emails are missing.
D.Ignore the nulls as they are not critical.
AnswerC

Correct. Root cause analysis should precede any corrective action.

Why this answer

Option C is correct because the first step in data quality remediation is root cause analysis. Without understanding why 1200 records have null emails (e.g., a source system bug, a failed ETL join, or a missing required field), any corrective action like deletion or placeholder insertion risks introducing bias or masking a systemic issue. Investigating the source system aligns with the data governance principle of 'fix the source, not the symptom.'

Exam trap

CompTIA often tests the principle that 'fix the source, not the symptom'—the trap here is that candidates jump to data cleansing actions (delete, fill, ignore) without first diagnosing why the nulls exist, which is a classic data quality management mistake.

How to eliminate wrong answers

Option A is wrong because deleting 1200 records with null emails reduces dataset size and may discard valid records if the nulls are due to a temporary system glitch, not actual missing data. Option B is wrong because filling null emails with a placeholder (e.g., 'unknown@domain.com') introduces false data that can skew analysis, violate email format constraints, and mislead downstream processes. Option D is wrong because ignoring nulls assumes they are non-critical without verification; in many contexts (e.g., customer communications, deduplication), missing emails are critical and can lead to incomplete insights or compliance issues.

111
MCQhard

A healthcare organization must ensure patient data privacy when sharing reports with external auditors. Which practice is most important?

A.Encrypt the report file
B.Obtain consent from patients
C.Aggregate data at low granularity
D.Use pseudonymization
AnswerD

Pseudonymization de-identifies data while retaining analytical value, meeting HIPAA requirements for sharing with auditors.

Why this answer

Pseudonymization replaces identifying information with pseudonyms, allowing data utility while protecting privacy. Aggregation reduces granularity but may still reveal identities; encryption secures transport but not the content; obtaining consent is impractical for large datasets.

112
MCQhard

During a presentation, a stakeholder questions the validity of a data insight because the sample size appears small. The analyst knows the sample is statistically significant. What is the best way to address this concern?

A.Ignore the question and continue the presentation.
B.Explain the margin of error and confidence interval used.
C.Ask the stakeholder to trust the analysis and move on.
D.Agree to collect more data before finalizing the report.
AnswerB

This provides statistical context to reassure the stakeholder.

Why this answer

Option B is correct because it directly addresses the stakeholder's concern by explaining the statistical concepts of margin of error and confidence interval, which demonstrate that the sample size is sufficient for the desired level of precision. This approach validates the data insight's reliability without dismissing the stakeholder's valid question, aligning with best practices in communicating data insights.

Exam trap

The trap here is that candidates may assume a small sample size is always invalid, but the DA0-001 exam tests understanding that statistical significance depends on the margin of error and confidence interval, not just sample size alone.

How to eliminate wrong answers

Option A is wrong because ignoring the question undermines trust and fails to address a legitimate concern about data validity, which is critical in data-driven presentations. Option C is wrong because asking for blind trust is unprofessional and does not provide the technical justification needed to alleviate doubts about sample size significance. Option D is wrong because agreeing to collect more data is unnecessary when the sample is already statistically significant, and it delays decision-making without addressing the underlying statistical reasoning.

113
MCQmedium

An analyst is presenting findings to a non-technical audience. The data shows a 20% increase in customer churn after a price change. Which presentation approach is BEST?

A.Explain the p-value
B.Provide the raw data table
C.Use a simple bar chart comparing churn before and after
D.Show a complex statistical model
AnswerC

A bar chart is simple, visual, and directly shows the comparison without jargon.

Why this answer

Option C is correct because a simple bar chart visually and intuitively communicates the 20% increase in churn to a non-technical audience without requiring statistical literacy. This approach aligns with best practices for presenting data insights to stakeholders who need clear, actionable takeaways rather than technical details.

Exam trap

The trap here is that candidates often overcomplicate the presentation by choosing technical options (like p-values or models) to demonstrate rigor, forgetting that the exam prioritizes audience-appropriate communication over statistical depth.

How to eliminate wrong answers

Option A is wrong because explaining a p-value introduces statistical significance testing, which is unnecessary and confusing for a non-technical audience that only needs to understand the magnitude of the change. Option B is wrong because providing the raw data table overwhelms the audience with numbers and fails to highlight the key insight (the 20% increase) effectively. Option D is wrong because showing a complex statistical model is inappropriate for a non-technical audience, as it obscures the simple before-and-after comparison and may lead to misinterpretation or disengagement.

114
Multi-Selectmedium

A data analyst is preparing a report to present to a mixed audience of technical and non-technical stakeholders. Which THREE techniques should the analyst use to ensure effective communication? (Choose three.)

Select 3 answers
A.Tailor the narrative to address different concerns
B.Use only one chart type for consistency
C.Use technical jargon to demonstrate expertise
D.Provide high-level summaries for non-technical audience
E.Include detailed technical appendices for those interested
AnswersA, D, E

Addressing diverse interests makes the report relevant to all.

Why this answer

Option A is correct because tailoring the narrative to address different concerns ensures that both technical and non-technical stakeholders receive relevant insights. For non-technical audiences, the analyst should focus on business impact and high-level trends, while for technical audiences, deeper data nuances can be included. This approach aligns with the DA0-001 domain of Communicating Data Insights, where audience analysis is critical for effective data storytelling.

Exam trap

The trap here is that candidates often confuse 'consistency' with 'clarity,' mistakenly believing that using a single chart type (Option B) simplifies the message, when in fact it can hide critical patterns that require different visual encodings.

115
MCQhard

A financial analyst at a bank is preparing a report on loan default risks to the risk management committee. The committee includes both technical (quantitative analysts) and non-technical (business managers) members. The analyst has built a logistic regression model that outputs probability scores for default. The model's performance is good, but the committee wants to understand the key drivers of default. The analyst needs to communicate both the model's accuracy and the impact of each feature. The report should be concise and persuasive, leading to policy changes. What is the best approach?

A.Provide a technical white paper.
B.Use a waterfall chart showing the impact of each feature on a sample prediction.
C.Present a feature importance bar chart and a table of coefficients.
D.Show the confusion matrix and AUC-ROC curve.
AnswerB

Intuitive visualization that explains contributions clearly to all audiences.

Why this answer

Option B is correct because a waterfall chart visually decomposes a single prediction into the additive contributions of each feature, making it intuitive for both technical and non-technical stakeholders to see which factors drive default risk. This approach directly addresses the committee's need to understand key drivers while keeping the report concise and persuasive for policy changes, unlike abstract metrics or tables.

Exam trap

The trap here is that candidates often pick Option C (feature importance bar chart and coefficients) thinking it is the most direct way to show feature impact, but they overlook that coefficients are on the log-odds scale and not easily interpretable by non-technical managers, whereas a waterfall chart provides a concrete, additive explanation for a single prediction.

How to eliminate wrong answers

Option A is wrong because a technical white paper is too detailed and jargon-heavy for non-technical business managers, failing the requirement for a concise and persuasive report. Option C is wrong because a feature importance bar chart and coefficient table require statistical literacy to interpret correctly, and coefficients in logistic regression are on the log-odds scale, which is not intuitive for non-technical audiences. Option D is wrong because a confusion matrix and AUC-ROC curve only communicate overall model accuracy and discrimination, not the impact of individual features on predictions, which is what the committee explicitly asked for.

116
MCQmedium

An analyst is creating a report to show the relationship between advertising spend and website traffic over the past 12 months. The data has a few outliers due to special promotional events. Which chart type should the analyst use to clearly show the trend while minimizing the impact of outliers?

A.Pie chart
B.Bar chart
C.Heatmap
D.Scatter plot with a trend line
AnswerD

Scatter plots show the relationship and outliers; a trend line summarizes the pattern.

Why this answer

A scatter plot with a trend line (Option D) is the best choice because it plots each data point individually, allowing the analyst to see the overall relationship between advertising spend and website traffic while the trend line (often a linear regression line) smooths out the influence of outliers. This chart type minimizes the visual impact of extreme values by focusing on the central tendency and direction of the data, making it ideal for identifying trends over 12 months despite promotional event spikes.

Exam trap

The trap here is that candidates often choose a bar chart (Option B) thinking it clearly shows trends over time, but they overlook that bar charts treat each period as a separate category and do not inherently reduce outlier impact, whereas a scatter plot with a trend line explicitly models the relationship and dampens outlier effects.

How to eliminate wrong answers

Option A is wrong because a pie chart shows proportions of a whole at a single point in time, not a trend over 12 months, and it cannot handle outliers or continuous variables like advertising spend and traffic. Option B is wrong because a bar chart compares discrete categories or time periods, but it treats each bar independently and does not inherently minimize outlier impact; outliers can distort the scale and make normal variations hard to see. Option C is wrong because a heatmap visualizes density or intensity across two dimensions using color gradients, which is useful for correlation matrices or geographic data, but it does not effectively show a continuous trend over time and can obscure the specific relationship between spend and traffic.

117
MCQhard

You are a data analyst at a retail company. Your team has deployed a new dashboard that displays real-time inventory levels across all warehouses. The dashboard uses a traffic light color scheme (green = sufficient, yellow = low, red = out of stock). After one week, the warehouse managers complain that the dashboard is overwhelming because it shows all 500+ products at once, and they cannot quickly identify which items need immediate attention. They request a way to prioritize items that are red or yellow. Additionally, the dashboard updates every second, causing the screen to flicker constantly, which they find distracting. You need to redesign the dashboard to address these issues. Which course of action should you take?

A.Add a filter to show only items with red or yellow status, and reduce the refresh rate to every 5 minutes.
B.Replace the dashboard with a static table that shows all products sorted alphabetically, and increase the refresh rate to every 2 seconds.
C.Remove the color scheme and display only numerical values, and keep the refresh rate at 1 second.
D.Add additional colors to distinguish more inventory levels and increase the refresh rate to every 500 milliseconds.
AnswerA

Filter reduces clutter; slower refresh eliminates flicker.

Why this answer

Option A is correct because it directly addresses both user complaints: adding a filter to show only red/yellow items reduces cognitive overload by hiding the 500+ products that are not critical, and reducing the refresh rate from every second to every 5 minutes eliminates distracting screen flicker while still providing near-real-time inventory updates. This approach balances usability with the need for timely alerts without overwhelming the user.

Exam trap

The trap here is that candidates may think increasing refresh rate improves data accuracy, but the real issue is user distraction and information overload, not data latency.

How to eliminate wrong answers

Option B is wrong because sorting alphabetically does not help prioritize critical items (red/yellow), and increasing the refresh rate to every 2 seconds would worsen the flicker issue. Option C is wrong because removing the color scheme removes the quick visual prioritization that the traffic light system provides, and keeping the 1-second refresh rate does not solve the flicker problem. Option D is wrong because adding more colors would increase visual complexity, not reduce it, and increasing the refresh rate to every 500 milliseconds would make the flicker even more distracting.

118
Multi-Selecthard

Which TWO of the following are best practices when creating a data visualization for a presentation to stakeholders?

Select 2 answers
A.Including interactive filters when possible to allow stakeholders to explore data
B.Using a consistent color scheme that aligns with company branding
C.Placing the legend at the bottom of the chart
D.Using 3D effects to make the chart visually appealing
E.Using only one type of chart throughout the presentation for consistency
AnswersA, B

Interactivity lets stakeholders focus on relevant data.

Why this answer

Option A is correct because interactive filters (e.g., slicers, parameter controls) empower stakeholders to drill down into specific data subsets, ask ad-hoc questions, and uncover insights without overwhelming the initial view. This aligns with the principle of user-driven exploration, which is a best practice for stakeholder presentations where different audience members may have varying interests. Including interactivity also reduces the need for multiple static charts, keeping the presentation concise while still offering depth.

Exam trap

CompTIA often tests the misconception that visual embellishments like 3D effects or consistent chart types are always beneficial, when in fact they can obscure data accuracy and limit communicative effectiveness.

119
MCQhard

An analyst is creating a data story about the decline in customer satisfaction scores. The data shows that satisfaction dropped after a software update. Which narrative structure best helps the audience understand the cause and effect?

A.Start with the current scores, then show historical trend
B.Begin with the software update event, then show satisfaction before and after
C.Present all data points without a story
D.Use a pie chart of satisfaction categories
AnswerB

This chronological structure clearly shows the cause and its effect.

Why this answer

Option B is correct because it follows a chronological cause-and-effect narrative, starting with the software update event and then showing satisfaction scores before and after. This structure directly maps to the audience's need to understand the causal relationship, as it highlights the intervention point and the resulting change in the metric. In data storytelling, this is known as the 'before-and-after' or 'change-over-time' narrative, which is most effective for demonstrating impact.

Exam trap

The trap here is that candidates often choose Option A because they think starting with the current state is more engaging, but Cisco tests the understanding that a chronological cause-and-effect narrative is required to clearly demonstrate the impact of a specific event, not just a general trend.

How to eliminate wrong answers

Option A is wrong because starting with current scores and then showing historical trend presents the data in reverse chronological order, which obscures the cause-and-effect relationship and forces the audience to infer the impact of the software update from the end of the story. Option C is wrong because presenting all data points without a story fails to provide context or a narrative arc, leaving the audience to interpret the cause-and-effect on their own, which is inefficient and can lead to misinterpretation. Option D is wrong because a pie chart of satisfaction categories shows a static distribution of current satisfaction levels, not the temporal change before and after the software update, and thus cannot illustrate cause and effect.

120
MCQmedium

After presenting a data analysis to stakeholders, they ask for additional details on a specific trend. The analyst needs to provide a follow-up report. What is the best approach?

A.Create a new dashboard with all the data.
B.Send the raw data file to the stakeholders.
C.Provide a detailed written report with charts and explanations.
D.Hold another meeting to verbally explain the trend.
AnswerC

Allows self-paced review and provides clear context.

Why this answer

Option C is correct because a detailed written report with charts and explanations allows stakeholders to review at their own pace and provides depth. Option A is wrong because creating a new dashboard for every request is inefficient. Option B is wrong because raw data files lack context and interpretation.

Option D is wrong because holding another meeting delays information.

121
MCQeasy

Refer to the exhibit. An ETL pipeline failed. What is the most likely root cause?

A.Data format mismatch
B.Authentication failure
C.Insufficient disk space
D.Network connectivity issues
AnswerD

Timeouts and connection resets are classic signs of network problems.

Why this answer

The exhibit shows an ETL pipeline failure with a timeout error, which is a classic symptom of network connectivity issues. When the pipeline cannot reach the source or target system due to a network interruption, the process hangs until a timeout threshold is exceeded, causing the job to fail. This is the most likely root cause because timeouts are directly associated with network problems, not with data format, authentication, or storage issues.

Exam trap

The trap here is that candidates may confuse a timeout error with a data format mismatch or authentication failure, but Cisco tests the understanding that timeouts are uniquely tied to network connectivity issues, not to data content or access permissions.

How to eliminate wrong answers

Option A is wrong because a data format mismatch would typically result in a parsing or schema validation error, not a timeout. Option B is wrong because an authentication failure would produce a 401 Unauthorized or 403 Forbidden error code, not a timeout. Option C is wrong because insufficient disk space would generate a 'disk full' or 'no space left on device' error, not a timeout.

122
MCQhard

A data analyst is tasked with presenting the results of a customer churn analysis to a group of marketing managers. The model identified that customers with low engagement scores are 3 times more likely to churn. The analyst wants to effectively communicate this finding to persuade the managers to focus on engagement. Which of the following is the most compelling way to present this insight?

A.Show a table with raw churn rates by engagement score
B.Use a bar chart comparing churn rates of low vs. high engagement
C.Show a scatter plot of engagement score vs. churn
D.Calculate the potential revenue increase if engagement improves by 10%
AnswerD

Revenue impact directly connects to business objectives.

Why this answer

Option A is correct because translating the finding into potential revenue impact resonates with managers' business goals. Option B (bar chart) is factual but less persuasive. Option C (scatter plot) is not compelling.

Option D (table) is less engaging.

123
MCQmedium

A data analyst creates a report showing sales by product category. The analyst notices that one category has a very high sales figure due to a one-time bulk order. Which of the following is the best way to communicate this insight to stakeholders?

A.Delete the bulk order from the dataset.
B.Add a note to the chart explaining the bulk order.
C.Remove the category with the bulk order from the report.
D.Use a pie chart to show the proportion of each category.
AnswerB

Annotation provides context for the anomaly.

Why this answer

Option B is correct because it maintains data integrity while providing necessary context. Adding a note to the chart allows stakeholders to understand the anomaly without distorting the underlying data. This approach aligns with best practices in data communication, where transparency about outliers is critical for accurate interpretation.

Exam trap

The trap here is that candidates may think removing or hiding data (options A and C) simplifies the message, but the exam tests the principle that data communication must be transparent and preserve the full context for decision-making.

How to eliminate wrong answers

Option A is wrong because deleting the bulk order from the dataset introduces bias and misrepresents actual sales performance, violating data integrity principles. Option C is wrong because removing the entire category hides the outlier entirely, preventing stakeholders from understanding its impact on overall trends. Option D is wrong because a pie chart would visually exaggerate the proportion of the bulk order category, potentially misleading stakeholders about normal sales distribution.

124
MCQhard

A data analyst is presenting results from a customer segmentation analysis to a non-technical audience. The segmentation was performed using k-means clustering, and the analyst wants to explain the characteristics of each cluster. Which visualization is most appropriate for this audience?

A.A parallel coordinates plot displaying all variables used in clustering.
B.A heatmap of the distance matrix between cluster centroids.
C.A bar chart comparing the average values of key metrics for each cluster.
D.A scatter plot with two principal components showing cluster boundaries.
AnswerC

Bar charts are simple, familiar, and effectively compare averages across categories.

Why this answer

A bar chart comparing average values of key metrics per cluster is the most appropriate choice because it directly translates the numerical output of k-means clustering into a simple, intuitive visual that non-technical audiences can easily interpret. Unlike complex multivariate plots, a bar chart focuses on the distinct characteristics of each cluster without requiring knowledge of clustering algorithms or dimensionality reduction. This aligns with the DA0-001 objective of communicating data insights effectively to stakeholders.

Exam trap

The trap here is that candidates may choose a technically sophisticated visualization like a parallel coordinates plot or PCA scatter plot, mistakenly believing it demonstrates deeper analytical skill, when the question specifically tests the ability to tailor visualizations to a non-technical audience's comprehension level.

How to eliminate wrong answers

Option A is wrong because a parallel coordinates plot displays all variables simultaneously, which overwhelms non-technical audiences with overlapping lines and multidimensional complexity, making it unsuitable for clear communication of cluster characteristics. Option B is wrong because a heatmap of the distance matrix between cluster centroids shows abstract mathematical relationships (e.g., Euclidean distances) that are meaningless to a non-technical audience and does not convey the actual metric values defining each cluster. Option D is wrong because a scatter plot with two principal components requires understanding of PCA and dimensionality reduction, which is not intuitive for non-technical stakeholders, and the cluster boundaries may be ambiguous without proper labeling of the original metrics.

125
MCQeasy

A retail company has a dashboard that tracks daily sales across all stores. The dashboard is used by the CEO during monthly board meetings. Recently, the CEO complained that the dashboard is confusing and takes too long to interpret. The data analyst is asked to redesign the dashboard to improve usability. The analyst has access to all sales data. The CEO prefers a high-level overview with the ability to drill down into regions if needed. The analyst must decide on a course of action. What should the analyst do first?

A.Revamp the dashboard with as many visualizations as possible.
B.Conduct a survey of all employees to understand their needs.
C.Create a static PDF report instead of a dashboard.
D.Interview the CEO to determine key performance indicators and preferred layout.
AnswerD

Directly addresses the user's frustration and gathers precise requirements.

Why this answer

The correct first step is to interview the CEO to determine key performance indicators and preferred layout. This ensures the dashboard is designed around the specific high-level metrics the CEO needs for monthly board meetings, with the ability to drill down into regions. Starting with stakeholder requirements prevents wasted effort on irrelevant visualizations or formats.

Exam trap

The trap here is that candidates may assume more data or broader input (options A or B) is always better, when in fact the correct approach is to focus on the specific needs of the primary user to reduce confusion and improve usability.

How to eliminate wrong answers

Option A is wrong because revamping with as many visualizations as possible would increase clutter and confusion, directly contradicting the CEO's complaint about the dashboard being confusing and taking too long to interpret. Option B is wrong because surveying all employees is unnecessary; the primary user is the CEO, and the dashboard is for board meetings, not general employee use. Option C is wrong because creating a static PDF report removes interactivity and the ability to drill down into regions, which the CEO explicitly prefers.

126
Drag & Dropmedium

Drag and drop the steps to perform a root cause analysis on data quality issues in the correct order.

Drag steps to the numbered slots on the right, or tap a step then tap a slot.

Steps
Order

Why this order

Root cause analysis involves defining the problem, gathering data, identifying causes, finding root cause, and acting.

127
MCQhard

An analyst notices that a dashboard displays conflicting data from two sources. What is the first step in troubleshooting?

A.Re-run all queries
B.Notify stakeholders of potential error
C.Replace the data source with the one believed to be correct
D.Check data transformation steps and join logic
AnswerD

Examining transformations and joins helps identify discrepancies between sources.

Why this answer

Option D is correct because the first step in troubleshooting conflicting data from multiple sources is to verify the data transformation steps and join logic. Conflicts often arise from incorrect joins (e.g., using INNER JOIN instead of LEFT JOIN), mismatched keys, or data type conversions that alter values. Checking these steps isolates the root cause before any other action, such as re-running queries or notifying stakeholders.

Exam trap

The trap here is that candidates often jump to re-running queries or notifying stakeholders first, mistaking a symptom (conflicting data) for a root cause, instead of methodically inspecting the transformation and join logic where the discrepancy likely originates.

How to eliminate wrong answers

Option A is wrong because re-running all queries blindly repeats the same potentially flawed logic without diagnosing the source of the conflict; it wastes time and may overwrite intermediate results. Option B is wrong because notifying stakeholders prematurely escalates an unverified issue, causing unnecessary alarm and eroding trust before the analyst has confirmed the error. Option C is wrong because replacing the data source without understanding the conflict ignores the possibility that the transformation logic or join conditions are the actual problem, and it may introduce new inconsistencies.

← PreviousPage 2 of 2 · 127 questions total

Ready to test yourself?

Try a timed practice session using only Communicating Data Insights questions.