- A
Use BigQuery column-level security with data masking.
Why wrong: Data masking is applied at query time; the underlying data remains sensitive.
- B
Write custom Dataflow transformations using a Java SDK to redact SSNs.
Why wrong: Custom code is less scalable and harder to maintain than DLP.
- C
Use Cloud DLP to inspect and transform the data, then store the de-identified results in BigQuery.
Cloud DLP can automatically identify and de-identify sensitive data.
- D
Use Cloud Data Catalog to tag sensitive columns and rely on access control.
Why wrong: Tagging does not de-identify the data.
Quick Answer
The correct approach is to use Cloud DLP to inspect and transform the data, then store the de-identified results in BigQuery. This is because Cloud DLP provides native, policy-based inspection and de-identification of sensitive data like Social Security numbers, and it can be integrated directly with both Dataflow and BigQuery to create a fully automated pipeline. On the Google Professional Cloud Security Engineer exam, this scenario tests your understanding of the difference between de-identification at rest versus masking at query time—a common trap is choosing BigQuery data masking, which only hides values during query execution but leaves the underlying sensitive data stored in plain text. Cloud DLP, by contrast, permanently transforms the data before it ever lands in BigQuery, ensuring compliance with healthcare regulations. Remember the memory tip: "Mask hides, DLP transforms"—if the requirement is to remove sensitive data from storage, always pick a transformation tool like Cloud DLP over a runtime masking feature.
PCSE Ensuring data protection Practice Question
This PCSE practice question tests your understanding of ensuring data protection. Read the scenario carefully and evaluate each option against the stated constraints before committing to an answer. After answering, compare your reasoning against the explanation and wrong-answer breakdown below. Once you have made your selection, read the full explanation to reinforce the concept and understand why each distractor is designed to mislead on exam day.
A healthcare organization ingests patient data into Cloud Storage and then processes it with Dataflow. They need to de-identify sensitive fields like Social Security numbers before storing in BigQuery. Which approach should they use?
Answer choices
Why each option matters
Answer the question above first, then reveal the full breakdown to understand why each option is right or wrong.
Correct answer & explanation
Use Cloud DLP to inspect and transform the data, then store the de-identified results in BigQuery.
Option B is correct because Cloud DLP inspection and transformation jobs can be integrated with BigQuery and Dataflow. Option A is wrong because BigQuery data masking only masks at query time, not at rest. Option C is wrong because Dataflow with custom code is more error-prone and harder to maintain. Option D is wrong because Cloud Data Catalog only catalogs but does not transform.
Key principle: Count usable hosts — not total addresses — and remember that the network and broadcast addresses are not available to hosts in standard IPv4 subnets.
Answer analysis
Option-by-option breakdown
For each option: why learners choose it and why it is or isn't the right answer here.
- ✗
Use BigQuery column-level security with data masking.
Why it's wrong here
Data masking is applied at query time; the underlying data remains sensitive.
- ✗
Write custom Dataflow transformations using a Java SDK to redact SSNs.
Why it's wrong here
Custom code is less scalable and harder to maintain than DLP.
- ✓
Use Cloud DLP to inspect and transform the data, then store the de-identified results in BigQuery.
Why this is correct
Cloud DLP can automatically identify and de-identify sensitive data.
Related concept
CIDR notation defines the prefix length.
- ✗
Use Cloud Data Catalog to tag sensitive columns and rely on access control.
Why it's wrong here
Tagging does not de-identify the data.
Common exam traps
Common exam trap: usable hosts are not the same as total addresses
Subnetting questions often tempt you into counting all addresses. In normal IPv4 subnets, the network and broadcast addresses are not usable host addresses.
Detailed technical explanation
How to think about this question
Subnetting questions test whether you can identify the network, broadcast address, usable range, mask and correct subnet. Slow down enough to calculate the block size correctly.
KKey Concepts to Remember
- CIDR notation defines the prefix length.
- Block size helps identify subnet boundaries.
- Network and broadcast addresses are not usable hosts in normal IPv4 subnets.
- The required host count determines the smallest suitable subnet.
TExam Day Tips
- Write the block size before choosing the subnet.
- Check whether the question asks for hosts, subnets or a specific address range.
- Do not confuse /24, /25, /26 and /27 host counts.
Key takeaway
Count usable hosts — not total addresses — and remember that the network and broadcast addresses are not available to hosts in standard IPv4 subnets.
Real-world example
How this comes up in practice
A media company stores terabytes of video archives that are accessed once a year for audit purposes. Moving these objects to a cold storage tier (Azure Archive, S3 Glacier, or Google Nearline) costs a fraction of hot storage. Questions like this test whether you understand storage tiers, access frequency tradeoffs, and retrieval latency requirements.
What to study next
Got this wrong? Here's your next step.
Review block sizes, usable host formulas (2^n − 2), and how to find network and broadcast addresses for /24 through /30. Then practise related PCSE subnetting questions on CIDR, address ranges, and subnet selection.
- →
Ensuring data protection — study guide chapter
Learn the concepts, then practise the questions
- →
Ensuring data protection practice questions
Targeted practice on this topic area only
- →
All PCSE questions
500 questions across all exam domains
- →
Google Professional Cloud Security Engineer study guide
Full concept coverage aligned to exam objectives
- →
PCSE practice test guide
How to use practice tests most effectively before exam day
Related practice questions
Related PCSE practice-question pages
Use these pages to review the topic behind this question. This is how one missed question becomes focused revision.
Configuring network security practice questions
Practise PCSE questions linked to Configuring network security.
Configuring access within a cloud solution environment practice questions
Practise PCSE questions linked to Configuring access within a cloud solution environment.
Ensuring data protection practice questions
Practise PCSE questions linked to Ensuring data protection.
Managing operations in a cloud solution environment practice questions
Practise PCSE questions linked to Managing operations in a cloud solution environment.
Supporting compliance requirements practice questions
Practise PCSE questions linked to Supporting compliance requirements.
PCSE fundamentals practice questions
Practise PCSE questions linked to PCSE fundamentals.
PCSE scenario practice questions
Practise PCSE questions linked to PCSE scenario.
PCSE troubleshooting practice questions
Practise PCSE questions linked to PCSE troubleshooting.
Practice this exam
Start a free PCSE practice session
Short sessions build daily habit. Longer sessions build exam-day stamina. Try a timed session to simulate real conditions.
FAQ
Questions learners often ask
What does this PCSE question test?
Ensuring data protection — This question tests Ensuring data protection — CIDR notation defines the prefix length..
What is the correct answer to this question?
The correct answer is: Use Cloud DLP to inspect and transform the data, then store the de-identified results in BigQuery. — Option B is correct because Cloud DLP inspection and transformation jobs can be integrated with BigQuery and Dataflow. Option A is wrong because BigQuery data masking only masks at query time, not at rest. Option C is wrong because Dataflow with custom code is more error-prone and harder to maintain. Option D is wrong because Cloud Data Catalog only catalogs but does not transform.
What should I do if I get this PCSE question wrong?
Review block sizes, usable host formulas (2^n − 2), and how to find network and broadcast addresses for /24 through /30. Then practise related PCSE subnetting questions on CIDR, address ranges, and subnet selection.
What is the key concept behind this question?
CIDR notation defines the prefix length.
About these practice questions
Courseiva creates original exam-style practice questions with explanations and wrong-answer analysis. It does not publish real exam questions, exam dumps, or protected exam content. Learn why practice questions differ from exam dumps →
Same concept, more angles
1 more ways this is tested on PCSE
These questions test the same concept from different angles. Work through them to make sure you can recognise it however the exam phrases it.
Variation 1. A healthcare organization stores Protected Health Information (PHI) in Cloud Storage. They need to de-identify data before sharing it with researchers. Which service should they use?
medium- A.Cloud Key Management Service (KMS)
- B.VPC Service Controls
- C.Cloud IAM
- ✓ D.Cloud Data Loss Prevention (DLP)
Why D: Cloud Data Loss Prevention (DLP) is the correct service because it is specifically designed to inspect, classify, and de-identify sensitive data such as Protected Health Information (PHI). It uses built-in infoTypes (e.g., US_INDIVIDUAL_HEALTHCARE_NPI) and de-identification techniques like masking, tokenization, and redaction to transform PHI into a de-identified dataset before sharing with researchers, ensuring compliance with HIPAA.
Keep practising
More PCSE practice questions
- Match each IAM role to its typical use case.
- Match each encryption scope to its description.
- Match each CVE or security concept to its description.
- Match each Google Cloud logging/monitoring term to its definition.
- Drag and drop the steps to rotate a customer-managed encryption key (CMEK) in Cloud KMS in the correct order.
- Drag and drop the steps to configure a Cloud NAT for private VM instances in the correct order.
Last reviewed: Jun 24, 2026
This PCSE practice question is part of Courseiva's free Google Cloud certification practice question bank. Courseiva provides original exam-style practice questions with explanations, topic-based practice, mock exams, readiness tracking, and study analytics to help learners prepare for the PCSE exam.
Question Discussion
Share a tip, memory trick, or ask about the reasoning behind this question. Do not post real exam questions, leaked content, braindumps, or copyrighted exam material. Comments are moderated and may be removed without notice.
Sign in to join the discussion.