Courseiva
Knowledge + Practice
CertificationsVendorsCareer RoadmapsLabs & ToolsStudy GuidesGlossaryPractice Questions
C
Courseiva

Free IT certification practice questions with explained answers for CCNA, CompTIA, AWS, Azure, Google Cloud, and more.

Certification Practice Questions

CCNA practice questionsSecurity+ SY0-701 practice questionsAWS SAA-C03 practice questionsAZ-104 practice questionsAZ-900 practice questionsCLF-C02 practice questionsA+ Core 1 practice questionsGoogle Cloud ACE practice questionsCySA+ CS0-003 practice questionsNetwork+ N10-009 practice questions
View all certifications →

Product

CertificationsCertification PathsExam TopicsPractice TestsExam Dumps vs Practice TestsStudy HubComparisons

Company

AboutContactEditorial PolicyQuestion Writing PolicyTrust Center

Legal

Privacy PolicyTerms of Service

Courseiva is a free IT certification practice platform offering original exam-style practice questions, detailed explanations, topic-based practice, mock exams, readiness tracking, and study analytics for Cisco, CompTIA, Microsoft, AWS, and other technology certifications.

© 2026 Courseiva. Courseiva is operated by JTNetSolutions Ltd. All rights reserved.

Courseiva is an independent certification practice platform and is not affiliated with, endorsed by, or sponsored by Cisco, Microsoft, AWS, CompTIA, Google, ISC2, ISACA, or any other certification vendor. Vendor names and certification marks are used only to identify the exams learners are preparing for.

← Ensuring solution quality practice sets

PDE Ensuring solution quality • Complete Question Bank

PDE Ensuring solution quality — All Questions With Answers

Complete PDE Ensuring solution quality question bank — all 0 questions with answers and detailed explanations.

45
Questions
Free
No signup
Certifications/PDE/Practice Test/Ensuring solution quality/All Questions
Question 1easymultiple choice
Read the full Ensuring solution quality explanation →

A data pipeline ingests streaming data from Pub/Sub into BigQuery via Dataflow. Recently, the pipeline has been failing with 'deadline exceeded' errors. What is the most likely cause?

Question 2mediummultiple choice
Read the full Ensuring solution quality explanation →

A team is designing a data lake on Google Cloud using Cloud Storage and BigQuery. They need to ensure that sensitive data (e.g., PII) is encrypted at rest and have the ability to audit access. Which approach meets these requirements?

Question 3hardmultiple choice
Read the full Ensuring solution quality explanation →

A company runs a batch processing job on Dataproc that uses Apache Spark to process 500 GB of data daily. The job completes successfully but takes 4 hours. The team wants to reduce the runtime to under 2 hours without increasing cost. What should they do?

Question 4mediummulti select
Read the full Ensuring solution quality explanation →

Which TWO actions are recommended to improve the reliability of a Cloud Dataflow streaming pipeline that processes event data from Pub/Sub?

Question 5hardmultiple choice
Read the full Ensuring solution quality explanation →

A data analyst runs a complex SQL query in BigQuery that joins multiple large tables and receives the above error. Which action is most likely to resolve the issue?

Exhibit

Refer to the exhibit.

BigQuery job error log:
{
  "jobId": "job_12345",
  "errorResult": {
    "reason": "resourcesExceeded",
    "message": "Resources exceeded during query execution: The query could not be executed in the allocated memory. High memory usage caused the query to fail."
  },
  "statistics": {
    "query": {
      "statementType": "SELECT",
      "totalSlotMs": 12000000,
      "totalBytesProcessed": 5000000000000
    }
  }
}
Question 6mediummultiple choice
Read the full Ensuring solution quality explanation →

A company runs a real-time anomaly detection system on Google Cloud. Streaming data from IoT devices is ingested via Pub/Sub, processed by Dataflow (Apache Beam), and results are written to Bigtable for low-latency serving. Recently, the system has been experiencing increased latency and occasional data loss. The Dataflow pipeline shows high system lag and backlog in Pub/Sub. The Bigtable cluster has 3 nodes and is reporting high CPU utilization (over 90%). The team suspects the issue is with the pipeline configuration. They have already verified that there are no errors in the pipeline code and no network issues. Which action should they take to resolve the issue?

Question 7mediumdrag order
Read the full Ensuring solution quality explanation →

Drag and drop the steps to create a Cloud Composer environment for Apache Airflow into the correct order.

Drag steps to the numbered slots on the right, or tap a step then tap a slot.

Steps
Order
1Step 1
2Step 2
3Step 3
4Step 4
5Step 5
Question 8mediumdrag order
Read the full Ensuring solution quality explanation →

Drag and drop the steps to migrate an on-premises MySQL database to Cloud SQL using Database Migration Service into the correct order.

Drag steps to the numbered slots on the right, or tap a step then tap a slot.

Steps
Order
1Step 1
2Step 2
3Step 3
4Step 4
5Step 5
Question 9mediummatching
Read the full Ensuring solution quality explanation →

Match each machine learning term to its description.

Drag a concept onto its matching description — or click a concept then click the description.

Concepts
Matches

Model trained on labeled data

Model trained on unlabeled data

Agent learns by interacting with environment

Model performs well on training data but poorly on new data

Question 10mediummatching
Read the full Ensuring solution quality explanation →

Match each data encryption concept to its description.

Drag a concept onto its matching description — or click a concept then click the description.

Concepts
Matches

Customer-supplied encryption key

Customer-managed encryption key via Cloud KMS

CSEK: keys provided by customer; CMEK: keys managed in Cloud KMS

Data encrypted while moving across networks

Question 11easymultiple choice
Read the full Ensuring solution quality explanation →

A data pipeline processes streaming data with Dataflow. The team notices occasional data duplication in BigQuery. What is the best approach to ensure exactly-once processing?

Question 12mediummultiple choice
Read the full NAT/PAT explanation →

A company is deploying a large-scale streaming application on Google Kubernetes Engine. They need to ensure the application can handle sudden traffic spikes without dropping data. Which architectural pattern is most appropriate?

Question 13hardmultiple choice
Read the full Ensuring solution quality explanation →

A data science team uses AI Platform Training with hyperparameter tuning. They observe that some trials fail due to transient errors. To improve solution quality and reduce costs, what should they do?

Question 14easymultiple choice
Read the full Ensuring solution quality explanation →

A company runs batch jobs on Dataproc. They need to ensure that if a job fails, it automatically retries with exponential backoff. What is the recommended approach?

Question 15easymultiple choice
Read the full Ensuring solution quality explanation →

A team developed a microservice that writes logs to stdout. They want to centralize logs for analysis. Which GCP service should they use to automatically collect and store logs?

Question 16mediummultiple choice
Read the full Ensuring solution quality explanation →

A data platform uses Cloud Spanner for transactional data. They are experiencing high latency during write-heavy periods. To maintain solution quality, what configuration change is most effective?

Question 17easymultiple choice
Read the full Ensuring solution quality explanation →

A company uses Cloud Functions to process events from Cloud Storage. They notice that occasionally functions are not triggered. What should they check first to ensure solution quality?

Question 18hardmultiple choice
Read the full Ensuring solution quality explanation →

A financial services company uses Dataflow pipelines with late data handling. They need to ensure that all late-arriving data is processed correctly but also want to control costs. What is the best configuration?

Question 19easymultiple choice
Read the full Ensuring solution quality explanation →

A team is deploying a model on AI Platform Prediction. They want to monitor for data drift to maintain model quality. Which service should they use?

Question 20mediummulti select
Read the full Ensuring solution quality explanation →

A data engineer needs to monitor the performance of BigQuery queries to identify opportunities for optimization. Which TWO metrics should they focus on? (Choose two.)

Question 21hardmulti select
Read the full Ensuring solution quality explanation →

A company uses Cloud Build to deploy containerized applications. They want to ensure build and deployment quality. Which THREE steps should they include in their CI/CD pipeline? (Choose three.)

Question 22mediummulti select
Read the full Ensuring solution quality explanation →

A company wants to ensure high availability for their Cloud SQL instance. Which TWO actions are most appropriate? (Choose two.)

Question 23mediummultiple choice
Read the full Ensuring solution quality explanation →

Refer to the exhibit. A team configured a Cloud Monitoring alerting policy as shown. They recently started receiving false positive alerts. What is the most likely cause?

Exhibit

{
  "alertName": "High Error Rate",
  "condition": {
    "conditionThreshold": {
      "filter": "metric.type=\"logging.googleapis.com/user/my-app-error-count\"",
      "aggregations": [
        {
          "alignmentPeriod": "60s",
          "perSeriesAligner": "ALIGN_RATE"
        }
      ],
      "duration": "60s",
      "comparison": "COMPARISON_GT",
      "threshold": 10
    }
  },
  "documentation": {},
  "notificationChannels": []
}
Question 24hardmultiple choice
Read the full Ensuring solution quality explanation →

Refer to the exhibit. A team received this error when running a query. Which optimization should they apply first?

Exhibit

bq query --use_legacy_sql=false 'SELECT COUNT(*) FROM mydataset.mytable WHERE date >= "2024-01-01" AND date < "2024-02-01"' returned: 'Error: Query exceeded resource limits. For best practices, see https://cloud.google.com/bigquery/docs/best-practices-performance-compute'
Question 25easymultiple choice
Read the full Ensuring solution quality explanation →

Refer to the exhibit. A subscriber is unable to pull messages from the topic. What is the most likely cause?

Exhibit

gcloud pubsub topics get-iam-policy my-topic
Bindings:
- members:
  - serviceAccount:sa@project.iam.gserviceaccount.com
  role: roles/pubsub.subscriber
Question 26easymultiple choice
Read the full Ensuring solution quality explanation →

A company uses Cloud Monitoring to track application latency. They notice a spike in latency every 30 minutes. What is the best initial step to diagnose the issue?

Question 27easymultiple choice
Read the full Ensuring solution quality explanation →

A team deploys a new version of a Cloud Function. After deployment, error rates increase significantly. What is the most efficient way to diagnose the cause?

Question 28easymultiple choice
Read the full Ensuring solution quality explanation →

A data pipeline using Dataflow processes streaming data. Late-arriving events are currently being dropped. How should the team modify the pipeline to ensure late data is processed correctly?

Question 29mediummultiple choice
Read the full Ensuring solution quality explanation →

A company uses GKE to run microservices. They want to ensure the application restarts automatically if it becomes unresponsive. Which probes should they configure in their pod spec?

Question 30mediummultiple choice
Read the full Ensuring solution quality explanation →

After migrating a production Cloud SQL for PostgreSQL database to a larger machine type, the team notices slower queries. What is the best step to identify the cause?

Question 31mediummultiple choice
Read the full Ensuring solution quality explanation →

A company uses BigQuery for analytics. They need to ensure data quality by preventing duplicate records from being inserted. Which approach is most effective?

Question 32hardmultiple choice
Read the full Ensuring solution quality explanation →

A company uses Cloud Spanner for a global transactional application. During peak hours, commit latency increases by over 50%. Which configuration issue is the most likely root cause?

Question 33hardmultiple choice
Read the full Ensuring solution quality explanation →

A team deploys a Cloud Run service that processes user-uploaded files. Some requests time out after 60 minutes. They need to handle large files reliably without losing tasks. What is the best solution?

Question 34hardmultiple choice
Read the full Ensuring solution quality explanation →

A data engineering team uses Cloud Composer (Airflow) for workflow orchestration. They notice DAG runs frequently fail, and the error indicates insufficient Airflow workers. The team wants to ensure reliable execution. Which approach best addresses the issue?

Question 35easymulti select
Read the full Ensuring solution quality explanation →

A company uses Cloud Logging to monitor application errors. They want to set up real-time notifications for critical errors. Which two actions are essential? (Choose two.)

Question 36mediummulti select
Read the full Ensuring solution quality explanation →

A team runs a production application on Compute Engine. They want to ensure high availability and quality. Which three best practices should they implement? (Choose three.)

Question 37hardmulti select
Read the full Ensuring solution quality explanation →

A company uses Cloud Dataproc for ephemeral clusters to run batch jobs. They want to ensure job reliability and data quality. Which two configuration options should they use? (Choose two.)

Question 38mediummultiple choice
Read the full Ensuring solution quality explanation →

Refer to the exhibit. A team uses this Cloud Build configuration to deploy a service to Cloud Run. The deployment step fails with a 'Permission denied' error. What is the most likely cause?

Network Topology
args: ['run'image'region'platform'Refer to the exhibit.```yamlsteps:- name: 'gcr.io/cloud-builders/docker'args: ['build', '-t', 'gcr.io/$PROJECT_ID/my-image', '.']args: ['push', 'gcr.io/$PROJECT_ID/my-image']- name: 'gcr.io/cloud-builders/gcloud'```
Question 39hardmultiple choice
Read the full Ensuring solution quality explanation →

Refer to the exhibit. A BigQuery dataset is shared with the group 'analysts@example.com' using the IAM policy shown. A user who is a member of this group reports that they cannot run queries on the dataset, though they can see the tables. What is the most likely reason?

Exhibit

Refer to the exhibit.

```json
{
  "bindings": [
    {
      "role": "roles/bigquery.dataViewer",
      "members": [
        "group:analysts@example.com"
      ]
    }
  ]
}
```
Question 40hardmultiple choice
Read the full Ensuring solution quality explanation →

A company runs a data pipeline that ingests clickstream events from multiple websites into Cloud Pub/Sub, then processed by Dataflow to generate user sessions, and written to BigQuery for analytics. The pipeline runs 24/7. Recently, the team noticed that some sessions are incomplete due to missing events, and data quality checks reveal that about 2% of sessions have gaps of more than 30 minutes. The pipeline uses fixed 30-minute windows for sessionization, with allowed lateness set to 10 minutes. They have Cloud Monitoring dashboards tracking system throughput and pipeline lag but do not have custom metrics tracking per-element delays or watermark progress. The team suspects two possible causes: (a) the Pub/Sub subscription accumulates backlog and some messages are delivered after the window end; (b) the Dataflow job has insufficient workers causing checkpoint failures. The team needs to determine the root cause and improve data quality. What is the best first course of action?

Question 41easymultiple choice
Read the full Ensuring solution quality explanation →

A media company runs a batch data pipeline on Cloud Dataflow that ingests log files from Cloud Storage, transforms them, and writes results to BigQuery for analytics. The pipeline runs daily and has been stable for months. Recently, the source log format changed: a new optional field was added to some records. The pipeline started failing with ParseErrors for rows that contain the new field. The error logs show that the Dataflow job uses a hardcoded JSON schema that does not include the new field. The Dataflow pipeline logs are written to Stackdriver Logging, but no alerts are configured. The team wants to ensure that future schema changes do not break the pipeline and that failures are detected promptly. The team has limited experience with streaming and wants to keep the batch approach. Which course of action should the team take to improve solution quality?

Question 42hardmultiple choice
Read the full Ensuring solution quality explanation →

A financial services company operates a real-time fraud detection pipeline using Apache Beam running on Google Cloud Dataflow. The pipeline reads transactions from Pub/Sub, enriches them with customer data from Bigtable, runs a machine learning model with side inputs from a Redis cluster, and writes results to BigQuery for downstream reporting. The data must be processed with exactly-once semantics to avoid duplicate fraud alerts or missing transactions. The pipeline currently uses a global window with 5-minute accumulation, but the team is experiencing high latency and occasional duplicates when the model side input is updated (triggered every 15 minutes via a WatchTransform). Additionally, the pipeline has a dead letter queue that outputs failed records to a separate Pub/Sub topic, but these records are never reprocessed. The team needs to ensure high reliability and data quality. Which course of action should the team take to improve solution quality?

Question 43easymulti select
Read the full Ensuring solution quality explanation →

A company is developing a streaming Dataflow pipeline to process real-time sensor data. To ensure data quality, the team wants to detect malformed records and late data. Which two practices should they implement? (Choose two.)

Question 44mediummultiple choice
Read the full Ensuring solution quality explanation →

Refer to the exhibit. A Dataflow pipeline is failing intermittently with the shown error. Which step should the team take to ensure data quality and prevent such errors?

Exhibit

Refer to the exhibit.
{
  "insertId": "abc123",
  "jsonPayload": {
    "message": "Error processing element: expected integer at field 'temperature', got string 'hot'",
    "workerId": "worker-5",
    "step": "ParseAndValidate"
  },
  "resource": {
    "type": "dataflow_step",
    "labels": {
      "job_id": "job-1234",
      "step_id": "s2"
    }
  }
}
Question 45hardmultiple choice
Read the full Ensuring solution quality explanation →

A company runs a large Dataflow pipeline that aggregates user activity data from Pub/Sub into BigQuery every 10 minutes using fixed windows. Recently, the daily summary reports have shown 5-10% lower user engagement for certain segments compared to historical trends. The pipeline is completing successfully with no errors in Cloud Monitoring, and the Dataflow job dashboard shows all steps in green. There are no alarms. The team suspects data is being dropped or missed. They have verified that the Pub/Sub topic is receiving data correctly. After reviewing the pipeline code, they find that the pipeline uses a global window with a default 10-minute trigger, and writes results to a single BigQuery table partitioned by date. They also use exactly-once processing mode. Which of the following is the most likely cause and the best course of action to diagnose and fix the data quality issue?

Practice tests

Scored 10-question sessions with instant feedback and explanations.

PDE Practice Test 1 — 10 Questions→PDE Practice Test 2 — 10 Questions→PDE Practice Test 3 — 10 Questions→PDE Practice Test 4 — 10 Questions→PDE Practice Test 5 — 10 Questions→PDE Practice Exam 1 — 20 Questions→PDE Practice Exam 2 — 20 Questions→PDE Practice Exam 3 — 20 Questions→PDE Practice Exam 4 — 20 Questions→Free PDE Practice Test 1 — 30 Questions→Free PDE Practice Test 2 — 30 Questions→Free PDE Practice Test 3 — 30 Questions→PDE Practice Questions 1 — 50 Questions→PDE Practice Questions 2 — 50 Questions→PDE Exam Simulation 1 — 100 Questions→

Practice by domain

Each domain maps to a weighted exam section. Focus on the domain where you are weakest.

Designing data processing systemsBuilding and operationalizing data processing systemsOperationalizing machine learning modelsEnsuring solution quality

Practice by scenario

Filter questions by type — troubleshooting, exhibit, drag-and-drop, PBQ, ACLs, OSPF, and more.

Browse scenarios→

Continue studying

All Ensuring solution quality setsAll Ensuring solution quality questionsPDE Practice Hub