PDE Building and operationalizing data processing systems • Set 2
PDE Building and operationalizing data processing systems Practice Test 2 — 15 questions with explanations. Free, no signup.
A company is building a real-time streaming pipeline using Pub/Sub and Dataflow to process clickstream data. The pipeline writes aggregated metrics to BigQuery every 10 seconds using a fixed window. During peak traffic, some windows produce duplicate rows in BigQuery. What is the most likely cause?