PDE Designing Data Processing Systems • Set 7
PDE Designing Data Processing Systems Practice Test 7 — 15 questions with explanations. Free, no signup.
A data pipeline ingests streaming events into Pub/Sub and needs to join them with a slowly updating reference table (few thousand rows) from a Cloud Storage CSV file. The pipeline runs on Dataflow with Apache Beam. Which approach is most cost-effective and operationally simple?