PDE Maintaining and Automating Data Workloads • Set 2
PDE Maintaining and Automating Data Workloads Practice Test 2 — 15 questions with explanations. Free, no signup.
A data engineer is building a batch pipeline that runs daily using Cloud Composer. The pipeline has three tasks: extract data from Cloud Storage, transform data using Dataflow, and load the transformed data into BigQuery. The engineer wants to ensure that the Dataflow job only starts after the extraction task completes successfully, and the load task only starts after the Dataflow job finishes. How should the engineer define the task dependencies in the Airflow DAG?