PDE Building and operationalizing data processing systems • Set 5
PDE Building and operationalizing data processing systems Practice Test 5 — 15 questions with explanations. Free, no signup.
You are optimizing a Dataflow pipeline that performs a group-by-key transformation on a large, skewed dataset. The pipeline is experiencing high latency due to data skew (some keys have many more values). Which TWO actions can help mitigate the skew? (Choose two.)