You are troubleshooting a slow-running pipeline in Azure Data Factory. The pipeline copies data from an on-premises SQL Server to Azure Synapse Analytics using a self-hosted integration runtime. The copy activity is using the 'Auto' copy method. You notice that network bandwidth is limited. Which configuration change would most likely improve performance?
Trap 1: Increase the Data Integration Units (DIU) for the copy activity
DIU applies to Azure IR, not self-hosted IR.
Trap 2: Change the copy method to 'Bulk insert'
Bulk insert is not available for on-premises sources.
Trap 3: Set the Fault Tolerance option to skip incompatible rows
This handles errors, not performance.
- A
Enable staging using Azure Blob Storage and use PolyBase to load into Synapse
Staging improves performance by using parallel uploads to Blob Storage.
- B
Increase the Data Integration Units (DIU) for the copy activity
Why wrong: DIU applies to Azure IR, not self-hosted IR.
- C
Change the copy method to 'Bulk insert'
Why wrong: Bulk insert is not available for on-premises sources.
- D
Set the Fault Tolerance option to skip incompatible rows
Why wrong: This handles errors, not performance.