MLS-C01 Data Engineering • Set 1
MLS-C01 Data Engineering Practice Test 1 — 15 questions with explanations. Free, no signup.
A data science team uses Amazon SageMaker to train models on a large dataset stored in S3. The dataset is 500 GB in CSV format and is updated daily. The team wants to optimize data loading for training jobs to reduce I/O wait time. Which data ingestion strategy is MOST effective?