MLS-C01 Data Engineering • 10 Questions
10 MLS-C01 Data Engineering practice questions with answers and explanations. Free, no signup.
A data science team uses Amazon SageMaker to train models on a large dataset stored in S3. The dataset is 500 GB in CSV format and is updated daily. The team wants to optimize data loading for training jobs to reduce I/O wait time. Which data ingestion strategy is MOST effective?