1
Data Preparation for Machine Learning
medium
A company uses SageMaker Processing jobs to clean customer transaction data. The processing script runs on a single ml.m5.large instance and takes 30 minutes to process 50 GB of data in CSV format. To reduce processing time, the company wants to process 200 GB of data within 1 hour. Which combination of changes should the company make?