A data scientist is using Amazon SageMaker to train a custom image classification model using a PyTorch script. The training job runs successfully but the model accuracy is lower than expected. The scientist wants to debug the training process by inspecting gradients and layer outputs. Which SageMaker feature should be used to capture this internal state during training?
SageMaker Debugger provides real-time monitoring of training metrics and internal state like gradients.
Why this answer
SageMaker Debugger captures internal model state such as gradients and tensors during training, enabling analysis and debugging. SageMaker Profiler (B) focuses on system performance, not model internals. SageMaker Experiments (C) tracks trials and metrics but does not capture internal state.
SageMaker Model Monitor (D) detects data drift after deployment.