A company deploys a machine learning model to Vertex AI for real-time predictions. After deployment, they notice that prediction latency spikes during peak traffic hours. Which approach should they take to reduce latency without sacrificing accuracy?
Select one:
Google Cloud often tests the misconception that reducing features or using batch prediction is the p...