PMLE Scaling prototypes into ML models • Complete Question Bank
Complete PMLE Scaling prototypes into ML models question bank — all 0 questions with answers and detailed explanations.
Refer to the exhibit. ``` Model accuracy: 0.92 Training data: 10,000 records Online prediction latency: 95th percentile = 450ms QPS: 50 After moving to production: - New data from users: 100,000 records/day - Data distribution shift detected (new features emerge) - Prediction latency increases to 95th percentile = 1200ms - QPS drops to 30 ```
Drag steps to the numbered slots on the right, or tap a step then tap a slot.
Drag steps to the numbered slots on the right, or tap a step then tap a slot.
Drag a concept onto its matching description — or click a concept then click the description.
Area Under the ROC Curve
Mean Squared Error
Tensor Processing Unit
Support Vector Machine
Principal Component Analysis
Drag a concept onto its matching description — or click a concept then click the description.
Game-theoretic approach to explain feature contributions
Local surrogate model to explain individual predictions
Ranking features by their impact on model output
Shows marginal effect of a feature on predictions
Measures decrease in performance when feature is shuffled
Refer to the exhibit.
{
"name": "projects/my-project/locations/us-central1/endpoints/123456",
"displayName": "my_endpoint",
"deployedModels": [
{
"id": "123",
"model": "projects/my-project/locations/us-central1/models/456",
"displayName": "model_v1",
"automaticResources": {
"minReplicaCount": 1,
"maxReplicaCount": 10
},
"dedicatedResources": null,
"enableAccessLogging": true
}
]
}gcloud ai endpoints deploy-model \
--endpoint=projects/my-project/locations/us-central1/endpoints/456 \
--model=projects/my-project/locations/us-central1/models/789 \
--display-name=my-deployment \
--machine-type=n1-standard-4 \
--min-replica-count=0 \
--max-replica-count=10 \
Error: (gcloud.beta.ai.endpoints.deploy-model) INVALID_ARGUMENT: min_replica_count must be at least 1 for online prediction.