Knowledge + Practice

Salesforce AI Associate AI Associate (AI Associate) — Questions 76–150

506 questions total · 7pages · All types, answers revealed

Take a mock exam Exam hub

Page 2 of 7

76

MCQeasy

Refer to the exhibit. A developer runs a SOQL query. What does the output indicate?

A.The query returned 10 records in total.

B.The query is still processing.

C.The output is incomplete.

D.The query failed.

AnswerA

totalSize shows the number of records returned, and done=true means the query finished.

Why this answer

The SOQL query output shows '10 records returned' with no error or partial result indicator, confirming that the query completed successfully and returned exactly 10 records. In Salesforce SOQL, the query result includes a 'totalSize' field that reflects the total number of records matching the query criteria, and here it matches the number of records displayed, indicating a complete and successful retrieval.

Exam trap

Salesforce often tests the misconception that a small result set might be incomplete or that the query is still running, but the presence of a record count matching the displayed records and no error or pagination indicator confirms a complete and successful query.

How to eliminate wrong answers

Option B is wrong because SOQL queries are synchronous and either complete or fail; there is no 'still processing' state in the output—if processing were ongoing, the query would not return a result set. Option C is wrong because the output explicitly states '10 records returned' and shows all records, with no truncation or 'more records available' indicator; SOQL uses query locators for large result sets, but here the count matches the displayed records, so the output is complete. Option D is wrong because a failed query would return an error message or exception, not a list of records; the presence of a result set with a record count confirms success.

Full explanation →

77

Multi-Selecteasy

Which TWO of the following are valid methods to improve data quality in Data Cloud before training an AI model?

Select 2 answers

A.Increase data stream ingestion frequency

B.Use a Data Transform to filter out invalid records

C.Implement data retention policies

D.Use Calculated Insights to detect anomalies

E.Enable data profiling on data streams

AnswersB, E

Directly removes poor-quality data.

Why this answer

Options B and D are correct. Data profiling helps understand data quality issues, and data transforms can filter invalid records. Option A is wrong because calculated insights detect anomalies but don't directly improve quality.

Option C is wrong because retention policies manage data lifecycle, not quality. Option E is wrong because increasing frequency does not fix existing quality issues.

Full explanation →

78

MCQmedium

A company uses Salesforce Einstein to build an AI model that predicts customer churn. The model is trained on historical data from the past two years. During testing, the model shows significantly higher accuracy for male customers compared to female customers. What is the most ethical course of action?

A.Deploy the model but add a disclaimer that it may be less accurate for female customers.

B.Deploy the model as is, because it still meets the overall accuracy threshold.

C.Investigate the cause of the disparity, retrain the model with more representative data, and re-evaluate fairness.

D.Manually adjust the model's output to ensure equal churn predictions across genders.

AnswerC

This addresses bias and upholds fairness principles.

Why this answer

Option B is correct because investigating and retraining the model to reduce bias is the ethical approach. Option A is wrong because ignoring the disparity could lead to unfair treatment. Option C is wrong because immediately deploying without correction is irresponsible.

Option D is wrong because manually adjusting predictions introduces new biases.

Full explanation →

79

MCQhard

An AI system for hiring is found to have a disparate impact on a protected class. The company is legally required to...

A.Conduct a bias audit and take corrective action.

B.Obtain consent from applicants.

C.Publish the algorithm.

D.Discontinue use of the system.

AnswerA

This is typically legally required.

Why this answer

Option B is correct because under many regulations, disparate impact triggers a requirement for bias audit and corrective action. Option A is wrong because discontinuation is not immediately required; mitigation is possible. Option C is wrong because obtaining consent does not address the impact.

Option D is wrong because publishing the algorithm is not typically a legal requirement.

Full explanation →

80

Multi-Selectmedium

Which THREE are key principles of Salesforce's AI Ethics framework? (Choose three.)

Select 3 answers

A.Be Inclusive

B.Be Fast Over Accurate

C.Be Accountable

D.Be Transparent

E.Be Profit-Maximizing

AnswersA, C, D

Inclusivity ensures fairness for all users.

Why this answer

Options A, C, and D are correct. Option B is wrong because 'maximum profit' is not an ethical principle. Option E is wrong because 'speed over accuracy' contradicts responsible AI.

Full explanation →

81

MCQmedium

A retail company implements an AI chatbot to recommend products. After launch, they notice the chatbot frequently suggests expensive items to budget-conscious customers. Which AI bias is most likely occurring?

A.Confirmation bias

B.Anchoring bias

C.Sample bias (biased training data)

D.Overconfidence bias

AnswerC

If the training data overrepresents high-spending customers, the model may learn to recommend expensive products to all users.

Why this answer

The chatbot's frequent suggestion of expensive items to budget-conscious customers indicates that the training data was biased toward high-cost products, leading the model to learn and replicate that preference. This is a classic case of sample bias (biased training data), where the dataset does not accurately represent the target user population, causing systematic errors in the model's recommendations.

Exam trap

Salesforce often tests the distinction between human cognitive biases (like anchoring or confirmation bias) and data-driven biases (like sample bias), so the trap here is that candidates confuse a human reasoning flaw with a machine learning training data issue.

How to eliminate wrong answers

Option A is wrong because confirmation bias refers to a human tendency to favor information that confirms preexisting beliefs, not a data-driven AI model's output skew. Option B is wrong because anchoring bias is a cognitive heuristic where humans rely too heavily on the first piece of information encountered, not a bias originating from training data distribution. Option D is wrong because overconfidence bias relates to a model or human assigning excessive certainty to predictions, not to a systematic skew in recommendation outputs due to imbalanced training data.

Full explanation →

82

MCQhard

A financial institution uses AI for loan approvals. They notice the model is denying loans to women more often. After retraining with balanced data, the disparity persists. What is the next best step?

A.Increase model regularization

B.Remove the gender feature

C.Reduce model complexity

D.Use adversarial debiasing technique

AnswerD

Adversarial debiasing explicitly reduces bias in learned representations.

Why this answer

Option B is correct because adversarial debiasing directly reduces bias in model representations. Option A is wrong because removing the gender feature may not eliminate bias if other features correlate. Option C is wrong because regularization helps control overfitting, not bias.

Option D is wrong because reducing complexity may not address bias.

Full explanation →

83

MCQmedium

A large enterprise needs to integrate data from Salesforce CRM, an external ERP, and marketing automation to train an AI model for cross-sell recommendations. Which data storage strategy is most aligned with Salesforce's AI capabilities?

A.Use only Salesforce CRM data and ignore external sources

B.Store each source separately in Data Cloud and train models on each

C.Export all data to an external data lake and build a custom model

D.Use Salesforce Data Cloud to unify the datasets

AnswerD

Data Cloud provides harmonization, governance, and native Einstein integration.

Why this answer

Salesforce Data Cloud is designed to unify data from multiple sources into a single platform for AI and analytics. Exporting to a data lake adds complexity, using only Salesforce objects limits data scope, and storing flat files lacks governance.

Full explanation →

84

MCQeasy

A marketing team wants to use Einstein Engagement Scoring to prioritize leads. What is the primary input for this AI feature?

A.Lead interaction history with emails and web activity.

B.Historical conversion data from closed opportunities.

C.Lead demographic information like industry and company size.

D.Social media posts and mentions of the company.

AnswerA

Engagement is measured by interactions.

Why this answer

Einstein Engagement Scoring analyzes lead interactions (email opens, clicks, web visits) to calculate engagement scores. Option A is correct. Option B is wrong because demographic data is not the primary input.

Option C is wrong because historical conversion data is used for predictive scoring, not engagement. Option D is wrong because social media data is not a direct input.

Full explanation →

85

MCQeasy

A sales manager wants to implement Einstein Automated Contacts to automatically create contacts from email interactions. The admin enables the feature and assigns the permission set. However, no contacts are being created automatically. What is the most likely reason?

A.The users have not logged any emails to Salesforce.

B.The admin did not set up the contact creation criteria in the Einstein Automated Contacts settings.

C.The feature requires at least 500 emails to be processed before it starts creating contacts.

D.The users need to install a plugin for their email client.

AnswerB

Rules must be configured; default is no creation.

Why this answer

Einstein Automated Contacts requires the admin to explicitly define the criteria that trigger automatic contact creation from email interactions. Simply enabling the feature and assigning the permission set does not configure the rules for when and how contacts should be created. Without these criteria, the system has no instructions to act upon, so no contacts are generated.

Exam trap

Salesforce often tests the misconception that enabling a feature and assigning permissions is sufficient for it to work, when in reality, configuration of specific rules or criteria is required to trigger the intended behavior.

How to eliminate wrong answers

Option A is wrong because the feature processes emails that are already logged to Salesforce; if users have not logged any emails, there would be no data to process, but the question states no contacts are being created, implying the feature is not triggering even when emails are present. Option C is wrong because Einstein Automated Contacts does not require a minimum number of emails to be processed before it starts creating contacts; it works on individual email interactions once criteria are set. Option D is wrong because Einstein Automated Contacts is a native Salesforce feature that works with the standard email integration (e.g., Salesforce for Outlook or Gmail) and does not require a separate plugin installation.

Full explanation →

86

MCQmedium

A dataset contains a 'date' column. Which feature engineering technique would best capture both long-term trends and seasonal patterns?

A.Extract year, month, day as separate features.

B.Use only the day of week.

C.Create cyclic features (sin/cos of month, day).

D.Drop the date column.

AnswerC

Cyclic encoding preserves the periodic nature of time.

Why this answer

Option C is correct because cyclic features using sine and cosine transformations preserve the circular nature of temporal data (e.g., month 12 and month 1 are adjacent, not far apart). This allows a model to learn both long-term trends (via the year component) and seasonal patterns (via the cyclic encoding of month and day) without imposing a false linear ordering. In contrast, simple numeric extraction treats time as linear, which can misrepresent seasonal cycles.

Exam trap

Salesforce often tests whether candidates recognize that simple numeric extraction (e.g., month as 1–12) fails to model cyclical continuity, leading them to mistakenly choose Option A over the correct cyclic encoding.

How to eliminate wrong answers

Option A is wrong because extracting year, month, and day as separate numeric features introduces a linear ordering that fails to capture the cyclical relationship between months (e.g., December and January are treated as far apart). Option B is wrong because using only the day of week ignores long-term trends and seasonal patterns across months or years, capturing only weekly periodicity. Option D is wrong because dropping the date column discards all temporal information, making it impossible for the model to learn any time-based patterns.

Full explanation →

87

Multi-Selecthard

A sales operations admin wants to use Einstein Opportunity Scoring. Which two steps are required to activate Einstein Opportunity Scoring? (Select two answers.)

Select 2 answers

A.Enable Einstein in the Einstein Setup menu.

B.Ensure that at least 100 opportunities have been won in the last 18 months.

C.Create a custom scoring model specific to the business.

D.Deactivate any existing scoring rules.

E.Assign the Einstein Opportunity Scoring permission set to users.

AnswersA, B

Einstein must be turned on at the org level.

Why this answer

Options B and C are correct: you need historical data (100 won opps in 18 months) and enable Einstein. Option A is incorrect because the permission set is needed for user access but not for activation. Option D is optional, not required.

Option E is incorrect because you don't need to deactivate other scoring rules.

Full explanation →

88

MCQhard

A company deploys an AI chatbot for customer service. After training on historical chats, the chatbot frequently gives incorrect answers to minority language queries. What is the likely cause?

A.Underfitting

B.Data imbalance

C.Lack of compute resources

D.Overfitting

AnswerB

Data imbalance causes the model to perform poorly on underrepresented groups, such as minority language queries.

Why this answer

Option C is correct because data imbalance means the training data had few examples of minority language queries, leading to poor performance. Option A is wrong because overfitting would cause poor generalization on all new data, not specifically minority languages. Option B is wrong because underfitting would cause poor performance on all data.

Option D is wrong because compute power does not cause this specific bias.

Full explanation →

89

MCQeasy

A company is developing a chatbot for customer service. They want to ensure the bot does not generate offensive responses. Which practice should they implement?

A.Use unsupervised learning to allow the bot to learn naturally from conversations.

B.Remove all toxicity detection to avoid false positives.

C.Train the model only on customer service transcripts without review.

D.Deploy a content filter and human-in-the-loop moderation for sensitive interactions.

AnswerD

Content filters and human oversight prevent harmful outputs.

Why this answer

Option D is correct because deploying a content filter combined with human-in-the-loop moderation provides both automated detection of offensive language and human oversight for ambiguous or sensitive interactions. This layered approach ensures that the chatbot can block clearly toxic outputs in real time while allowing human reviewers to handle edge cases, reducing the risk of generating offensive responses.

Exam trap

Salesforce often tests the misconception that unsupervised learning or removing safeguards is acceptable for safety, when in fact explicit content filtering and human oversight are required to prevent offensive outputs.

How to eliminate wrong answers

Option A is wrong because unsupervised learning on raw conversations would expose the model to toxic language without guidance, likely causing it to learn and reproduce offensive patterns. Option B is wrong because removing toxicity detection eliminates the primary safeguard against offensive outputs, directly contradicting the goal of preventing harmful responses. Option C is wrong because training solely on customer service transcripts without review can still include subtle biases or inappropriate language, and the model may overfit to narrow patterns without generalizing safely.

Full explanation →

90

MCQmedium

A data engineer is troubleshooting a predictive model that stopped updating. The data flow from Data Cloud shows 'Data Transform Failed' with error: 'Field Amount cannot be null'. What is the most likely cause?

A.The data transform includes a filter that removes records with null Amount.

B.The source object has a validation rule.

C.The data flow schedule is incorrect.

D.The target field in the model requires a non-null value but source data has nulls.

AnswerD

This directly matches the error: the transform requires non-null input.

Why this answer

The error 'Field Amount cannot be null' indicates that the target field in the predictive model is configured to require a non-null value. When the data flow attempts to write records with null Amount values into that field, the transform fails. This is a common schema constraint mismatch where the source data contains nulls that violate the target field's nullability requirement.

Exam trap

Salesforce often tests the distinction between source-side constraints (validation rules) and target-side constraints (field nullability in the model schema), leading candidates to incorrectly choose Option B when the error actually originates from the target field requirement.

How to eliminate wrong answers

Option A is wrong because a filter that removes records with null Amount would prevent nulls from reaching the target, not cause a 'cannot be null' error. Option B is wrong because validation rules apply at the source object level during record creation or update, not during a data flow transform that reads data. Option C is wrong because an incorrect schedule would cause the data flow to run at the wrong time or not at all, not produce a specific transform error about a null field.

Full explanation →

91

MCQhard

A retail company uses Einstein Next Best Action with customer data from Data Cloud. The recommendations are not personalized. The admin checks the data quality dashboard and finds that the 'Customer_Profile' object has 40% records with missing 'PreferredChannel' field. What is the best course of action?

A.Remove the field from the model.

B.Impute the missing values using the mode of the field.

C.Increase the data refresh frequency.

D.Train the model with only records that have non-null PreferredChannel.

AnswerB

Imputation is a standard data cleaning technique that maintains dataset size and field utility.

Why this answer

Option B is correct because imputing missing values using the mode (most frequent value) of the 'PreferredChannel' field is a standard data preprocessing technique that preserves the dataset size and statistical distribution. In Einstein Next Best Action, missing categorical data can degrade model personalization, and mode imputation is a simple, effective way to handle this without losing records or altering the model structure.

Exam trap

The trap here is that candidates might think removing the field or filtering out incomplete records is simpler, but Salesforce often tests the understanding that imputation is a standard, non-destructive method to handle missing data in AI models, especially when the missing rate is high.

How to eliminate wrong answers

Option A is wrong because removing the field entirely discards potentially valuable signal from the 'PreferredChannel' feature, which could reduce model accuracy and personalization. Option C is wrong because increasing data refresh frequency does not address the root cause of missing data; it only updates the data more often without fixing the quality issue. Option D is wrong because training the model only on records with non-null 'PreferredChannel' reduces the training dataset size by 40%, which can lead to biased or less robust models and loss of valuable customer information.

Full explanation →

92

MCQeasy

A company has been using Einstein Lead Scoring for six months. Recently, the lead score confidence has dropped from 85% to 60%. The admin reviews the model and finds that many leads have missing data in custom fields used by the model. The admin also notices that field history tracking is not enabled on the Lead object. The lead volume is adequate with over 10,000 leads. What should the admin do to improve the model's confidence?

A.Disable and re-enable Einstein Lead Scoring.

B.Enable field history tracking on the Lead object and retrain the model.

C.Increase the lead volume to at least 50,000 records.

D.Manually update all leads with missing data to have complete records.

AnswerB

Field history tracking provides the necessary historical data for scoring accuracy.

Why this answer

Option B is correct because field history tracking is required for Einstein Lead Scoring to capture changes over time, and retraining after enabling it will incorporate historical data. Option A is incorrect because manually updating all leads is impractical and doesn't address the root cause. Option C is incorrect because the lead volume is already adequate.

Option D is incorrect because disabling and re-enabling will reset the model but not fix the missing field history.

Full explanation →

93

MCQmedium

Refer to the exhibit. A Salesforce admin sees this error when trying to enable Einstein Lead Scoring. What should the admin do to resolve the issue?

A.Enable Einstein features in the org

B.Map lead fields to Einstein fields

C.Add more lead records with associated activities until reaching at least 100

D.Grant the admin the 'Manage Einstein' permission

AnswerC

The model needs 100 leads with activities to train.

Why this answer

Option C is correct because Einstein Lead Scoring requires a minimum of 100 lead records with associated activities (e.g., emails, events, tasks) to generate a predictive model. The error indicates insufficient data, so adding more leads with activities meets the threshold for model training.

Exam trap

Salesforce often tests the minimum data requirement (100 leads with activities) as a common pitfall, where candidates mistakenly focus on permissions or feature toggles instead of the data prerequisite.

How to eliminate wrong answers

Option A is wrong because the error is not about enabling Einstein features globally; the admin already attempted to enable scoring, implying features are enabled. Option B is wrong because lead field mapping is not required for Einstein Lead Scoring; the feature uses standard lead fields automatically. Option D is wrong because the 'Manage Einstein' permission is not a prerequisite for enabling scoring; the admin likely already has necessary permissions if they can access the setup page.

Full explanation →

94

MCQeasy

A service manager wants to automatically categorize incoming support cases based on the customer's description. Which Einstein feature should be used?

A.Einstein Reply Recommendations

B.Einstein Case Classification

C.Einstein Bots

D.Einstein Article Recommendations

AnswerB

Case Classification automatically categorizes cases.

Why this answer

Einstein Case Classification uses machine learning to automatically categorize incoming support cases based on the customer's description, assigning them to predefined case fields (e.g., type, priority, product). This directly matches the requirement to automatically categorize cases from text, making it the correct choice.

Exam trap

Salesforce often tests the distinction between 'categorization' and 'recommendation' features, so the trap here is confusing Einstein Case Classification (which assigns labels to the case) with Einstein Article Recommendations (which suggests content to the user).

How to eliminate wrong answers

Option A is wrong because Einstein Reply Recommendations suggests pre-written email responses based on context, not categorization of cases. Option C is wrong because Einstein Bots automate conversational flows and deflect cases, but they do not perform classification of case records. Option D is wrong because Einstein Article Recommendations suggests knowledge articles to agents or customers, not categorizing the case itself.

Full explanation →

95

MCQmedium

A company uses Einstein GPT to answer customer inquiries. To improve response relevance, the admin wants to restrict the AI's knowledge to only the company's product catalog and knowledge articles. Which approach should the admin use?

A.Use grounding data in the prompt template to provide relevant context

B.Apply zero-shot learning without any examples

C.Fine-tune the model on the company's data

D.Use prompt engineering to instruct the model to only use company data

AnswerA

Grounding by including specific data sources in the prompt helps the model rely on that information and reduces hallucinations.

Why this answer

Correct: Ground the prompt with specific data sources. Option A: Fine-tuning adjusts model weights, not real-time data. Option B: Prompt engineering alone doesn't restrict knowledge without grounding.

Option D: Zero-shot learning uses no examples; doesn't limit knowledge.

Full explanation →

96

Multi-Selecthard

Which THREE factors should be considered when evaluating the quality of a dataset for an AI model?

Select 3 answers

A.Total number of records available for training.

B.Presence of outliers that may skew the model.

C.Number of distinct labels in the outcome field.

D.Percentage of missing values in key fields.

E.Number of duplicate records in the dataset.

AnswersB, D, E

Outliers can distort the model's understanding.

Why this answer

Option B is correct because outliers can disproportionately influence model training, especially in algorithms like linear regression or k-means clustering, leading to biased predictions. Evaluating the presence and impact of outliers is critical for ensuring the model generalizes well to unseen data.

Exam trap

Salesforce often tests the misconception that dataset size (option A) is a primary quality metric, whereas the exam emphasizes that completeness, consistency, and absence of bias (e.g., missing values, duplicates, outliers) are more critical for model reliability.

Full explanation →

97

Multi-Selectmedium

A healthcare provider uses AI to predict patient outcomes. Which THREE measures should be implemented to ensure ethical AI use?

Select 3 answers

A.Establish a human-in-the-loop for critical decisions

B.Implement strict data anonymization and consent protocols

C.Use a black-box model for better accuracy

D.Optimize model for maximum profit

E.Regularly test for bias across demographic groups

AnswersA, B, E

Human oversight ensures accountability and safety.

Why this answer

Options A, C, and D are correct: human oversight (A), data privacy (C), and fairness (D). Option B is wrong because black-box models lack transparency. Option E is wrong as profit maximization is not ethical goal.

Full explanation →

98

MCQmedium

An admin is configuring Einstein Vision and wants to train a model to identify product defects from images. The admin has uploaded 500 images of defective products and 500 images of non-defective products. However, the model training fails with an error about data quality. What is the most likely cause?

A.The images are in JPEG format

B.The dataset has only one label per category

C.The images are larger than 10 MB each

D.The dataset does not have enough images

AnswerB

Einstein requires at least 2 unique labels per category to avoid overfitting.

Why this answer

The error about data quality in Einstein Vision typically occurs when the dataset has only one label per category. For binary classification (defective vs. non-defective), each category must contain at least two distinct labels to allow the model to learn meaningful patterns. With only one label per category, the model cannot differentiate between variations within a class, leading to a data quality error.

Exam trap

Salesforce often tests the misconception that more images automatically solve training failures, when the real issue is insufficient label diversity within categories.

How to eliminate wrong answers

Option A is wrong because JPEG is a supported image format in Einstein Vision, and the format itself does not cause data quality errors. Option C is wrong because while image size limits exist, Einstein Vision accepts images up to 10 MB, and the error message specifically mentions data quality, not file size. Option D is wrong because 500 images per category meets the minimum requirement (typically 10-50 images per label), so insufficient quantity is not the issue here.

Full explanation →

99

Multi-Selectmedium

Which TWO actions are recommended when preparing data for an Einstein Prediction Builder model?

Select 2 answers

A.Ensure the data set contains at least 500 records with the outcome field populated.

B.Include all available fields on the object, even if unrelated.

C.Use external data sources and upload CSV files without any preprocessing.

D.Select fields that are logically related to the prediction outcome.

E.Include data from the last two days only for the most current trends.

AnswersA, D

Minimum sample size is required for model training.

Why this answer

Option A is correct because Einstein Prediction Builder requires a minimum of 500 records with the outcome field populated to ensure statistical significance and reliable model training. Fewer records can lead to overfitting or insufficient pattern recognition, making the model less accurate.

Exam trap

Salesforce often tests the misconception that more data (all fields) or recent data only is always better, but the trap here is that Einstein Prediction Builder requires a minimum record threshold and logically relevant features to avoid noise and ensure model validity.

Full explanation →

100

MCQhard

You are a data scientist at a retail company. The company uses Einstein Discovery to analyze customer purchase patterns. The model is built on a dataset of 50,000 transactions. The model's R-squared is 0.85, but the predictions for new customers are consistently off by a large margin. The data includes features like 'Customer Age', 'Income', 'Previous Purchases', and 'Product Category'. The model was trained on data from the past two years. However, six months ago, the company launched a new loyalty program that significantly changed purchasing behavior. You suspect the model is not generalizing to new customers. What should you do to validate your hypothesis?

A.Create a holdout set of transactions from the last six months and compare model performance on it vs. older data

B.Exclude new customers from the dataset entirely

C.Increase the training data size to include older transactions

D.Remove the 'Product Category' feature to simplify the model

AnswerA

If performance is worse on recent data, concept drift is confirmed.

Why this answer

Option A is correct because creating a holdout set of transactions from the last six months directly tests whether the model's performance has degraded due to the loyalty program's impact on purchasing behavior. By comparing the R-squared or other metrics on this recent holdout set versus older data, you can quantify the drop in predictive accuracy and confirm that the model fails to generalize to the new data distribution. This approach is a standard method for detecting concept drift in machine learning models, especially when external changes (like a loyalty program) alter the underlying patterns.

Exam trap

Salesforce often tests the misconception that improving model performance (e.g., by adding more data or simplifying features) is the correct response to poor generalization, rather than first validating the hypothesis of concept drift through a time-based holdout evaluation.

How to eliminate wrong answers

Option B is wrong because excluding new customers entirely would remove the very data needed to detect the generalization failure, and it does not validate the hypothesis about model performance on new customers. Option C is wrong because increasing training data with older transactions would only reinforce the model's bias toward pre-loyalty-program patterns, making it even less adaptable to the new behavior. Option D is wrong because removing the 'Product Category' feature simplifies the model but does not address the root cause of concept drift; it may reduce accuracy further and does not test whether the loyalty program caused the shift.

Full explanation →

101

MCQmedium

An organization uses Einstein Recommendation Builder to suggest products. They want to ensure recommendations are fair across demographics. Which action should they take?

A.Use only demographic features

B.Remove all user data

C.Regular bias audits

D.Increase model depth

AnswerC

Bias audits proactively identify and address unfair patterns in recommendations.

Why this answer

Option B is correct because regular bias audits help detect and mitigate unfairness over time. Option A is wrong because using only demographic features could lead to stereotype-based recommendations. Option C is wrong because removing all user data would eliminate personalization.

Option D is wrong because increasing model depth does not address fairness.

Full explanation →

102

MCQmedium

An admin is setting up Einstein Bot for a customer service chat. The bot needs to collect the customer's account number before transferring to a human agent. What should the admin configure?

A.Add a Variable with the account number field

B.Create a Dialog to ask for the account number

C.Define a Rule to validate the account number

D.Use a Set Value action to assign the account number

AnswerB

Dialogs guide the conversation and collect input.

Why this answer

Option B is correct because a Dialog is the component that handles a conversation flow, including collecting information. Option A is incorrect because a Variable stores data but does not collect it. Option C is incorrect because a Rule determines logic, not collection.

Option D is incorrect because a Set Value action sets a variable, but the overall collection happens within a Dialog.

Full explanation →

103

Multi-Selectmedium

A Salesforce admin is reviewing data sources for Einstein Recommendation Builder. Which two data types are required for training? (Choose two.)

Select 2 answers

A.User profile data

B.User-item interactions

C.Sales reports

D.External product prices

E.Item metadata

AnswersB, E

Interaction data (e.g., clicks, purchases) forms the basis of recommendations.

Why this answer

Options B and D are correct. Item metadata (B) and user-item interactions (D) are essential for recommendation models. User profile data (A) is optional; external product prices (C) are not required; sales reports (E) are not a standard input.

Full explanation →

104

MCQeasy

A company uses an AI model to screen job applications. They discover the model is less likely to recommend female candidates. What should the company prioritize first?

A.Remove gender-related features from the model.

B.Re-train the model with only male candidates.

C.Implement a fairness metric to evaluate bias.

D.Increase the model's training data size.

AnswerC

Fairness metrics help quantify bias, which is the first step toward mitigation.

Why this answer

Option D is correct because the first step is to measure the extent of bias using fairness metrics before taking corrective action. Option A is wrong because training only on males would worsen bias. Option B is wrong because removing features may not eliminate proxy bias.

Option C is wrong because simply increasing data may not address the underlying bias.

Full explanation →

105

MCQmedium

Refer to the exhibit. What does the "Status: FAIL" indicate?

A.The model's overall accuracy is low.

B.The data is missing age_group information.

C.The model is not allowed for deployment.

D.The model shows a significant disparity in true positive rates across age groups.

AnswerD

Equal opportunity difference measures TPR disparity.

Why this answer

Option B is correct because equal opportunity difference measures disparity in true positive rates across groups; a result exceeding threshold indicates significant disparity. Option A is wrong the exhibit doesn't state deployment restriction. Option C is wrong overall accuracy is not measured here.

Option D is wrong the data includes age_group.

Full explanation →

106

Multi-Selectmedium

Which THREE factors should be considered when evaluating the fairness of an AI model?

Select 3 answers

A.Disparate impact ratio across groups.

B.Overall accuracy on the test set.

C.Model training time.

D.Equal opportunity difference.

E.Demographic parity in predictions.

AnswersA, D, E

Measures adverse impact ratio.

Why this answer

Option A is correct because the disparate impact ratio measures whether an AI model's predictions disproportionately harm or benefit certain demographic groups, typically by comparing selection rates across groups. A ratio below 0.8 or above 1.25 is often considered evidence of adverse impact, making it a key quantitative fairness metric.

Exam trap

Salesforce often tests the distinction between performance metrics (like accuracy) and fairness metrics, trapping candidates who assume a high-accuracy model is automatically fair.

Full explanation →

107

MCQmedium

An AI system recommends job candidates to recruiters. The system was trained on resumes of past successful hires, most of whom were male. As a result, it consistently ranks female candidates lower. What is the most appropriate mitigation?

A.Re-sample the training data to include more female candidates and use fairness-aware algorithms.

B.Add a post-processing adjustment to increase female candidates' scores.

C.Accept the bias as a reflection of historical data.

D.Remove the gender feature from the model.

AnswerA

Balancing data and using fairness techniques reduce bias.

Why this answer

Option C is correct because ensuring gender balance in training data addresses the root cause. Option A is wrong because removing gender may not eliminate proxy variables like 'years of experience gaps.' Option B is wrong because ignoring the issue perpetuates bias. Option D is wrong because post-processing adjustments may not be sufficient without data changes.

Full explanation →

108

MCQhard

A large e-commerce company uses Salesforce Einstein to recommend products to customers. The AI model is trained on purchase history, browsing behavior, and demographic data including age and gender. Recently, the company received complaints that the model seems to recommend lower-priced items to female customers and higher-priced items to male customers for the same product categories. The data science team confirms the model has a statistically significant difference in recommendation value by gender. The company's ethical AI policy requires fairness, transparency, and human oversight. The compliance team is considering several actions. Which action should the company take first?

A.Adjust the model to increase recommendation prices for female customers

B.Disable the recommendation system until the issue is resolved

C.Conduct a thorough bias audit to identify all sources of disparate impact

D.Immediately remove gender from the training data and retrain the model

AnswerC

A bias audit is essential to understand the full extent and root causes before taking action.

Why this answer

The correct answer is B because conducting a bias audit is the ethical first step to understand the root cause and scope of the bias before taking corrective action. Option A is wrong because removing gender from the model may not eliminate proxy effects. Option C is wrong because disabling the model without analysis loses business value and does not address how to fix it.

Option D is wrong because increasing prices for females would introduce a new bias and is unethical.

Full explanation →

109

MCQhard

A company uses Einstein Bots to handle customer service inquiries. The bot often fails to understand complex requests, leading to escalations. Which improvement strategy is most effective?

A.Train the bot with additional intents and example phrases for complex scenarios.

B.Route all complex requests directly to human agents without bot interaction.

C.Increase the confidence threshold for intent matching to avoid misclassification.

D.Reduce the number of dialogue options to simplify the bot's logic.

AnswerA

More training data improves NLU accuracy.

Why this answer

Option A is correct because training the bot with additional intents and example phrases directly addresses the root cause of the bot's failure: insufficient training data for complex scenarios. By expanding the training corpus, the natural language understanding (NLU) model can better recognize and classify nuanced user inputs, reducing misclassifications and unnecessary escalations.

Exam trap

Salesforce often tests the misconception that increasing confidence thresholds or simplifying logic improves accuracy, when in fact these actions reduce the bot's ability to handle complex inputs, leading to more escalations.

How to eliminate wrong answers

Option B is wrong because routing all complex requests to human agents without bot interaction bypasses the bot entirely, failing to improve its capability and defeating the purpose of using an AI bot to handle escalations. Option C is wrong because increasing the confidence threshold for intent matching would cause the bot to reject more queries, leading to even more false negatives and escalations, not fewer. Option D is wrong because reducing dialogue options simplifies the bot's logic but does not improve its understanding of complex requests; it may actually increase escalations by limiting the bot's ability to handle varied inputs.

Full explanation →

110

MCQhard

A large enterprise uses multiple Salesforce AI services including Einstein Bots, Prediction Builder, and Next Best Action. They want to create a consistent ethical AI policy across all services. Which action is most effective?

A.Limit AI usage to only one Salesforce AI service to simplify oversight

B.Allow each business unit to define its own ethical guidelines for its respective AI service

C.Develop a centralized AI ethics framework that applies to all Salesforce AI services and train staff on its principles

D.Rely on existing legal and regulatory compliance requirements to guide ethical use

AnswerC

A unified framework ensures consistent ethical practices.

Why this answer

Option C (Develop a centralized AI ethics framework that applies to all Salesforce AI services and train staff on its principles) is correct because consistency requires a unified framework. Option A (letting each team set their own policies) leads to inconsistency. Option B (using only one AI service) is impractical.

Option D (relying on legal compliance alone) may not cover all ethical aspects.

Full explanation →

111

MCQmedium

A nonprofit organization uses Salesforce to manage donor relationships. They have implemented Einstein Prediction Builder to predict which donors are likely to upgrade their donation level in the next 90 days. The model was built using a custom object "Donation" with fields like Amount, Frequency, and Campaign. After deployment, the predictions seem random and do not correlate with donor engagement. The admin suspects the model is not trained on enough records. The organization has 500 donors with at least two donations each. What should the admin do to improve the model?

A.Increase the prediction window from 90 to 180 days to capture more upgrade events.

B.Use a different field as the prediction outcome, such as 'donation amount increase'.

C.Ensure that at least 500 records exist where the donor actually upgraded, and retrain the model.

D.Add more fields to the model, such as donor age and geographic location.

AnswerC

Sufficient positive examples are needed.

Why this answer

Option C is correct because Einstein Prediction Builder requires a minimum number of positive outcome records (upgrade events) to train a reliable model. With only 500 donors and likely far fewer upgrades, the model lacks sufficient signal. Ensuring at least 500 actual upgrade records provides the necessary positive examples for the algorithm to learn meaningful patterns, reducing randomness in predictions.

Exam trap

Salesforce often tests the misconception that adding more data fields or changing the prediction window can compensate for a lack of positive training records, when in fact the core requirement is a sufficient number of outcome examples for the model to learn from.

How to eliminate wrong answers

Option A is wrong because increasing the prediction window does not address the root cause of insufficient positive training records; it may dilute the signal by including more non-upgrade events. Option B is wrong because changing the prediction outcome field does not solve the data scarcity issue; the model still needs enough historical upgrade events to learn from. Option D is wrong because adding more fields without sufficient positive records will not improve model accuracy and may introduce noise, as the algorithm still lacks enough examples to identify meaningful correlations.

Full explanation →

112

MCQmedium

A healthcare organization uses AI to prioritize patient appointments. The AI gives lower priority to patients with a specific chronic condition. To ensure ethical AI, what should the organization do?

A.Train the model on more data from patients with that condition

B.Immediately disable the AI system and revert to manual scheduling

C.Accept the model's decisions since they are based on data

D.Conduct a fairness audit and involve medical experts to review the model's decisions

AnswerD

Combining technical and domain expertise ensures ethical oversight.

Why this answer

Option B (Conduct a fairness audit and involve medical experts to review the model's decisions) is correct because it combines technical assessment with domain expertise. Option A (immediately disable the AI) may be too drastic. Option C (accept the model as is) ignores the bias.

Option D (train the model on more data of that condition) might not correct the prioritization logic.

Full explanation →

113

MCQhard

A company is building a text classification model for customer support tickets. They have a dataset of 10,000 tickets. The team decides to use active learning for labeling. Which approach best aligns with active learning principles?

A.Randomly select 2,000 tickets and label them manually.

B.Train a preliminary model and prioritize labeling tickets with low prediction confidence.

C.Use a pre-trained model to label all tickets automatically.

D.Have subject matter experts label all 10,000 tickets.

AnswerB

Active learning focuses on uncertain samples.

Why this answer

Active learning iteratively selects the most informative unlabeled data points for labeling, typically those with low prediction confidence from a preliminary model. This minimizes labeling effort while maximizing model performance, which is the core principle of active learning.

Exam trap

Salesforce often tests the distinction between active learning and passive learning (random sampling) or semi-supervised learning, and the trap here is assuming that any automated labeling (like using a pre-trained model) qualifies as active learning, when in fact active learning requires iterative human feedback based on model uncertainty.

How to eliminate wrong answers

Option A is wrong because random selection ignores model uncertainty, wasting labeling effort on data that may not improve the model. Option C is wrong because using a pre-trained model to auto-label all tickets bypasses the human-in-the-loop feedback essential for active learning and may propagate errors. Option D is wrong because labeling all 10,000 tickets defeats the purpose of active learning, which is to reduce labeling cost by focusing only on informative samples.

Full explanation →

114

MCQeasy

A marketer wants to use Einstein Segment Creation to build a segment for a campaign. Which data source can be used?

A.Standard report snapshots.

B.Data Cloud unified profile data.

C.External web analytics.

D.Einstein Activity Capture data.

AnswerB

Unified profiles contain the data needed for segmentation.

Why this answer

Einstein Segment Creation works with Data Cloud unified profiles.

Full explanation →

115

MCQhard

Refer to the exhibit. What is the most likely cause of this error?

A.The source data is missing required fields.

B.The data transform has a recursive formula.

C.The target field expects an integer but source provides null.

D.A division formula in the data transform is dividing by a field that contains zero values.

AnswerD

Directly matches the ArithmeticException: / by zero.

Why this answer

The error shown in the exhibit is a division-by-zero runtime error, which occurs when a formula in a data transform attempts to divide a value by a field that contains zero. In Pega, data transforms execute field-level calculations, and if a divisor field holds a zero, the system throws a 'Divide by zero' exception. Option D correctly identifies this as the most likely cause because the error message explicitly indicates a division operation failed due to a zero divisor.

Exam trap

Salesforce often tests the distinction between null values and zero values in arithmetic operations, so candidates mistakenly choose Option C (null) when the actual error is caused by a zero divisor, not a missing value.

How to eliminate wrong answers

Option A is wrong because missing required fields typically produce validation or commit errors, not a division-by-zero runtime error. Option B is wrong because a recursive formula would cause a stack overflow or infinite loop error, not a division-by-zero exception. Option C is wrong because a target field expecting an integer but receiving null would result in a 'null value' or 'type mismatch' error, not a division-by-zero error.

Full explanation →

116

MCQmedium

A company deploys an Einstein AI model that recommends products to customers. To ensure transparency, what should the company include in the customer-facing interface?

A.A statement that each recommendation is personally selected by a human expert.

B.A simple label 'AI-generated' without further explanation.

C.A complete list of all features used by the model.

D.A brief note that recommendations are generated by AI, along with a way to get more information about why a product was suggested.

AnswerD

Transparency means users know AI is involved and can inquire further.

Why this answer

Option B is correct because informing customers that AI is used and explaining how recommendations are generated aligns with transparency. Option A is wrong as it gives too much detail. Option C is wrong because not all recommendations need to be explained per se, but the system's behavior should be transparent.

Option D is wrong because a simple 'AI-generated' note is insufficient.

Full explanation →

117

MCQmedium

A company uses an AI model to screen job candidates. They discover the model is rejecting candidates from certain zip codes. What should they do first?

A.Increase model complexity

B.Add more features to the model

C.Remove zip code feature

D.Audit training data for bias

AnswerD

Auditing data helps identify biased patterns and is the recommended first step.

Why this answer

Option B is correct because auditing the training data for bias is the first step to identify and mitigate unfairness. Option A is wrong because increasing model complexity may exacerbate bias. Option C is wrong because removing the zip code feature alone does not address underlying bias in other correlated features.

Option D is wrong because adding more features without bias analysis could introduce more bias.

Full explanation →

118

MCQmedium

An AI system is used to approve loan applications. The model uses income, zip code, and credit score as features. What is a potential ethical concern?

A.Zip code may act as a proxy for race, leading to discrimination

B.The model should be a black box to avoid bias

C.Credit scores are rarely accurate

D.Income is not a reliable predictor of repayment

AnswerA

Using zip code can indirectly discriminate based on race or ethnicity.

Why this answer

Option A is correct because using zip code as a feature can introduce proxy discrimination. Zip codes are strongly correlated with race and socioeconomic status due to historical redlining and residential segregation. When the model learns patterns from zip code, it may inadvertently deny loans to applicants from certain racial or ethnic groups, violating fair lending laws and ethical AI principles.

Exam trap

Salesforce often tests the misconception that bias is only introduced by explicitly using protected attributes, when in fact proxy features like zip code can cause discrimination even if race or gender is not directly used.

How to eliminate wrong answers

Option B is wrong because making the model a black box does not avoid bias; in fact, black-box models obscure how decisions are made, making it harder to detect and mitigate bias. Option C is wrong because credit scores, while not perfect, are statistically validated predictors of repayment behavior and are widely used in the financial industry; the ethical concern is not about their accuracy but about how they are combined with other features. Option D is wrong because income is a strong predictor of repayment ability; the ethical issue is not its reliability but the potential for discrimination when combined with proxy features like zip code.

Full explanation →

119

MCQeasy

A company wants to use Einstein OCR to extract text from uploaded documents. To protect customer privacy, what should they ensure before processing documents containing personal data?

A.Ask customers to manually blur sensitive information before uploading.

B.Use HTTPS to securely upload documents.

C.Obtain consent from customers and rely on the model to ignore sensitive data.

D.Enable the Einstein Trust Layer to mask sensitive data and comply with data protection policies.

AnswerD

The Trust Layer is designed for privacy and security.

Why this answer

Option A is correct because the Einstein Trust Layer provides data masking and privacy controls. Option B is wrong because anonymization by users is unreliable. Option C is wrong as encryption protects data in transit but does not address data use.

Option D is wrong because consent alone does not protect privacy if data is exposed.

Full explanation →

120

MCQhard

A hospital uses an AI model to predict patient deterioration. The model was trained on data from a single hospital with a predominantly white patient population. When deployed at a hospital serving a diverse population, the model underperforms for minority groups. What is the most effective way to address this ethical issue?

A.Create separate models for each demographic group to ensure accuracy.

B.Continuously monitor model performance across demographic groups and report disparities.

C.Retrain the model using a more diverse dataset that represents the target population.

D.Adjust the decision threshold for minority groups to improve sensitivity.

AnswerC

Diverse training data improves fairness and performance.

Why this answer

Option D is correct because retraining with diverse data from the target population addresses the root cause. Option A is wrong as it only monitors without improvement. Option B is wrong because adjusting thresholds may not fix the underlying model bias.

Option C is wrong because creating separate models for each group could be logistically complex and stigmatizing.

Full explanation →

121

MCQhard

A global company uses Salesforce Einstein Discovery to predict customer churn. They have a dataset with fields: Customer_Since__c (date), Last_Interaction_Date__c (date), Support_Cases__c (number), Product_Usage__c (percentage), Region__c (picklist), and Churned__c (boolean target). The model was trained and deployed, but predictions show bias against customers in the "EMEA" region. The data scientist notices that in the training data, 80% of EMEA customers are labeled as churned, while only 20% of other regions. Additionally, the Product_Usage__c field has many missing values for EMEA customers. The company wants to retrain the model to reduce bias. What is the best course of action?

A.Oversample EMEA churned customers and undersample non-churned from other regions

B.Increase the sample size of EMEA customers by adding synthetic data

C.Remove the Region__c field from the model and retrain

D.Preprocess the data to impute missing Product_Usage__c values using region-specific averages, and then rebalance the dataset using stratified sampling

AnswerD

Region-specific imputation preserves regional characteristics, and stratified sampling ensures each region is proportionally represented in training, reducing bias.

Why this answer

Option D is correct because it addresses both the missing data (impute using region-specific averages to preserve regional patterns) and the class imbalance (stratified sampling ensures balanced representation across regions during training). Option A removes Region, losing valuable information; Option B uses synthetic data which may introduce artificial patterns; Option C only rebalances but does not fix missing data which could still bias the model.

Full explanation →

122

Multi-Selecthard

Which THREE strategies can help mitigate bias in an AI model? (Choose three.)

Select 3 answers

A.Remove protected attributes from training data

B.Focus training on majority group data for accuracy

C.Randomize a portion of model outputs

D.Use diverse and representative training data

E.Apply fairness metrics during model evaluation

AnswersA, D, E

Removing attributes like race/gender can prevent direct discrimination.

Why this answer

Option A is correct because removing protected attributes (e.g., race, gender) from training data reduces the risk of the model directly learning correlations with these sensitive features. This is a common pre-processing technique to prevent direct discrimination, though it may not eliminate indirect bias if correlated proxy features remain.

Exam trap

Salesforce often tests the misconception that simply removing protected attributes or randomizing outputs is sufficient to eliminate bias, when in fact bias can persist through proxies and requires comprehensive fairness evaluation and diverse data.

Full explanation →

123

Multi-Selecthard

Which TWO are best practices for mitigating bias in AI models?

Select 2 answers

A.Using complex deep learning models.

B.Using balanced training datasets.

C.Removing all sensitive attributes.

D.Training models on the most recent data only.

E.Applying disparity analysis.

AnswersB, E

Balanced datasets help reduce bias from imbalanced representation.

Why this answer

Options A and C are correct. Using balanced training datasets helps reduce representation bias, and applying disparity analysis identifies where bias exists. Option B is wrong because removing sensitive attributes alone may not eliminate proxy variables.

Option D is wrong because using only recent data may still contain bias. Option E is wrong because complex deep learning models can be harder to audit and may amplify bias.

Full explanation →

124

Multi-Selecteasy

Which TWO are common data quality issues that can negatively impact AI model performance?

Select 2 answers

A.Missing values in critical fields

B.Low model accuracy during validation

C.Insufficient storage space for data

D.Inconsistent data governance policies

E.Duplicate records in the dataset

AnswersA, E

Missing data is a common quality issue.

Why this answer

Missing values in critical fields (Option A) are a common data quality issue because many AI models, particularly those relying on statistical or gradient-based optimization, cannot handle null or NaN inputs without imputation or removal. If missing values are not addressed, the model may learn biased patterns or fail to converge, leading to degraded predictive performance.

Exam trap

Salesforce often tests the distinction between data quality issues (problems with the data itself) and model performance issues or infrastructure constraints, so candidates mistakenly select options like low accuracy or insufficient storage as data quality problems.

Full explanation →

125

Multi-Selectmedium

Which THREE are key ethical considerations for AI according to Salesforce?

Select 3 answers

A.Accountability

B.Profitability

C.Transparency

D.Privacy

E.Speed

AnswersA, C, D

Organizations must take responsibility for AI outcomes.

Why this answer

Options A, C, and D are correct. Privacy, transparency, and accountability are foundational ethical principles for AI. Option B is wrong because profitability is a business goal, not an ethical consideration.

Option E is wrong because speed is a performance attribute.

Full explanation →

126

MCQmedium

A data scientist needs to prepare data for Einstein Discovery. The dataset includes a field 'Customer_Status__c' with values 'Active', 'Inactive', and 'Churned'. How should this field be treated?

A.Create separate boolean fields for each value to improve model accuracy.

B.Remove the field because text fields cannot be used in Einstein Discovery.

C.Keep as a text field and let Einstein Discovery handle it as a categorical predictor.

D.Convert to numeric values 1, 2, 3 to preserve order.

AnswerC

Einstein Discovery automatically treats text fields as categorical predictors.

Why this answer

Option C is correct because Einstein Discovery natively supports text fields as categorical predictors, automatically encoding them for model training. The platform handles string values like 'Active', 'Inactive', and 'Churned' without requiring manual transformation, preserving the semantic meaning and cardinality of the data.

Exam trap

The trap here is that candidates assume text fields must be converted to numbers or one-hot encoded for machine learning, but Einstein Discovery abstracts this preprocessing, and manual conversion can introduce ordinal bias or unnecessary complexity.

How to eliminate wrong answers

Option A is wrong because creating separate boolean fields for each value (one-hot encoding) is unnecessary and can introduce multicollinearity or increase feature dimensionality without benefit, as Einstein Discovery's internal preprocessing already handles categorical encoding optimally. Option B is wrong because text fields are fully supported in Einstein Discovery as categorical predictors; the platform does not require numeric-only inputs and can process string values directly. Option D is wrong because converting to numeric values 1, 2, 3 implies an ordinal relationship that does not exist among 'Active', 'Inactive', and 'Churned', which would mislead the model into treating the categories as ordered, degrading prediction accuracy.

Full explanation →

127

MCQeasy

A retail company wants to use Einstein AI to personalize marketing offers. They plan to include customer purchase history and demographic data. What is the essential first step to ensure ethical use of customer data?

A.Only use data from customers who have not opted out of marketing.

B.Obtain explicit consent from customers for using their data in AI personalization.

C.Proceed with available data since it was collected for business purposes.

D.Anonymize all customer data before feeding it into the model.

AnswerB

Consent is foundational for ethical data use.

Why this answer

Option A is correct because obtaining explicit consent aligns with privacy ethics and regulations like GDPR. Option B is wrong as anonymization alone does not replace consent. Option C is wrong because it bypasses transparency.

Option D is wrong because consent is required even if data is already collected.

Full explanation →

128

MCQeasy

A team is building a pipeline to train a model daily. The source data arrives in CSV files but needs to be converted to Parquet for efficiency. Which pipeline step should perform this conversion?

A.Feature engineering step

B.Model deployment step

C.Data validation step

D.Data ingestion step

AnswerD

Ingestion can transform data into a more efficient format.

Why this answer

Option D is correct because the data ingestion step is responsible for bringing raw data into the pipeline, including format conversions like CSV to Parquet. Converting to Parquet at ingestion improves storage efficiency and query performance for downstream processing, as Parquet uses columnar storage and compression.

Exam trap

Salesforce often tests the distinction between data ingestion (raw data handling) and data validation (quality checks), leading candidates to confuse format conversion with validation steps.

How to eliminate wrong answers

Option A is wrong because feature engineering transforms existing data into features for model training, not raw format conversion. Option B is wrong because model deployment serves the trained model for inference, not data preprocessing. Option C is wrong because data validation checks data quality and schema compliance, but does not perform format conversion.

Full explanation →

129

MCQhard

A company wants to deploy an Einstein AI model that uses sensitive customer data. Which practice should they follow to comply with data privacy regulations?

A.Store all sensitive data in an external data lake and connect via APIs.

B.Obtain explicit consent from data subjects before using their data in AI models.

C.Limit the data used for training to only essential fields.

D.Use Einstein Trust Layer features to mask personally identifiable information (PII) in the model.

AnswerD

Trust Layer masks PII so the model does not see raw sensitive data.

Why this answer

Option D is correct because the Einstein Trust Layer provides built-in capabilities to automatically mask or redact personally identifiable information (PII) before data is sent to the underlying AI model, ensuring compliance with data privacy regulations like GDPR and CCPA without requiring manual data handling. This feature operates at the platform level, intercepting data in transit and applying masking rules based on predefined patterns, so sensitive customer data is never exposed to the model or stored in its training logs.

Exam trap

Salesforce often tests the distinction between procedural compliance steps (like obtaining consent) and technical enforcement mechanisms (like the Einstein Trust Layer), leading candidates to choose Option B because it sounds correct in a general privacy context, but the question specifically asks about deploying the model, where a platform-native feature is the correct answer.

How to eliminate wrong answers

Option A is wrong because storing sensitive data in an external data lake and connecting via APIs does not inherently address privacy compliance; it merely shifts the storage location and still requires proper governance, encryption, and consent mechanisms to meet regulations. Option B is wrong because while obtaining explicit consent is a fundamental privacy practice, it is a procedural step, not a technical feature of the Einstein AI platform, and the question asks for a practice to follow when deploying the model, implying a built-in technical solution. Option C is wrong because limiting data to essential fields reduces exposure but does not guarantee compliance; sensitive fields may still be included, and without masking or anonymization, the model could inadvertently memorize and leak PII, violating privacy laws.

Full explanation →

130

Multi-Selecthard

Which THREE of the following are effective strategies to mitigate bias in AI models?

Select 3 answers

A.Ensuring training data is diverse and representative

B.Conducting regular bias audits on model outcomes

C.Reducing the complexity of the model

D.Using only historical data without modification

E.Involving a diverse team in model development

AnswersA, B, E

Diverse data reduces the risk of bias.

Why this answer

Options B, C, and E are correct: Diverse training data, regular bias audits, and inclusive design teams help mitigate bias. Option A (Using only historical data) can perpetuate bias. Option D (Reducing model complexity) may not address bias directly.

Full explanation →

131

Multi-Selecthard

Which THREE actions are recommended when preparing data for Einstein Next Best Action? (Choose 3)

Select 3 answers

A.Provide data on which actions were offered and whether they were accepted

B.Include at least 10 different action types per strategy

C.Record rejections (actions not taken) as negative examples

D.Use only historical data from the last 30 days

E.Retrain the model weekly with fresh interaction data

AnswersA, C, E

This is essential for reinforcement learning.

Why this answer

Option A is correct because Einstein Next Best Action (NBA) requires historical interaction data showing which actions were offered and whether they were accepted to train the predictive model. This feedback loop enables the AI to learn which actions are most effective for specific customer contexts, directly improving recommendation accuracy.

Exam trap

Salesforce often tests the misconception that more action types or recent data alone improve model performance, when in fact the key requirements are balanced positive/negative examples, sufficient historical depth, and regular retraining with fresh interaction data.

Full explanation →

132

MCQhard

A global retail company deploys an AI-powered chatbot for customer service. The chatbot uses natural language processing to understand and respond to customer inquiries. After deployment, the company notices that the chatbot consistently provides less accurate and less helpful responses to customers from non-English-speaking regions, particularly those using dialects or slang. The company's data science team trained the model primarily on English-language customer service transcripts from the US and UK. The AI Ethics team has raised concerns about fairness and potential bias. The company wants to address this issue while maintaining overall performance and minimizing cost. Which action should the company take first?

A.Implement a fairness constraint in the model's loss function to penalize disparities across language groups.

B.Conduct a fairness audit using diverse test cases from multiple languages and dialects to quantify the disparity.

C.Disable the chatbot for non-English languages and redirect those customers to human agents.

D.Collect more training data from all regions and retrain the model from scratch.

AnswerB

An audit with diverse test cases will identify the specific gaps, allowing targeted and cost-effective improvements.

Why this answer

Option B is correct because the first step in addressing potential bias in an AI system is to measure and quantify the disparity. Conducting a fairness audit with diverse test cases from multiple languages and dialects provides the data science team with a clear, empirical baseline of the model's performance gaps. This diagnostic step is essential before any remediation (like retraining or adding constraints) to ensure that subsequent actions are targeted and effective, avoiding wasted resources or unintended consequences.

Exam trap

Salesforce often tests the principle that measurement and diagnosis must precede intervention; the trap here is that candidates may jump to a technical fix (like a fairness constraint) or a drastic operational change (like disabling the chatbot) without first conducting the essential diagnostic step of a fairness audit.

How to eliminate wrong answers

Option A is wrong because implementing a fairness constraint in the loss function is a technical intervention that should only be applied after the specific disparities have been identified and understood; applying it blindly can degrade overall model performance or introduce new biases without addressing the root cause. Option C is wrong because disabling the chatbot for non-English languages is a reactive, non-technical workaround that reduces service availability and customer satisfaction, failing to leverage the AI's potential and contradicting the goal of maintaining overall performance. Option D is wrong because collecting more training data from all regions and retraining from scratch is a costly, time-consuming approach that should be guided by the results of a fairness audit; without first quantifying the disparity, the new data may not address the specific failure modes, and the retraining may not be necessary if the issue can be fixed with targeted fine-tuning or data augmentation.

Full explanation →

133

MCQeasy

A sales manager wants to automatically prioritize leads based on their likelihood to convert. Which Einstein feature should be used?

A.Einstein Prediction Builder

B.Einstein Relationship Health

C.Einstein Lead Scoring

D.Einstein Activity Capture

AnswerC

Einstein Lead Scoring prioritizes leads based on conversion likelihood.

Why this answer

Einstein Lead Scoring is the correct feature because it uses predictive models to automatically rank leads based on their likelihood to convert, enabling the sales manager to prioritize follow-up efforts. It analyzes historical lead data and engagement patterns to assign a score, directly addressing the requirement for automated prioritization without manual rules.

Exam trap

Salesforce often tests the distinction between Einstein Lead Scoring and Einstein Prediction Builder, trapping candidates who think any predictive model builder can be used for lead scoring, when in fact Lead Scoring is a purpose-built feature for that exact use case.

How to eliminate wrong answers

Option A is wrong because Einstein Prediction Builder is a no-code tool for creating custom predictive models on any object or field, not specifically designed for lead scoring or conversion likelihood. Option B is wrong because Einstein Relationship Health measures the strength of existing account relationships, not the conversion potential of new leads. Option D is wrong because Einstein Activity Capture automatically logs emails and events to Salesforce records, but it does not perform any predictive scoring or prioritization of leads.

Full explanation →

134

Multi-Selectmedium

A data analyst is troubleshooting Einstein Article Recommendations that are not showing up on the site. Which TWO checks should be performed first? (Choose 2)

Select 2 answers

A.Ensure at least 100 articles are in the knowledge base

B.Confirm that article authors have the correct profile permissions

C.Check that article view events are being captured in the data

D.Increase the recommendation frequency from daily to hourly

E.Verify that the recommendation model is published and active

AnswersC, E

Without view data, the model has no basis to recommend.

Why this answer

Option C is correct because Einstein Article Recommendations rely on user interaction data, specifically article view events, to generate personalized recommendations. If these events are not being captured, the model has no input to learn from, and recommendations will not appear. Checking event capture is a fundamental first step in troubleshooting data pipeline issues.

Exam trap

Salesforce often tests the misconception that increasing data volume or frequency (Options A and D) will fix recommendation issues, when in fact the core problem is usually missing event data or an inactive model.

Full explanation →

135

Multi-Selectmedium

A sales manager wants to use Einstein Opportunity Scoring to prioritize deals. Which two requirements must be met?

Select 2 answers

A.There must be at least 50 closed-won opportunities in the last 365 days.

B.The user must have the 'Manage Einstein' permission.

C.Field history tracking must be enabled for Opportunity.

D.There must be at least 500 open opportunities.

E.Einstein must be enabled in Setup.

AnswersA, E

The scoring model trains on historical closed-won opportunities.

Why this answer

Option B is correct because Einstein must be enabled at the org level. Option D is correct because Einstein Opportunity Scoring requires at least 50 closed-won opportunities in the last 365 days to train the model. Option A is incorrect because the 'Manage Einstein' permission is not required for users; it is an admin permission.

Option C is incorrect because the requirement is for closed-won opportunities, not open ones. Option E is incorrect because field history tracking is not a prerequisite for Opportunity Scoring.

Full explanation →

136

MCQeasy

You are a Salesforce admin at a nonprofit organization. The organization uses Einstein Engagement Scoring to prioritize donors for outreach. The model is based on donation history and event attendance. Recently, the model stopped generating new scores for recently added donors. You check the data source and see that the model's data includes the 'Contact' and 'Opportunity' objects. The data refresh is scheduled daily. The model status is 'Active'. What should you investigate first to resolve the issue?

A.Check if the model has reached its scoring capacity and needs retraining

B.Add the 'Lead' object to the data source

C.Increase the data refresh frequency to hourly

D.Check if the model was deactivated automatically

AnswerA

Engagement Scoring models have a limit on scored records; after reaching it, new records are not scored until retraining.

Why this answer

Option A is correct because Einstein Engagement Scoring models have a maximum scoring capacity (e.g., 2 million scored records per model). When new donors are added but the model stops generating scores, the most likely cause is that the model has reached this capacity and requires retraining to incorporate new records. Retraining resets the scoring queue and allows the model to score newly added donors.

Exam trap

The trap here is that candidates assume the issue is data freshness or object configuration, but Cisco tests the specific behavior that Einstein models have a scoring capacity limit that requires retraining, not just data refresh or object inclusion.

How to eliminate wrong answers

Option B is wrong because the model is already based on 'Contact' and 'Opportunity' objects, which are the correct objects for donor scoring; adding the 'Lead' object is irrelevant since leads are not donors and would not resolve the scoring stoppage. Option C is wrong because increasing the data refresh frequency addresses data latency, not the model's inability to score new records due to capacity limits; the model is already refreshing daily and the issue is scoring capacity, not data freshness. Option D is wrong because the model status is explicitly stated as 'Active', so deactivation is not the cause; automatic deactivation would change the status to 'Inactive' or 'Error', which is not the case here.

Full explanation →

137

MCQmedium

A company uses Salesforce and wants to provide automated chat responses for common customer inquiries. Which Einstein feature should be configured?

A.Einstein Sentiment

B.Einstein Case Routing

C.Einstein Bots

D.Einstein Recommender

AnswerC

Einstein Bots provide automated chat responses.

Why this answer

Einstein Bots is the correct feature because it enables automated chat responses for common customer inquiries using natural language processing (NLP) and predefined dialog flows. It integrates directly with Salesforce Chat to handle routine questions without human intervention, reducing response times and agent workload.

Exam trap

Salesforce often tests the distinction between features that analyze data (Sentiment, Recommender) versus those that automate actions (Bots), leading candidates to confuse Einstein Sentiment or Recommender as capable of providing chat responses when they are purely analytical or recommendation tools.

How to eliminate wrong answers

Option A is wrong because Einstein Sentiment analyzes the emotional tone of text (e.g., positive, negative, neutral) in conversations or social posts, not for providing automated chat responses. Option B is wrong because Einstein Case Routing uses machine learning to assign cases to the best agent based on skills and availability, not to generate chat replies. Option D is wrong because Einstein Recommender suggests next-best actions or products to agents or customers based on historical data, not for automating chat responses.

Full explanation →

138

Multi-Selecteasy

Which THREE types of data sources are commonly integrated into Salesforce Data Cloud for AI use cases?

Select 3 answers

A.Third-party demographic data

B.Web and mobile app engagement data

C.CRM transaction records

D.Model training logs

E.Data transformation scripts

AnswersA, B, C

External data enhances AI models.

Why this answer

Option A is correct because Salesforce Data Cloud can ingest third-party demographic data from external sources (e.g., data enrichment providers) to enrich customer profiles. This data, when combined with first-party data, enables AI models to generate more accurate predictions and segmentations. Data Cloud’s Data Streams and Data Lake objects support structured ingestion of such external datasets.

Exam trap

Salesforce often tests the distinction between data sources (raw inputs) and data processing artifacts (logs, scripts), leading candidates to mistakenly select model training logs or transformation scripts as valid data sources.

Full explanation →

139

MCQmedium

A company wants to use Einstein to predict which customers are likely to churn in the next 30 days. Which type of prediction should be created in Einstein Prediction Builder?

A.Numeric prediction

B.Multi-class classification

C.Regression

D.Binary classification

AnswerD

Binary classification predicts one of two outcomes, like churn or not churn.

Why this answer

Einstein Prediction Builder is designed for binary outcomes, such as whether a customer will churn (yes/no) within a specified time frame. A binary classification model predicts one of two possible labels, making it the correct choice for this churn prediction use case.

Exam trap

Salesforce often tests the distinction between regression (numeric prediction) and classification (categorical prediction), and candidates mistakenly choose 'Numeric prediction' or 'Regression' because churn prediction involves a time-based numeric threshold (30 days), but the output is still a binary label, not a number.

How to eliminate wrong answers

Option A is wrong because numeric prediction is used for forecasting a continuous numerical value (e.g., revenue amount), not a categorical outcome like churn. Option B is wrong because multi-class classification predicts among three or more categories (e.g., product type), whereas churn is a binary yes/no outcome. Option C is wrong because regression is a type of numeric prediction for continuous values, not suitable for a binary classification task.

Full explanation →

140

MCQeasy

A nonprofit organization wants to use Einstein Bots to handle inquiries on their website. They are concerned that the bot may give incorrect or insensitive responses. Which feature should they prioritize to maintain trustworthy AI?

A.Use a larger training dataset from generic internet sources.

B.Allow the bot to generate responses only from pre-authored articles.

C.Implement a fallback to human agent for uncertain queries.

D.Disable sentiment analysis to avoid misinterpretation.

AnswerC

Fallback improves trust by human handling.

Why this answer

Option C is correct because implementing a fallback to a human agent for uncertain queries directly addresses the risk of incorrect or insensitive responses by ensuring that when the bot cannot confidently answer, the conversation is escalated to a human. This aligns with the principle of maintaining trustworthy AI, as it prevents the bot from generating potentially harmful or inaccurate responses and provides a safety net for complex or sensitive inquiries.

Exam trap

The trap here is that candidates may confuse 'trustworthy AI' with 'restricting the bot's knowledge' (Option B) or 'disabling features' (Option D), rather than recognizing that a fallback mechanism is the standard industry practice for handling uncertainty and maintaining safety in conversational AI systems.

How to eliminate wrong answers

Option A is wrong because using a larger training dataset from generic internet sources does not guarantee accuracy or sensitivity; it may introduce noise, bias, or irrelevant information, and does not address the specific concern about incorrect or insensitive responses. Option B is wrong because allowing the bot to generate responses only from pre-authored articles restricts the bot's ability to handle dynamic or nuanced inquiries, and does not provide a mechanism for handling queries that fall outside the pre-authored content, potentially leading to unhelpful or inappropriate responses. Option D is wrong because disabling sentiment analysis removes the bot's ability to detect and appropriately respond to user emotions or tone, which could actually increase the risk of insensitive responses rather than reduce it.

Full explanation →

141

Multi-Selecthard

Data quality is critical for AI model performance. Which three data quality dimensions should be monitored? (Choose three.)

Select 3 answers

A.Completeness

B.Consistency

C.Uniqueness

D.Timeliness

E.Volume

AnswersA, B, D

Ensures no missing values that could bias the model.

Why this answer

Completeness, timeliness, and consistency are fundamental data quality dimensions. Volume is not a quality dimension; uniqueness is related to consistency but not always required.

Full explanation →

142

MCQeasy

A company wants to use Einstein Prediction Builder to predict customer churn. They have a dataset with 10,000 records and 50 features. What is the primary consideration for model accuracy?

A.The dataset size is too small for reliable predictions.

B.All features must be numerical and normalized.

C.The dataset must be balanced between churned and non-churned customers.

D.The model needs to be retrained daily.

AnswerC

Balancing prevents bias towards majority class.

Why this answer

Option C is correct because Einstein Prediction Builder uses automated machine learning (AutoML) to train models, and class imbalance is a critical factor that directly impacts model accuracy. If the dataset is highly skewed (e.g., 95% non-churned, 5% churned), the model may achieve high accuracy by simply predicting the majority class, but it will fail to identify actual churners. Einstein Prediction Builder includes built-in handling for imbalanced data, but the user must ensure the dataset is reasonably balanced or use techniques like oversampling to improve predictive performance.

Exam trap

Salesforce often tests the misconception that dataset size is the primary driver of accuracy, but the trap here is that class balance is more critical than raw record count for classification models in Einstein Prediction Builder.

How to eliminate wrong answers

Option A is wrong because 10,000 records is generally sufficient for a binary classification task like churn prediction, especially with 50 features; Einstein Prediction Builder can work with datasets as small as a few thousand records. Option B is wrong because Einstein Prediction Builder automatically handles feature encoding and normalization; it accepts categorical, numerical, and text features without requiring manual preprocessing. Option D is wrong because retraining frequency depends on business needs and data drift, not on model accuracy; Einstein Prediction Builder supports scheduled retraining but does not require daily retraining as a primary consideration for accuracy.

Full explanation →

143

MCQeasy

An administrator is configuring a Salesforce AI model that uses historical sales data. The data includes fields like 'Amount', 'Close_Date', and 'Lead_Source'. What is the primary purpose of data preprocessing in this context?

A.To generate visualizations for business stakeholders

B.To increase the storage capacity of the database

C.To enforce data access permissions for different user roles

D.To clean and transform data into a format suitable for model training

AnswerD

Preprocessing ensures data quality and format.

Why this answer

Data preprocessing is essential for AI models because raw historical sales data often contains missing values, inconsistent formats, and noise. Cleaning (e.g., handling nulls in 'Amount') and transforming (e.g., encoding 'Lead_Source' into numerical features) ensure the model can learn patterns effectively, directly impacting training accuracy and convergence.

Exam trap

Salesforce often tests the distinction between data preprocessing and other data management tasks; the trap here is that candidates confuse preprocessing with reporting (visualizations) or security (permissions), when the core goal is to prepare data for model ingestion.

How to eliminate wrong answers

Option A is wrong because generating visualizations is a downstream analytics task, not the primary purpose of preprocessing for model training. Option B is wrong because preprocessing does not increase storage capacity; it may reduce data size through cleaning but does not affect database storage limits. Option C is wrong because enforcing data access permissions is a security and governance concern, handled by Salesforce's sharing and permission settings, not by data preprocessing steps.

Full explanation →

144

MCQhard

A data scientist notices that an Einstein Discovery model predicts a low probability of conversion for all leads in a new campaign, even though the campaign targets high-value accounts. Which initial diagnostic step should be taken?

A.Retrain the model with the new campaign data included

B.Check the Einstein model recipe for incorrect filters

C.Compare the feature distributions of the training and campaign data

D.Increase the prediction confidence threshold

AnswerC

Distribution mismatch often explains low predictions; if features differ, the model may not apply.

Why this answer

Option C is correct because the most likely cause of a model predicting low conversion for all leads in a new campaign is a shift in feature distributions between the training data and the campaign data (covariate shift). Checking these distributions is the standard initial diagnostic step to identify if the model is encountering data it was not trained on, which would invalidate its predictions. This aligns with best practices for model monitoring and data validation in Einstein Discovery.

Exam trap

Salesforce often tests the misconception that retraining is the immediate fix for poor model performance, but the trap here is that candidates overlook the fundamental diagnostic step of checking for data drift before taking any corrective action.

How to eliminate wrong answers

Option A is wrong because retraining the model with new campaign data is a premature action; the root cause (data drift) must be diagnosed first, and retraining without investigation could mask the issue or introduce bias. Option B is wrong because checking the Einstein model recipe for incorrect filters addresses configuration errors, but the scenario describes a systematic prediction pattern across all leads, which is more indicative of data distribution shift than a filter misconfiguration. Option D is wrong because increasing the prediction confidence threshold does not fix the underlying cause of low probabilities; it only changes the cutoff for classification, leaving the flawed predictions unchanged.

Full explanation →

145

MCQhard

Refer to the exhibit. The data pipeline is failing. What is the most likely cause?

A.Network timeout.

B.Missing required field in source data.

C.Insufficient memory.

D.Schema mismatch between source and target.

AnswerD

The field 'account_id' is expected in the schema but is not found, indicating a schema mismatch.

Why this answer

Option D is correct because a schema mismatch between source and target is the most common cause of pipeline failures in data integration workflows. When the source data structure (e.g., column names, data types, or nested fields) does not match the target schema, the pipeline cannot map or transform the data correctly, leading to errors during ingestion or transformation stages. This is especially relevant in tools like Apache NiFi, AWS Glue, or Azure Data Factory, where schema validation is enforced at runtime.

Exam trap

Salesforce often tests the misconception that pipeline failures are always due to network or resource issues, but the trap here is that schema mismatch is a subtle, configuration-level error that is frequently overlooked in favor of more obvious causes like timeouts or memory limits.

How to eliminate wrong answers

Option A is wrong because a network timeout would typically produce a connection error or retry failure, not a schema-related failure, and most pipelines have retry mechanisms to handle transient network issues. Option B is wrong because a missing required field in source data would cause a validation error or null constraint violation, but the question describes a pipeline failure that is more likely due to structural incompatibility rather than missing data. Option C is wrong because insufficient memory would manifest as an out-of-memory error or performance degradation, not a schema mismatch error, and modern pipelines are designed to handle memory constraints gracefully.

Full explanation →

146

Multi-Selecteasy

When preparing data for Einstein Next Best Action, which two aspects must be considered for compliance with data privacy regulations? (Choose two.)

Select 2 answers

A.Data compression

B.Color coding fields in the dataset

C.Indexing speed for real-time recommendations

D.Consent management

E.Data masking of personally identifiable information (PII)

AnswersD, E

Obtaining and tracking consent is a fundamental privacy requirement.

Why this answer

Options A and D are correct. Consent management (A) ensures legal basis; data masking of PII (D) protects sensitive data. Data compression (B) and color coding (C) are not privacy measures; indexing speed (E) is performance related.

Full explanation →

147

MCQhard

Refer to the exhibit. Which ethical principle is most at risk with this AI model configuration?

A.Accountability

B.Fairness

C.Privacy

D.Transparency

AnswerD

Explainability is disabled, so the model acts as a black box.

Why this answer

The configuration has explainability disabled, meaning the model's decisions cannot be interpreted. This violates transparency. Option B (Transparency) is correct.

Option A (Fairness) is partially addressed by the fairness check, though not fully. Option C (Accountability) is at risk because human review is not required, but the most direct risk is transparency. Option D (Privacy) is not directly affected.

Full explanation →

148

MCQhard

A data scientist is evaluating a custom Einstein model for a lead scoring use case. The model's precision is 0.9, recall is 0.5. What is the most important improvement priority?

A.Increase recall to reduce false negatives

B.Increase precision to reduce false positives

C.Optimize for an F1 score of 0.7

D.Improve overall accuracy above 80%

AnswerA

Recall is low (0.5), meaning half of actual leads are missed. This should be improved.

Why this answer

With a precision of 0.9 and recall of 0.5, the model is highly selective but misses many actual leads (high false negatives). In lead scoring, false negatives mean lost sales opportunities, which is typically more costly than false positives. Therefore, increasing recall to capture more true positives is the most important improvement priority.

Exam trap

Salesforce often tests the trade-off between precision and recall in imbalanced classification scenarios, where candidates mistakenly focus on improving precision or accuracy without recognizing that low recall (high false negatives) is the critical business problem in lead scoring.

How to eliminate wrong answers

Option B is wrong because increasing precision would further reduce false positives, but the model already has high precision (0.9); the bigger issue is the low recall (0.5) causing many missed leads. Option C is wrong because optimizing for an F1 score of 0.7 is a metric goal, not a direct improvement priority; the F1 score is a harmonic mean of precision and recall, and simply targeting a number does not address the underlying imbalance. Option D is wrong because overall accuracy can be misleading in imbalanced datasets; a model could achieve high accuracy by always predicting the majority class, but that would not improve lead capture for the minority class (actual leads).

Full explanation →

149

MCQmedium

An admin wants to use Einstein Reply Recommendations in Service Cloud. Which ethical consideration is most important to implement before enabling the feature?

A.Reduce the cost of agent training.

B.Maximize the number of recommendations.

C.Increase the speed of case resolution.

D.Ensure customer data is anonymized.

AnswerD

Anonymization protects privacy and reduces risk of PII exposure in AI outputs.

Why this answer

Customer data privacy is paramount; anonymizing data ensures recommendations do not expose sensitive information.

Full explanation →

150

Multi-Selecthard

A company is developing an AI system to assist with hiring. Which TWO practices are essential for ethical AI deployment?

Select 2 answers

A.Optimize the system for speed to reduce waiting times

B.Remove all demographic features to ensure fairness

C.Conduct regular bias audits on model predictions

D.Maximize accuracy on historical hiring data

E.Obtain informed consent from applicants if their data is used

AnswersC, E

Audits help detect and mitigate discriminatory outcomes.

Why this answer

Option C is correct because regular bias audits are a core ethical practice for AI systems, especially in hiring. These audits involve systematically testing the model's predictions across demographic groups to detect and mitigate unintended discrimination, ensuring compliance with fairness standards like the EEOC's Uniform Guidelines on Employee Selection Procedures.

Exam trap

Salesforce often tests the misconception that removing demographic features (Option B) is sufficient to eliminate bias, when in fact proxy variables and model behavior must be actively monitored through audits (Option C).

Full explanation →

Page 2 of 7

All pages

Practice AI Associate by domain

Target a specific domain to shore up weak areas.

AI Fundamentals AI Capabilities in CRM Ethical Considerations of AI Data for AI

See all domains with question counts →