CCNA Sfai Ai Fundamentals Questions

75 of 84 questions · Page 1/2 · Sfai Ai Fundamentals topic · Answers revealed

1
Multi-Selecthard

A financial services firm wants to use AI to detect fraudulent transactions. They have a dataset with 1% fraudulent and 99% legitimate transactions. Which THREE actions should they take to address class imbalance? (Select THREE.)

Select 3 answers
A.Use a larger dataset even if it is more imbalanced
B.Use accuracy as the primary evaluation metric
C.Use F1 score as the evaluation metric
D.Assign higher class weights to the minority class during training
E.Oversample the minority class (fraud) or undersample the majority class
AnswersC, D, E

F1 score balances precision and recall and is robust to imbalance.

Why this answer

Class imbalance can be addressed by resampling (oversampling minority, undersampling majority), using class weights, or using metrics like F1 that consider precision and recall.

2
MCQhard

A model trained on CRM data predicts customer lifetime value (CLV) with high accuracy, but when deployed, predictions are significantly off for new customer segments. What is the most likely cause?

A.Feature engineering was insufficient for the original segments
B.The model is overfitting to the training data
C.The model architecture is too simple
D.The training data was not representative of the new segments
AnswerD

Unrepresentative training data leads to poor generalization for unseen segments.

Why this answer

If the training data was not representative of the new segments, the model will not generalize, leading to poor out-of-sample performance.

3
Multi-Selecthard

A company is developing a sentiment analysis model for customer reviews. The team wants to ensure the model is fair and does not exhibit bias. Which TWO actions are MOST effective? (Choose two.)

Select 2 answers
A.Use a more complex model to capture nuanced patterns
B.Audit the training data for imbalanced representation across demographic groups
C.Use only historical data as-is without modification
D.Ensure the training data includes reviews from a diverse range of customers
E.Remove all negative reviews from the training data to avoid negative bias
AnswersB, D

Identifying and correcting imbalances helps reduce bias.

Why this answer

Auditing training data for bias and ensuring diverse representation helps mitigate bias. Using a complex model doesn't address bias; removing all negative examples would bias the model. Using historical data as-is may perpetuate existing biases.

4
Multi-Selectmedium

A retail company wants to use AI to predict next month's sales for each product category. They have five years of monthly sales data. Which THREE factors are most critical for the accuracy of the predictive model? (Select THREE.)

Select 3 answers
A.Choosing the most popular AI algorithm on the market
B.Including seasonality and trend components from historical data
C.Using a deep learning model instead of a simpler model
D.Ensuring the training data is clean and free of missing values
E.Incorporating external factors like holiday calendars and promotions
AnswersB, D, E

Critical for capturing recurring patterns.

Why this answer

Historical patterns (seasonality), external factors (holidays), and data quality are key for time series forecasting. Model complexity and algorithm brand are less critical.

5
MCQeasy

A marketing team wants to recommend products to customers based on their past purchases and browsing behavior. Which type of AI is most appropriate?

A.Generative AI
B.Reinforcement learning
C.Predictive AI
D.Unsupervised learning
AnswerC

Correct: predictive AI uses historical patterns to forecast future behavior, e.g., product recommendations.

Why this answer

Product recommendation is a predictive AI task that uses historical data to predict which products a customer is likely to purchase.

6
MCQeasy

Which statement best describes 'inference' in the context of machine learning?

A.The process of training a model on labeled data
B.The process of collecting and preparing data
C.The process of using a trained model to make predictions on new data
D.The process of evaluating a model's accuracy
AnswerC

Inference is applying the model to new, unseen data.

Why this answer

Inference is the process of using a trained model to make predictions on new data. Training is the learning phase, and evaluation is assessing performance. Data collection is separate.

7
Multi-Selectmedium

A company is building a sentiment analysis model for customer reviews. They want to measure its performance. Which TWO metrics are most appropriate for evaluating a classification model?

Select 2 answers
A.F1 score
B.R-squared
C.Mean squared error
D.Recall
E.Precision
AnswersD, E

Measures coverage of actual positives.

Why this answer

Precision and recall are key classification metrics. Accuracy can be misleading if classes are imbalanced. F1 is composite; but the question asks for two metrics, precision and recall are fundamental.

8
MCQmedium

A company wants to use AI to automatically extract key information (e.g., invoice number, date, total amount) from scanned PDF invoices. Which AI capability should they use?

A.Generative AI for text generation
B.Predictive analytics
C.Computer vision with optical character recognition (OCR)
D.Sentiment analysis
AnswerC

Correct: OCR converts scanned images to text, then NLP extracts fields.

Why this answer

Document scanning combined with OCR and NLP entity extraction can parse structured fields from documents. This falls under computer vision (OCR) and NLP.

9
MCQeasy

A data analyst is preparing data for a machine learning model. They notice that many records have missing values for the 'industry' field. What is the best first step?

A.Delete all records with missing values
B.Replace missing values with a placeholder or impute them
C.Ignore the missing values and train the model as is
D.Use a model that automatically handles missing data
AnswerB

Correct. Imputation or placeholder ensures data completeness.

Why this answer

Data quality is critical. Missing values must be handled (imputed or removed) to avoid garbage-in-garbage-out.

10
Multi-Selectmedium

A company wants to use AI to analyze customer feedback from surveys and social media. Which TWO capabilities are most relevant?

Select 2 answers
A.Speech recognition
B.Sentiment analysis
C.Image recognition
D.Text classification
E.Predictive lead scoring
AnswersB, D

Sentiment analysis measures emotional tone, key for understanding customer feelings.

Why this answer

Sentiment analysis detects overall sentiment (positive/negative), and text classification can categorize feedback by topic. Entity extraction might be useful but is less central for broad feedback analysis.

11
MCQmedium

A data scientist is training a model to predict churn. The model achieves 99% accuracy on training data but only 60% on test data. Which issue is most likely occurring?

A.Concept drift
B.Overfitting
C.Data leakage
D.Underfitting
AnswerB

Correct. Large gap between training and test performance indicates overfitting.

Why this answer

Overfitting: the model learns training data patterns too well, including noise, failing to generalize to new data.

12
Multi-Selectmedium

A sales team uses an AI model to prioritize leads. The model's predictions are not improving despite adding more data. Which THREE factors could explain this? (Choose three.)

Select 3 answers
A.The new data is noisy or of low quality
B.The model is underfitting and lacks capacity
C.The model is already overfitting to the training data
D.The features used are not predictive of lead conversion
E.The team has not added enough features to the model
AnswersA, B, D

Garbage in, garbage out; low-quality data hinders improvement.

Why this answer

Poor data quality, irrelevant features, and model underfitting can all cause lack of improvement. Adding more data helps only if it's high-quality; more features can help if relevant, but not necessarily. Overfitting would show good training performance.

13
MCQeasy

Which type of machine learning is used to predict customer churn based on historical labeled data?

A.Reinforcement learning
B.Unsupervised learning
C.Supervised learning
D.Self-supervised learning
AnswerC

Supervised learning trains on labeled examples to predict outcomes, making it ideal for churn prediction.

Why this answer

Supervised learning uses labeled data (historical churn outcomes) to train a model to predict future churn.

14
MCQhard

A data scientist trains a lead scoring model that achieves 99% accuracy on training data but only 65% accuracy on a held-out test set. What is the most likely issue?

A.Underfitting due to insufficient model complexity
B.Overfitting due to model complexity or insufficient regularization
C.Label noise in the training data
D.Data leakage from the test set into training
AnswerB

Overfitting explains the large gap between high training accuracy and low test accuracy.

Why this answer

Overfitting occurs when the model memorizes training data noise instead of learning generalizable patterns, leading to poor test performance.

15
Multi-Selecthard

A company uses AI to generate personalized email content for marketing campaigns. They notice the AI occasionally produces factually incorrect statements. Which THREE actions should they take to mitigate this?

Select 3 answers
A.Implement retrieval-augmented generation (RAG) to ground outputs in verified data
B.Reduce the model's temperature parameter to lower randomness
C.Include a human review step before emails are sent
D.Train the model on more unlabeled emails from the internet
E.Remove all safety guardrails to allow more creative content
AnswersA, B, C

RAG provides factual context, reducing hallucinations.

Why this answer

Grounding the model with retrieval, adding a human review step, and adjusting temperature can reduce hallucinations. Training on more data alone may not fix the issue.

16
Multi-Selectmedium

A CRM administrator is planning to implement predictive AI for lead scoring. Which TWO actions should be taken to ensure data quality?

Select 2 answers
A.Ensure lead source values are standardized (e.g., 'Web' not 'website' or 'web form')
B.Remove duplicate leads and leads older than 5 years with no activity
C.Use as many fields as possible to increase model complexity
D.Manually label all historical leads again to remove bias
E.Use a smaller dataset to train faster
AnswersA, B

Consistent categorical values prevent the model from learning spurious patterns.

Why this answer

Removing outdated or duplicate records and ensuring consistent formatting across fields are key data quality steps. Standardizing lead source values also ensures consistent labels.

17
MCQmedium

A sales operations manager wants to use AI to predict which leads are most likely to convert. The CRM has historical data on past leads, including whether they were won or lost, along with demographic and behavioral attributes. Which machine learning type should be used?

A.Generative AI
B.Supervised learning
C.Unsupervised learning
D.Reinforcement learning
AnswerB

Supervised learning trains on labeled data (outcomes) to predict new outcomes, ideal for lead scoring.

Why this answer

Supervised learning uses labeled historical data (won/lost outcomes) to predict future outcomes, making it the best fit for lead scoring.

18
MCQhard

A company uses generative AI to create personalized email content for each customer. They notice that occasionally the AI produces content that is factually incorrect. What is this phenomenon called?

A.Hallucination
B.Overfitting
C.Concept drift
D.Bias
AnswerA

Correct. Hallucination is the generation of false information by an AI.

Why this answer

Hallucination occurs when generative models produce plausible-sounding but incorrect information.

19
MCQeasy

A marketing manager wants to predict which customers are most likely to respond to a new email campaign. Which type of machine learning is most appropriate?

A.Generative AI
B.Supervised learning
C.Unsupervised learning
D.Reinforcement learning
AnswerB

Correct. Uses labeled data (responded/did not respond) to train a model.

Why this answer

Supervised learning uses labeled historical data (e.g., past response data) to predict a specific outcome (respond vs. not respond).

20
MCQmedium

A customer service team wants to automatically route incoming emails to the appropriate department based on content. Which NLP capability is essential for this task?

A.Sentiment analysis
B.Text classification
C.Entity extraction
D.Intent detection
AnswerB

Text classification assigns a category to the email, enabling routing.

Why this answer

Text classification categorizes text into predefined classes, enabling automated routing.

21
MCQmedium

A sales operations team wants to automatically categorize incoming support cases into predefined categories (e.g., Billing, Technical, General). The team has thousands of historical cases with correct category labels. Which AI approach should they use?

A.Topic modeling (unsupervised)
B.Text classification
C.Sentiment analysis
D.Named entity recognition
AnswerB

Correct: text classification maps text to predefined categories using labeled training data.

Why this answer

Text classification is a supervised NLP task that assigns predefined labels to text. Historical labeled cases provide the training data for a classifier.

22
MCQmedium

A company wants to generate personalized marketing email content for each customer, including product recommendations and tailored copy. Which AI approach is BEST?

A.Use an unsupervised learning model to cluster customers and send generic emails per cluster
B.Use predictive AI to score customers and generative AI to create email copy based on the scores
C.Use only predictive AI to send the same email to all customers with top products
D.Use only generative AI to create emails without any customer data
AnswerB

Predictive AI identifies what to recommend, and generative AI writes the personalized copy.

Why this answer

Generative AI can create original content, and predictive AI can recommend products. The combination of both can generate personalized copy with dynamic product suggestions. Using only predictive AI yields recommendations but not content; generative AI alone can create text but lacks recommendation logic.

23
MCQmedium

A retailer wants to recommend products to customers based on their purchase history and browsing behavior. Which AI approach is most suitable?

A.Supervised learning to predict purchase probability for each product
B.Reinforcement learning with real-time rewards
C.Natural language generation to create product descriptions
D.Computer vision to analyze product images
AnswerA

Supervised learning can be trained on historical purchases to predict which products a customer is likely to buy.

Why this answer

Product recommendations are typically handled by supervised or unsupervised learning, but the most common approach in CRM is collaborative filtering or similar supervised/unsupervised models. However, given options, supervised learning on historical interactions is typical.

24
Multi-Selecthard

A company wants to use AI to automatically extract key information (e.g., invoice number, date, total amount) from scanned invoices. Which THREE technologies should be combined?

Select 3 answers
A.Generative AI for text generation
B.Optical Character Recognition (OCR)
C.Sentiment analysis
D.Natural Language Processing (NLP) – entity extraction
E.Computer vision
AnswersB, D, E

Converts image text into machine-readable text.

Why this answer

Computer vision reads the document, OCR converts image to text, and NLP extracts entities. Generative AI is not needed.

25
MCQeasy

What is the term for when an AI model produces confident but incorrect information, often in generative AI?

A.Bias
B.Overfitting
C.Hallucination
D.Underfitting
AnswerC

Hallucination describes the generation of false information presented as fact.

Why this answer

Hallucination refers to when a generative AI model fabricates information that seems plausible but is false.

26
MCQmedium

A data scientist notices that a churn prediction model has high variance: small changes in training data cause large changes in predictions. Which technique is BEST to address this?

A.Apply regularization techniques
B.Reduce the amount of training data
C.Add more features to the model
D.Increase the number of training epochs
AnswerA

Regularization penalizes large coefficients, reducing model complexity and variance.

Why this answer

High variance indicates overfitting. Regularization (e.g., L1/L2) reduces model complexity and variance. Adding more features would increase variance, increasing training data can help but is not always feasible, and reducing training data would worsen the problem.

27
Multi-Selecthard

A company is deploying an AI chatbot for customer service. Which THREE ethical considerations should be addressed? (Select 3)

Select 3 answers
A.Protecting customer data privacy during model training and inference
B.Using the most complex deep learning architecture available
C.Mitigating bias in training data that could lead to unfair responses
D.Ensuring the chatbot explains when a customer is speaking to AI
E.Maximizing the number of training epochs
AnswersA, C, D

Data privacy is a core ethical and legal requirement.

Why this answer

Transparency, bias mitigation, and data privacy are key ethical areas for AI systems.

28
MCQmedium

A company uses an AI model to predict customer churn. The model's predictions are used to automatically assign discounts to high-risk customers. A customer complains about receiving a discount offer they did not request. Which ethical concern is most relevant?

A.Bias in training data
B.Model overfitting
C.Data quality
D.Lack of transparency of AI decisions to customers
AnswerD

Correct: customers should be informed that the discount was AI-generated and why.

Why this answer

Transparency about AI decisions is key. Customers should be informed when an AI system is making decisions that affect them.

29
MCQmedium

A company uses an AI model to classify customer support cases into categories (billing, technical, general). The model performs well on training data but poorly on new cases. Which issue is MOST likely occurring?

A.Underfitting
B.Overfitting
C.Data leakage
D.Bias in training data
AnswerB

Overfitting leads to high accuracy on training data but low accuracy on new data because the model is too complex.

Why this answer

Overfitting means the model memorized the training data and fails to generalize to new, unseen data. Underfitting would show poor performance on both training and test data. Data leakage occurs when future information leaks into training.

Bias in data is a different issue.

30
MCQeasy

Which type of AI is designed to perform only a specific task, such as playing chess or recommending products?

A.Narrow AI
B.Super AI
C.General AI
D.Strong AI
AnswerA

Correct: narrow AI is task-specific, like chess or product recommendations.

Why this answer

Narrow AI (weak AI) is specialized for one task and cannot generalize beyond its training.

31
MCQeasy

A sales operations manager wants to predict which leads are most likely to convert to deals. The CRM has historical data on thousands of leads with outcomes (converted or not). Which type of machine learning should they use?

A.Unsupervised learning
B.Supervised learning
C.Deep learning
D.Reinforcement learning
AnswerB

Supervised learning trains on labeled historical data (features + outcome) to predict future outcomes like lead conversion.

Why this answer

Supervised learning uses labeled data (past leads with known outcomes) to predict future lead conversion. Unsupervised learning finds patterns without labels, reinforcement learning learns from rewards, and deep learning is a subset of supervised/unsupervised but not the most specific answer here.

32
MCQhard

A generative AI chatbot sometimes produces factually incorrect responses about a company's products. What is this phenomenon called?

A.Data leakage
B.Overfitting
C.Hallucination
D.Bias
AnswerC

Hallucination describes when an AI model produces confident but false information.

Why this answer

Hallucination is when an AI generates incorrect or nonsensical information.

33
Multi-Selectmedium

Which THREE factors are most important for ensuring the accuracy of an AI model in a CRM context? (Select 3)

Select 3 answers
A.Relevant features selected for the prediction task
B.Large number of model parameters
C.Representative training data
D.Higher inference speed
E.Clean and complete data
AnswersA, C, E

Feature selection directly affects model performance.

Why this answer

Data quality, representative training data, and relevant features directly impact model accuracy.

34
MCQmedium

A customer service chatbot misinterprets user requests and often provides irrelevant answers. The development team wants to improve the chatbot's understanding of user intent. Which NLP component should they focus on?

A.Text generation
B.Sentiment analysis
C.Named entity recognition
D.Intent detection
AnswerD

Correct: intent detection determines the user's goal, enabling the correct response.

Why this answer

Intent detection is the NLP task that identifies what the user wants to do. Improving it directly addresses misinterpretation of requests.

35
MCQmedium

A retailer wants to use computer vision to automatically identify products from images uploaded by customers for a return process. Which computer vision task is required?

A.Object detection
B.Image classification
C.Product recognition
D.Text extraction (OCR)
AnswerC

Correct. Product recognition identifies specific products from images.

Why this answer

Product recognition is a specific computer vision task that identifies objects (products) in images.

36
MCQmedium

A company wants to build a customer service chatbot that answers questions about their internal policy documents. The documents are updated monthly, and the team cannot afford to retrain a model each time. Which approach is MOST appropriate?

A.Use a larger foundation model with a longer context window and paste all documents into each prompt
B.Train a custom model from scratch on the policy documents each month
C.Fine-tune a base LLM on the policy documents monthly
D.Use Retrieval-Augmented Generation (RAG) with the policy documents indexed in a vector store
AnswerD

RAG retrieves relevant document chunks at query time, ensuring the chatbot always answers from the latest uploaded documents without any model retraining.

Why this answer

RAG (Retrieval-Augmented Generation) allows the LLM to retrieve relevant document sections at inference time, so knowledge stays current without retraining. The other options either require expensive retraining for each update or lack document grounding.

37
MCQhard

A company uses an NLP model to detect intent in customer messages. The model works well for English but fails for Spanish messages. What is the most likely cause?

A.The model is overfitted to English
B.The inference threshold is too high
C.The model uses the wrong algorithm
D.The training data is predominantly English
AnswerD

Models perform poorly on languages unseen in training data.

Why this answer

If the training data lacks Spanish examples, the model cannot learn Spanish patterns.

38
MCQeasy

A sales team notices that their lead scoring model assigns high scores to leads that rarely convert. The model was trained on data from the past 5 years. What is the most likely cause?

A.The features used are irrelevant to lead conversion
B.The model is overfitting to noise in the training data
C.The model is underfitting and needs more features
D.There is concept drift because buying behaviors have changed
AnswerD

Correct. Changes in market conditions or buyer behavior cause old data to become less predictive.

Why this answer

Concept drift occurs when the relationship between features and labels changes over time. Old data may no longer represent current sales patterns.

39
MCQmedium

A company wants to automatically extract key information like order numbers and dates from customer emails. Which NLP technique should be used?

A.Entity extraction
B.Sentiment analysis
C.Text classification
D.Text summarization
AnswerA

Entity extraction identifies structured information like order numbers and dates from unstructured text.

Why this answer

Entity extraction (named entity recognition) identifies specific pieces of information such as dates, order numbers, and names from text.

40
MCQmedium

A customer support center wants to automatically route incoming cases to the appropriate department based on the issue description. Which NLP task is most relevant?

A.Intent detection
B.Entity extraction
C.Sentiment analysis
D.Text classification
AnswerD

Correct. Classifies the case into predefined categories for routing.

Why this answer

Text classification assigns a category (e.g., billing, technical) to a text, enabling automated routing.

41
MCQhard

A predictive model for lead scoring shows high precision but low recall. Which business impact is most likely?

A.Many high-quality leads are not followed up
B.The model is overfitted to training data
C.Sales team spends time on many leads that never convert
D.The model has high bias
AnswerA

Correct: low recall means the model misses a large fraction of true hot leads.

Why this answer

High precision means most leads flagged as hot are indeed hot, but low recall means many actual hot leads are missed, causing lost opportunities.

42
MCQeasy

What is the primary difference between narrow AI and general AI?

A.Narrow AI is task-specific; general AI can perform any intellectual task a human can
B.Narrow AI can perform any intellectual task; general AI is limited to one task
C.Narrow AI uses machine learning; general AI uses deep learning
D.General AI is already deployed in CRM systems; narrow AI is experimental
AnswerA

Narrow AI excels at one domain; general AI would be versatile but remains theoretical.

Why this answer

Narrow AI is designed for specific tasks, while general AI (AGI) would have human-like broad capabilities, which does not yet exist.

43
Multi-Selecthard

A data scientist is building a churn prediction model. The dataset has 95% non-churn and 5% churn. Which THREE actions should the data scientist take to address the class imbalance?

Select 3 answers
A.Train the model on the original dataset without changes
B.Use accuracy as the primary metric
C.Undersample the non-churn class
D.Use precision, recall, or F1 score instead of accuracy
E.Oversample the churn class
AnswersC, D, E

Reduces majority class size to balance.

Why this answer

Oversampling minority class, undersampling majority class, and using appropriate metrics (like precision/recall) are common approaches. Using accuracy is not recommended. Training on original data without adjustment will produce a biased model.

44
Multi-Selectmedium

A data scientist trains a model to predict customer churn. The model performs well on training data but poorly on test data. Which TWO issues are most likely?

Select 2 answers
A.Overfitting
B.Insufficient training data
C.High bias
D.Underfitting
E.Data leakage
AnswersA, E

Model fits noise in training data, leading to poor generalization.

Why this answer

Overfitting means the model memorized training data and fails to generalize. Data leakage inflates training performance but not test performance.

45
Multi-Selecthard

A data scientist is building a churn prediction model. What THREE factors are most critical for model success?

Select 3 answers
A.High number of model parameters
B.Relevant features correlated with churn
C.Representative dataset reflecting all customer segments
D.Quality labeled historical data
E.Large dataset size
AnswersB, C, D

Good features drive model performance.

Why this answer

Quality labeled historical data ensures the model learns from accurate patterns. Relevant features (e.g., usage frequency, support tickets) improve predictions. A representative dataset avoids bias and ensures generalization.

46
MCQmedium

A company’s AI model recommends products to customers. The team wants to measure how often the recommended products are actually purchased. Which metric is most appropriate?

A.F1 score
B.Precision
C.Recall
D.Accuracy
AnswerB

Correct. Precision = TP/(TP+FP), focusing on the correctness of positive predictions.

Why this answer

Precision measures the proportion of recommended items that are relevant (purchased). For product recommendations, precision is key.

47
MCQeasy

A customer support team wants to automatically categorize incoming cases into predefined categories such as Billing, Technical, or Account. Which NLP task is most appropriate?

A.Intent detection
B.Entity extraction
C.Sentiment analysis
D.Text classification
AnswerD

Text classification assigns predefined labels (Billing, Technical, Account) to text, exactly matching the requirement.

Why this answer

Text classification assigns predefined labels to text, making it the right choice for categorizing support cases.

48
MCQmedium

A sales director wants to implement lead scoring but has no historical data on which leads converted. What approach can the team use to start?

A.Use unsupervised learning to cluster leads into segments and score based on cluster characteristics
B.Train a supervised model using assumptions as labels
C.Skip lead scoring until enough conversion data is collected
D.Use reinforcement learning to learn scoring from sales team feedback
AnswerA

Correct. Clustering reveals patterns; leads in high-value clusters get higher scores.

Why this answer

Unsupervised learning can cluster leads based on similarities, providing initial scores without labeled outcomes.

49
MCQeasy

A customer service team wants to automatically detect the intent of incoming chat messages (e.g., complaint, inquiry, purchase). Which AI technique is BEST suited for this task?

A.Generative AI
B.Reinforcement learning
C.Natural language processing (NLP)
D.Computer vision
AnswerC

NLP is designed to interpret and classify text, making it ideal for intent detection from chat messages.

Why this answer

Natural language processing (NLP) deals with understanding human language, including intent detection. Computer vision is for images, reinforcement learning for decision-making, and generative AI for content creation.

50
MCQeasy

What is the primary difference between narrow AI and general AI?

A.Narrow AI is rule-based, while general AI uses machine learning
B.Narrow AI requires labeled data, while general AI can learn without data
C.Narrow AI is designed for a specific task, while general AI can understand and reason across a wide range of tasks
D.Narrow AI can perform any intellectual task, while general AI is limited to one task
AnswerC

Narrow AI is specialized; general AI has broad cognitive abilities.

Why this answer

Narrow AI excels at specific tasks, while general AI would possess human-like cognitive abilities across many domains. General AI does not yet exist. The other options confuse definitions.

51
Multi-Selectmedium

A company wants to use AI to reduce customer churn. Which TWO approaches are most appropriate? (Select 2)

Select 2 answers
A.Use predictive AI to score churn risk based on historical data
B.Use generative AI to create personalized retention offers
C.Use computer vision to analyze customer photos
D.Use reinforcement learning to train a chatbot
E.Use sentiment analysis on customer support interactions to detect dissatisfaction
AnswersA, E

Predictive models can estimate churn probability from historical patterns.

Why this answer

Predictive AI can forecast churn likelihood, and sentiment analysis can identify dissatisfied customers early.

52
MCQeasy

Which type of machine learning is used when a model is trained on historical sales data that includes both input features and the known outcome (e.g., closed won/lost) to predict whether a new lead will convert?

A.Semi-supervised learning
B.Supervised learning
C.Reinforcement learning
D.Unsupervised learning
AnswerB

Correct: supervised learning uses labeled outcomes to train a predictive model.

Why this answer

Supervised learning uses labeled training data where the correct output is provided. Lead scoring with historical outcomes is a classic supervised learning task.

53
MCQmedium

A company wants to generate personalized product recommendations for each customer based on their purchase history and browsing behavior. Which approach is MOST appropriate?

A.Train a supervised learning model on historical purchase data to predict the next product a customer will buy
B.Use a generative AI model to create new product descriptions for each customer
C.Deploy a reinforcement learning agent that explores different recommendations in real-time
D.Use an unsupervised learning algorithm to cluster customers and recommend popular items in each cluster
AnswerA

Supervised learning can use features like past purchases and browsing to predict the next purchase for each customer.

Why this answer

Supervised learning can predict what a customer might buy next based on labeled data (past purchases). Unsupervised learning can cluster customers but doesn't directly generate recommendations. Reinforcement learning is for dynamic environments, and generative AI creates content, not recommendations.

54
Multi-Selectmedium

A company is deploying an AI model to automatically classify customer emails into categories (Complaint, Inquiry, Feedback). They have 10,000 labeled emails. Which TWO actions are essential to ensure the model's accuracy? (Select TWO.)

Select 2 answers
A.Train the model until it achieves 100% accuracy on the training set
B.Split the data into training and test sets
C.Use only the most recent 1,000 emails for training to reflect current trends
D.Remove all emails that contain spelling errors
E.Ensure all three categories are adequately represented in the training data
AnswersB, E

Essential to evaluate model performance on unseen data.

Why this answer

Splitting data into train/test sets allows evaluation of generalization. Ensuring balanced representation of categories prevents bias towards majority classes.

55
MCQhard

A data scientist evaluates a churn prediction model. On the test set, the model achieves 99% accuracy, but the business reports that the model rarely flags actual churners. Which metric should the data scientist focus on to improve the model?

A.Recall
B.F1 score
C.Accuracy
D.Precision
AnswerA

Recall measures the fraction of actual churners correctly identified; improving recall directly addresses the business issue.

Why this answer

When classes are imbalanced (few churners), accuracy can be misleading. Recall measures how many actual churners are identified, which is the key business concern.

56
MCQhard

A company uses computer vision to scan receipts for expense reporting. The model performs well on high-resolution scans but poorly on blurry photos. Which improvement is most effective?

A.Decrease the learning rate
B.Add blurry images to the training data
C.Use a larger batch size
D.Increase the model's number of layers
AnswerB

Training on blurry examples teaches the model to handle that variation.

Why this answer

Augmenting training data with blurry images helps the model learn to handle various quality levels.

57
MCQhard

A financial services firm uses an AI model to approve loan applications. They discover that the model denies loans at a higher rate for a certain demographic group, even when financial indicators are similar. What is the primary ethical concern?

A.Bias in training data
B.Low model accuracy
C.Model overfitting
D.Hallucination
AnswerA

Correct. Disparate impact suggests historical biases in data leading to unfair model outcomes.

Why this answer

Bias in training data leads to unfair outcomes for certain groups. This is a fairness and bias issue.

58
MCQmedium

A data scientist trains a model to predict customer churn. The model achieves 98% accuracy on training data but only 72% on test data. What issue is most likely occurring?

A.Data leakage
B.Underfitting
C.Overfitting
D.Bias in the training data
AnswerC

The large gap between training and test accuracy indicates overfitting.

Why this answer

Overfitting occurs when a model learns training data too well, including noise, and fails to generalize to new data.

59
MCQeasy

Which of the following is an ethical concern when using AI to make decisions about customers?

A.AI models always improve over time without intervention
B.Bias in training data may lead to unfair treatment of certain customer groups
C.AI models require no human oversight
D.Deep learning is too complex for business use
AnswerB

Biased data can cause AI to make discriminatory decisions, a major ethical issue.

Why this answer

Bias in training data can lead to unfair or discriminatory outcomes, a key ethical concern in AI.

60
MCQmedium

A sales manager wants to predict which deals are likely to close this quarter. The CRM has rich historical data on won/lost opportunities, deal amount, and sales stage. Which AI approach is best suited for this task?

A.Predictive AI for opportunity scoring
B.Generative AI to create new sales content
C.Unsupervised learning to cluster opportunities
D.Reinforcement learning for sales strategy
AnswerA

Predictive AI can use historical data to predict the likelihood of a deal closing.

Why this answer

Predictive AI uses historical data to forecast outcomes, making it ideal for opportunity scoring.

61
MCQhard

A company uses an NLP model to detect customer intent from chat messages. The model correctly identifies 'billing question' 90% of the time for actual billing questions, but also flags many non-billing messages as billing (false positives). Which metric should the team prioritize to reduce false alarms?

A.Accuracy
B.F1 score
C.Precision
D.Recall
AnswerC

Correct. Precision = TP/(TP+FP); higher precision means fewer false alarms.

Why this answer

Precision focuses on the proportion of positive identifications that are correct; improving precision reduces false positives.

62
MCQmedium

An e-commerce company uses AI to provide product recommendations. The model suggests popular items but fails to personalize for individual users. Which type of learning could improve personalization?

A.Generative AI
B.Unsupervised learning
C.Reinforcement learning
D.Supervised learning
AnswerD

Supervised learning can train on user-item interactions to predict personalized recommendations.

Why this answer

Supervised learning can use user purchase history as labels to predict what a specific user might buy, enabling personalization.

63
MCQeasy

What does the term 'hallucination' refer to in the context of generative AI?

A.The model only works when the user prompts it with specific keywords
B.The model has a bug in the training algorithm
C.The model generates creative but accurate content
D.The model produces outputs that are not grounded in the training data or reality
AnswerD

Correct: hallucination refers to fabricated or incorrect information.

Why this answer

Hallucination is when a generative AI model produces content that is factually incorrect or nonsensical but presented as if it were true.

64
MCQmedium

A CRM team wants to predict the expected revenue from each opportunity. The data includes opportunity amount, close date, stage, and historical win rates. Which type of AI is best suited?

A.Unsupervised learning
B.Predictive AI
C.Generative AI
D.Reinforcement learning
AnswerB

Correct. Predictive AI forecasts numeric outcomes using regression models.

Why this answer

Predictive AI uses historical data to forecast outcomes. Regression (a type of predictive AI) predicts a continuous value like revenue.

65
MCQhard

A data scientist trains a churn prediction model on CRM data that includes customer tenure, support ticket count, and last purchase date. The model achieves 95% accuracy on training data but only 60% on a holdout validation set. What is the most likely issue?

A.The validation set contains data leakage
B.The model needs more features
C.The model is overfitting the training data
D.The model is underfitting the training data
AnswerC

Correct: high training accuracy but much lower validation accuracy indicates overfitting.

Why this answer

Large gap between training accuracy and validation accuracy is a classic sign of overfitting. The model memorized training data and fails to generalize.

66
Multi-Selecteasy

A bank is implementing an AI system to approve small business loans. Which TWO ethical considerations should be addressed?

Select 2 answers
A.Ensure training data does not contain historical biases against certain demographics
B.Use as many data points as possible to improve predictions
C.Replace all human loan officers with AI
D.Provide explanations to customers when AI denies a loan
E.Maximize model accuracy at all costs
AnswersA, D

Biased data can lead to discriminatory loan decisions.

Why this answer

Bias in training data can lead to unfair denials; transparency ensures customers understand AI decisions. Accuracy is important but not primarily an ethical issue.

67
MCQeasy

A marketing team wants to use AI to predict which leads are most likely to convert. The CRM contains historical lead data with conversion outcomes. Which type of machine learning should be used?

A.Unsupervised learning
B.Generative AI
C.Reinforcement learning
D.Supervised learning
AnswerD

Supervised learning uses labeled historical data to train a model to predict conversion probability.

Why this answer

Supervised learning uses labeled data (historical outcomes) to predict future outcomes, making it ideal for lead scoring.

68
MCQhard

A data scientist notices that a sentiment analysis model performs well on general product reviews but fails to correctly classify negative sentiment in industry-specific jargon (e.g., 'the API is flaky'). The most likely cause is:

A.The training data lacked examples of industry-specific language
B.The model is overfitted to training data
C.The model architecture is too simple
D.Sentiment analysis cannot handle jargon
AnswerA

Correct: if training data does not include domain terms, the model cannot learn them.

Why this answer

The model was likely trained on general data and does not generalize to domain-specific language. This is a domain adaptation issue, which can be due to training data not being representative.

69
Multi-Selectmedium

Which TWO statements correctly describe predictive AI compared to generative AI?

Select 2 answers
A.Generative AI is best suited for lead scoring
B.Predictive AI is used for forecasting numeric outcomes like churn probability
C.Generative AI produces new content such as text, images, or code
D.Both predictive and generative AI rely on the same algorithms
E.Predictive AI can generate realistic customer service responses
AnswersB, C

Predictive AI excels at regression and classification tasks.

Why this answer

Predictive AI forecasts outcomes (churn, CLV) using historical data, while generative AI creates new content like text or images.

70
Multi-Selectmedium

A company wants to use AI to automatically route customer support emails to the appropriate department (billing, technical, sales). Which THREE AI capabilities are needed? (Choose three.)

Select 3 answers
A.Intent detection
B.Entity extraction
C.Sentiment analysis
D.Text classification
E.Generative AI
AnswersA, B, D

Identifies the customer's intent, which maps to a department.

Why this answer

Text classification categorizes emails, intent detection identifies the purpose, and entity extraction can identify account numbers or products to aid routing. Sentiment analysis is about emotion, not routing. Generative AI creates responses, not needed for routing.

71
MCQmedium

A model predicts customer churn with 95% accuracy, but most customers who actually churn are not flagged by the model. Which metric should the team improve?

A.Precision
B.Recall
C.Accuracy
D.F1 score
AnswerB

Recall measures the proportion of actual churners correctly predicted. Low recall indicates many churners are missed.

Why this answer

Recall (true positive rate) measures how many actual churners are correctly identified. High accuracy with low recall indicates the model is biased toward the majority class (non-churners). Precision focuses on how many flagged churners actually churn, F1 is a balance, and accuracy is already high.

72
MCQmedium

A model predicts customer lifetime value with high precision but low recall on high-value customers. What is the business impact?

A.Many false negatives for high-value customers
B.The model is overfitted
C.Many false positives for high-value customers
D.High accuracy on low-value customers
AnswerA

Low recall means the model fails to identify many actual high-value customers.

Why this answer

Low recall means many high-value customers are missed, leading to lost opportunities to retain them.

73
MCQhard

A predictive model for opportunity scoring shows high precision but low recall. Which business impact is most likely?

A.The model is overfitting to the training data
B.The sales team wastes time on many low-quality opportunities
C.The model overestimates the likelihood of conversion for most opportunities
D.The sales team misses some convertible opportunities
AnswerD

Low recall means many actual winners are not flagged, causing missed opportunities.

Why this answer

High precision means few false positives (won deals predicted correctly), but low recall means many actual wins are missed. The sales team will likely miss out on some convertible opportunities.

74
MCQeasy

What type of AI is designed to perform a specific task, such as playing chess or recommending products?

A.Super AI
B.Reactive AI
C.Narrow AI
D.General AI
AnswerC

Narrow AI is specialized for specific tasks like recommendations.

Why this answer

Narrow AI, also called weak AI, is specialized for one task.

75
MCQmedium

A company wants to use AI to automatically extract invoice numbers, dates, and totals from scanned invoices. Which AI capability is MOST relevant?

A.Intent detection
B.Generative AI to create invoices
C.Sentiment analysis
D.Computer vision with entity extraction
AnswerD

Computer vision recognizes text in images, and entity extraction pulls out specific fields.

Why this answer

Computer vision can process images of documents, and entity extraction (often part of NLP) identifies specific data points. Sentiment analysis is for emotions, intent detection for purpose, and generative AI for content creation.

Page 1 of 2 · 84 questions totalNext →

Ready to test yourself?

Try a timed practice session using only Sfai Ai Fundamentals questions.