A company wants to use AI to automatically transcribe customer service calls into text. Which AWS service is most suitable?
Transcribe is designed for speech-to-text conversion.
Why this answer
Amazon Transcribe is the correct choice because it is a fully managed automatic speech recognition (ASR) service designed specifically to convert speech into text. It can handle real-time streaming or batch processing of audio files, making it ideal for transcribing customer service calls into searchable text.
Exam trap
The trap here is that candidates often confuse Amazon Transcribe (speech-to-text) with Amazon Polly (text-to-speech) or assume Amazon Comprehend can process audio directly, when in fact Comprehend only works on text input.
How to eliminate wrong answers
Option B is wrong because Amazon Comprehend is a natural language processing (NLP) service used for extracting insights like sentiment, entities, and key phrases from text, not for transcribing audio. Option C is wrong because Amazon Polly is a text-to-speech (TTS) service that converts text into lifelike speech, the opposite of the required speech-to-text functionality. Option D is wrong because Amazon Rekognition is a computer vision service for analyzing images and videos, such as object detection and facial recognition, and has no capability to process audio or transcribe speech.