Which TWO factors are most important when selecting a foundation model in Amazon Bedrock for a text summarization task with strict latency requirements?
Low latency is critical for real-time summarization.
Why this answer
Options A and B are correct. Response latency directly impacts user experience, so a model with low latency is essential. Output quality/token ensures the summaries are accurate and concise.
Option C is wrong because fine-tuning increases cost and latency. Option D is wrong because model size affects latency but latency itself is the direct factor. Option E is wrong because input token limit is relevant but not as critical as latency and quality for this use case.