AIF-C01 Fundamentals of Generative AI • Set 6
AIF-C01 Fundamentals of Generative AI Practice Test 6 — 15 questions with explanations. Free, no signup.
A team is developing a real-time code completion feature using an LLM deployed on Amazon SageMaker. They observe high latency under load. Which optimization technique should they prioritize?