You are designing a solution that uses Azure OpenAI Service to generate product descriptions based on product attributes (name, category, features). The solution must: - Use a GPT-4 model deployed in the West US region. - Implement content filtering to block inappropriate content. - Handle up to 100 requests per second. - Minimize latency. - Use managed identity for authentication. What should you include in the design?
Meets all requirements with minimal complexity.
Why this answer
Option B is correct because deploying a single GPT-4 model with sufficient capacity (e.g., 100K TPM) ensures the solution can handle up to 100 requests per second while minimizing latency by avoiding cross-region calls. Enabling content filtering directly on the Azure OpenAI deployment blocks inappropriate content without additional services, and using a system-assigned managed identity provides secure, keyless authentication that aligns with Azure best practices.
Exam trap
The trap here is that candidates often overcomplicate the solution by adding unnecessary redundancy (multiple models or regions) or extra services (Azure AI Content Safety), when the built-in capabilities of Azure OpenAI—content filtering, managed identity, and sufficient TPM—directly satisfy all requirements with minimal latency.
How to eliminate wrong answers
Option A is wrong because deploying multiple GPT-4 models across different regions would increase latency due to cross-region network hops and does not address the requirement to minimize latency, as the model must be in West US. Option C is wrong because using API key authentication stored in Azure Key Vault violates the requirement to use managed identity for authentication, and deploying two GPT-4 models for load balancing is unnecessary when a single model with sufficient TPM capacity can handle 100 requests per second. Option D is wrong because Azure AI Content Safety is an additional service that adds latency and complexity; Azure OpenAI’s built-in content filtering already meets the requirement to block inappropriate content without needing an extra component.