A company wants to build a customer service chatbot that answers questions about their internal policy documents. The documents are updated monthly, and the team cannot afford to retrain a model each time. Which approach is MOST appropriate?
RAG retrieves relevant chunks at query time, avoiding retraining.
Why this answer
RAG allows the LLM to retrieve relevant document sections at inference time, so knowledge stays current without retraining.