A legal firm wants to automate contract analysis to extract key clauses and risks. They have 10,000 contracts in PDF format. The solution must handle varying layouts and be cost-effective. Which approach is BEST?
Document AI handles layout parsing, and the structured text is then fed to a GenAI model with a well-designed prompt. This combination is scalable and cost-effective.
Why this answer
Using Document AI to parse PDFs into text, then Vertex AI with a structured prompt for clause extraction combines robust document understanding with flexible GenAI. Fine-tuning on 100 contracts is insufficient for layout variation. Agent Builder is overkill.
Direct Gemini API on raw PDFs loses document structure.