A team is implementing a RAG system for a legal document Q&A. They need to chunk documents effectively. Which THREE chunking strategies should they consider to improve retrieval accuracy for legal texts that contain hierarchical sections (clauses, sub-clauses, definitions)?
Allows retrieval of granular chunks while maintaining broader context.
Why this answer
Semantic chunking splits at natural boundaries (e.g., paragraphs, sections), preserving meaning. Hierarchical chunking indexes with parent-child relationships for context. Fixed-size chunking is simple but may break sentences or clauses.
Overlapping chunks can help but is not a primary strategy for accuracy; sliding window is a specific technique.