RAG Workbench

Explore each step of the RAG pipeline through interactive visualizations

Text Splitting

Visualize how documents are split into meaningful chunks while preserving semantic coherence and natural boundaries.

Fixed Character

Simple uniform segmentation based on predetermined character length. Best for prototyping and resource-constrained environments.

Recursive Character

Multi-tier algorithm that preserves natural language boundaries and semantic coherence. Recommended for semantic integrity.

Parent-Child

Dual-tier architecture using fine-grained chunks for matching while maintaining parent documents for rich context.

Note: When merging segments, some may exceed the chunk size. The splitting logic can cause combined length to surpass the set value, resulting in over-limit chunks.

Configuration
Langchain
¶¶
Sample text from AWS Documentation
Hover over chunks to highlight their position in the source document
Chunks:0
Avg. Size:0 chars
With Overlap:0
Avg. Overlap:0 chars