Semantic vs Naive Chunking Analysis
This research tool demonstrates the impact of different chunking strategies on RAG system performance. Compare semantic chunking (similarity-based) against naive chunking (fixed-size) using comprehensive RAGAS metrics and statistical analysis.
OpenAI API Key
Your API key is stored locally in your browser and never sent to our servers.
Don't have an API key? You can still use the application with simulated results.
Get an API key from OpenAI.
Document Input
or drag and drop
PDF, TXT, or MD files up to 4.0 MB
0 characters • 0 words
Or try a sample document:
Experiment Configuration
Semantic Chunking Parameters
Higher values (0.7-0.9) create focused chunks better for Q&A. Lower values (0.5-0.7) preserve more context.
Maximum tokens per semantic chunk
Minimum tokens per semantic chunk
Naive Chunking Parameters
Fixed size for naive chunks (tokens)
Token overlap between consecutive chunks