Documentation Index
Fetch the complete documentation index at: https://docs.upsonic.ai/llms.txt
Use this file to discover all available pages before exploring further.
Overview
Character splitter splits text using a single, specified separator. Ideal for documents with clear and consistent delimiters. Uses a direct “Split and Merge” process for efficiency and positional integrity. Splitter Class:CharacterChunker
Config Class: CharacterChunkingConfig
Dependencies
No additional dependencies required. Uses standard library.Examples
Parameters
| Parameter | Type | Description | Default | Source |
|---|---|---|---|---|
chunk_size | int | Target size of each chunk | 1024 | Base |
chunk_overlap | int | Overlapping units between chunks | 200 | Base |
min_chunk_size | int | None | Minimum size for a chunk | None | Base |
length_function | Callable[[str], int] | Function to measure text length | len | Base |
strip_whitespace | bool | Strip leading/trailing whitespace | False | Base |
separator | str | Single separator string or regex | "\n\n" | Specific |
is_separator_regex | bool | Treat separator as regex | False | Specific |
keep_separator | bool | Keep separator in chunks | True | Specific |

