latest--MarkdownHeaderTextSplitter
shared| Type | object |
|---|---|
| Schema URL | https://catalog.lintel.tools/schemas/schemastore/eidolon-resource/_shared/latest--MarkdownHeaderTextSplitter.json |
| Parent schema | eidolon-resource |
Type:
object
Properties
implementation
const: "MarkdownHeaderTextSplitter"
required
headers_to_split_on
array[]
required
Headers we want to track, e.g., #, ##, etc.
chunk_size
integer
Maximum size of chunks to return
chunk_overlap
integer
Overlap in characters between chunks
keep_separator
boolean
Whether to keep the separator in the chunks
strip_whitespace
boolean
If True, strips whitespace from the start and end of every document
return_each_line
boolean
Return each line w/ associated headers