Type object
Schema URL https://catalog.lintel.tools/schemas/schemastore/eidolon-resource/_shared/latest--MarkdownHeaderTextSplitter.json
Parent schema eidolon-resource
Type: object

Properties

implementation const: "MarkdownHeaderTextSplitter" required
Constant: "MarkdownHeaderTextSplitter"
headers_to_split_on array[] required

Headers we want to track, e.g., #, ##, etc.

chunk_size integer

Maximum size of chunks to return

Default: 4000
chunk_overlap integer

Overlap in characters between chunks

Default: 200
keep_separator boolean

Whether to keep the separator in the chunks

Default: false
strip_whitespace boolean

If True, strips whitespace from the start and end of every document

Default: true
return_each_line boolean

Return each line w/ associated headers

Default: false