Type object
Schema URL https://catalog.lintel.tools/schemas/schemastore/eidolon-resource/_shared/latest--HTMLHeaderTextSplitter.json
Parent schema eidolon-resource
Type: object

Properties

implementation const: "HTMLHeaderTextSplitter" required
Constant: "HTMLHeaderTextSplitter"
headers_to_split_on array[] required

Headers we want to track, e.g., h1, h2, etc.

chunk_size integer

Maximum size of chunks to return

Default: 4000
chunk_overlap integer

Overlap in characters between chunks

Default: 200
keep_separator boolean

Whether to keep the separator in the chunks

Default: false
strip_whitespace boolean

If True, strips whitespace from the start and end of every document

Default: true
return_each_element boolean

Return each element w/ associated headers

Default: false