EveryVoice TTS Toolkit Text Configuration 0.3 (SchemaStore) JSON Schema

Schema URL

https://catalog.lintel.tools/schemas/schemastore/everyvoice-tts-toolkit-text-configuration/versions/0.3.json

← Back to EveryVoice TTS Toolkit Text Configuration

Type: object

Properties

symbols object

2 nested properties

silence string[]

The symbol(s) used to indicate silence.

Default:

[
  "<SIL>"
]

punctuation

EveryVoice will combine punctuation and normalize it into a set of five permissible types of punctuation to help tractable training.

All of: Punctuation object

to_replace Record<string, string>

Default:

{}

cleaners string[]

g2p_engines Record<string, string>

User defined or external G2P engines. See https://github.com/EveryVoiceTTS/everyvoice_g2p_template_plugin to implement your own G2P.

Default:

{}

Examples: "{"fr": "everyvoice_plugin_g2p4example.g2p"}"

Definitions

Punctuation object

exclamations string[]

Exclamation punctuation symbols used in your datasets. Replaces these symbols with internally.

Default:

[
  "!",
  "¡"
]

question_symbols string[]

Question/interrogative punctuation symbols used in your datasets. Replaces these symbols with internally.

Default:

[
  "?",
  "¿"
]

quotemarks string[]

Quotemark punctuation symbols used in your datasets. Replaces these symbols with internally.

Default:

[
  "\"",
  "'",
  "“",
  "”",
  "«",
  "»"
]

parentheses string[]

Punctuation symbols indicating parentheses, brackets, or braces. Replaces these symbols with internally.

Default:

[
  "(",
  ")",
  "[",
  "]",
  "{",
  "}"
]

periods string[]

Punctuation symbols indicating a 'period' used in your datasets. Replaces these symbols with internally.

Default:

[
  "."
]

colons string[]

Punctuation symbols indicating a 'colon' used in your datasets. Replaces these symbols with internally.

Default:

[
  ":"
]

semi_colons string[]

Punctuation symbols indicating a 'semi-colon' used in your datasets. Replaces these symbols with internally.

Default:

[
  ";"
]

hyphens string[]

Punctuation symbols indicating a 'hyphen' used in your datasets. * is a hyphen by default since unidecode decodes middle-dot punctuation as an asterisk. Replaces these symbols with internally.

Default:

[
  "-",
  "—",
  "*"
]

commas string[]

Punctuation symbols indicating a 'comma' used in your datasets. Replaces these symbols with internally.

Default:

[
  ","
]

ellipses string[]

Punctuation symbols indicating ellipses used in your datasets. Replaces these symbols with internally.

Default:

[
  "…"
]

Symbols object

silence string[]

The symbol(s) used to indicate silence.

Default:

[
  "<SIL>"
]

punctuation

EveryVoice will combine punctuation and normalize it into a set of five permissible types of punctuation to help tractable training.

All of: Punctuation object