Schema URL

Type: object

Properties

symbols object
2 nested properties
silence string[]

The symbol(s) used to indicate silence.

Default:
[
  "<SIL>"
]
punctuation

EveryVoice will combine punctuation and normalize it into a set of five permissible types of punctuation to help tractable training.

All of: Punctuation object
to_replace Record<string, string>
Default:
{}
cleaners string[]
g2p_engines Record<string, string>

User defined or external G2P engines. See https://github.com/EveryVoiceTTS/everyvoice_g2p_template_plugin to implement your own G2P.

Default:
{}
Examples: "{"fr": "everyvoice_plugin_g2p4example.g2p"}"

Definitions

Punctuation object
exclamations string[]

Exclamation punctuation symbols used in your datasets. Replaces these symbols with internally.

Default:
[
  "!",
  "¡"
]
question_symbols string[]

Question/interrogative punctuation symbols used in your datasets. Replaces these symbols with internally.

Default:
[
  "?",
  "¿"
]
quotemarks string[]

Quotemark punctuation symbols used in your datasets. Replaces these symbols with internally.

Default:
[
  "\"",
  "'",
  "“",
  "”",
  "«",
  "»"
]
parentheses string[]

Punctuation symbols indicating parentheses, brackets, or braces. Replaces these symbols with internally.

Default:
[
  "(",
  ")",
  "[",
  "]",
  "{",
  "}"
]
periods string[]

Punctuation symbols indicating a 'period' used in your datasets. Replaces these symbols with internally.

Default:
[
  "."
]
colons string[]

Punctuation symbols indicating a 'colon' used in your datasets. Replaces these symbols with internally.

Default:
[
  ":"
]
semi_colons string[]

Punctuation symbols indicating a 'semi-colon' used in your datasets. Replaces these symbols with internally.

Default:
[
  ";"
]
hyphens string[]

Punctuation symbols indicating a 'hyphen' used in your datasets. * is a hyphen by default since unidecode decodes middle-dot punctuation as an asterisk. Replaces these symbols with internally.

Default:
[
  "-",
  "—",
  "*"
]
commas string[]

Punctuation symbols indicating a 'comma' used in your datasets. Replaces these symbols with internally.

Default:
[
  ","
]
ellipses string[]

Punctuation symbols indicating ellipses used in your datasets. Replaces these symbols with internally.

Default:
[
  "…"
]
Symbols object
silence string[]

The symbol(s) used to indicate silence.

Default:
[
  "<SIL>"
]
punctuation

EveryVoice will combine punctuation and normalize it into a set of five permissible types of punctuation to help tractable training.

All of: Punctuation object