The synthesis markup is special tags inserted into the text to control the speech synthesis process. Tags provide information to speech synthesis on how to pronounce the text correctly, where to insert pauses, change intonation, highlight accents, and more.
The markup of the synthesis helps to make speech more natural and understandable. This is especially useful in areas where accuracy and naturalness of speech are of great importance, for example, voice assistants and others.
| Description | Tags | Usage Example |
|---|---|---|
| Put emphasis. This is useful in situations where it is necessary to emphasize the correct pronunciation of a word. | + |
- I'm +going to work. |
| Insert a pause between sentences. | sil<[t]>, where t is the duration of the pause in milliseconds. |
This is the first sentence. sil<[500]>. This is the second sentence |
| Specify a pause depending on the context. This tag allows you to adjust the duration of the pause depending on the meaning. | <[small]>. Acceptable values: tiny, small, medium, large, huge. |
<[Tomorrow]> medium we are waiting for you in our salon. |
| To emphasize the word. | <[accented]> or **accented** |
- And <[more]> you will get a discount on the first order. - We are **glad** to see you. |
| Use phonetic pronunciation. This allows you to correct the pronunciation of complex or non-standard words. | [[<phonemes>]] |
[[zʲ ɪ m a]] has arrived. |