![]() For more information about the custom lexicon file, see Pronunciation Lexicon Specification (PLS) Version 1.0. Using Azure Blob Storage is recommended but not required. The URI of the publicly accessible custom lexicon XML file with either the. Usage of the lexicon element's attributes are described in the following table. For long-form text to speech, use the batch synthesis API (Preview) instead. ![]() The lexicon element is not supported by the Long Audio API. Then you upload the custom lexicon XML file and reference it with the SSML lexicon element.įor a list of locales that support custom lexicon, see footnotes in the language support table. To define how multiple entities are read, create an XML structured custom lexicon file. You can define how single entities (such as company, a medical term, or an emoji) are read in SSML by using the phoneme and sub elements. In the first two examples, the values of ph="tə.ˈmeɪ.toʊ" or ph="təmeɪˈtoʊ" are specified to stress the syllable meɪ. The supported values for attributes of the phoneme element were described previously. For sapi, if you want to stress one syllable, you need to place the stress symbol after this syllable, whether or not all syllables of the word are marked. ![]() Or else, the syllable before this stress symbol will be stressed. For ipa, to stress one syllable by placing stress symbol before this syllable, you need to mark all syllables for the word. ![]() If the specified string contains unrecognized phones, text to speech rejects the entire SSML document and produces none of the speech output specified in the document. The alphabet applies only to the phoneme in the element.Ī string containing phones that specify the pronunciation of the word in the phoneme element. The following options are the possible alphabets that you can specify: The string that specifies the alphabet must be specified in lowercase letters. The phonetic alphabet to use when you synthesize the pronunciation of the string in the ph attribute. Usage of the phoneme element's attributes are described in the following table. Consider the different en-US pronunciations of the letter "c" in the words "candy" and "cease" or the different pronunciations of the letter combination "th" in the words "thing" and "those."įor a list of locales that support phonemes, see footnotes in the language support table. This is in contrast to the Latin alphabet, where any letter might represent multiple spoken sounds. Each phone describes a unique sound of speech. Phonetic alphabets are composed of phones, which are made up of letters, numbers, or characters, sometimes in combination. Always provide human-readable speech as a fallback. The phoneme element is used for phonetic pronunciation in SSML documents. For more information about SSML syntax, see SSML document structure and events. Refer to the sections below for details about how to use SSML elements to improve pronunciation. You can also use SSML to define how a word or mathematical expression is pronounced. For example, you can use SSML with phonemes and a custom lexicon to improve pronunciation. You can use Speech Synthesis Markup Language (SSML) with text to speech to specify how the speech is pronounced.
0 Comments
Leave a Reply. |