Text-to-Speech SSML support RRS feed

  • Question

  • The text-to-speech HTML interface accepts the input as an SSML document.  I was trying to use the following features of SSML without any luck:

    • <say-as interpret-as="ordinal">3</say-as>   - should say "third"
    • <phoneme> - to render speech by its phonetic pronunciation. For example I get the wrong pronunciation of 'record'.  I want to force it to use the correct pronunciation.  (ie record player vs record an song).

    Use SSML to Control Synthesized Speech

    Most of the things I tired were from the old Microsoft Speech SDK documentation.  Is there any guidance on what is supported, what is not supported and any plans on adding support things such as <say-as> or <phoneme>?

    Thursday, May 12, 2016 7:10 AM


  • Hi,

    Our latest Speech API SDKs for Cognitive Services can be found here:



    Wednesday, November 9, 2016 7:31 PM