locked
What is Voice Key RRS feed

All replies

  • Hi Bob

    Microsoft Cognitive Services includes the following Speech features

    Speech recognition

    Convert spoken audio to text. The API can be directed to turn on and recognize audio coming from the microphone in real-time, recognize audio coming from a different real-time audio source, or to recognize audio from within a file. In all cases, real-time streaming is available, so as the audio is being sent to the server, partial recognition results are also being returned.

    Speech intent recognition

    Convert spoken audio to intent. Similar to Speech Recognition, Speech Intent Recognition -in addition to returning recognized text from audio input- returns structured information about the incoming speech so that apps can easily parse the intent of the speaker, and subsequently drive further action.

    Text To Speech

    With this APIs developers can easily convert text to spoken audio. When applications need to “talk” back to their users, this API can be used to convert text that is generated by the app into audio that can be played back to the user.

    See the tech in action on our demo page or learn more about the API with our documentation. Get access to samples on the cognitive services github page

    https://github.com/Microsoft/ProjectOxford-ClientSDK/tree/master/Speech


    Tuesday, April 5, 2016 6:52 AM
    Moderator
  • Is there a reason you've avoided answering the question?
    Tuesday, April 5, 2016 9:45 AM
  • Bob Voice key activation is simply the ability to use speech to interact with the application, service or program which you develop. Keys are phrases or sentences which prompt a interactive response. So for example Hey cortana is a key which triggers cortana Hope that helps.
    Tuesday, April 5, 2016 10:29 PM
    Moderator
  • Yes, thank you, that helps. Isn't voice key the same as a wakeup word/phrase? What is the support for voice keys? IIRC, there is no API that raises an event if a voice key is heard. In lieu of that feature, my workaround has been to monitor STT streams and search for the voice key. What we really need is an on-device/non-internet API that raises an event when the voice key is heard. The quintessential implementation being Amazon Alexa.
    • Edited by Bob Salita Wednesday, April 6, 2016 9:10 AM
    Wednesday, April 6, 2016 9:09 AM
  • There seems to be some confusion in this thread. Please note that "Voice Key" is under the heading "Windows 10 Speech API," because this is a feature of Windows 10, not the Microsoft Cognitive Services APIs. The good news is that Windows 10 Speech APIs support all Windows-10 based devices including IoThardware, phones, tablets, and PCs.

    Links to the Windows 10 Speech API documentation.


    https://developer.microsoft.com/en-us/windows/speech


    https://msdn.microsoft.com/library/windows.media.speechrecognition.aspx

    Tuesday, April 12, 2016 5:04 AM
    Moderator
  • OK, it's a Windows 10 feature. I don't see it mentioned anywhere. It's not part of the Windows 10 Speech API documentation. So back to the OP -- what is it? Is it a feature, implementation pattern, speech technology jargon? The reason I'm asking is because some of us are wanting a wakeup word feature, like an Amazon Echo's "Alexa". I was hoping that "voice key activation" could somehow contribute to the implementing of a wakeup word feature. It sure sounds synonymous with "wakeup".



    • Edited by Bob Salita Tuesday, April 12, 2016 9:19 AM
    Tuesday, April 12, 2016 9:15 AM
  • Hi Bob

    Sorry, I think I may understand what you mean, apologies over the confusion, we actually call this Cortana Voice Commands within Windows 10 UWP 

    In addition to using voice commands within Cortana to access system features, you can also extend Cortana with features and functionality from your app (as a background task) using voice commands that specify an action or command to execute.

    When an app handles a voice command in the background, it does not take focus. Instead, it returns all feedback and results through the Cortana canvas and the Cortana voice so you add this feature into your Windows 10 App.

    https://msdn.microsoft.com/en-us/windows/uwp/input-and-devices/cortana-interactions

    There are various samples at 

    https://github.com/Microsoft/Windows-universal-samples/blob/master/Samples/CortanaVoiceCommand/README.md

    My Colleague has a number of example on his blog but this is a good example using Windows 10 UWP on a Raspberry Pi to turn on a light with a voice command see https://mtaulty.com/2015/08/03/m_15888/



    Tuesday, May 10, 2016 5:45 PM
    Moderator