By using Watson Speech Recognition (SR) plugin to UniMRCP Server, IVR platforms can utilize IBM Watson Speech to Text API via the industry-standard Media Resource Control Protocol (MRCP) version 1 and 2.
IBM Watson Speech to Text API performs speech transcription powered by machine learning and supporting the following main features.
Automatically transcribe audio in real-time. Rapidly identify and transcribe what is being discussed, even from lower quality audio, across a variety of audio formats and programming interfaces.
Customize your model to improve accuracy for language and content you care most about, such as product names, sensitive subjects or names of individuals. Recognizes different speakers in your audio Spot specified keywords in real-time with high accuracy and confidence.
Transcribe audio for various use cases ranging from real-time transcription for audio from a microphone, to analyzing 1000s of audio recording from your call center to provide meaningful analytics.
The speech recognition API currently supports 7 languages.
By using Watson Speech Synthesis (SS) plugin to UniMRCP Server, IVR platforms can utilize IBM Watson Text to Speech API via the industry-standard Media Resource Control Protocol (MRCP) version 1 and 2.
IBM Watson Text to Speech API performs text to speech conversion supporting the following main features.
Text is instantly synthesized into human-sounding speech and can be used for real-time conversion.
The text to speech API supports a variety of languages.
By supporting SSML 1.0, the API allows changing certain characteristics of generated voice output like speaking rate, volume and pronunciation.
The text to speech API supports a variety of different male and female voices.