Cloud Polly allows you to turn any text into lifelike speech, allowing you to create various media content such as audio books, podcasts, voice contents and also applications that talk, and build entirely new categories of speech-enabled products. Cloud Polly’s Text-to-Speech (TTS) service uses advanced deep learning technologies of leading cloud service providers such as Amazon Web Services, Microsoft Azure, Google Cloud Platform and IBM Cloud to synthesize natural sounding human speech. With over 630 different lifelike voices across more than 70 languages, you can build speech-enabled applications that work in many different countries.
In addition to Standard TTS voices, Cloud Polly offers Neural Text-to-Speech (NTTS) voices that deliver advanced improvements in speech quality through a new machine learning approach. Most of Cloud Polly’s Neural TTS technology also supports unique speaking styles depending on the cloud vendor that allow you to better match the delivery style of the speaker to the application: Example: a Newscaster reading style that is tailored to news narration use cases, and a Conversational speaking style that is ideal for two-way communication like telephony applications.
Enjoy convenient usage of SSML tags to add various voice effects, such as adjusting pitch, volume, speed, emphasis, word or phrase beep outs to name a few. Full list can be found on demo upon selecting respective voices.