Tuesday, January 2, 2018

Google’s new AI system can articulate like humans @ Tacotron 2



Google’s new AI system can articulate like humans @ Tacotron 2

‘Tacotron 2’ delivers speech that matches human voice

 

In a major step towards its “AI first” dream, Google has developed a text-to-speech artificial intelligence (AI) system that will confuse you with its human-like articulation.

The tech giant’s text-to-speech system called “Tacotron 2” delivers an AI-generated computer speech that almost matches with the voice of humans, technology news website Inc.com reported.
At Google I/O 2017 developers conference, the company’s CEO Sundar Pichai announced that the internet giant was shifting its focus from mobile-first to “AI first” and launched several products and features, including Google Lens, Smart Reply for Gmail and Google Assistant for iPhone.

According to a paper published in arXiv.org, the system first creates a spectrogram of the text, a visual representation of how the speech should sound.

That image is put through Google’s WaveNet algorithm, which uses the image and brings AI closer than ever to mimicking human speech. It can easily learn different voices and even generates artificial breaths. “Our model achieves a mean opinion score (MOS) of 4.53 comparable to a MOS of 4.58 for professionally recorded speech,” the researchers were quoted as saying.

Source | The Hindu | 2nd January 2018

Regards

Pralhad Jadhav  

Senior Manager @ Knowledge Repository  
Khaitan & Co 



Twitter Handle | @Pralhad161978

No comments:

Post a Comment