Google’s new AI system can articulate like humans @ Tacotron 2
‘Tacotron 2’ delivers speech that matches human voice
In a major step towards its “AI first” dream,
Google has developed a text-to-speech artificial intelligence (AI) system that
will confuse you with its human-like articulation.
The tech giant’s text-to-speech system called
“Tacotron 2” delivers an AI-generated computer speech that almost matches with
the voice of humans, technology news website Inc.com reported.
At Google I/O 2017 developers conference, the
company’s CEO Sundar Pichai announced that the internet giant was shifting its
focus from mobile-first to “AI first” and launched several products and
features, including Google Lens, Smart Reply for Gmail and Google Assistant for
iPhone.
According to a paper published in arXiv.org,
the system first creates a spectrogram of the text, a visual representation of
how the speech should sound.
That image is put through Google’s WaveNet
algorithm, which uses the image and brings AI closer than ever to mimicking
human speech. It can easily learn different voices and even generates
artificial breaths. “Our model achieves a mean opinion score (MOS) of 4.53
comparable to a MOS of 4.58 for professionally recorded speech,” the
researchers were quoted as saying.
Source | The Hindu | 2nd January 2018
Regards
Pralhad
Jadhav
Senior Manager @
Knowledge Repository
Khaitan &
Co
Twitter Handle | @Pralhad161978
No comments:
Post a Comment