Google is offering beta Cloud Text-to-Speech via the Google Cloud Platform at https://cloud.google.com/text-to-speech/, supporting natural-sounding WaveNet voices powered by DeepMind AI. WaveNet, instead of stringing together syllables, "analyzes the waveforms from a huge database of human speech and re-creates them at a rate of 24,000 samples per second," per https://www.theverge.com/2018/3/27/1716 ... nd-wavenet. There are 32 voices in 12 languages and variants, including English, French, Spanish, German, Dutch, Brazilian Portuguese, Turkish, Japanese, and Swedish. The supported voices are listed at https://cloud.google.com/text-to-speech/docs/voices. The generated speech can be saved to .mp3 files, etc.
The free tier of usage through the GCP API is up to 1 million characters per month, which sounds to me like more than enough for personal use. I think this is going to be fun to play with.
new realistic TTS service from Google
-
- Orange Belt
- Posts: 228
- Joined: Sun Feb 26, 2017 4:01 pm
- Languages: English (native); strong reading skills - Russian, Spanish, French, Italian, German, Serbo-Croatian, Macedonian, Bulgarian, Slovene, Farsi; fair reading skills - Polish, Czech, Dutch, Esperanto, Portuguese; beginner/rusty - Swedish, Norwegian, Danish
- x 590
- rdearman
- Site Admin
- Posts: 7255
- Joined: Thu May 14, 2015 4:18 pm
- Location: United Kingdom
- Languages: English (N)
- Language Log: viewtopic.php?f=15&t=1836
- x 23259
- Contact:
Re: new realistic TTS service from Google
Very timely post. I was using AWS Polly for something I needed. I'll see if this gives me more natural speech.
EDIT: WaveNet support seems to only implemented for English. It isn't available for French.
EDIT: WaveNet support seems to only implemented for English. It isn't available for French.
1 x
: Read 150 books in 2024
My YouTube Channel
The Autodidactic Podcast
My Author's Newsletter
I post on this forum with mobile devices, so excuse short msgs and typos.
My YouTube Channel
The Autodidactic Podcast
My Author's Newsletter
I post on this forum with mobile devices, so excuse short msgs and typos.
-
- Orange Belt
- Posts: 228
- Joined: Sun Feb 26, 2017 4:01 pm
- Languages: English (native); strong reading skills - Russian, Spanish, French, Italian, German, Serbo-Croatian, Macedonian, Bulgarian, Slovene, Farsi; fair reading skills - Polish, Czech, Dutch, Esperanto, Portuguese; beginner/rusty - Swedish, Norwegian, Danish
- x 590
Re: new realistic TTS service from Google
That's too bad - I must have misread the list of supported languages. Thanks for the correction. The list does say that it includes both standard and WaveNet voices. I hope they expand the WaveNet support soon.
I've been pretty happy with AWS Polly so far, though.
I've been pretty happy with AWS Polly so far, though.
0 x
-
- White Belt
- Posts: 14
- Joined: Wed May 03, 2017 8:40 pm
- Location: CH
- Languages: PL (N), EN (C2), FR (C2), ES (C1);
Studying DE, RU, euPT, ελ - x 23
Re: new realistic TTS service from Google
https://cloud.google.com/text-to-speech/docs/voices
Wavenet is now available in many langauges, and it is very impressive imho. Quality varies but Polish is uncanny valley and much better than older commercial engines.
Pricing is reasonable, with 1 million characters for free monthly, and 16$/million above the limit. The standard TTS is also very good and much cheaper.
https://ankiweb.net/shared/info/814349176 a modified Awesome TTS addon already makes use of this functionality. Whoever made the addon estimates that the free limit is enough for 23 hours of audio every month (with some limits on the size of an individual query )
This could be great for languages with few audiobooks - EuPortuguese, I'm looking at you!
Wavenet is now available in many langauges, and it is very impressive imho. Quality varies but Polish is uncanny valley and much better than older commercial engines.
Pricing is reasonable, with 1 million characters for free monthly, and 16$/million above the limit. The standard TTS is also very good and much cheaper.
https://ankiweb.net/shared/info/814349176 a modified Awesome TTS addon already makes use of this functionality. Whoever made the addon estimates that the free limit is enough for 23 hours of audio every month (with some limits on the size of an individual query )
This could be great for languages with few audiobooks - EuPortuguese, I'm looking at you!
3 x
- tussentaal
- White Belt
- Posts: 23
- Joined: Sun Feb 14, 2016 8:21 pm
- x 22
Re: new realistic TTS service from Google
I find Nuance/Vocalizer voices more natural-sounding.
0 x
ge hebt ne vriend...
Return to “Language Programs and Resources”
Who is online
Users browsing this forum: No registered users and 2 guests