Thursday, January 4, 2018

AI now has a human style voice.

Home / Technology / Google Develops Voice AI That Is Indistinguishable From Humans | Tacotron 2

Google Develops Voice AI That Is Indistinguishable From Humans | Tacotron 2

in Technology January 3, 2018

[Estimated read time: 3 minutes]

Google develops Tecotron 2 that makes machine generated speech sound less robotic and more like a human.
They used neural networks trained on text transcripts and speech examples.
The system synthesizes speech with WaveNet-level audio quality and Tacotron-level prosody.

Research on generating natural speech from a given text (text-to-speech synthesis, TTS) has been going on for decades. In a last few couple of years, there has been impressive progress.

You are familiar with Google voice service, it’s available in both male and female voices. The robotic voice is a staple in our culture, like Microsoft’s Cortana or Apple’s Siri. As the years have gone by Google’s AI voice has started to sound less robotic and more like a human. And now, it is almost indistinguishable from humans.

Google engineers incorporated ideas from past work like WaveNet and Tacotron, and enhanced the techniques to end up with new system, Tecotron 2. In order to achieve human-like speech, they used neural networks trained on only text transcripts and speech examples, rather than using any complicated linguistic and acoustic features as input.

Model Architecture

The system contains two main components –

A recurrent sequence-to-sequence feature prediction network optimized for TTS to map sequence of letters to a sequence of features, encoding the audio.
An improved version of WaveNet that produces time-domain waveform samples based on the predicted spectrogram frames.

Tacotron 2’s model architecture

The sequence-to-sequence model features an 80 dimensional audio spectrogram (with frames measured every 12.5 milliseconds) that captures words, speed, volume and intonation. These features are eventually converted into 16-bit samples at 24 kHz waveform using an enhanced-WaveNet version.

The resulting system synthesizes speech with WaveNet-level audio quality and Tacotron-level prosody. It can be trained on data without relying on any complicated feature engineering, and accomplishes state-of-the-art sound quality very close to that of natural human voice.

Unlike other core artificial intelligence research the company does, this technology is immediately useful to Google. For instance, first appeared in 2016, WaveNet is now used in Google Assistant. Tacotron 2 would be a more powerful addition to the service.

Reference: arXiv | 1712.05884

Audio Samples

Below, we have attached some samples. Each sentence is generated by artificial intelligence program and the other is a human. Can you figure out which one is AI?

“That girl did a video about Star Wars lipstick.”

Audio Player

00:00

00:00

Use Up/Down Arrow keys to increase or decrease volume.

Audio Player

00:00

00:00

Use Up/Down Arrow keys to increase or decrease volume.

“George Washington was the first President of the United States.”

Audio Player

00:00

00:00

Use Up/Down Arrow keys to increase or decrease volume.

Audio Player

00:00

00:00

Use Up/Down Arrow keys to increase or decrease volume.

“She earned a doctorate in sociology at Columbia University.”

Audio Player

00:00

00:00

Use Up/Down Arrow keys to increase or decrease volume.

Audio Player

00:00

00:00

Use Up/Down Arrow keys to increase or decrease volume.

In an evaluation, Google asked humans to rate the naturalness of the speech. The model achieved a Mean Opinion Score (MOS) of 4.53 comparable to 4.58 MOS for professionally recorded speech.

More Samples: Google.Github.io

Additional Capabilities of Tacotron 2

It can pronounce complex and out-of-the-context words.

“Basilar membrane and otolaryngology are not auto-correlations.”

Audio Player

00:00

00:00

Use Up/Down Arrow keys to increase or decrease volume.

It takes care of spelling errors.

“This is really awesome!”

Audio Player

00:00

00:00

Use Up/Down Arrow keys to increase or decrease volume.

It learns stress and intonation (capitalizing words changes the overall intonation).

“The buses aren’t the problem, they actually provide a solution.”

Audio Player

00:00

00:00

Use Up/Down Arrow keys to increase or decrease volume.

“The buses aren’t the PROBLEM, they actually provide a SOLUTION.”

Audio Player

00:00

00:00

Use Up/Down Arrow keys to increase or decrease volume.

It is good at tongue twisters.

“Peter Piper picked a peck of pickled peppers. How many pickled peppers did Peter Piper pick?”

Audio Player

00:00

00:00

Use Up/Down Arrow keys to increase or decrease volume.

Limitations

The sample sounds great, but there are still a few problems to be solved. The system faces issues while pronouncing complicated words like “merlot” and “decorum”. In extreme cases, it randomly creates strange noises.

Read: Google’s Artificial Intelligence Creates an AI That Beats Human Code

For now, the system can’t generate audio in realtime and generated speech can’t be controlled, like directing it to sound sad or happy. Furthermore, it is only trained to mimic a female voice; to speak like another female or like a male, developers would need to train the system again.

Wednesday, January 3, 2018

The CO2 level is higher this year! Apologize to the children,

Daily CO2

December 30, 2017: 407.35 ppm

December 30, 2016: 404.92 ppm

November CO2
November 2017:  405.14 ppm

November 2016:  403.53 ppm

October Temperature

4th Warmest October since 1880: 2017 & 2003

Coolest October since 1880: 1908 & 1912

Earth's CO2 Home Page

Atmospheric CO2

November 2017

405.14parts per million (ppm)

Mauna Loa Observatory, Hawaii (NOAA-ESRL)

Preliminary data released December 5, 2017

CO2.Earth is live!!

CO2.Earth has Launched!

November 13, 2013

CO2.Earth is now live. I am proud that it is one of the very first websites on the internet with a .earth domain. The first .earth site to launch—democracy.earth—happened last week. This week, CO2.Earth is the site that's rolling out, just before .earth domains open for public registration on December 19, 2015.

Also, just in time for the international climate summit in Paris, CO2.Earth takes over global redistribution of CO2 data from CO2Now.org.

CO2.Earth is here to track the atmospheric CO2 trend along with you. Any time you want an update for earth's planetary vital signs, CO2.Earth points to the latest numbers.

Michael McGee
Producer, CO2.Earth
Vancouver Island, Canada

P.S. Please note that some articles and the set up of CO2 web widgets are still being completed.

Media

Interlnk via PR Newsire Popular citizen sustainability site relaunching on new .earth domain

CO2.Earth via PR Web Global public gest new site to track atmospheric CO2

CO2.Earth Backgrounder

CO2.Earth Media Releases + Media Room

Keeling Curve Monthly

Atmospheric CO2

CO2 Data

NOAA-ESRL Trends in atmospheric CO2

Scripps UCSD Keeling Curve + Scripps CO2 Program

CO2.earth (reposted data) Daily CO2 | Weekly CO2 | Monthly CO2 | Annual CO2

CO2.earth Track The Trend

Show the Trend

Show.earth Add a 'KC Monthly' CO2 widget to your site or blog

Global Warming Update

October Global Temperature Change*

Rankings: October 1880 - October 2017
Comparisons with 20th Century Global Average Surface Temperature
(Temperatures are not compared with a pre-industrial baseline)

Rank	Year	Change in Temperature*
Warmest October	2015	+1.0°C +1.8°F
4th Warmest October	2017, 2003 (tie)	+0.73°C +1.31°F
Coolest October	1908, 1912 (tie)	-0.52°C -0.94°F
		Data retrieved: December 5, 2017

*Surface temperature changes relative to 20th Century global average (1901 - 2000)
Source data NOAA-NCEI State of the Climate: Global Analysis [Web + data download]

The combined average temperature over global land and ocean surfaces for October 2017 was 0.73°C (1.31°F) above the 20th century average of 14.0°C (57.1°F). This value tied with 2003 as the fourth highest October temperature on record since global records began in 1880, behind 2015 (+1.0°C / +1.8°F), 2014 (+0.79°C / +1.42°F), and 2016 (+0.74°C / +1.33°F). The 10 warmest Octobers on record have all occurred during the 21st century, specifically since 2003. October 2017 also marks the 41st consecutive October and the 394th consecutive month with temperatures at least nominally above the 20th century average. [NOAA global analysis accessed December 5, 2017].

"The science is sobering—the global temperature in 2012 was among the hottest since records began in 1880. Make no mistake: without concerted action, the very future of our planet is in peril."

~ Christine Lagarde, Managing Director
International Monetary Fund
[video][text]

NOAA's global analysis: "2016 became the warmest year in NOAA's 137-year series. Remarkably, this is the third consecutive year a new global annual temperature record has been set. The average global temperature across land and ocean surface areas for 2016 was 0.94°C (1.69°F) above the 20th century average of 13.9°C (57.0°F), surpassing the previous record warmth of 2015 by 0.04°C (0.07°F). The global temperatures in 2016 were majorly influenced by strong El Niño conditions that prevailed at the beginning of the year.

This marks the fifth time in the 21st century a new record high annual temperature has been set (along with 2005, 2010, 2014, and 2015) and also marks the 40th consecutive year (since 1977) that the annual temperature has been above the 20th century average. To date, all 16 years of the 21st century rank among the seventeen warmest on record (1998 is currently the eighth warmest.) The five warmest years have all occurred since 2010.

Overall, the global annual temperature has increased at an average rate of 0.07°C (0.13°F) per decade since 1880 and at an average rate of 0.17°C (0.31°F) per decade since 1970." [NOAA global analysis for 2016accessed March 6, 2017].

"Globally-averaged temperatures in 2015 shattered the previous mark set in 2014 by 0.23 degrees Fahrenheit (0.13 Celsius). Only once before, in 1998, has the new record been greater than the old record by this much."

~ NASA Goddard Institute for Space Studies [NASA post of January 20, 2016]

Before the end of 2015, scientists projected that average global temperature increase for 2015 will exceed 1°C above pre-industrial levels. The years 1850-1990 are used as the pre-industrial baseline by the MET Office and Climate Research Unit at the University of East Anglia in the UK. The MET Office released this statement in November 2015:

"This year marks an important first but that doesn't necessarily mean every year from now on will be a degree or more above pre-industrial levels, as natural variability will still play a role in determining the temperature in any given year. As the world continues to warm in the coming decades, however, we will see more and more years passing the 1 degree marker - eventually it will become the norm."

~ Peter Stott
Head of Climate Monitoring and Attribution (MET Office)

CO2 Past. CO2 Present. CO2 Future.

Subscribe to: Posts (Atom)