Michelle Burbick
Michelle Burbick is the Associate Editor and a blogger for No Jitter, UBM Tech's online community for news and analysis...
Read Full Bio >>

Michelle Burbick | August 14, 2017 |


Google Boosts Cloud Speech API

Google Boosts Cloud Speech API New features and additional language support extends capabilities and reach of company's speech conversation technology.

New features and additional language support extends capabilities and reach of company's speech conversation technology.

Google has been busy upgrading its Cloud Speech API to better meet a growing imperative for converting speech to text at cloud scale for improved user experiences.

Google previously used the speech-to-text technology exclusively in Google Now and other of its applications, as well as for search. But last spring the company opened up its speech technology to the masses, beginning with alpha testing of the Cloud Speech API in April 2016 and moving to GA this past April.

"Through the cloud, we're democratizing speech for businesses, from small startups to large enterprises," said Dan Aharon, Google Cloud product manager.

Thousands of businesses already are using the Cloud Speech API, and the number is growing every month, Aharon said. Based on customer feedback Google has been collecting over the past year through various channels, today it announced a few enhancements meant to address common demand.

The enhancements come in three parts. First, Google has added timestamp information for each word in a transcript -- the No. 1 most requested feature, Aharon said. Timestamps on a word-by-word level allow users to jump to the moment in the audio where the text was spoken, or display the relevant text while the audio is playing, he explained.

A couple of businesses already making use of this functionality are Happy Scribe, which uses the API to provide a voice-to-text transcription service, and Voximplant, which uses the API to help other companies build voice and video applications, Google announced. In prepared statements, both companies have commented that this feature allows them to save considerable time in working with speech-to-text transcriptions and pass along that time saved to their business customers.

In addition, Google has extended the length of the audio file supported. Cloud Speech API previously worked with audio files than ran no longer than 80 minutes, but now supports files of up to three hours in length. Further, customers who have even longer files can apply for a quota extension through Cloud Support, for approval on a case-by-case basis.

Lastly, Google has added support for 30 additional languages, including Bengali, Latvian, and Swahili, bringing the total to 119 and representing a billion additional potential users, Aharon said.

A motivating factor for the language extension is to aid in the economic development of more countries, Aharon said. By exposing an additional billion people to the product, Google is hopeful it can help existing customers reach new audiences and expose people in those countries to technology previously unavailable to them, he added.

In a demo of the technology Aharon gave, Cloud Speech API performed in real time to 100% accuracy.

Going forward, Google will have additional speech technology-related news, including market innovations, later this year, Aharon hinted.

Overall, Google is working on "strengthening our cloud portfolio with an eye on bringing AI to customers," Aharon said. He explained that there are two main spaces Google sees for the use of AI through speech recognition: human-computer interaction and speech analytics on human-to-human interactions.

Regarding AI, Google sees a wide range of enterprise readiness, which is often dependent on how regulated an industry is. Highly regulated businesses like those in healthcare and financial services tend to be more conservative or laggards, while startups and those in less regulated spaces are more likely to be first to market with AI-enabled solutions. In particular, Google sees high demand for AI and speech technology in the contact center, being leveraged to improve the customer experience.

Follow Michelle Burbick and No Jitter on Twitter!


March 7, 2018

Video collaboration is experiencing significant change and innovation-how can your enterprise take advantage? In this webinar, leading industry analyst Ira Weinstein will present detailed analysis

February 21, 2018

Business agility has become the strongest driver for enterprises to begin migrating their communications to the cloud-and its a benefit that enterprises are already realizing. To gain this benefit

February 7, 2018

Enterprises are starting to grasp the critical importance of security and compliance in their team collaboration deployments. And once the risks are mitigated, your enterprise can integrate these n

March 12, 2018
An effective E-911 implementation doesn't just happen; it takes a solid strategy. Tune in for tips from IT expert Irwin Lazar, of Nemertes Research.
March 9, 2018
IT consultant Steve Leaden lays out the whys and how-tos of getting the green light for your convergence strategy.
March 7, 2018
In advance of his speech tech tutorial at EC18, communications analyst Jon Arnold explores what voice means in a post-PBX world.
February 28, 2018
Voice engagement isn't about a simple phone call any longer, but rather a conversational experience that crosses from one channel to the next, as Daniel Hong, a VP and research director with Forrester....
February 16, 2018
What trends and technologies should you be up on for your contact center? Sheila McGee-Smith, Contact Center & Customer Experience track chair for Enterprise Connect 2018, gives us the lowdown.
February 9, 2018
Melanie Turek, VP of connected work research at Frost & Sullivan, walks us through key components -- and sticking points -- of customer-oriented digital transformation projects.
February 2, 2018
UC consultant Marty Parker has crunched lots of numbers evaluating UC options; tune in for what he's learned and tips for your own analysis.
January 26, 2018
Don't miss out on the fun! Organizer Alan Quayle shares details of his pre-Enterprise Connect hackathon, TADHack-mini '18, showcasing programmable communications.
December 20, 2017
Kevin Kieller, partner with enableUC, provides advice on how to move forward with your Skype for Business and Teams deployments.
December 20, 2017
Zeus Kerravala, principal analyst with ZK Research, shares his perspective on artificial intelligence and the future of team collaboration.
December 20, 2017
Delanda Coleman, Microsoft senior marketing manager, explains the Teams vision and shares use case examples.
November 30, 2017
With a ruling on the FCC's proposed order to dismantle the Open Internet Order expected this month, communications technology attorney Martha Buyer walks us through what's at stake.
October 23, 2017
Wondering which Office 365 collaboration tool to use when? Get quick pointers from CBT Nuggets instructor Simona Millham.
September 22, 2017
In this podcast, we explore the future of work with Robert Brown, AVP of the Cognizant Center for the Future of Work, who helps us answer the question, "What do we do when machines do everything?"
September 8, 2017
Greg Collins, a technology analyst and strategist with Exact Ventures, delivers a status report on 5G implementation plans and tells enterprises why they shouldn't wait to move ahead on potential use ....
August 25, 2017
Find out what business considerations are driving the SIP trunking market today, and learn a bit about how satisfied enterprises are with their providers. We talk with John Malone, president of The Ea....
August 16, 2017
World Vision U.S. is finding lots of goodness in RingCentral's cloud communications service, but as Randy Boyd, infrastructure architect at the global humanitarian nonprofit, tells us, he and his team....
August 11, 2017
Alicia Gee, director of unified communications at Sutter Physician Services, oversees the technical team supporting a 1,000-agent contact center running on Genesys PureConnect. She catches us up on th....
August 4, 2017
Andrew Prokop, communications evangelist with Arrow Systems Integration, has lately been working on integrating enterprise communications into Internet of Things ecosystems. He shares examples and off....
July 27, 2017
Industry watcher Elka Popova, a Frost & Sullivan program director, shares her perspective on this acquisition, discussing Mitel's market positioning, why the move makes sense, and more.
July 14, 2017
Lantre Barr, founder and CEO of Blacc Spot Media, urges any enterprise that's been on the fence about integrating real-time communications into business workflows to jump off and get started. Tune and....
June 28, 2017
Communications expert Tsahi Levent-Levi, author of the popular blog, keeps a running tally and comprehensive overview of communications platform-as-a-service offerings in his "Choosing a W....
June 9, 2017
If you think telecom expense management applies to nothing more than business phone lines, think again. Hyoun Park, founder and principal investigator with technology advisory Amalgam Insights, tells ....
June 2, 2017
Enterprises strategizing on mobility today, including for internal collaboration, don't have the luxury of learning as they go. Tony Rizzo, enterprise mobility specialist with Blue Hill Research, expl....
May 24, 2017
Mark Winther, head of IDC's global telecom consulting practice, gives us his take on how CPaaS providers evolve beyond the basic building blocks and address maturing enterprise needs.
May 18, 2017
Diane Myers, senior research director at IHS Markit, walks us through her 2017 UC-as-a-service report... and shares what might be to come in 2018.
April 28, 2017
Change isn't easy, but it is necessary. Tune in for advice and perspective from Zeus Kerravala, co-author of a "Digital Transformation for Dummies" special edition.
April 20, 2017
Robin Gareiss, president of Nemertes Research, shares insight gleaned from the firm's 12th annual UCC Total Cost of Operations study.
March 23, 2017
Tim Banting, of Current Analysis, gives us a peek into what the next three years will bring in advance of his Enterprise Connect session exploring the question: Will there be a new model for enterpris....
March 15, 2017
Andrew Prokop, communications evangelist with Arrow Systems Integration, discusses the evolving role of the all-important session border controller.
March 9, 2017
Organizer Alan Quayle gives us the lowdown on programmable communications and all you need to know about participating in this pre-Enterprise Connect hackathon.
March 3, 2017
From protecting against new vulnerabilities to keeping security assessments up to date, security consultant Mark Collier shares tips on how best to protect your UC systems.
February 24, 2017
UC analyst Blair Pleasant sorts through the myriad cloud architectural models underlying UCaaS and CCaaS offerings, and explains why knowing the differences matter.
February 17, 2017
From the most basics of basics to the hidden gotchas, UC consultant Melissa Swartz helps demystify the complex world of SIP trunking.
February 7, 2017
UC&C consultant Kevin Kieller, a partner at enableUC, shares pointers for making the right architectural choices for your Skype for Business deployment.
February 1, 2017
Elka Popova, a Frost & Sullivan program director, shares a status report on the UCaaS market today and offers her perspective on what large enterprises need before committing to UC in the cloud.
January 26, 2017
Andrew Davis, co-founder of Wainhouse Research and chair of the Video track at Enterprise Connect 2017, sorts through the myriad cloud video service options and shares how to tell if your choice is en....
January 23, 2017
Sheila McGee-Smith, Contact Center/Customer Experience track chair for Enterprise Connect 2017, tells us what we need to know about the role cloud software is playing in contact centers today.