• Skip to main content
  • Skip to secondary menu
  • Skip to primary sidebar
  • Skip to footer
  • Our Services
  • Contact Us
  • Newsletter
  • Top Nav Social Icons

Mobile ID World

Mobile ID World

Identification Revolution

  • Mobile ID
    • What Is Mobile ID?
    • Identity Associations
    • Premier Partners
    • FAQ
  • News
  • Solutions
    • Behavioral
    • Facial Recognition
    • Fingerprint Biometrics
    • Iris Biometrics
    • Second Factor
    • Smart Cards
    • Smartphones
    • Vital
    • Voice
    • Wearable Tech
    • Other
  • Applications
    • Access Control
    • Cloud Technology
    • Commerce
    • Enterprise
    • Healthcare
    • Identification
    • Internet of Things
    • Law Enforcement
    • Strong Online Authentication
  • Exclusive
    • Interviews
    • Featured Articles
    • Podcasts
  • Companies
  • Events

Google Unveils Smaller and Faster Speech Recognition Technology

March 18, 2019

Google Unveils Smaller and Faster Speech Recognition Technology

Google has made some significant improvements to the speech recognizer on its mobile phones. The new software outputs every single character in real time and is entirely contained on the mobile device, which means that the dictation system will work offline with zero latency.

Johan Schalkwyk, a Google Fellow with the company’s Speech Team, explained the new system in a recent AI Blog post. According to Schalkwyk, more conventional speech recognition systems convert speech to text using a sequence that involves three separate steps, beginning with an analysis of an audio sample to identify specific sounds. The software then uses those sounds to form words and a language model to complete the sentence.

The drawback is that those traditional systems require a complete input sequence in order to generate a transcription. Google’s team used Recurrent Neural Network transducer (RNN-T) technology to convert audio input to text output on a character-by-character basis, improving speed by outputting each individual letter instead of a longer word or phrase.

The new platform is also smaller than its predecessors, reducing the speech recognizer footprint from 2 GB to 80 MB. At the former size, speech recognizers are too unwieldly to store on a mobile device and therefore require a network connection in order to function. The new dictation system is small enough to embed on a standard smartphone and will be available to customers on or offline.

For now, the new speech recognizer will only be available in American English on Pixel phones, though Google hopes to launch the service for more languages and devices soon. The announcement is the latest RNN breakthrough for the company’s speech recognition team, which achieved human parity back in 2017.  

Source: Google AI Blog

Filed Under: Industry News Tagged With: AI, Artificial Intelligence, Google, mobile tech, neural networks, Recurrent Neural Network transducer, RNN-T, speech recognition, speech-to-text, virtual assistants

Related News & Articles

IDEX Intensifies China Focus with Latest Biometric Cards Partner

Google Launches Cloud Identity Platform for the Enterprise

CO-OP Expands Utility of CardNav App to Cover Credit and Debit Cards

Primary Sidebar

Register For the Next Virtual Identity Summit

Register now!

Tweets

Sponsored Links

facetec logo

FaceTec’s patented, industry-leading 3D Face Authentication software anchors digital identity, creating a chain of trust from user onboarding to ongoing authentication on all modern smart devices and webcams. FaceTec’s 3D FaceMaps™ make trusted, remote identity verification finally possible. As the only technology backed by a persistent spoof bounty program and NIST/iBeta Certified Liveness Detection, FaceTec is the global standard for Liveness and 3D Face Matching with millions of users on six continents in financial services, border security, transportation, blockchain, e-voting, social networks, online dating and more. www.facetec.com

FACEPHI is a global leader in Facial Recognition technology and in Mobile Biometrics technologies. With a strong concentration in the financial sector, FacePhi’s product is rapidly becoming a service used by banks all over the world. Its implementation doesn’t just save money, it is also a way to attract clients and build loyalty, while increasing the security of transactions for both the customer and the business. To learn more about FacePhi, visit https://www.facephi.com/en/

Recent Posts

  • MDL, Digital ID Gain Momentum in State Efforts
  • Brazil-based Selfie Onboarding Startup Reports 250% Sales Jump, Global Expansion
  • ‘All Partners Remain Committed’ to Digital Travel ID Project: Transport Canada
  • North Carolina DMV Seeks Political Support for MDL
  • The Road Ahead for Biometrics and Identity Online Summit

Footer

  • About Us
  • Company Directory
  • Advertise With Us
  • Contact Us
  • Privacy Policy
  • Terms of Use
  • Archives
  • CCPA: Do not sell my personal info.

Follow Us

Copyright © 2023 MobileIDWorld