• Skip to main content
  • Skip to secondary menu
  • Skip to primary sidebar
  • Skip to footer
  • Our Services
  • Contact Us
  • Newsletter
  • Top Nav Social Icons

Mobile ID World

Mobile ID World

Identification Revolution

  • Mobile ID
    • What Is Mobile ID?
    • Identity Associations
    • Premier Partners
    • FAQ
  • News
  • Solutions
    • Behavioral
    • Facial Recognition
    • Fingerprint Biometrics
    • Iris Biometrics
    • Second Factor
    • Smart Cards
    • Smartphones
    • Vital
    • Voice
    • Wearable Tech
    • Other
  • Applications
    • Access Control
    • Cloud Technology
    • Commerce
    • Enterprise
    • Healthcare
    • Identification
    • Internet of Things
    • Law Enforcement
    • Strong Online Authentication
  • Exclusive
    • Interviews
    • Featured Articles
    • Podcasts
  • Companies
  • Events

Amazon Reduces Alexa Footprint with End-to-End Speech Recognition Models

October 22, 2020

Amazon has confirmed that it is using end-to-end models to improve the speech recognition capabilities of the Alexa platform. With an end-to-end model, the entire speech recognition process can be completed on the device itself, from speech input all the way through to output and transcription. That contrasts with previous versions of Alexa, which processed data in the cloud because the models were too big to install on a standalone device.

Amazon Reduces Alexa Footprint with End-to-End Speech Recognition Models

Those earlier iterations of Alexa broke speech down into multiple components, such as acoustics and the actual language, each of which had to be processed with a separate model. The new version, on the other hand, is able to process speech as a single cohesive entity.

“With an end-to-end model, you end up getting away from having these separate pieces and end up with a combined neural network,” said Automatic Speech Recognition Head Shehzad Mevawalla in an interview with VentureBeat. “You’re going from gigabytes down to less than 100MB in size. That allows us to run these things in very constrained spaces.”

Despite the smaller footprint, the new Alexa model still needs to be paired with an on-device accelerator to deliver the expected performance speeds. With that in mind, Amazon has teamed up with MediaTek to develop the AZ1 Neural Edge processor, which has been deployed in the latest versions of Amazon’s various Echo devices.

According to Mevawalla, end-to-end models have also enhanced Alexa’s ability to identify individual speakers. The Natural Turn Taking feature is able to filter Alexa requests from regular background noise, and to use a camera to determine whether the speaker is directing their comments to Alexa or to a person or another device somewhere else in the room. The feature will still function without a camera, but is more accurate in devices that can capture video.

Mevawalla went on to claim that the use of end-to-end models has improved the accuracy of Alexa by as much as 25 percent. However, Natural Turn Taking will only be available in English when it debuts in 2021.

Amazon recently accredited Kudelski IoT Labs to test products with built-in Alexa capabilities. The tech giant is one of several companies working toward on-device speech and voice recognition. Frost & Sullivan has predicted that car manufacturers will prioritize hybrid voice assistants, while NXP has released a new MCU that will support offline voice recognition in IoT devices.

Source: VentureBeat

Filed Under: Industry News Tagged With: AI, AI assistants, Alexa, Amazon, Artificial Intelligence, conversational AI assistants, speech recognition, voice command technology, voice interaction, voice-based AI assistants

Related News & Articles

Robert Mueller Becomes Zwipe CTO

The Ticket Bank Uses Yoti Tech for Secure Authentication

Deep Learning Leads to Death-proof Liveness Detection for Iris Biometrics

Primary Sidebar

Register For the Next Virtual Identity Summit

Register now!

Tweets

Sponsored Links

facetec logo

FaceTec’s patented, industry-leading 3D Face Authentication software anchors digital identity, creating a chain of trust from user onboarding to ongoing authentication on all modern smart devices and webcams. FaceTec’s 3D FaceMaps™ make trusted, remote identity verification finally possible. As the only technology backed by a persistent spoof bounty program and NIST/iBeta Certified Liveness Detection, FaceTec is the global standard for Liveness and 3D Face Matching with millions of users on six continents in financial services, border security, transportation, blockchain, e-voting, social networks, online dating and more. www.facetec.com

FACEPHI is a global leader in Facial Recognition technology and in Mobile Biometrics technologies. With a strong concentration in the financial sector, FacePhi’s product is rapidly becoming a service used by banks all over the world. Its implementation doesn’t just save money, it is also a way to attract clients and build loyalty, while increasing the security of transactions for both the customer and the business. To learn more about FacePhi, visit https://www.facephi.com/en/

Recent Posts

  • Transatlantic Digital Traveler Identity Project Gets High-Profile Tech Partner
  • Digital Identity Tech Demo Online Event
  • Mobile ID Comes to Another US Campus
  • New York DMV Developing Mobile Driver’s License
  • Mobile ID World & FindBiometrics Partner With Access Control Executive Brief – Join Us at ISC West 2023

Footer

  • About Us
  • Company Directory
  • Advertise With Us
  • Contact Us
  • Privacy Policy
  • Terms of Use
  • Archives
  • CCPA: Do not sell my personal info.

Follow Us

Copyright © 2023 MobileIDWorld