Microsoft Reaches Human Parity in Speech Recognition Tech

“The pioneering system was developed using Microsoft’s Computational Network Toolkit, an open source deep learning system upon which Microsoft’s speech recognition system was able to train its neural networks.”

“We’ve reached human parity,” Microsoft’s head speech scientist, Xuedong Huang, has proclaimed, referring to the company’s speech recognition technology.

It’s Xuedong Huang’s assessment of the latest testing from Microsoft Artificial Intelligence and Research, which has announced in a new paper that its speech recognition technology now has attained an equal or better word error rate in comparison to human transcriptionists. The technology’s word error rate has dropped from 6.3 percent to 5.9 percent – the lowest such rate ever recorded using Switchboard, an industry standard test.

The pioneering system was developed using Microsoft’s Computational Network Toolkit, an open source deep learning system upon which Microsoft’s speech recognition system was able to train its neural networks. The research team responsible says its goal now is to improve the system’s functionality in real world settings, with background noise and regional accents challenging its performance.

The work could prove critical going forward, given the rising importance of speech recognition technology. It’s widely expected that such technology could provide the primary user interface for connected devices associated with the Internet of Things, and other major tech companies such as Google and Apple are stepping up their own investments in the technology accordingly.

Sponsored Links

FaceTec’s patented, industry-leading 3D Face Authentication software anchors digital identity, creating a chain of trust from user onboarding to ongoing authentication on all modern smart devices and webcams. FaceTec’s 3D FaceMaps™ make trusted, remote identity verification finally possible. As the only technology backed by a persistent spoof bounty program and NIST/iBeta Certified Liveness Detection, FaceTec is the global standard for Liveness and 3D Face Matching with millions of users on six continents in financial services, border security, transportation, blockchain, e-voting, social networks, online dating and more. www.facetec.com

FACEPHI is a global leader in Facial Recognition technology and in Mobile Biometrics technologies. With a strong concentration in the financial sector, FacePhi’s product is rapidly becoming a service used by banks all over the world. Its implementation doesn’t just save money, it is also a way to attract clients and build loyalty, while increasing the security of transactions for both the customer and the business. To learn more about FacePhi, visit https://www.facephi.com/en/

“The pioneering system was developed using Microsoft’s Computational Network Toolkit, an open source deep learning system upon which Microsoft’s speech recognition system was able to train its neural networks.”

Related News & Articles

Footer

Follow Us