Microsoft Speech Recognition Reaches Human Parity (Again)

“It is a big deal, to be sure. Speech is becoming an increasingly important user interface on smartphones and devices associated with the Internet of Things, and various companies are investing heavily in technological advancement in this area.”

Microsoft has reached another new milestone in its speech recognition technology, with head speech scientist Xuedong Huang proclaiming that it has reached an error rate of 5.1 percent in a new post on the Microsoft Research Blog.

The new record bests Microsoft’s achievement last autumn, when its speech recognition technology reached a word error rate of 5.9 percent; and it beats IBM’s word error rate of 5.5 percent from this past March.

Commenting on the achievement, Huang notes that while Microsoft had previously considered 5.9 percent to be human parity for speech recognition, “other researchers conducted their own study, employing a more involved multi-transcriber process, which yielded a 5.1 human parity word error rate.” Hence the celebration of Microsoft’s new 5.1 percent record, affording the company its second opportunity to proclaim that it has achieved human parity in its speech recognition technology.

It is a big deal, to be sure. Speech is becoming an increasingly important user interface on smartphones and devices associated with the Internet of Things, and various companies are investing heavily in technological advancement in this area. With its latest achievement, Microsoft can boast not only of having field-leading speech recognition, but the machine learning and cloud computing tools – Microsoft Cognitive Toolkit and Azure GPUs, namely – to reduce its word error rate by 12 percent over the past year.

Source: Microsoft Research Blog

Sponsored Links

FaceTec’s patented, industry-leading 3D Face Authentication software anchors digital identity, creating a chain of trust from user onboarding to ongoing authentication on all modern smart devices and webcams. FaceTec’s 3D FaceMaps™ make trusted, remote identity verification finally possible. As the only technology backed by a persistent spoof bounty program and NIST/iBeta Certified Liveness Detection, FaceTec is the global standard for Liveness and 3D Face Matching with millions of users on six continents in financial services, border security, transportation, blockchain, e-voting, social networks, online dating and more. www.facetec.com

FACEPHI is a global leader in Facial Recognition technology and in Mobile Biometrics technologies. With a strong concentration in the financial sector, FacePhi’s product is rapidly becoming a service used by banks all over the world. Its implementation doesn’t just save money, it is also a way to attract clients and build loyalty, while increasing the security of transactions for both the customer and the business. To learn more about FacePhi, visit https://www.facephi.com/en/

“It is a big deal, to be sure. Speech is becoming an increasingly important user interface on smartphones and devices associated with the Internet of Things, and various companies are investing heavily in technological advancement in this area.”

Related News & Articles

Footer

Follow Us