Categories: Technology

Microsoft demonstrates “breakthrough” speech translation technology

Microsoft has posted a remarkable video demonstrating speech-to-speech language translation where spoken English is translated directly into Chinese and read aloud by machine in near real-time.

Microsoft’s Chief Research Officer Rick Rashid demonstrated the “breakthrough” speech translation technology on stage last month at Microsoft Research Asia’s 21t Century Computing event in Tianjin, China. Rashid also shares some background information on the history of automated speech translation in this guest post.

Rashid describes the advances made in speech technology to date, from simple waveform pattern matching first developed almost 60 years ago, to a technique known as hidden Markov developed in the late 1970’s.

The technology demonstrated by Rashid uses a technique called Deep Neural Networks, developed by Microsoft Research and the University of Toronto two years ago. Deep Neural Networks is a technique “patterned after human brain behaviour” and provides the ability to “train more discriminative and better speech recognisers” than older methods.

“We have been able to reduce the word error rate for speech by over 30% compared to previous methods. This means that rather than having one word in 4 or 5 incorrect, now the error rate is one word in 7 or 8. While still far from perfect, this is the most dramatic change in accuracy since the introduction of hidden Markov modeling in 1979, and as we add more data to the training we believe that we will get even better results.”

What’s even more interesting is that the Chinese language translation demonstrated mimics Rashid’s own voice, made possible by recording an hour’s worth of English speech data from Rashid prior to the presentation.

As Rashid quips, Star Trek’s universal translator is a reality now more so than ever.

Albizu Garcia

Albizu Garcia is the Co-Founder and CEO of Gain -- a marketing technology company that automates the social media and content publishing workflow for agencies and social media managers, their clients and anyone working in teams.

View Comments

Recent Posts

DARPA ‘Generative Optogenetics (GO)’ seeks to program biology using light, could aid in ‘extended human spaceflight’

Apart from 'extended human spaceflight' for what other purposes could DARPA GO serve? perspective DARPA…

7 hours ago

Competing in the post-gatekeeper era: How the DMA is rewiring platforms, security, and market access

The Digital Markets Act (DMA) has joined the General Data Protection Regulation (GDPR) as one…

3 days ago

Horasis India Meeting to Spotlight India’s Global Ascent At Singapore Summit This Month

Amid several years of shifting global dynamics, it’s become increasingly clear that we are entering…

4 days ago

AI scams targeting businesses are surging: Here are the top 3 threats your team is likely to face in 2026 (Brains Byte Back Podcast)

Imagine a company interviewing a candidate for a senior IT role. The résumé checks out,…

5 days ago

AI Won’t Scale in Advertising Until Trust Does: How to Identify AI Tools That Deliver Quality Security and Expertise

At the start of the year, data suggested that only about a third of agencies,…

5 days ago

What It Means When Algorithms Say “I”: Toward a Theory of Digital Subjectivity

Picture an AI assistant you have worked with for the past five years. It knows…

5 days ago