MIT Researchers Use AI to Capture Silent Speech

From personal assistant applications to helping people with disabilities speak, voice and speech recognition is one of the most researched areas in AI. Last week, researchers from MIT announced a breakthrough, a deep learning based wearable device that can transcribe words people internally verbalize but do not actually speak out loud.
“Our idea was: Could we have a computing platform that’s more internal, that melds human and machine in some ways and that feels like an internal extension of our own cognition?,” Arnav Kapur, the MIT student who led the development of the new system told MIT News.
The system, called AlterEgo, uses a wearable Bluetooth device that can be paired with a computer or phone, to access the deep learning algorithm.
Using NVIDIA TITAN X GPUs and the cuDNN-accelerated TensorFlow deep learning framework, the researchers trained their model on 31 hours of silently spoken text. The system was designed to identify subvocalized words from neuromuscular signals.
The neural network was tested on 15 people and achieved an accuracy level of 92%, on par with state-of-the-art speech recognition systems which require people to lip-sync their words, the researchers said.

“This allows the user to communicate to their computing devices in natural language without any observable action at all and without explicitly saying anything,” the researchers wrote in their research paper. “Users can silently communicate in natural language and receive aural output, thereby enabling a discreet, bi-directional interface with a computing device, and providing a seamless form of intelligence augmentation,”
The team is collecting more data in the hope of building an application with a much more expansive vocabulary.
Read more >

MIT Researchers Use AI to Capture Silent Speech

Related resources

Tags

About the Authors

MIT Researchers Use AI to Capture Silent Speech

Related resources

Tags

About the Authors

Comments

Related posts

Google Develops ASR System To Help People with Speech Impairments

Experimental AI Powered Hearing Aid Automatically Amplifies Who You Want to Hear

AI Can Interpret and Translate American Sign Language Sentences

AI Researchers Pave the Way For Translating Brain Waves Into Speech

Lip Reading AI More Accurate Than Humans

Related posts

Just Released: NVIDIA Modulus v24.04

New Video Series: OpenUSD for Developers

Generative AI for Digital Humans and New AI-powered NVIDIA RTX Lighting

NVIDIA Speech and Translation AI Models Set Records for Speed and Accuracy

Boost Multi-Omics Analysis with GPU-Acceleration and Generative AI