October 13, 2017

Augmented tongue ultrasound for speech therapy

A team of researchers in the GIPSA-Lab (CNRS/Université Grenoble Alpes/Grenoble INP) and at INRIA Grenoble Rhône-Alpes has developed a system that can display the movements of tongues in real time. Captured using an ultrasound probe placed under the jaw, these movements are processed by a machine-learning algorithm that controls an "articulatory talking head." As well as the face and lips, this avatar shows the tongue, palate and teeth, which are usually hidden inside the vocal tract. This "visual biofeedback" system, which ought to be easier to understand and therefore should produce better correction of pronunciation, could be used for speech therapy and for learning foreign languages. This work is published in the October 2017 issue of Speech Communication.

For a person with an articulation disorder, speech therapy partly uses repetition exercises: the practitioner qualitatively analyzes the patient's pronunciations and orally explains, using drawings, how to place articulators, particularly the tongue: something patients are generally unaware of. How effective therapy is depends on how well the patient can integrate what they are told. It is at this stage that "visual biofeedback" systems can help. They let patients see their articulatory movements in real time, and in particular how their tongues move, so that they are aware of these movements and can correct pronunciation problems faster.

For several years, researchers have been using ultrasound to design biofeedback systems. The image of the tongue is obtained by placing under the jaw a probe similar to that used conventionally to look at a heart or fetus. This image is sometimes deemed to be difficult for a patient to use because it is not very good quality and does not provide any information on the location of the palate and teeth. In this new work, the present team of researchers propose to improve this visual feedback by automatically animating an articulatory talking head in real time from ultrasound images. This virtual clone of a real speaker, in development for many years at the GIPSA-Lab, produces a contextualized—and therefore more natural—visualization of articulatory movements.

Credit: CNRS

The strength of this new system lies in a machine learning algorithm that researchers have been working on for several years. This algorithm can (within limits) process articulatory movements that users cannot achieve when they start to use the system. This property is indispensable for the targeted therapeutic applications. The algorithm exploits a probabilistic model based on a large articulatory database acquired from an "expert" speaker capable of pronouncing all of the sounds in one or more languages. This model is automatically adapted to the morphology of each new user, over the course of a short system calibration phase, during which the patient must pronounce a few phrases.

This system, validated in a laboratory for healthy speakers, is now being tested in a simplified version in a clinical trial for patients who have had tongue surgery. The researchers are also developing another version of the system, where the articulatory talking head is automatically animated, not by ultrasounds, but directly by the user's voice.

More information: Thomas Hueber et al. Speaker-Adaptive Acoustic-Articulatory Inversion Using Cascaded Gaussian Mixture Regression, IEEE/ACM Transactions on Audio, Speech, and Language Processing (2015). DOI: 10.1109/TASLP.2015.2464702

Provided by CNRS

Citation: Augmented tongue ultrasound for speech therapy (2017, October 13) retrieved 12 July 2024 from https://phys.org/news/2017-10-augmented-tongue-ultrasound-speech-therapy.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Baboon vocalizations contain five vowel-like sounds comparable to those of human speech

30 shares

Feedback to editors

Real-life 'stillsuit': Dune-inspired upgrade for spacesuits allow astronauts to recycle urine into water

4 hours ago

New research reveals how galaxies avoid early death

9 hours ago

Oxygen tweaking may be key to accelerator optimization

10 hours ago

A stealth fungus has decimated North American bats, but scientists may be a step closer to treating white-nose syndrome

11 hours ago

Scientific definition of a planet says it must orbit our sun: A new proposal would change that

11 hours ago

Forest carbon storage has declined across much of the Western U.S., likely due to drought and fire

11 hours ago

Study introduces lead-coated nickel catalyst for enhanced hydrogen evolution reaction efficiency

11 hours ago

Q&A: Researcher discusses how gravitational waves hint at dark matter and Big Bang mysteries

12 hours ago

Team develops the first cell-free system in which genetic information and metabolism work together

12 hours ago

Chemists develop robust molecule that gives organic electronic devices a boost

13 hours ago

Load comments (0)

Augmented tongue ultrasound for speech therapy

Real-life 'stillsuit': Dune-inspired upgrade for spacesuits allow astronauts to recycle urine into water

New research reveals how galaxies avoid early death

Oxygen tweaking may be key to accelerator optimization

A stealth fungus has decimated North American bats, but scientists may be a step closer to treating white-nose syndrome

Scientific definition of a planet says it must orbit our sun: A new proposal would change that

Forest carbon storage has declined across much of the Western U.S., likely due to drought and fire

Study introduces lead-coated nickel catalyst for enhanced hydrogen evolution reaction efficiency

Q&A: Researcher discusses how gravitational waves hint at dark matter and Big Bang mysteries

Team develops the first cell-free system in which genetic information and metabolism work together

Chemists develop robust molecule that gives organic electronic devices a boost

Relevant PhysicsForums posts

Help Needed with MCNP Simulation for Brachytherapy Treatment Room

Help with some optimization code for Block Matrices

Is an API Always Necessary for Server-Client Communication?

5 GHz PC WiFi connection Cybersecurity question

I did this POST message configuration damage to my wifi internet, help

Number of Multiplications in the FFT Algorithm

Baboon vocalizations contain five vowel-like sounds comparable to those of human speech

Patients improve speech by watching 3-D tongue images

Ultrasound guides tongue to pronounce 'r' sounds

Speech synthesizer designed to work out mouth movements into words

On the tip of your tongue: Researchers reveal our motor system activates when we hear speech

Babies need free tongue movement to decipher speech sounds

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Medical Xpress

Tech Xplore

Science X

Augmented tongue ultrasound for speech therapy

Real-life 'stillsuit': Dune-inspired upgrade for spacesuits allow astronauts to recycle urine into water

New research reveals how galaxies avoid early death

Oxygen tweaking may be key to accelerator optimization

A stealth fungus has decimated North American bats, but scientists may be a step closer to treating white-nose syndrome

Scientific definition of a planet says it must orbit our sun: A new proposal would change that

Forest carbon storage has declined across much of the Western U.S., likely due to drought and fire

Study introduces lead-coated nickel catalyst for enhanced hydrogen evolution reaction efficiency

Q&A: Researcher discusses how gravitational waves hint at dark matter and Big Bang mysteries

Team develops the first cell-free system in which genetic information and metabolism work together

Chemists develop robust molecule that gives organic electronic devices a boost

Relevant PhysicsForums posts

Related Stories

Baboon vocalizations contain five vowel-like sounds comparable to those of human speech

Patients improve speech by watching 3-D tongue images

Ultrasound guides tongue to pronounce 'r' sounds

Speech synthesizer designed to work out mouth movements into words

On the tip of your tongue: Researchers reveal our motor system activates when we hear speech

Babies need free tongue movement to decipher speech sounds

Recommended for you

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Newsletter sign up

Donate and enjoy an ad-free experience