Acoustic-articulatory modeling: from assistive technologies to the study of speech development mechanisms

resource center

Do you notice a mistake?

information

event: Traitement du signal pour la voix (Action Audio)
Type: Séminaire / Conférence
performance location: Ircam, Salle Igor-Stravinsky (Paris)
duration: 01 h 01 min
date: October 20, 2022

Speech production is a complex motor process involving several physiological phenomena, such as the neural, nervous and muscular activities that drive our respiratory, laryngeal and articulatory movements. Modeling speech production, in particular the relationship between articulatory gestures (tongue, lips, jaw, velum) and acoustic realizations of speech, is a challenging, and still evolving, research question. From an applicative point of view, such models could be embedded into assistive devices able to restore oral communication when part of the speech production chain is damaged (articulatory synthesis). They could also help rehabilitate speech sound disorders using a therapy based on biofeedback (and articulatory inversion). From a more fundamental research perspective, such models can also be used to question the cognitive mechanisms underlying speech perception and motor control. In this talk, I will present different studies conducted in our group, aiming at learning acoustic-articulatory models from real-world data, using (deep, but not only) machine learning. First, I will focus on different attempts to adapt a direct or inverse model, pre-trained on a reference speaker, to any new speaker. Then, I will present a recent work on the integration of articulatory priors into the latent space of a variational auto-encoder, for potential application to speech enhancement. Finally, I will describe a recent line of research aiming at studying, through modeling and simulation, how a child learns the acoustic-to-articulatory inverse mapping in a self-supervised manner when repeating auditory-only speech stimuli.

speakers

Thomas Hueber

lecturer

From the same archive

De la théorie source-filtre aux interactions pneumo-phono-résonantiels : la complexité de la voix humaine

Video

October 20, 2022 01:05:09

Video

Deep learning methods for voice processing: Neural vocoding for voice transformation

Video

October 20, 2022 00:56:17

Video

Présentation des doctorants en salle

Video

October 20, 2022 00:26:53

Video

Prédiction de la forme géométrique du conduit vocal à partir de la suite de phonèmes à articuler

Video

October 20, 2022 01:09:20

Video

Do you notice a mistake?

IRCAM

1, place Igor-Stravinsky
75004 Paris
+33 1 44 78 48 43

opening times

Monday through Friday 9:30am-7pm
Closed Saturday and Sunday

subway access

Hôtel de Ville, Rambuteau, Châtelet, Les Halles

Institut de Recherche et de Coordination Acoustique/Musique

Acoustic-articulatory modeling: from assistive technologies to the study of speech development mechanisms

information

speakers

From the same archive

De la théorie source-filtre aux interactions pneumo-phono-résonantiels : la complexité de la voix humaine

Deep learning methods for voice processing: Neural vocoding for voice transformation

Présentation des doctorants en salle

Prédiction de la forme géométrique du conduit vocal à partir de la suite de phonèmes à articuler

share

IRCAM

opening times

subway access