Modeling the vocal tract transfer function using a 3D digital waveguide mesh

Matthew David Adam Speed, Damian Thomas Murphy, David Martin Howard

Research output: Contribution to journalArticlepeer-review

Abstract

The digital waveguide mesh has been shown to be capable of reproducing the acoustic impulse response of cylindrical vocal tract analogs. This study extends the same methodology to three-dimensional simulation of the acoustic response of graphical models of the vocal tract obtained from magnetic resonance imaging for a group of trained subjects. By such simulation of the vocal tract transfer function and convolution with an appropriate source waveform, basic phonemes are resynthesised and compared with benchmark audio recordings. The technologies and techniques used for simulation are described, alongside the protocol for image capture and the process for collection of benchmark audio. The results of simulation and acoustic recording are then evaluated and compared. The value of three-dimensional simulation in comparison to existing lower-dimensionality equivalents is assessed. It is found that while three- dimensional simulation provides a strong representation of the low frequency vocal tract transfer function, at higher frequencies its performance becomes geometry-dependent. MRI imaging and benchmark audio is provided for future studies and to permit comparison with comparable means of acoustic simulation.
Original languageEnglish
Pages (from-to)453-464
Number of pages12
JournalIEEE Transactions On Audio Speech And Language Processing
Volume22
Issue number2
Early online date11 Dec 2013
DOIs
Publication statusPublished - Feb 2014

Bibliographical note

©2013, IEEE.

Keywords

  • waveguide
  • vocal tract
  • MRI data

Cite this