Abstract
The digital waveguide mesh has been shown to be capable of reproducing the acoustic impulse response of cylindrical vocal tract analogs. This study extends the same methodology to three-dimensional simulation of the acoustic response of graphical models of the vocal tract obtained from magnetic resonance imaging for a group of trained subjects. By such simulation of the vocal tract transfer function and convolution with an appropriate source waveform, basic phonemes are resynthesised and compared with benchmark audio recordings. The technologies and techniques used for simulation are described, alongside the protocol for image capture and the process for collection of benchmark audio. The results of simulation and acoustic recording are then evaluated and compared. The value of three-dimensional simulation in comparison to existing lower-dimensionality equivalents is assessed. It is found that while three- dimensional simulation provides a strong representation of the low frequency vocal tract transfer function, at higher frequencies its performance becomes geometry-dependent. MRI imaging and benchmark audio is provided for future studies and to permit comparison with comparable means of acoustic simulation.
Original language | English |
---|---|
Pages (from-to) | 453-464 |
Number of pages | 12 |
Journal | IEEE Transactions On Audio Speech And Language Processing |
Volume | 22 |
Issue number | 2 |
Early online date | 11 Dec 2013 |
DOIs | |
Publication status | Published - Feb 2014 |
Bibliographical note
©2013, IEEE.Keywords
- waveguide
- vocal tract
- MRI data