Abstract
The APEX system [1] enables vocal tract articulation using a reduced set of user controllable parameters by means of Principal Component Analysis of X-ray tract data. From these articulatory profiles it is then possible to calculate cross-sectional area function data that can be used as input to a number of articulatory based speech synthesis algorithms. In this paper the Kelly-Lochbaum 1-D digital waveguide vocal tract is used, and both APEX control and synthesis engine have been implemented and tested in SuperCollider. Accurate formant synthesis and real-time control are demonstrated, although for multi-parameter speech-like articulation a more direct mapping from tract-to-synthesizer tube sections is needed. SuperCollider provides an excellent framework for the further exploration of this work.
Original language | English |
---|---|
Pages | 307-313 |
Number of pages | 7 |
Publication status | Published - 30 Nov 2015 |
Event | 18th Int. Conference on Digital Audio Effects (DAFx-15) - Trondheim, Norway Duration: 30 Nov 2015 → 3 Dec 2015 http://www.ntnu.edu/web/dafx15/ |
Conference
Conference | 18th Int. Conference on Digital Audio Effects (DAFx-15) |
---|---|
Country/Territory | Norway |
City | Trondheim |
Period | 30/11/15 → 3/12/15 |
Internet address |
Keywords
- digital audio
- acoustics
- speech processing
- Voice acoustics