Abstract
A two-dimensional physical model of the human vocal tract is described. Such a system promises increased realism and control in the synthesis. of both speech and singing. However, the parameters describing the shape of the vocal tract while in use are not easily obtained, even using medical imaging techniques, so instead a genetic algorithm (GA) is applied to the model to find an appropriate configuration. Realistic sounds are produced by this method. Analysis of these, and the reliability of the technique (convergence properties) is provided.
Original language | English |
---|---|
Pages (from-to) | 1454-1461 |
Number of pages | 8 |
Journal | IEEE Transactions On Audio Speech And Language Processing |
Volume | 14 |
Issue number | 4 |
DOIs | |
Publication status | Published - Jul 2006 |
Bibliographical note
© Copyright 2006 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.Keywords
- 2-MUSI
- digital waveguide mesh
- finite difference
- genetic algorithms (GAs)
- optimization methods
- singing synthesis
- speech synthesis
- voice analysis
- HISTORY
- VOICE