Singing synthesis with an evolved physical model

Crispin Cooper, D.T. Murphy, D. Howard, A. Tyrrell

Research output: Contribution to journalArticlepeer-review

Abstract

A two-dimensional physical model of the human vocal tract is described. Such a system promises increased realism and control in the synthesis. of both speech and singing. However, the parameters describing the shape of the vocal tract while in use are not easily obtained, even using medical imaging techniques, so instead a genetic algorithm (GA) is applied to the model to find an appropriate configuration. Realistic sounds are produced by this method. Analysis of these, and the reliability of the technique (convergence properties) is provided.

Original languageEnglish
Pages (from-to)1454-1461
Number of pages8
JournalIEEE Transactions On Audio Speech And Language Processing
Volume14
Issue number4
DOIs
Publication statusPublished - Jul 2006

Bibliographical note

© Copyright 2006 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

Keywords

  • 2-MUSI
  • digital waveguide mesh
  • finite difference
  • genetic algorithms (GAs)
  • optimization methods
  • singing synthesis
  • speech synthesis
  • voice analysis
  • HISTORY
  • VOICE

Cite this