Abstract
This paper investigates the potential of using higher-order Ambisonic features to perform acoustic scene classification. We compare the performance of systems trained using first-order and fourth-order spatial features extracted from the EigenScape database. Using both Gaussian mixture model and convolutional neural network classifiers, we show that features extracted from higher-order Ambisonics can yield increased classification accuracies relative to first-order features. Diffuseness-based features seem to describe scenes particularly well relative to direction-of-arrival based features. With specific feature subsets, however, differences in classification accuracy between first and fourth-order features become negligible.
Original language | English |
---|---|
Title of host publication | 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2019 |
Publisher | IEEE |
Pages | 328-332 |
Number of pages | 5 |
ISBN (Electronic) | 9781728111230 |
DOIs | |
Publication status | Published - 23 Dec 2019 |
Event | 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2019 - New Paltz, United States Duration: 20 Oct 2019 → 23 Oct 2019 |
Publication series
Name | IEEE Workshop on Applications of Signal Processing to Audio and Acoustics |
---|---|
Volume | 2019-October |
ISSN (Print) | 1931-1168 |
ISSN (Electronic) | 1947-1629 |
Conference
Conference | 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2019 |
---|---|
Country/Territory | United States |
City | New Paltz |
Period | 20/10/19 → 23/10/19 |
Bibliographical note
Funding Information:†The research leading to these results has received funding from the European Research Council under the European Union’s H2020 Framework Program through ERC Grant Agreement 637422 EVERYSOUND. The authors also wish to acknowledge CSC-IT Center for Science, Finland, for computational resources.
Funding Information:
∗Funding was provided by a UK Engineering and Physical Sciences Research Council (EPSRC) Doctoral Training Award, an Audio Engineering Society (AES) Educational Foundation grant, and the Department of Electronic Engineering at the University of York.
Publisher Copyright:
© 2019 IEEE.
Keywords
- acoustic scene classification
- ambisonics
- convolutional neural networks
- gaussian mixture models
- spatial audio