Acoustic scene classification using higher-order ambisonic features

Marc C. Green, Sharath Adavanne, Damian Murphy, Tuomas Virtanen

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This paper investigates the potential of using higher-order Ambisonic features to perform acoustic scene classification. We compare the performance of systems trained using first-order and fourth-order spatial features extracted from the EigenScape database. Using both Gaussian mixture model and convolutional neural network classifiers, we show that features extracted from higher-order Ambisonics can yield increased classification accuracies relative to first-order features. Diffuseness-based features seem to describe scenes particularly well relative to direction-of-arrival based features. With specific feature subsets, however, differences in classification accuracy between first and fourth-order features become negligible.

Original languageEnglish
Title of host publication2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2019
PublisherIEEE
Pages328-332
Number of pages5
ISBN (Electronic)9781728111230
DOIs
Publication statusPublished - 23 Dec 2019
Event2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2019 - New Paltz, United States
Duration: 20 Oct 201923 Oct 2019

Publication series

NameIEEE Workshop on Applications of Signal Processing to Audio and Acoustics
Volume2019-October
ISSN (Print)1931-1168
ISSN (Electronic)1947-1629

Conference

Conference2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2019
Country/TerritoryUnited States
CityNew Paltz
Period20/10/1923/10/19

Bibliographical note

Funding Information:
†The research leading to these results has received funding from the European Research Council under the European Union’s H2020 Framework Program through ERC Grant Agreement 637422 EVERYSOUND. The authors also wish to acknowledge CSC-IT Center for Science, Finland, for computational resources.

Funding Information:
∗Funding was provided by a UK Engineering and Physical Sciences Research Council (EPSRC) Doctoral Training Award, an Audio Engineering Society (AES) Educational Foundation grant, and the Department of Electronic Engineering at the University of York.

Publisher Copyright:
© 2019 IEEE.

Keywords

  • acoustic scene classification
  • ambisonics
  • convolutional neural networks
  • gaussian mixture models
  • spatial audio

Cite this