Machine learning and air quality modeling

Christoph A. Keller, Mathew J. Evans, J. Nathan Kutz, Steven Pawson

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Air quality models are limited by the computational costs associated with the simulation of the complex chemical and dynamical processes of reactive pollutants in the atmosphere. We discuss here the potential usage of machine learning and reduced-order modeling techniques to mitigate some of these limitations. We first give an overview of three new methods emerging from the field of signal processing - sparse sampling, randomized matrix decompositions and the construction of reduced order models - and discuss them in the context of air quality modeling. In the second part we discuss the substitution of the standard chemical solver of the chemistry model with a random forest regression model trained through machine learning. We find that this approach shows promising initial results for important air pollutants such as ozone (O3), predicting concentrations that deviate less than 10% from the values computed by the traditional model. The here highlighted methods all have the potential to significantly reduce the computational burden of air quality models while maintaining the model's capability to capture all features relevant to air quality. Such lightweight air quality models offer new opportunities for air quality forecasting and to assimilate the rapidly increasing array of air quality observations.

Original languageEnglish
Title of host publicationProceedings - 2017 IEEE International Conference on Big Data, Big Data 2017
PublisherIEEE
Pages4570-4576
Number of pages7
Volume2018-January
ISBN (Electronic)9781538627143
DOIs
Publication statusPublished - 12 Jan 2018
Event5th IEEE International Conference on Big Data, Big Data 2017 - Boston, United States
Duration: 11 Dec 201714 Dec 2017

Conference

Conference5th IEEE International Conference on Big Data, Big Data 2017
Country/TerritoryUnited States
CityBoston
Period11/12/1714/12/17

Keywords

  • Air Quality Modeling
  • Big Data Analytics
  • Machine Learning
  • Statistical Sub-sampling

Cite this