Novel feature selection method for genetic programming using metabolomic H-1 NMR data

R A Davis, A J Charlton, S Oehlschlager, J C Wilson

Research output: Contribution to journalArticlepeer-review

Abstract

A novel technique for multivariate data analysis using a two-stage genetic programming (GP) routine for feature selection is described. The method is compared with conventional genetic programming for the classification of genetically modified barley. Metabolic fingerprinting by H-1 NMR spectroscopy was used to analyse the differences between transgenic and null-segregant plants. We show that the method has a number of major advantages over standard genetic programming techniques. By selecting a minimal set of characteristic features in the data, the method provides models that are easier to interpret. Moreover the new method achieves better classification results and convergence is reached significantly faster. (c) 2005 Elsevier B.V All rights reserved.

Original languageEnglish
Pages (from-to)50-59
Number of pages10
JournalChemometrics and Intelligent Laboratory Systems
Volume81
Issue number1
DOIs
Publication statusPublished - 15 Mar 2006

Keywords

  • metabolomics
  • multivariate data analysis
  • genetic programming
  • feature selection
  • NMR
  • MULTIVARIATE-ANALYSIS
  • FUNCTIONAL GENOMICS
  • COMPLEX-MIXTURES
  • PLANT-EXTRACTS
  • SPECTRAL DATA
  • SPECTROSCOPY
  • CLASSIFICATION
  • CHEMOMETRICS
  • ALGORITHM
  • TOOL

Cite this