English   español  
Please use this identifier to cite or link to this item: http://hdl.handle.net/10261/126165
Share/Impact:
Statistics
logo share SHARE logo core CORE   Add this article to your Mendeley library MendeleyBASE

Visualizar otros formatos: MARC | Dublin Core | RDF | ORE | MODS | METS | DIDL
Exportar a otros formatos:

Title

Efficient feature selection for mass spectrometry based electronic nose applications

AuthorsLlobet Brossa, Enrique; Gualdrón,O; Vinaixa, M.; El-Barbri, N.; Brezmes, J.; Vilanova, X.; Bouchikhi, B.; Gómez, R.; Carrasco, J. A.; Correig, X.
Issue Date2007
PublisherElsevier
CitationChemometrics and Intelligent Laboratory Systems 85: 253- 261 (2007)
AbstractHigh dimensionality is inherent to MS-based electronic nose applications where hundreds of variables per measurement (m/z fragments) - a significant number of them being highly correlated or noisy - are available. Feature selection is, therefore, an unavoidable pre-processing step if robust and parsimonious pattern classification models are to be developed. In this article, a new strategy for feature selection has been introduced and its good performance demonstrated using two MS e-nose databases. The feature selection is conducted in three steps. The first two steps are aimed at removing noisy, non-informative and highly collinear features (i.e., redundant), respectively. These two steps are computationally inexpensive and allow for dramatically reducing the number of variables (near 80% of initially available features are eliminated after the second step). The third step makes use of a stochastic variable selection method (simulated annealing) to further reduce the number of variables. For example, applying the method to an Iberian ham database has resulted in the number of features being reduced from 209 down to 14. Using the surviving m/z fragments, a fuzzy ARTMAP classifier was able to sort ham samples according to producer and quality (11-category classification) with a 97.24% success rate. The whole feature selection process runs in a few minutes in a Pentium IV PC platform. © 2006 Elsevier B.V. All rights reserved.
URIhttp://hdl.handle.net/10261/126165
DOI10.1016/j.chemolab.2006.07.002
Identifiersdoi: 10.1016/j.chemolab.2006.07.002
issn: 0169-7439
Appears in Collections:(IF) Artículos
Files in This Item:
File Description SizeFormat 
accesoRestringido.pdf15,38 kBAdobe PDFThumbnail
View/Open
Show full item record
Review this work
 

Related articles:


WARNING: Items in Digital.CSIC are protected by copyright, with all rights reserved, unless otherwise indicated.