Noise detection in the meta-learning level

Noise detection in the meta-learning level

Author Garcia, Luis P. F. Google Scholar
de Carvalho, Andre C. P. L. F. Google Scholar
Lorena, Ana C. Autor UNIFESP Google Scholar
Abstract The presence of noise in real data sets can harm the predictive performance of machine learning algorithms. There are several noise filtering techniques whose goal is to improve the quality of the data in classification tasks. These techniques usually scan the data for noise identification in a preprocessing step. Nonetheless, this is a non-trivial task and some noisy data can remain unidentified, while safe data can also be removed. The bias of each filtering technique influences its performance on a particular data set. Therefore, there is no single technique that can be considered the best for all domains or data distribution and choosing a particular filter is not straightforward. Meta-learning has been largely used in the last years to support the recommendation of the most suitable machine learning algorithm(s) for a new data set. This paper presents a meta-learning recommendation system able to predict the expected performance of noise filters in noisy data identification tasks. For such, a meta-base is created, containing meta-features extracted from several corrupted data sets along with the performance of some noise filters when applied to these data sets. Next, regression models are induced from this meta base to predict the expected performance of the investigated filters in the identification of noisy data. The experimental results show that meta-learning can provide a good recommendation of the most promising filters to be applied to new classification data sets. (C) 2015 Elsevier B.V. All rights reserved.
Keywords Noise identification
Characterization measures
Complexity measures
xmlui.dri2xhtml.METS-1.0.item-coverage Amsterdam
Language English
Date 2016
Published in Neurocomputing. Amsterdam, v. 176, p. 14-25, 2016.
ISSN 0925-2312 (Sherpa/Romeo, impact factor)
Publisher Elsevier Science Bv
Extent 14-25
Access rights Closed access
Type Article
Web of Science ID WOS:000368962800003

Show full item record


File Size Format View

There are no files associated with this item.

This item appears in the following Collection(s)




My Account