Evolutionary model trees for handling continuous classes in machine learning

Barros, Rodrigo C.; Ruiz, Duncan D.; Basgalupp, Marcio P. [UNIFESP]

Evolutionary model trees for handling continuous classes in machine learning

Data

2011-03-01

Autores

Barros, Rodrigo C.

Ruiz, Duncan D.

Basgalupp, Marcio P.

Tipo

Artigo

Resumo

Model trees are a particular case of decision trees employed to solve regression problems. They have the advantage of presenting an interpretable output, helping the end-user to get more confidence in the prediction and providing the basis for the end-user to have new insight about the data, confirming or rejecting hypotheses previously formed. Moreover, model trees present an acceptable level of predictive performance in comparison to most techniques used for solving regression problems. Since generating the optimal model tree is an NP-Complete problem, traditional model tree induction algorithms make use of a greedy top-down divide-and-conquer strategy, which may not converge to the global optimal solution. in this paper, we propose a novel algorithm based on the use of the evolutionary algorithms paradigm as an alternate heuristic to generate model trees in order to improve the convergence to globally near-optimal solutions. We call our new approach evolutionary model tree induction (E-Motion). We test its predictive performance using public UCI data sets, and we compare the results to traditional greedy regression/model trees induction algorithms, as well as to other evolutionary approaches. Results show that our method presents a good trade-off between predictive performance and model comprehensibility, which may be crucial in many machine learning applications. (C) 2010 Elsevier Inc. All rights reserved.

Citação

Information Sciences. New York: Elsevier B.V., v. 181, n. 5, p. 954-971, 2011.