Breast cancer detection with logistic regression improved by features constructed by kaizen programming in a hybrid approach

Date
2016Author
de Melo, Vinicius Veloso [UNIFESP]
Type
Trabalho apresentado em eventoIs part of
2016 Ieee Congress On Evolutionary Computation (CEC)DOI
10.1109/CEC.2016.7743773Metadata
Show full item recordAbstract
Breast cancer is known as the second largest cause of cancer deaths among women, but thankfully it can be cured if diagnosed early. There have been many investigations on methods to improve the accuracy of the diagnostic, and Machine Learning (ML) and Evolutionary Computation (EC) tools are among the most successfully employed modern methods. On the other hand, Logistic Regression (LR), a traditional and popular statistical method for classification, is not commonly used by computer scientists as those modern methods usually outperform it. Here we show that LR can achieve results that are similar to those of ML and EC methods and can even outperform them when useful knowledge is discovered in the dataset. In this paper, we employ the recently proposed Kaizen Programming (KP) approach with LR to construct high-quality nonlinear combinations of the original features resulting in new sets of features. Experimental analysis indicates that the new sets provide significantly better predictive accuracy than the original ones. When compared to related work from the literature, it is shown that the proposed approach is competitive and a promising method for automatic feature construction.
Citation
2016 Ieee Congress On Evolutionary Computation (CEC). New york, p. 16-23, 2016.Keywords
Bloat Control MethodsFeature-Selection
Neural-Network
Diagnosis
Prognosis
Classification
System
Sponsorship
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)
Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)