Machine learning with incomplete datasets using multi-objective optimization models
AuthorKhorshidi, HA; Kirley, M; Aickelin, U
Source TitleProceedings of the International Joint Conference on Neural Networks
Document TypeConference Paper
CitationsKhorshidi, H. A., Kirley, M. & Aickelin, U. (2020). Machine learning with incomplete datasets using multi-objective optimization models. Proceedings of the International Joint Conference on Neural Networks, IEEE. https://doi.org/10.1109/ijcnn48605.2020.9206742.
Access StatusOpen Access
Machine learning techniques have been developed to learn from complete data. When missing values exist in a dataset, the incomplete data should be preprocessed separately by removing data points with missing values or imputation. In this paper, we propose an online approach to handle missing values while a classification model is learnt. To reach this goal, we develop a multi-objective optimization model with two objective functions for imputation and model selection. We also propose three formulations for imputation objective function. We use an evolutionary algorithm based on NSGA II to find the optimal solutions as the Pareto solutions. We investigate the reliability and robustness of the proposed model using experiments by defining several scenarios in dealing with missing values and classification. We also describe how the proposed model can contribute to medical informatics. We compare the performance of three different formulations via experimental results. The proposed model results get validated by comparing with a comparable literature.
- Click on "Export Reference in RIS Format" and choose "open with... Endnote".
- Click on "Export Reference in RIS Format". Login to Refworks, go to References => Import References