|Tytuł||NCA-GA-SVM: A new two-level feature selection method based on neighborhood component analysis and genetic algorithm in hepatocellular carcinoma fatality prognosis|
|Publication Type||Journal Article|
|Autorzy||Książek W, Turza F, Pławiak P|
|Journal||International Journal for Numerical Methods in Biomedical Engineering|
|Słowa kluczowe||Feature selection, Genetic algorithms, Hepatocellular carcinoma, Machine learning, neighborhood component analysis|
Abstract Hepatocellular carcinoma (HCC) is one of the major challenges facing biomedical research. Despite the high lethality, methods to predict mortality for this type of aggressive malignant tumor are insufficient. Machine learning is recognized by many authors as a valuable, yet poorly studied tool in this field. Undoubtedly, searching for new feature selection methods is significant in building an effective machine-learning model. In this study, we propose the novel hybrid model using neighborhood components analysis, genetic algorithm and support vector machine classifier (NCA-GA-SVM). Because SVM works with default parameters characterized by low classification results, we decided to use GA for the proper optimization and feature selection. As reported in the available literature, NCA and GA obtain high classification results. Here, we decided to combine these approaches, building a two-level algorithm for HCC fatality prognosis. We used a well-known dataset collected from 165 patients at Coimbra's Hospital and University Center, Portugal. Our results revealed 96.36\% classification accuracy and 95.52\% F1-score. Additionally, we compared all data for these metrics published so far. We demonstrated that our algorithm achieved the highest accuracy and can be successfully applied for the assessment of hepatocellular carcinoma mortality in the future. Our findings bring methodological value for future HCC studies and emphasize the possibility of using machine-learning techniques to improve the quality of medical decisions.