ORCID

0000-0002-8991-1737 (Pant)

Document Type

Article

Publication Date

2021

DOI

10.3991/ijoe.v17i02.19889

Publication Title

International Journal of Online and Biomedical Engineering (iJOE)

Volume

17

Issue

2

Pages

148-163

Abstract

Breast cancer poses the greatest threat to human life and especially to women's life. Despite the progress made in data mining technology in recent years, the ability to predict and diagnose such fatal diseases based on gene expression data still reveals a limited prediction performance, which may not be surprising since most of the genes in expression data are believed to be irrelevant or redundant. The dimensionality reduction process may be considered as a crucial step to analyze gene expression data, as it can reduce the high dimensionality of the breast cancer datasets, which may result into a better prediction performance of such diseases. The paper suggests a new hybrid approach-based gene selection that combines the filter method and the Ant Colony Optimization algorithm to find the smallest subset of informative genes (genes markers) among 24,481 genes. The proposed approach combines four machine learning algorithms - C5.0 Decision Tree, Support Vector Machines, K-Nearest Neighbors algorithm, and Random Forest Classifier - to classify each of the selected samples (patients) into two classes which have cancer or not. Compared with existing methods in the literature, experimental results indicate that our proposed gene selection approach achieved globally higher classification accuracies with a relatively smaller number of genes.

Rights

The International Journal of Online and Biomedical Engineering (iJOE)is an Open Access journal. All articles are available in PDF format immediately upon publication, free of charge and without any subscription. Any user may download, print, or copy the articles or use them for any other lawful purpose, as long as the licensing terms are respected.

Article is published published under the Creative Commons Attribution Licence (CC-BY 4.0). This means that users may share and adapt the articles published on this website in a reasonable manner, but they must give appropriate credit of the creator and indicate the changes they have made. Furthermore, users must not apply additional restrictions, but must publish the work under the same license (CC-BY 4.0).

Original Publication Citation

Hamim, M., El Moudden, I., Pant, M. D., Moutachaouik, H., & Hain, M. (2021). A hybrid gene selection strategy based on Fisher and Ant Colony Optimization algorithm for breast cancer classification. International Journal of Online and Biomedical Engineering (iJOE), 17(2), 148-163. https://doi.org/10.3991/ijoe.v17i02.19889

Share

COinS