ORCID
0000-0001-7702-2564 (El Moudden)
Document Type
Article
Publication Date
2021
DOI
10.25046/aj060196
Publication Title
Advances in Science, Technology and Engineering Systems Journal
Volume
6
Issue
1
Pages
871-878
Abstract
Despite the significant progress made in data mining technologies in recent years, breast cancer risk prediction and diagnosis at an early stage using DNA microarray technology still a real challenging task. This challenge comes especially from the high-dimensionality in gene expression data, i.e., an enormous number of genes versus a few tens of subjects (samples). To overcome this problem of data imbalance, a gene selection phase becomes a crucial step for gene expression data analysis. This study proposes a new Decision Tree model-based attributes (genes) selection strategy, which incorporates two stages: fisher-score-based filter technique and the gene selection ability of the C5.0 algorithm. Our proposed strategy is assessed using an ensemble of machine learning algorithms to classify each subject (patients). Comparing our approach with recent previous works, the experiment results demonstrate that our new gene selection strategy achieved the highest prediction performance of breast cancer by involving only five genes as predictors among 24481 genes.
Rights
© 2026 Advances in Science, Technology, and Engineering Systems Journal.
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) License.
Original Publication Citation
Hamim, M., Moudden, I. E., Moutachaouik, H., & Hain, M. (2021). Gene selection for cancer classification: A new hybrid filter-C5.0 approach for breast cancer risk prediction. Advances in Science, Technology and Engineering Systems Journal, 6(1), 871–878. https://doi.org/10.25046/aj060196
Repository Citation
Hamim, M., Moudden, I. E., Moutachaouik, H., & Hain, M. (2021). Gene selection for cancer classification: A new hybrid filter-C5.0 approach for breast cancer risk prediction. Advances in Science, Technology and Engineering Systems Journal, 6(1), 871–878. https://doi.org/10.25046/aj060196