Document Type
Conference Paper
Publication Date
2024
Publication Title
American Society for Engineering Management 2024 International Annual Conference
Pages
10 pp.
Conference Name
American Society for Engineering Management 2024 International Annual Conference, November 6-9, 2024, Virginia Beach, Virginia
Abstract
The advent of Next-Generation Sequencing (NGS) techniques has revolutionized genomic research by enabling the rapid sequencing of DNA and RNA. This data can be used for various applications, including genome sequencing, transcriptome profiling, metagenomics, and epigenetics studies. For this study, DNA classifier dataset was extracted from UCI repository of machine learning databases. This vast amount of genomic data necessitates the development of sophisticated machine learning (ML) models for effective classification and analysis. This study presents a comprehensive comparison of various ML models, including Support Vector Machines (SVM), Random Forests (RF), and Neural Networks (NNs), approaches, in classifying genomic data. We evaluate these models based on accuracy, computational efficiency, and their ability to handle high-dimensional data. The comparison reveals distinct strengths and limitations of each model, with NNs and DL approaches showing superior performance in handling complex patterns in genomic sequences but requiring significant computational resources. Conversely, SVM and RF offer a balance between accuracy and computational demand, making them suitable for applications with limited computational resources. À notable research gap identified is the challenge of integrating diverse genomic data types (e.g., single nucleotide polymorphisms, copy number variations, and gene expression data) in a unified classification model. This gap underscores the need for developing advanced ML models that can effectively leverage the heterogeneous nature of genomic data to improve classification outcomes. This study paves the way for future research aimed at bridging this gap, which is critical for advancing personalized medicine and understanding complex genetic disorders.
Rights
Copyright© 2024. Reprinted with permission of the American Society for Engineering Management. International Annual Conference. All rights reserved.
Included with the kind written permission of the copyright holders.
ORCID
0000-0003-2824-4528 (Alla), 0009-0009-1889-6427 (Bheesetty), 0000-0003-1078-0314 (Mohanty)
Original Publication Citation
Alla, S., Bheesetty, N., Komaragiri, S. G., Chidipudi, P., Mohanty, J., Chintala, S. K., Thomas, J., Vummadi, J., Volikatla, H., & Kamuni, N. (2024). Integrative machine learning approaches for enhanced classification of genomic sequences: A next-generation sequencing perspective [Paper presentation]. American Society for Engineering Management 2024 International Annual Conference, Virginia Beach, Virginia.
Repository Citation
Alla, Sujatha; Bheesetty, Nagesh; Komaragiri, Sai Gireesh; Chidipudi, Prasanthi; Mohanty, Joshit; Chintala, Sathish Kumar; Thomas, Jubin; Vummadi, Jayapal; Volikatla, Hemanth; and Kamuni, Navin, "Integrative Machine Learning Approaches for Enhanced Classification of Genomic Sequences: A Next-Generation Sequencing Perspective" (2024). Engineering Management & Systems Engineering Faculty Publications. 234.
https://digitalcommons.odu.edu/emse_fac_pubs/234
Included in
Biomedical Engineering and Bioengineering Commons, Genetics and Genomics Commons, Operations Research, Systems Engineering and Industrial Engineering Commons, Theory and Algorithms Commons