Date of Award
Summer 8-2025
Document Type
Dissertation
Degree Name
Doctor of Philosophy (PhD)
Department
Computer Science
Program/Concentration
Computer Science
Committee Director
Jiangwen Sun
Committee Member
Desh Ranjan
Committee Member
Jing He
Committee Member
Yet Nguyen
Abstract
Epigenetic events, such as DNA methylation and histone modifications, arise from a complex interplay among genomic sequence, chromatin-remodeling factors, and environmental cues. These regulatory mechanisms can induce changes in gene expression without altering the underlying DNA sequence, playing critical roles in development, disease, and cellular differentiation. Among these events, DNA methylation is frequently profiled using bisulfite sequencing (e.g., whole-genome bisulfite sequencing [WGBS], reduced representation bisulfite sequencing [RRBS]). However, predictive modeling of epigenetic states—including methylation patterns and regulatory variant effects—remains challenging due to data sparsity, label noise, and limited uncertainty estimation in current deep learning approaches. This dissertation addresses these issues by introducing a suite of data-centric and uncertainty-aware deep learning frameworks. First, I conduct a systematic evaluation to quantify how data quality, sample size, and label noise affect model performance. Second, I propose a transfer learning method to impute sparse methylation profiles to improve data quality. Third, I develop a Monte Carlo dropout–based pipeline that quantifies uncertainty for non-coding genetic variant effect predictions, aiding in the prioritization of potentially regulatory variants in tissue-specific contexts. Collectively, these contributions advance scalable, interpretable, and reliable computational approaches for epigenomic data analysis, paving the way for improved understanding and practical utilization of epigenetic events in developmental and disease settings.
Rights
In Copyright. URI: http://rightsstatements.org/vocab/InC/1.0/ This Item is protected by copyright and/or related rights. You are free to use this Item in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s).
DOI
10.25777/zsma-bk24
ISBN
9798293842377
Recommended Citation
Dodlapati, Sanjeeva R..
"Learning Regulatory DNA-Sequence Code of Epigenetic Events Using Deep Neural Networks"
(2025). Doctor of Philosophy (PhD), Dissertation, Computer Science, Old Dominion University, DOI: 10.25777/zsma-bk24
https://digitalcommons.odu.edu/computerscience_etds/189
ORCID
0000-0002-0198-1264