Document Type
Article
Publication Date
2026
DOI
10.1002/advs.202520333
Publication Title
Advanced Science
Volume
Advance online publication
Pages
21 pp.
Abstract
Despite enabling high-resolution mapping of gene expression within tissues, spatial transcriptomics (ST) still faces challenges in accurately segmenting spatial domains due to complex tissue architecture and limitations of current methods. Most approaches rely on local spatial priors, lack gene-level interpretability, and fall short in capturing structure-discriminative genes or long-range functional relationships, limiting their ability to resolve biologically meaningful architectures. We present Spatially Aware Gene selection and dual-view Embedding fusion (SAGE), a unified and reproducible framework for domain identification in spatial transcriptomics that combines topic-driven gene selection with dual-view embedding fusion to address these gaps. SAGE integrates non-negative matrix factorization (NMF)-based topic modeling with classifier-based importance scoring to identify highly spatially informative genes, and fuses a local expression graph with a topic-driven non-local graph via consensus refinement and contrastive graph representation learning to jointly learn spatial and functional embeddings. Evaluated on 34 real-world datasets, SAGE not only outperforms existing methods in clustering accuracy but also reveals functionally coherent regions and interpretable gene expression patterns. In case studies, SAGE reveals spatial heterogeneity associated with a pre-malignant activation state in human breast cancer. Moreover, in zebrafish melanoma, it refines the tumor-muscle interface into transcriptionally distinct subdomains and uncovers shared vascular signatures between anatomically separate tissues. Together, these results demonstrate that SAGE can be used not only for accurate spatial domain delineation across diverse ST platforms, but also for dissecting microenvironmental niches and long-range tissue interactions underlying disease progression.
Rights
© 2026 The Authors
This is an open access article under the terms of the Creative Commons Attribution 4.0 International (CC BY 4.0) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
Data Availability
Article states: "Data sharing is not applicable to this article as no new data were createdor analyzed in this study."
Original Publication Citation
He, Y., Xu, Y., Ding, L., Li, H.-D., Li, Y., & Wang, S. (2026). SAGE: Spatially aware gene selection and dual-view embedding fusion for domain identification in spatial transcriptomics. Advance Science. Advance online publication. https://doi.org/10.1002/advs.202520333
Repository Citation
He, Y., Xu, Y., Ding, L., Li, H.-D., Li, Y., & Wang, S. (2026). SAGE: Spatially aware gene selection and dual-view embedding fusion for domain identification in spatial transcriptomics. Advance Science. Advance online publication. https://doi.org/10.1002/advs.202520333
ORCID
0000-0003-0178-1876 (Li)
Supporting Information