Document Type

Article

Publication Date

2026

DOI

10.1002/advs.202520333

Publication Title

Advanced Science

Volume

Advance online publication

Pages

21 pp.

Abstract

Despite enabling high-resolution mapping of gene expression within tissues, spatial transcriptomics (ST) still faces challenges in accurately segmenting spatial domains due to complex tissue architecture and limitations of current methods. Most approaches rely on local spatial priors, lack gene-level interpretability, and fall short in capturing structure-discriminative genes or long-range functional relationships, limiting their ability to resolve biologically meaningful architectures. We present Spatially Aware Gene selection and dual-view Embedding fusion (SAGE), a unified and reproducible framework for domain identification in spatial transcriptomics that combines topic-driven gene selection with dual-view embedding fusion to address these gaps. SAGE integrates non-negative matrix factorization (NMF)-based topic modeling with classifier-based importance scoring to identify highly spatially informative genes, and fuses a local expression graph with a topic-driven non-local graph via consensus refinement and contrastive graph representation learning to jointly learn spatial and functional embeddings. Evaluated on 34 real-world datasets, SAGE not only outperforms existing methods in clustering accuracy but also reveals functionally coherent regions and interpretable gene expression patterns. In case studies, SAGE reveals spatial heterogeneity associated with a pre-malignant activation state in human breast cancer. Moreover, in zebrafish melanoma, it refines the tumor-muscle interface into transcriptionally distinct subdomains and uncovers shared vascular signatures between anatomically separate tissues. Together, these results demonstrate that SAGE can be used not only for accurate spatial domain delineation across diverse ST platforms, but also for dissecting microenvironmental niches and long-range tissue interactions underlying disease progression.

Rights

© 2026 The Authors

This is an open access article under the terms of the Creative Commons Attribution 4.0 International (CC BY 4.0) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.

Data Availability

Article states: "Data sharing is not applicable to this article as no new data were createdor analyzed in this study."

Original Publication Citation

He, Y., Xu, Y., Ding, L., Li, H.-D., Li, Y., & Wang, S. (2026). SAGE: Spatially aware gene selection and dual-view embedding fusion for domain identification in spatial transcriptomics. Advance Science. Advance online publication. https://doi.org/10.1002/advs.202520333

ORCID

0000-0003-0178-1876 (Li)

advs73676-sup-0001-suppmat.pdf (17253 kB)
Supporting Information

Share

COinS