Document Type
Conference Paper
Publication Date
2025
DOI
10.18653/v1/2025.emnlp-main.670
Publication Title
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Pages
13259-13275
Conference Name
2025 Conference on Empirical Methods in Natural Language Processing, November 4-9, 2025, Suzhou, China
Abstract
Topic modeling is a powerful unsupervised tool for knowledge discovery. However, existing work struggles with generating limited-quality topics that are uninformative and incoherent, which hindering interpretable insights from managing textual data. In this paper, we improve the original variational autoencoder framework by incorporating contextual and graph information to address the above issues. First, the encoder utilizes topic fusion techniques to combine contextual and bag-of-words information well, and meanwhile exploits the constraints of topic alignment and topic sharpening to generate informative topics. Second, we develop a simple word co-occurrence graph information fusion strategy that efficiently increases topic coherence. On three benchmark datasets, our new framework generates more coherent and diverse topics compared to various baselines, and achieves strong performance on both automatic and manual evaluations.
Rights
© 2025 Association for Computational Linguistics.
Materials published in or after 2016 are licensed on a Creative Commons Attribution 4.0 International (CC BY 4.0) License.
Original Publication Citation
Liu, J., Yan, J., Zhu, C., Liu, X., Li, Q., & Rao, Y. (2025). Neural topic modeling via contextual and graph information fusion. In C. Christodoulopoulos, T. Chakraborty, C. Rose & V. Peng (Eds.), Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing (pp. 13259-13275). Association for Computational Linguistics. https://doi.org/10.18653/v1/2025.emnlp-main.670
Repository Citation
Liu, J., Yan, J., Zhu, C., Liu, X., Li, Q., & Rao, Y. (2025). Neural topic modeling via contextual and graph information fusion. In C. Christodoulopoulos, T. Chakraborty, C. Rose & V. Peng (Eds.), Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing (pp. 13259-13275). Association for Computational Linguistics. https://doi.org/10.18653/v1/2025.emnlp-main.670