Document Type
Conference Paper
Publication Date
2026
DOI
10.1145/3786304.3788850
Publication Title
CHIIR '26: Proceedings of the 2026 Conference on Human Information Interaction and Retrieval
Pages
285-294
Conference Name
CHIIR '26: 2026 ACM SIGIR Conference on Human Information Interaction and Retrieval, March 22-26, 2026, Seattle USA
Abstract
Online discussions have become integral to how people exchange ideas, form opinions, and participate in collective deliberation. While sighted users can comfortably engage with online discussions, blind users who are dependent on screen readers are forced to listen to long threads narrated in a single, monotonic voice that lacks prosodic variation, rhythm, or emotion. This robotic auditory experience not only deteriorates the user engagement with the content but also increases cognitive strain, by making it difficult to remain attentive and discern meaning beyond literal words. In an interview study, most blind participants reported that monotonous narration hindered their ability to detect salient information, perceive emotional cues, and comprehend content authors’ intents in discussions. Many described experiencing mental fatigue when listening to ‘flat’, ‘uninspiring’ voices, noting that their attention tended to diminish quickly over time. The participants also indicated that they often tried to ‘add’ prosodic variation or emotional inflection themselves in their minds, but characterized this compensatory effort as mentally taxing and cognitively demanding. To address this issue, we introduce VoxVista, a multi-voice design framework driven by a large language model that leverages a custom voice-preference dataset to assign personalized voice profiles to user posts in discussions, thereby replacing the traditional monotone narration in screen readers with a more expressive, dynamic, and contextually-aware narration. In a study with 20 blind participants, we observed that VoxVista significantly improved user engagement, comprehension, and willingness to continue listening to longer discussions.
Rights
© 2026 Copyright held by the owner/authors.
This work is licensed under a Creative Commons Attribution 4.0 International License.
Original Publication Citation
Prakash, Y., Nayak, A. K., Alyaan, M. S., Jayarathna, S., Lee, H.-N., & Ashok, V. (2026). VoxVista: Enhancing screen reading experience for online user comments. In C. Shah, R. W. White, A. Fourney, C. T. Lopes, & J. Trippas (Eds), CHIIR '26: Proceedings of the 2026 Conference on Human Information Interaction and Retrieval (pp. 285-294). Association for Computing Machinery. https://doi.org/10.1145/3786304.3788850
Repository Citation
Prakash, Y., Nayak, A. K., Alyaan, M. S., Jayarathna, S., Lee, H.-N., & Ashok, V. (2026). VoxVista: Enhancing screen reading experience for online user comments. In C. Shah, R. W. White, A. Fourney, C. T. Lopes, & J. Trippas (Eds), CHIIR '26: Proceedings of the 2026 Conference on Human Information Interaction and Retrieval (pp. 285-294). Association for Computing Machinery. https://doi.org/10.1145/3786304.3788850
ORCID
0000-0001-8593-327X (Prakash), 0009-0000-9992-9706 (Nayak), 0009-0002-7353-4698 (Alyaan), 0000-0002-4879-7309 (Jayarathna), 0000-0002-4772-1265 (Ashok)