Document Type
Article
Publication Date
2025
DOI
10.1109/ACCESS.2025.3600146
Publication Title
IEEE Access
Volume
13
Pages
147002-147033
Abstract
Large Language Models (LLMs) play an increasingly integrated and pivotal role in generating diverse types of texts, such as social media messages, emails, narratives, and technical reports, among other textual communication forms. As AI-generated messaging filters into human communication, a systematic exploration of their effectiveness for mimicking human-like communication of life events is needed. In this study, we employ a zero-shot structured narrative prompt to generate 24,000 life event messages for birth, death, hiring, and firing events using OpenAI's GPT-4. From this dataset, we manually classify 2880 messages and evaluate their validity in conveying these life events through the form of X (formerly Twitter) posts. Human evaluators found that 87.43% of the sampled messages (n = 2880) sufficiently met the intentions of their structured prompts based on their interpretations of the prompt and the corresponding AI-generated message. To automate the identification of valid and invalid messages, we train and validate nine Machine Learning models (ML) on the classified datasets. Leveraging an ensemble of these nine models, we extend our analysis to predict the classifications of the remaining 21,120 untagged messages. Finally, we manually tag 1% of the messages with predicted classifications and attain 90.57% accuracy (sd = 29.3%, n = 212) across the four life event types. The ML models excelled at classifying valid messages as valid, but experienced challenges at simultaneously classifying invalid messages as invalid. Our findings advance the study of LLM capabilities, limitations, and validity while offering practical insights for message generation and natural language processing applications.
Rights
© 2025 The Authors.
Published under the terms of a Creative Commons Attribution 4.0 International (CC BY 4.0) License.
Original Publication Citation
Lynch, C. J., Jensen, E., Gore, R., Zamponi, V., O'Brien, K., Feldhaus, B., Smith, K., Martinez, J., Munro, M. H., Ozkose, T. E., Gundogdu, T. B., Reinhold, A. M., Kavak, H., & Ezell, B. (2025). AI-generated messaging for life events using structured prompts: A comparative study of GPT with human experts and machine learning. IEEE Access, 13, 147002-147033. https://doi.org/10.1109/ACCESS.2025.3600146
ORCID
0000-0002-4830-7488 (Lynch), 0000-0002-5026-4501 (Smith), 0000-0002-2244-8274 (Martinez), 0000-0003-4274-908X (Ezell)
Repository Citation
Lynch, Christopher; Jensen, Erik; Gore, Ross; Zamponi, Virginia; O'Brien, Kevin; Feldhaus, Brandon; Smith, Katherine; Martínez, Joseph; Munro, Madison H.; Ozkose, Timur E.; Gundogdu, Tugce B.; Reinhold, Ann Marie; Kavak, Hamdi; and Ezell, Barry, "AI-Generated Messaging for Life Events Using Structured Prompts: A Comparative Study of GPT With Human Experts and Machine Learning" (2025). VMASC Publications. 142.
https://digitalcommons.odu.edu/vmasc_pubs/142
Included in
Artificial Intelligence and Robotics Commons, Electrical and Computer Engineering Commons
Comments
This work was supported in part by the Office of Enterprise Research and Innovation at Old Dominion University under Grant 300916-010, in part by the Department of Education Modeling and Simulation Program awarded to Old Dominion University’s Virginia Modeling Analysis and Simulation Center under Grant P116S210003, in part by the Commonwealth Cyber Initiative (CCI) Fellowship under Grant H-2Q24-016, in part by the Office of Naval Research under Grant N00014-19-1-2624, and in part by the Air Force Office of Scientific Research through the Minerva Research Initiative under Grant 22RT0286.