Document Type

Conference Paper

Publication Date

2021

Publication Title

CEUR Workshop Proceedings: Proceedings of the Workshop on Scientific Document Understanding co-located with 35th AAAI Conference on Artificial Intelligence (AAAI 2021)

Volume

2831

Pages

16 (1-8)

Conference Name

Workshop on Scientific Document Understanding co-located with 35th AAAI Conference on Artificial Intelligence (AAAI 2021), February 9, 2021, Remote, Online

Abstract

Recent increases in the number of retractions of published papers reflect heightened attention and increased scrutiny in the scientific process motivated, in part, by the replication crisis. These trends motivate computational tools for understanding and assessment of the scholarly record. Here, we sketch the landscape of retracted papers in the Retraction Watch database, a collection of 19k records of published scholarly articles that have been retracted for various reasons (e.g., plagiarism, data error). Using metadata as well as features derived from full-text for a subset of retracted papers in the social and behavioral sciences, we develop a random forest classifier to predict retraction in new samples with 73% accuracy and F1-score of 71%. We believe this study to be the first of its kind to demonstrate the utility of machine learning as a tool for the assessment of retracted work.

Rights

© 2021 by the authors.

This paper is published under a Creative Commons Attribution 4.0 International (CC BY 4.0) License.

Bibliographic data published under a CC0 1.0 Universal Public Domain Dedication.

Original Publication Citation

Modukuri, S. A., Rajtmajer, S., Squicciarini, A. C., Wu, J., & Giles, C. L. (2021). Understanding and predicting retractions of published work. CEUR Workshop Proceedings, 2831, 16. https://ceur-ws.org/Vol-2831/paper16.pdf

ORCID

0000-0003-0173-4463 (Wu)

Share

COinS