Document Type
Article
Publication Date
2024
DOI
10.3390/math12111623
Publication Title
Mathematics
Volume
12
Issue
11
Pages
1623 (1-15)
Abstract
Clustered data are a special type of correlated data where units within a cluster are correlated while units between different clusters are independent. The number of units in a cluster can be associated with that cluster’s outcome. This is called the informative cluster size (ICS), which is known to impact clustered data inference. However, when comparing the outcomes from multiple groups of units in clustered data, investigating ICS may not be enough. This is because the number of units belonging to a particular group in a cluster can be associated with the outcome from that group in that cluster, leading to an informative intra-cluster group size or IICGS. This phenomenon of IICGS can exist even in the absence of ICS. Ignoring the existence of IICGS can result in a biased inference for group-based outcome comparisons in clustered data. In this article, we mathematically formulate the concept of IICGS while distinguishing it from ICS and propose a nonparametric bootstrap-based statistical hypothesis-testing mechanism for testing any claim of IICGS in a clustered data setting. Through simulations and real data applications, we demonstrate that our proposed statistical testing method can accurately identify IICGS, with substantial power, in clustered data.
Rights
© 2024 by the authors.
This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution 4.0 International (CC BY 4.0) License.
Data Availability
The Students’ Academic Success Data is available through https://archive.ics.uci.edu/dataset/697/predict+students+dropout+and+academic+success. The Rat Pup Data is available from the authors upon reasonable request.
Original Publication Citation
Senevirathne, H. K. W., & Dutta, S. (2024). Testing informativeness of covariate-induced group sizes in clustered data. Mathematics, 12(11), 1-15, Article 1623. https://doi.org/10.3390/math12111623
ORCID
0000-0001-8937-5148 (Senevirathne), 0000-0002-7211-2752 (Dutta)
Repository Citation
Wickrama Senevirathne, Hasika K. and Duttta, Sandipan, "Testing Informativeness of Covariate-Induced Group Sizes in Clustered Data" (2024). Mathematics & Statistics Faculty Publications. 256.
https://digitalcommons.odu.edu/mathstat_fac_pubs/256