Document Type

Article

Publication Date

2024

DOI

10.3390/math12111623

Publication Title

Mathematics

Volume

12

Issue

11

Pages

1623 (1-15)

Abstract

Clustered data are a special type of correlated data where units within a cluster are correlated while units between different clusters are independent. The number of units in a cluster can be associated with that cluster’s outcome. This is called the informative cluster size (ICS), which is known to impact clustered data inference. However, when comparing the outcomes from multiple groups of units in clustered data, investigating ICS may not be enough. This is because the number of units belonging to a particular group in a cluster can be associated with the outcome from that group in that cluster, leading to an informative intra-cluster group size or IICGS. This phenomenon of IICGS can exist even in the absence of ICS. Ignoring the existence of IICGS can result in a biased inference for group-based outcome comparisons in clustered data. In this article, we mathematically formulate the concept of IICGS while distinguishing it from ICS and propose a nonparametric bootstrap-based statistical hypothesis-testing mechanism for testing any claim of IICGS in a clustered data setting. Through simulations and real data applications, we demonstrate that our proposed statistical testing method can accurately identify IICGS, with substantial power, in clustered data.

Rights

© 2024 by the authors.

This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution 4.0 International (CC BY 4.0) License.

Data Availability

The Students’ Academic Success Data is available through https://archive.ics.uci.edu/dataset/697/predict+students+dropout+and+academic+success. The Rat Pup Data is available from the authors upon reasonable request.

Original Publication Citation

Senevirathne, H. K. W., & Dutta, S. (2024). Testing informativeness of covariate-induced group sizes in clustered data. Mathematics, 12(11), 1-15, Article 1623. https://doi.org/10.3390/math12111623

ORCID

0000-0001-8937-5148 (Senevirathne), 0000-0002-7211-2752 (Dutta)

Share

COinS