ijsmr logo-pdf 1349088093

Comparison of Methods for Clustered Data Analysis in a Non-Ideal Situation: Results from an Evaluation ofPredictors of Yellow Fever Vaccine Refusal in the Global TravEpiNet(GTEN) Consortium
Pages 215-223
Sowmya R. Rao, Regina C. LaRocque, Emily S.Jentes, Stefan H.F. Hagmann, Edward T. Ryan, PaulineV. Han, DavidG. Kleinbaum and Global TravEpiNet Consortium
DOI:
http://dx.doi.org/10.6000/1929-6029.2014.03.03.1
Published: 05 August 2014


Abstract: Not accounting for clustering in data from multiple centersmight yield biased estimates and their standard errors, potentially leading to incorrect inferences.We fit 15 different models with different correlation structures and with/without adjustment for small clusters, including unadjusted logistic regression, Population-averaged models (Generalized Estimating Equations), Cluster-specific models (linear and non-linear with random intercept)and Survey data analysis methodsto study the association of variables with the probability of declining yellow fever vaccine among patients seeking pre-travel health consultations at 18 US practices in the Global TravEpiNet Consortium from1 January, 2009,to6 June, 2012. Results varied by the method chosen. Generally, when the odds ratio estimates were similar, adjusting for clustering and the small number of clinics increased the standard errors. We chose the random intercept model with the Morel, Bokossa and Neerchal (MBN) adjustment to be the most preferable method for the GTEN dataset since this was one of the more conservative models that accounted for clustering, small sample sizes and also the random effect due to site.Investigators should not ignore clusteringand consider the appropriate adjustments necessary for their studies.

Keywords: Clustering, cluster size, cluster imbalance, data analysis.
Download Full Article
Submit to FacebookSubmit to TwitterSubmit to LinkedIn