Effects of ignoring clustered data structure in confirmatory factor analysis of ordered polytomous items: a simulation study based on PANSS

Jan Stochl, Peter B Jones, Jesus Perez, Golam M Khandaker, Jan R Böhnke, Tim J Croudace

Research output: Contribution to journalArticlepeer-review


Statistical theory indicates that hierarchical clustering by interviewers or raters needs to be considered to avoid incorrect inferences when performing any analyses including regression, factor analysis (FA) or item response theory (IRT) modelling of binary or ordinal data. We use simulated Positive and Negative Syndrome Scale (PANSS) data to show the consequences (in terms of bias, variance and mean square error) of using an analysis ignoring clustering on confirmatory factor analysis (CFA) estimates. Our investigation includes the performance of different estimators, such as maximum likelihood, weighted least squares and Markov Chain Monte Carlo (MCMC). Our simulation results suggest that ignoring clustering may lead to serious bias of the estimated factor loadings, item thresholds, and corresponding standard errors in CFAs for ordinal item response data typical of that commonly encountered in psychiatric research. In addition, fit indices tend to show a poor fit for the hypothesized structural model. MCMC estimation may be more robust against clustering than maximum likelihood and weighted least squares approaches but further investigation of these issues is warranted in future simulation studies of other datasets. Copyright Copyright © 2015 John Wiley & Sons, Ltd.

Original languageEnglish
JournalInternational Journal of Methods in Psychiatric Research
Early online date20 Jun 2015
Publication statusPublished - 2015

Bibliographical note

Copyright © 2015 John Wiley & Sons, Ltd.

Cite this