Applied Psychological Measurement, Ahead of Print.
Item-level fit analysis not only serves as a complementary check to global fit analysis, it is also essential in scale development because the fit results will guide item revision and/or deletion (Liu & Maydeu-Olivares, 2014). During data collection, missing response data may likely happen due to various reasons. Chi-square-based item fit indices (e.g., Yen’s Q1, McKinley and Mill’s G2, Orlando and Thissen’s S-X2 and S-G2) are the most widely used statistics to assess item-level fit. However, the role of total scores with complete data used in S-X2 and S-G2 is different from that with incomplete data. As a result, S-X2 and S-G2 cannot handle incomplete data directly. To this end, we propose several modified versions of S-X2 and S-G2 to evaluate item-level fit when response data are incomplete, named as Mimpute-X2 and Mimpute-G2, of which the subscript “impute” denotes different imputation methods. Instead of using observed total scores for grouping, the new indices rely on imputed total scores by either a single imputation method or three multiple imputation methods (i.e., two-way with normally distributed errors, corrected item-mean substitution with normally distributed errors and response function imputation). The new indices are equivalent to S-X2 and S-G2 when response data are complete. Their performances are evaluated and compared via simulation studies; the manipulated factors include test length, sources of misfit, misfit proportion, and missing proportion. The results from simulation studies are consistent with those of Orlando and Thissen (2000, 2003), and different indices are recommended under different conditions.