Research Examples

Key paper describing the IRW

  • Domingue B, Braginsky M, Caffrey-Maffei L, Gilbert JB, Kanopka K, Kapoor R, Lee H, Liu Y, Nadela S, Pan G, Zhang L, Zhang S, Frank MC. (2025). An introduction to the Item Response Warehouse (IRW): A resource for enhancing data usage in psychometrics. Behavior Research Methods

Psychometrics research that use data from the IRW (including non-public portions of the repository)

  • Ahmed I, Bertling M, Zhang L, Ho A, Loyalka P, Xue H, Rozelle S, Domingue B. (2024). Heterogeneity of item-treatment interactions masks complexity and generalizability in randomized controlled trials. Journal of Research on Educational Effectiveness.

  • Domingue B, Kanopka K, Kapoor R, Pohl S, Chalmers P, Rahal C, Rhemtulla M (2024). The InterModel Vigorish as a lens for understanding (and quantifying) the value of item response models for dichotomously coded items. Psychometrika.

  • Domingue B, Kanopka K, Stenhaug B, Sulik MJ, Beverly T, Brinkhuis M, Circi R, Faul J, Liao D, McCandliss B, Obradović J, Piech C, Porter T, Soland J, Weeks J, Wise S, Yeatman J. (2022). Speed–Accuracy Trade-Off? Not So Fast: Marginal Changes in Speed Have Inconsistent Relationships With Accuracy in Real-World Settings. Journal of Educational and Behavioral Statistics, 47(5), 576-602.

  • Gilbert J, Himmelsbach Z, Soland J, Joshi M, Domingue B. (2025). Estimating Heterogeneous Treatment Effects with Item-Level Outcome Data: Insights from Item Response Theory. Journal of Policy Analysis and Management..

  • Gilbert J, Domingue B, & Kim J. (2025). Estimating Causal Effects on Psychological Networks Using Item Response Theory. Psychological Methods..

  • Gilbert J. (2025). How Measurement Affects Causal Inference: Attenuation Bias Is (Usually) More Important Than Outcome Scoring Weights. Methodology.

  • Gilbert J, Young W, Himmelsbach Z, Ulitzsch E, Domingue B. (2025). Conditional Dependencies Between Response Time and Item Discrimination: An Item-Level Meta-Analysis. PsyArXiv.

  • Gilbert J, Himmelsbach Z, Miratrix L, Ho AD, Domingue B. (2025). Item-Level Heterogeneity in Value Added Models: Implications for Reliability, Cross-Study Comparability, and Effect Sizes. EdWorkingPaper.

  • Gilbert J, Soland J, Domingue B. (2025). The Sensitivity of Value-Added Estimates to Test Scoring Decisions. EdWorkingPaper.

  • Ma WA, Liu Y, Kanopka K, Ma W, Domingue B. (2025). A comparison of the predictive performance of continuous and class-based latent trait models. PsyArXiv.

  • Nalbandyan R, Gilbert JB, Franco VR, & Domingue BW. (2024). Signposts on the Path from Nominal to Ordinal Scales. PsyArxiv. DOI: 10.31234/osf.io/zbv8f.

  • Zhang L, Liu Y, Molenaar D, Domingue B. (2025). Realistic Simulation of Item Difficulties. PsyArXiv. doi: 10.31234/osf.io/jbhxy.