Matthias von Davier is a psychometrician, academic, inventor, and author. He is the executive director of the TIMSS & PIRLS International Study Center at the Lynch School of Education and Human Development and the J. Donald Monan, S.J., University Professor in Education at Boston College.[1]

Matthias von Davier
Occupation(s)Psychometrician, academic, inventor, and author
AwardsETS Scientist Award, Educational Testing Service (2006)
Bradley Hanson Award for Contributions to Educational Measurement, National Council on Measurement in Education (2012)
Award for Significant Contribution to Educational Measurement and Research Methodology, American Educational Research Association (AERA) (2017)
Academic background
EducationMasters in Psychology
Dr. rer. nat.
Alma materKiel University
Academic work
InstitutionsTIMSS & PIRLS International Study Center, Lynch School of Education and Human Development, Boston College

von Davier's research focuses on developing advanced psychometric models and methodologies for analyzing complex educational and survey data. He has authored and co-authored more than 130 research articles, chapters, and research reports, along with six books, including Advancing Human Assessment, which is a part of the series Methodology of Educational Measurement and Assessment, co-edited by him.[2] Additionally, he is the recipient of numerous awards such as the 2006 ETS Research Scientist award,[3] the 2012 National Council on Measurement in Education (NCME) Brad Hanson Award for Contributions to Educational Measurement,[4] and the AERA Division-D 2017 Award for Significant Contribution to Measurement and Research Methodology via his book Handbook of International Large-Scale Assessment.[5]

von Davier has been a Fellow of the American Educational Research Association (AERA) since 2021[6] and elected National Academy of Education member since 2022.[7] He has served as the editor of two leading scientific journals, the British Journal of Mathematical and Statistical Psychology and Psychometrika,[8] and was one of the two founding editors of the Springer journal Large-Scale Assessments in Education, which is a joint publication of the IEA and ETS.[9] He has also been invited as a keynote speaker for the Anne Anastasi lecture at Fordham University,[10] the 9th IEA International Research Conference in Dubai,[11] The Cross Straights Conference on Educational Measurement in Nanchang, China, the International Meeting of the Psychometric Society, the University of Connecticut,[12] Ludwig Maximilian University of Munich,[13] and the Organisation for Economic Co-operation and Development.[14]

Education and Early Career

edit

von Davier obtained a master's degree in psychology with honors from the Faculty of Mathematics and Science (Mathematisch-Naturwissenschaftlichen Fakultät) at CAU Kiel University in 1993. Subsequently, he completed a Doctoral degree (Dr. rer. nat.) in psychology from the same faculty in 1996.[15]

von Davier's career began as an Assistant Research Scientist at the Institute for Science Education (IPN) at Kiel University. He then was awarded a Postdoctoral Fellowship at Educational Testing Service (ETS) in Princeton, NJ, where he developed item fit measures for complex IRT models. He moved to the role of Research Scientist in the Center for Global Assessment at ETS, Princeton, from November 2000 to April 2004.[16]

Career

edit

In 2004, von Davier became a Senior Research Scientist at the Center for Global Assessment in Princeton, where he led initiatives focused on evaluating outcomes-based models. Transitioning to various roles within the Educational Testing Service (ETS), he assumed responsibilities as a Senior Research Scientist in 2007 while also serving as Technical Director for the National Assessment of Educational Progress (NAEP) Task Order Component and managing the Virtual Research Laboratory at ETS/IEA Research Institute. Among other professional appointments, he stepped into the role of Principal Research Scientist in May 2007.[17]

von Davier was appointed Director of Research at the Center for Global Assessment in June 2011, overseeing international survey assessment research and leading the ETS Research Initiative as a Co-Leader. Since September 2013, he has served as co-director at the Center for Global Assessment, concurrently holding the position of Senior Research Director since October 2014. In January 2017, he assumed the position of Distinguished Research Scientist at the National Board of Medical Examiners (NBME) in Philadelphia.[18] He is the executive director at the TIMSS & PIRLS International Study Center in the Lynch School of Education and Human Development at Boston College since September 2020, alongside his role as the J. Donald Monan, S.J., University Professor in Education at the same institution.[19]

Methodological Research

edit

von Davier's areas of study include item response theory (IRT), latent class analysis, and diagnostic classification models, with a broader emphasis on classification and mixture distribution models, computational statistics, person-fit, item-fit, model checking, and hierarchical model extensions for categorical data analysis.[20]

Focusing on psychometric methodologies, von Davier's quantitative methodological research has received several patents.[21][22][23][24]

Contributions to Psychometric Theory

edit

von Davier's work in psychometrics has centered around model development, model fit, and estimation methods, including parallel computation and estimation of latent variable models in complex data collection designs. Key examples include his contributions to model extensions around the Rasch Model, such as Conditional Maximum Likelihood Estimation of various Polytomous Rasch Models, Extensions of Mixture Distribution Rasch models, and polytomous HYBRID models.[25] He has worked on Fit Assessment in Latent Variable Models, encompassing Person, Item, and Model Fit Assessment. Among other contributions, the General Diagnostic Model [26] is considered a flexible diagnostic classification model for both binary and polytomous data, as well as for binary and polytomous ordinal attributes. His work also includes the Parallel-E, Parallel-M algorithm.[27] A fundamental result on intelligence testing practices using discontinue rules was derived by von Davier and collaborators [28]

He has also developed models that integrate information on achievement, non-response, and process data, including extensions of the speed-accuracy model.[29][30] Additionally, his research delved into the use of Artificial Intelligence in automated item generation and automated scoring.[31][32][33]

Applied Research

edit

von Davier's applied research has focused on utilizing psychometric methods in international large-scale assessment. In his roles at ETS and Boston College, he led the psychometric work on transitioning the PIAAC 2012, the PISA 2015, the TIMSS 2019 and the PIRLS 2021 from a paper-based to a computer-based trendline using mode effect models with data from studies that were designed to align results from paper and computer-based assessments.[34]

Another line of his research has concerned the more general issue of linking in large-scale educational assessments.[35][36]

A third line of von Davier's research has discussed the response styles and correcting for survey response bias in self-reports. The applications range from mixture models for personality data[37][38] to the pitfalls of attempts to correct response bias by anchoring vignettes.[39] More recently, his research's focal point was the use of process data in assessment to improve achievement estimation and contextualize assessment results.[29][30][40]

Publications

edit

von Davier has authored and co-authored over 150 publications in peer-reviewed journals, edited books, monographs, and research report series. His h-index is 52. He co-edited several books on topics ranging from Latent Variable Models in Psychometrics to International Large-Scale Assessments and NLP in Assessment.[41]

von Davier's first book, Multivariate and Mixture Distribution Rasch Models: Extensions and Applications, explored the advanced applications and extensions of the Rasch model across various disciplines, including education, psychology, health sciences. Allan S. Cohen commented in the Journal of the American Statistical Association, "This book, published in honor of the retirement of Jürgen Rost, is an edited volume of 22 invited chapters written by eminent researchers in the field of item response theory (IRT)."[42] His next book, The Role of International Large-Scale Assessments: Perspectives from Technology, Economy, and Educational Research, published in 2012, discussed the significance of large-scale international assessments as catalysts for change in understanding the role of human capital distribution, impacting policy, education, and research. In 2013, he co-edited the Handbook of International Large-Scale Assessment: Background, Technical Issues, and Methods of Data Analysis, with Leslie Rutkowski and David Rutkowski, which explored the methodology, technical details, and policy implications of International Large-Scale Assessments (ILSA) in education. Terry Ackerman remarked, "This book is an excellent resource and guide to international large-scale assessments or ILSAs. The three editors have done an excellent job identifying a group of prominent scholars whose expertise ranges from international testing and behavioral statistics to educational policy."[43]

Alongside Randy E. Bennett, von Davier published Advancing Human Assessment: The Methodological, Psychological and Policy Contributions of ETS in 2017, detailing the advancements in human assessment made by ETS, covering measurement and statistics, education policy, psychology, and the development of widely used educational surveys and methodologies. Building upon this exploration of assessment methodologies, he co-edited Advancing Natural Language Processing in Educational Assessment in 2023 with Victoria Yaneva, which looked into the implementation, benefits, and challenges of using NLP in educational testing and assessment. In addition, his book, Handbook of Diagnostic Classification Models: Models and Model Extensions, Applications, Software Packages, provided an overview of diagnostic classification models (DCMs), discussing their development, application, and advantages in offering detailed evaluations of test taker performance across multiple skill domains compared to traditional assessment models. Yu Bao reviewed the book and stated, "The Handbook of Diagnostic Classification Models serves as a reference book that consists of a comprehensive collection of the majority of research topics and a summary of the influential publications within recent decades."[44]

In his highly cited studies, von Davier wrote the practices researchers can use for analyzing and reporting data from large-scale international assessments, addressing common issues and statistical complexities to ensure unbiased results.[45] He emphasized the importance of correctly using plausible values in large-scale survey data analysis to avoid biased estimates and underscored the need to follow established procedures and guidelines.[46] Additionally, he presented a diagnostic model for multidimensional skill profiles using maximum likelihood techniques, demonstrated its application with simulated and real data,[47] and introduced general diagnostic models (GDMs) for estimating skill profiles, suitable for polytomous data and missing responses, with a focus on TOEFL Internet-based testing (iBT) field test data.[48] In related research, he showed that the G-DINA and LCDM approaches to diagnostic modeling are special cases of the GDM.[49] Some of his later research focused on large language models, recurrent neural networks, and other so-called AI methods and how they can be used in automated item generation, automated scoring, and other applications in large-scale educational assessment.[31][32][33]

Awards and honors

edit
  • 2006 – ETS Scientist Award, ETS[3]
  • 2012 – Bradley Hanson Award for Contributions to Educational Measurement, NCME[4]
  • 2017– Award for Significant Contribution to Educational Measurement and Research Methodology, AERA[5]

Bibliography

edit

Books

edit
  • Multivariate and Mixture Distribution Rasch Models: Extensions and Applications (2007) ISBN 978-0387329161
  • The Role of International Large-Scale Assessments: Perspectives from Technology, Economy, and Educational Research (2012) ISBN 978-9400797116
  • Handbook of International Large-Scale Assessment: Background, Technical Issues, and Methods of Data Analysis (2013) ISBN 978-1439895122
  • Advancing Human Assessment: The Methodological, Psychological and Policy Contributions of ETS (2017) ISBN 978-3319586878
  • Handbook of Diagnostic Classification Models: Models and Model Extensions, Applications, Software Packages (2019) ISBN 978-3030055837
  • Advancing Natural Language Processing in Educational Assessment (2023) ISBN 978-1032244525

Selected articles

edit

References

edit
  1. ^ "Our Staff - Matthias von Davier". timssandpirls.bc.edu.
  2. ^ "Methodology of Educational Measurement and Assessment". Springer.
  3. ^ a b "Presenters, Moderators, and Discussants' Biographies" (PDF).
  4. ^ a b "Awards - NCME". www.ncme.org.
  5. ^ a b "Awards". www.aera.net.
  6. ^ "2021 AERA Fellows". www.aera.net.
  7. ^ "Matthias von Davier Newly Elected Member of the National Academy of Education | IEA.nl". www.iea.nl.
  8. ^ "New Executive Editor for Psychometrika". Psychometric Society. July 24, 2023.
  9. ^ "Large-scale Assessments in Education". SpringerOpen.
  10. ^ "Anastasi Lecture 2022 | Fordham". www.fordham.edu.
  11. ^ "Conference Program | IEA.nl". www.iea.nl.
  12. ^ Yelie, Yuan (April 17, 2023). "Interdisciplinary Seminar: Matthias von Davier, Boston College | Department of Statistics".
  13. ^ "Guest lecture: Dr. Matthias von Davier - Munich Center of the Learning Sciences - LMU Munich". www.en.mcls.uni-muenchen.de.
  14. ^ "PIAAC Methodological Seminar" (PDF).
  15. ^ "Bio". September 2, 2016.
  16. ^ "Matthias Von Davier | National Education Policy Center". nepc.colorado.edu.
  17. ^ "NEPS > Project Overview > Advisory Experts > Matthias von Davier". www.neps-data.de.
  18. ^ "Our Staff - Matthias von Davier". timssandpirls.bc.edu.
  19. ^ "Meet Matthias von Davier - Lynch School of Education and Human Development". Boston College.
  20. ^ "Matthias von Davier - Lynch School of Education and Human Development". Boston College.
  21. ^ "Parallel computing for data analysis using generalized latent variable models".
  22. ^ "Systems and methods for evaluating multilingual text sequences".
  23. ^ "Mixture general diagnostic model".
  24. ^ "System and Method for Large Scale Survey Analysis". www.ets.org.
  25. ^ von Davier, Matthias; Rost, Jürgen (June 11, 1995). Fischer, Gerhard H.; Molenaar, Ivo W. (eds.). Rasch Models: Foundations, Recent Developments, and Applications. Springer. pp. 371–379. doi:10.1007/978-1-4612-4230-7_20 – via Springer Link.
  26. ^ von Davier, Matthias (February 11, 2014). "The DINA model as a constrained general diagnostic model: Two variants of a model equivalency". British Journal of Mathematical and Statistical Psychology. 67 (1): 49–71. doi:10.1111/bmsp.12003. PMID 23297749 – via CrossRef.
  27. ^ von Davier, Matthias (December 11, 2016). "High-Performance Psychometrics: The Parallel-E Parallel-M Algorithm for Generalized Latent Variable Models". ETS Research Report Series. 2016 (2): 1–11. doi:10.1002/ets2.12120 – via CrossRef.
  28. ^ von Davier, Matthias; Cho, Youngmi; Pan, Tianshu (March 2019). "Effects of Discontinue Rules on Psychometric Properties of Test Scores". Psychometrika. 84 (1): 147–163. doi:10.1007/s11336-018-09652-3. PMID 30607661.
  29. ^ a b Ulitzsch, Esther; von Davier, Matthias; Pohl, Steffi (November 11, 2020). "A hierarchical latent response model for inferences about examinee engagement in terms of guessing and item-level non-response". British Journal of Mathematical and Statistical Psychology. 73 (S1): 83–112. doi:10.1111/bmsp.12188 – via CrossRef.
  30. ^ a b Leng, Dihao; Bezirhan, Ummugul; Khorramdel, Lale; Fishbein, Bethany; von Davier, Matthias (April 24, 2024). "Examining Gender Differences in TIMSS 2019 Using a Multiple-Group Hierarchical Speed-Accuracy-Revisits Model". Educational Measurement: Issues and Practice. doi:10.1111/emip.12606.
  31. ^ a b von Davier, Matthias (December 1, 2018). "Automated Item Generation with Recurrent Neural Networks". Psychometrika. 83 (4): 847–857. doi:10.1007/s11336-018-9608-y. PMID 29532403 – via Springer Link.
  32. ^ a b Bezirhan, Ummugul; von Davier, Matthias (January 2023). "Automated reading passage generation with OpenAI's large language model - ScienceDirect". Computers and Education: Artificial Intelligence. 5. arXiv:2304.04616. doi:10.1016/j.caeai.2023.100161.
  33. ^ a b Jung, Ji Yoon; Tyack, Lillian; von Davier, Matthias (April 8, 2024). "Combining machine translation and automated scoring in international large-scale assessments". Large-scale Assessments in Education. 12 (1): 10. doi:10.1186/s40536-024-00199-7.
  34. ^ von Davier, Matthias; Khorramdel, Lale; He, Qiwei; Shin, Hyo Jeong; Chen, Haiwen (December 11, 2019). "Developments in Psychometric Population Models for Technology-Based Large-Scale Assessments: An Overview of Challenges and Opportunities". Journal of Educational and Behavioral Statistics. 44 (6): 671–705. doi:10.3102/1076998619881789 – via CrossRef.
  35. ^ von Davier, Matthias; Yamamoto, Kentaro; Shin, Hyo Jeong; Chen, Henry; Khorramdel, Lale; Weeks, Jon; Davis, Scott; Kong, Nan; Kandathil, Mat (July 4, 2019). "Evaluating item response theory linking and model fit for data from PISA 2000–2012". Assessment in Education: Principles, Policy & Practice. 26 (4): 466–488. doi:10.1080/0969594X.2019.1586642 – via CrossRef.
  36. ^ "PISA 2022 Technical Report" (PDF).
  37. ^ "Applying the Mixed Rasch Model to Personality Questionnaires".
  38. ^ von Davier, Matthias; Naemi, Bobby; Roberts, Richard D. (October 11, 2012). "Factorial Versus Typological Models: A Comparison of Methods for Personality Data". Measurement: Interdisciplinary Research & Perspective. 10 (4): 185–208. doi:10.1080/15366367.2012.732798 – via CrossRef.
  39. ^ von Davier, Matthias; Shin, Hyo-Jeong; Khorramdel, Lale; Stankov, Lazar (June 11, 2018). "The Effects of Vignette Scoring on Reliability and Validity of Self-Reports". Applied Psychological Measurement. 42 (4): 291–306. doi:10.1177/0146621617730389. PMC 5978608. PMID 29881126.
  40. ^ Pohl, Steffi; Ulitzsch, Esther; von Davier, Matthias (April 23, 2021). "Reframing rankings in educational assessments". Science. 372 (6540): 338–340. Bibcode:2021Sci...372..338P. doi:10.1126/science.abd3300. PMID 33888624 – via CrossRef.
  41. ^ "Matthias von Davier". scholar.google.com.
  42. ^ Bradstreet, Thomas E.; Cohen, Allan S.; Anderson-Cook, Christine M.; Cook, John R.; Robinson, Timothy J.; Cavanaugh, Joseph; Embrechts, Paul; Oleson, Jacob J. (2008). "Telegraphic Reviews". Journal of the American Statistical Association. 103 (481): 433–436. doi:10.1198/jasa.2008.s227. JSTOR 27640065 – via JSTOR.
  43. ^ Ackerman, Terry (July 3, 2015). "Rutkowski, L., von Davier, M., & Rutkowski, D. (Eds.). (2009). Handbook of International Large-Scale Assessment: Background, Technical Issues, and Methods of Data Analysis. New York, NY: CRC Press". International Journal of Testing. 15 (3): 274–289. doi:10.1080/15305058.2015.1034867 – via CrossRef.
  44. ^ Bao, Yu; Mireles, Nicolas Emundo (October 2, 2023). "Handbook of Diagnostic Classification Models: Models and Model Extensions, Applications, Software Packages Handbook of Diagnostic Classification Models: Models and Model Extensions, Applications, Software Packages , by Matthias von Davier, Young-Sun Lee, New York, United States, Springer, 2019, 656 pp., ISBN: 978-3-030-05583-7: by Matthias von Davier, Young-Sun Lee, New York, United States, Springer, 2019, 656 pp., ISBN: 978-3-030-05583-7". Measurement: Interdisciplinary Research and Perspectives. 21 (4): 282–285. doi:10.1080/15366367.2022.2159686 – via CrossRef.
  45. ^ Rutkowski, Leslie; Gonzalez, Eugenio; Joncas, Marc; von Davier, Matthias (March 11, 2010). "International Large-Scale Assessment Data: Issues in Secondary Analysis and Reporting". Educational Researcher. 39 (2): 142–151. doi:10.3102/0013189X10363170 – via CrossRef.
  46. ^ von Davier, Matthias; Gonzalez, Eugenio; Mislevy, Robert (January 30, 2009). "What are plausible values and why are they useful?" (PDF). ETS Research Report Series.
  47. ^ von Davier, Matthias (November 11, 2008). "A general diagnostic model applied to language testing data". British Journal of Mathematical and Statistical Psychology. 61 (2): 287–307. doi:10.1348/000711007X193957. PMID 17535481 – via CrossRef.
  48. ^ von Davier, Matthias (December 11, 2005). "A General Diagnostic Model Applied to Language Testing Data". ETS Research Report Series. 2005 (2): i–35. doi:10.1002/j.2333-8504.2005.tb01993.x – via CrossRef.
  49. ^ von Davier, Matthias (December 11, 2014). "The Log-Linear Cognitive Diagnostic Model ( LCDM ) as a Special Case of the General Diagnostic Model ( GDM )". ETS Research Report Series. 2014 (2): 1–13. doi:10.1002/ets2.12043 – via CrossRef.