EN
TR
Investigation of Measurement Precision and Test Length in Computerized Adaptive Testing Under Different Conditions / Bilgisayar Ortamında Bireye Uyarlanmış Test Uygulamalarında Ölçme Kesinliğinin ve Test Uzunluğunun Farklı Koşullar Altında İncelenmesi
Abstract
Computerized Adaptive Tests (CAT) are gaining much more attention than ever by the institutions especially the ones attracting students worldwide due to the nature of CAT not allowing the same items to be presented to different individuals taking the test. In this study, it was aimed to investigate of measurement precision and test length in computerized adaptive testing (CAT) under different conditions. The research was implemented as a Monte Carlo simulation study. In line with the purpose of the study, 500 items which response probabilities were modeled with the three parameter logistic (3PL) model were generated. Fixed length (15,20), standard error (SE<.30, SE<.50) termination rules have been used for the study. Additionally, in comparing termination rules, different starting rules (θ=0,-1<θ<1), ability estimation methods (Maksimum Likelihood Estimation (MLE) ,Expected a Posteriori (EAP) and Maximum a Posteriori Probability (MAP)), item selection method (Kullback Leibler Information (KLI) and Maximum Fischer Information (MFI)) have been selected since these are critical in the algorithms of CAT. 25 replications was performed for each condition in the generated data. The results obtained from study were evaluated by using RMSE, bias and fidelity values criterions. R software was used for data generation and analyses. As a result of the study, it was seen that choosing the test starting rule as θ=0 or -1<θ<1 did not cause a significant difference in terms of measurement precision and test length. It was concluded that the termination rule, in which RMSE and bias values were lower than the other conditions, was the 0.30 SE termination rule. When the EAP ability estimation method was used, lower RMSE and bias values were obtained compared to the MLE. It was concluded that the KLI item selection method had lower RMSE and bias values compared to the MFI.
Keywords
Kaynakça
- Babcock, B. & Weiss, D. J. (2009). Termination criteria in computerized adaptive tests: variable-length cats are not biased. Paper presented at The 2009 Conference on Computerized Adaptive Testing, Minnesota, USA.https://www.researchgate.net/publication/262674764_Termination_Criteria_in_Computerized_Adaptive_Tests_Do_Variable-Length_CATs_Provide_Efficient_and_Effective_Measurement
- Babcock, B. ve Weiss, D. J. (2012). Termination criteria in computerized adaptive tests: do variable-length CATs provide efficient and effective measurement? Journal of Computerized Adaptive Testing, 1(1), 1–18. https://doi.org/10.7333/1212-0101001
- Baker, F.B. & Kim, S.H. (2004). Item response theory: Parameter estimation techniques. Marcel Bekker Inc.
- Birnbaum, A. (1968). Some latent trait models and their use in inferring an examaninee’s ability. In Lord, F.M. & Novick, M.R. (Eds.) Statistical theories of mental test scores (pp. 397-479) . Addison-Wesley.
- Blais, J.& Raiche, G. (2002). Features of the sampling distribution of the ability estimate in computerized adaptive testing according to two stopping rules. Paper presented at The International Objective Measurement Workshop International Objective Measurement Workshop, New Orleans, USA. https://pubmed.ncbi.nlm.nih.gov/21164229/
- Blais, J. & Raiche, G. (2010). Features of the sampling distribution of the ability estimate in Computerized Adaptive Testing according to two stopping rules, Journal of Applied Measurement, 11(4), 424-31. https://www.researchgate.net/publication/49689146
- Bock, R. D. & Aitkin, M. (1981). Marginal maximum likelihood estimation of item parameters: Application of an EM algorithm. Psychometrika, 46(4), 443–459. https://link.springer.com/article/10.1007/BF02293801
- Bock, R. D. & Mislevy, R. J. (1982). Adaptive EAP estimation of ability in a microcomputer environment. Applied Psychological Measurement, 6(4), 431– 444. https://doi.org/10.1177/014662168200600405
Ayrıntılar
Birincil Dil
Türkçe
Konular
Eğitim Üzerine Çalışmalar
Bölüm
Araştırma Makalesi
Yayımlanma Tarihi
28 Şubat 2022
Gönderilme Tarihi
13 Kasım 2021
Kabul Tarihi
26 Ocak 2022
Yayımlandığı Sayı
Yıl 1970 Cilt: 13 Sayı: 1
APA
Balta, E., & Uçar, A. (2022). Bilgisayar Ortamında Bireye Uyarlanmış Test Uygulamalarında Ölçme Kesinliğinin ve Test Uzunluğunun Farklı Koşullar Altında İncelenmesi / Investigation of Measurement Precision and Test Length in Computerized Adaptive Testing Under Different Conditions. e-Uluslararası Eğitim Araştırmaları Dergisi, 13(1), 51-68. https://doi.org/10.19160/e-ijer.1023098
Cited By
The Effects of Different Item Selection Methods on Test Information and Test Efficiency in Computer Adaptive Testing
Eğitimde ve Psikolojide Ölçme ve Değerlendirme Dergisi
https://doi.org/10.21031/epod.1140757Detection of aberrant testing behaviour in unproctored CAT via a verification test
International Journal of Assessment Tools in Education
https://doi.org/10.21449/ijate.1598330