Processing and Visualization of Test-Results Data
Keywords:
data processing; data visualization; computer aided assessment
Abstract
A data processing method is proposed for estimating the measurement error of scaled test scores; this involves scaling half-test raw scores to effective full-test raw scores and converting them to scaled test scores. It is shown that the method allows to estimate the measurement error with high accuracy.
The proposed approach to data visualization involves the removal of the part of the data corresponding to the rarest values. In the case of very noisy data, this method helps to eliminate the contribution of atypical values and provides a significant increase in the visibility of scatter plots.
References
1. L. S. Feldt, M. Steffen, and N. C. Gupta, "A comparison of five models for estimating the standard error
of measurement at specific score levels," Applied Psychological Measurement, vol. 9, no. 4, pp. 351–361,
1985; doi: 10.1177/014662168500900402
2. F. M. Lord, "Estimating test reliability," Educational and Psychological Measurement, vol. 15, no. 4,
pp. 325–336, 1955; doi: 10.1177/001316445501500401
3. J. A. Keats, "Estimation of error variances of test scores," Psychometrika, vol. 22, no. 1, pp. 29–41, 1957;
doi: 10.1007/BF02289207
4. J. Ludbrook, "Comparing methods of measurement," Clinical and Experimental Pharmacology and
Physiology, vol. 24, no. 2, pp. 193–203, 1997; doi: 10.1111/j.1440-1681.1997.tb01807.x
5. V. V. Monakhov, A. V. Kozhedub, P. A. Naumenko, L. A. Evstigneev, M. A. Krukelis, D. V. Solodovnikov,
and I. B. Kernitskii, "BARSIC: A Programming System for Physicists," Programming and Computer Software, vol. 31, no. 3, pp. 157–165, 2005; doi: 10.1007/s11086-005-0028-2
6. V. V. Monakhov, S. K. Stafeev, L. A. Evstigneev, A. F. Kavtrev, and V. E. Fradkin, "The purpose and experience of online contests in physics," Physics in Higher Education, no. 4, pp. 53–63, 2007 (in Russian).
7. V. V. Monakhov, "Analysis of the results of Russian Unified State Examinations in mathematics and
physics and online competitions in Physics," Computer Tools in Education, no. 1, pp. 50–57, 2011 (in
Russian).
8. A. L. Qualls-Payne, "A Comparison of Score Level Estimates of the Standard Error of Measurement," Journal of Educational Measurement, vol. 29, no. 3, pp. 213–225, 1992; doi: 10.1111/j.1745-
3984.1992.tb00374.x
9. G. Rasch, "On General Laws and the Meaning of Measurement in Psychology," In Proc. of the Fourth
Berkeley Symposium on Mathematical Statistics and Probability, IV, Berkeley, CA, 1961, pp. 321–333.
10. R. L. Thorndike, "Reliability," In Educational measurement, E. F. Lindquist Ed., Washington DC: American Council on Education, 1951, pp. 560–620.
11. B. D. Wright and M. H. Stone, Measurement Essentials, 2nd Ed., Wilmington, DE: Wide Range, 1999.
12. D. Woodruff, "Conditional Standard Error of Measurement in Prediction," Journal of Educational Measurement, vol. 27, no. 3, pp. 191–208, 1990; doi: 10.1111/j.1745-3984.1990.tb00743.x
of measurement at specific score levels," Applied Psychological Measurement, vol. 9, no. 4, pp. 351–361,
1985; doi: 10.1177/014662168500900402
2. F. M. Lord, "Estimating test reliability," Educational and Psychological Measurement, vol. 15, no. 4,
pp. 325–336, 1955; doi: 10.1177/001316445501500401
3. J. A. Keats, "Estimation of error variances of test scores," Psychometrika, vol. 22, no. 1, pp. 29–41, 1957;
doi: 10.1007/BF02289207
4. J. Ludbrook, "Comparing methods of measurement," Clinical and Experimental Pharmacology and
Physiology, vol. 24, no. 2, pp. 193–203, 1997; doi: 10.1111/j.1440-1681.1997.tb01807.x
5. V. V. Monakhov, A. V. Kozhedub, P. A. Naumenko, L. A. Evstigneev, M. A. Krukelis, D. V. Solodovnikov,
and I. B. Kernitskii, "BARSIC: A Programming System for Physicists," Programming and Computer Software, vol. 31, no. 3, pp. 157–165, 2005; doi: 10.1007/s11086-005-0028-2
6. V. V. Monakhov, S. K. Stafeev, L. A. Evstigneev, A. F. Kavtrev, and V. E. Fradkin, "The purpose and experience of online contests in physics," Physics in Higher Education, no. 4, pp. 53–63, 2007 (in Russian).
7. V. V. Monakhov, "Analysis of the results of Russian Unified State Examinations in mathematics and
physics and online competitions in Physics," Computer Tools in Education, no. 1, pp. 50–57, 2011 (in
Russian).
8. A. L. Qualls-Payne, "A Comparison of Score Level Estimates of the Standard Error of Measurement," Journal of Educational Measurement, vol. 29, no. 3, pp. 213–225, 1992; doi: 10.1111/j.1745-
3984.1992.tb00374.x
9. G. Rasch, "On General Laws and the Meaning of Measurement in Psychology," In Proc. of the Fourth
Berkeley Symposium on Mathematical Statistics and Probability, IV, Berkeley, CA, 1961, pp. 321–333.
10. R. L. Thorndike, "Reliability," In Educational measurement, E. F. Lindquist Ed., Washington DC: American Council on Education, 1951, pp. 560–620.
11. B. D. Wright and M. H. Stone, Measurement Essentials, 2nd Ed., Wilmington, DE: Wide Range, 1999.
12. D. Woodruff, "Conditional Standard Error of Measurement in Prediction," Journal of Educational Measurement, vol. 27, no. 3, pp. 191–208, 1990; doi: 10.1111/j.1745-3984.1990.tb00743.x
Published
2019-05-03
How to Cite
Monakhov, V. V., Kozhedub, A. V., Khannanov, N., KorolevA. А., & Kurashova, S. A. (2019). Processing and Visualization of Test-Results Data. Computer Tools in Education, (5), 24-40. https://doi.org/10.32603/2071-2340-2018-5-24-40
Issue
Section
Computers in the teaching process
This work is licensed under a Creative Commons Attribution 4.0 International License.