Item Response Theory and Machine Learning

Overview

This page contains code and data for our IRT analyses.

Coming soon

If you use the following data, please cite:

J.P. Lalor, H. Wu, H. Yu, Building an Evaluation Scale using Item Response Theory, In EMNLP 2016. ACL link

The dataset consists of response patterns collected using the Amazon Mechanical Turk crowdsourcing platform.

Included in the zip file is the data and a README.

Download: zip file

Code used to generate the evaluation scales from the paper was written in R. Included are R files for each of the 5 evaluation scales.

If you use the following data please cite:

J.P. Lalor, H. Wu, T. Munkhdalai, H. Yu. Understanding Deep Learning Performance through an Examination of Test Set Difficulty: A Psychometric Case Study. In EMNLP 2018. ACL link

Questions about the code or data? Contact me at john dot lalor at nd dot edu.