What happens if you treat ordinal ratings as interval data? Human evaluations in NLP are even more under-powered than you think (2021)

First Author: Howcroft D.M.

Attributed to: MaDrIgAL: MultiDimensional Interaction management and Adaptive Learning funded by EPSRC

No abstract provided

Type: Other

Parent Publication: EMNLP 2021 - 2021 Conference on Empirical Methods in Natural Language Processing, Proceedings