What happens if you treat ordinal ratings as interval data? Human evaluations in NLP are even more under-powered than you think (2021)
Attributed to:
MaDrIgAL: MultiDimensional Interaction management and Adaptive Learning
funded by
EPSRC
Abstract
No abstract provided
Bibliographic Information
Publication URI: https://api.elsevier.com/content/abstract/scopus_id/85127439705
Type: Other
Parent Publication: EMNLP 2021 - 2021 Conference on Empirical Methods in Natural Language Processing, Proceedings