What happens if you treat ordinal ratings as interval data? Human evaluations in NLP are even more under-powered than you think (2021)

First Author: Howcroft D

Abstract

No abstract provided

Bibliographic Information

Digital Object Identifier: http://dx.doi.org/10.18653/v1/2021.emnlp-main.703

Publication URI: http://dx.doi.org/10.18653/v1/2021.emnlp-main.703

Type: Conference/Paper/Proceeding/Abstract