AI in the Gray: Exploring Moderation Policies in Dialogic Large Language Models vs. Human Answers in Controversial Topics (2023)
Attributed to:
REPHRAIN: Research centre on Privacy, Harm Reduction and Adversarial Influence online
funded by
SPF
Abstract
No abstract provided
Bibliographic Information
Digital Object Identifier: http://dx.doi.org/10.48550/arxiv.2308.14608
Publication URI: https://arxiv.org/abs/2308.14608
Type: Other