AI in the Gray: Exploring Moderation Policies in Dialogic Large Language Models vs. Human Answers in Controversial Topics (2023)

First Author: Ghafouri V

Attributed to: REPHRAIN: Research centre on Privacy, Harm Reduction and Adversarial Influence online funded by SPF

Abstract

No abstract provided

Bibliographic Information

Digital Object Identifier: http://dx.doi.org/10.48550/arxiv.2308.14608

Publication URI: https://arxiv.org/abs/2308.14608

Type: Other