How to catch an ai liar: Lie detection in black-box llms by asking unrelated questions (2024)
Attributed to:
Robust and interpretable machine learning for biomedicine and healthcare
funded by
EPSRC
Abstract
No abstract provided
Bibliographic Information
Publication URI: https://arxiv.org/abs/2309.15840
Type: Conference/Paper/Proceeding/Abstract