Representation noising effectively prevents harmful fine-tuning on LLMs (2024)
Attributed to:
FAIR: Framework for responsible adoption of Artificial Intelligence in the financial seRvices industry
funded by
EPSRC
Abstract
No abstract provided
Bibliographic Information
Publication URI: https://arxiv.org/abs/2405.14577
Type: Conference/Paper/Proceeding/Abstract