📣 Help Shape the Future of UKRI's Gateway to Research (GtR)

We're improving UKRI's Gateway to Research and are seeking your input! If you would be interested in being interviewed about the improvements we're making and to have your say about how we can make GtR more user-friendly, impactful, and effective for the Research and Innovation community, please email gateway@ukri.org.

Group Robust Preference Optimization in Reward-free RLHF (2024)

First Author: S. Ramesh

Attributed to: Robust and Efficient Model-based Reinforcement Learning funded by EPSRC

Abstract

No abstract provided

Bibliographic Information

Type: Conference/Paper/Proceeding/Abstract