Reward functions

The NLP Student April 9, 2023

"Deep Reinforcement Learning From Human Preferences" Paper Explained

This paper is the work of a collaboration with Deep Mind and Open AI, improving the field of Deep Reinforcement Learning. The ideas discussed in this paper are also a key component in the training of GPT-4. RL agents need good reward functions to learn complex tasks. However, it…