Papers

Surveys and Reviews

Implementation

Resources

# Reinforcement Learning from Human Feedback by Nathan Lambert

A short introduction to RLHF and post-training focused on language models. 👉 https://rlhfbook.com/


See also