Dominic Rigby

POMO: Policy Optimisation with Multiple Optima for Reinforcement Learning

Date read: 16th October 2025

Paper link

Key Points