Dominic Rigby

Benchmarking Population-Based Reinforcement Learning across Robotic Tasks with GPU-Accelerated Simulation

Date read: 25th July 2025

Key Points

Trains multiple policies in parallel in order to find best hyperparameters
Population hyperparameters are evolved using an evolutionary algorithm