Benchmarking Population-Based Reinforcement Learning across Robotic Tasks with GPU-Accelerated Simulation
Date read: 25th July 2025
arXiv link
Key Points
- Trains multiple policies in parallel in order to find best hyperparameters
- Population hyperparameters are evolved using an evolutionary algorithm