Dominic Rigby

RL Grokking Recipe- How Can We Enable LLMs to Solve Previously Unsolvable Tasks

Date read: 3rd October

ArXiv link

Key Points