Tomek Korbak bio photo

Tomek Korbak

Senior Research Scientist,
UK AISI

Email Twitter Scholar LinkedIn GitHub

I’m a Senior Research Scientist at the UK AI Safety Institute working with Geoffrey Irving on safety cases for frontier models. Previously, I was a Member of Technical Staff at Anthropic working on honesty. Before that, I did a PhD at the University of Sussex with Chris Buckley and Anil Seth focusing on RL from human feedback (RLHF) and spent time as a visiting researcher at NYU working with Ethan Perez, Sam Bowman and Kyunghyun Cho. I studied cognitive science, philosophy and physics at the University of Warsaw.

Highlighted papers