AI Researcher & Software Engineer
Researching LLM post-training, reward modelling, and tokenization. Recent work includes Command A, RewardBench 2, Fishing for Magikarp, SCRIPT-BPE, and Unigram Pieces.