I am an AI research scientist at Meta Super Intelligence Lab (MSL), and part of the LLaMa post-training team. My research focus spans various direction including post-training, reasoning, and reward modeling in LLMs, studying representation of DNNs and phenomenon such as space folding, fundamental aspects of RL such as information-theoric principles in offline RL, generalization in transformer architectures for in-context retrieval, and multimodal vlm proactive agents. At Meta, I led human preference alignment at MSL, improving LLaMA’s performance on user alignment, unverifiable capabilities, reasoning, and response quality. I also led research on in-context generative retrieval for Meta's ranking, and vlm proactive multimodal agents for Meta's smart wearables. Before joining Meta, I was a Postdoc at the Institute for Machine Learning, at the Johannes Kepler University of Linz. In my research, I am interested in alignment, post training, reasoning, and large-scale reinforcement learning.
Here is a list of my selected publications.
2021
2020

2019

2018

If you are from an under represented group, and need help with ML research or similar topics, you can book a mentoring session with me.