I'm currently researching embodied and video-based foundation models for world modelling, with a focus on temporal grounding and reasoning in VLAs and world models. Feel free to reach out if you'd like to chat!

Current Media