
Xiao Yu
I am a third year Ph.D. student in Computer Science at Columbia University advised by Zhou Yu. Before joining the Ph.D. program, I was an undergrad also at Columbia University, majoring in Computer Science and minoring in Applied Physics.
🌟 Currently I am interested in improving the environment understanding and planning capabilities of AI agents, especially for browser/computer/phone-use.
Scalable Reinforcement Learning algorithms
World Model training methods such as Dyna
Planning Algorithms such as MCTS
🚀 My most recent work include (in chronological order):
arXiv
Reinforcement World Model Learning for LLM-based Agents
Xiao Yu, Baolin Peng, Ruize Xu, Yelong Shen, Pengcheng He, Suman Nath, Nikhil Singh, Jiangfeng Gao, Zhou Yu
ICLR 2026
(Workshop)
Dyna-Think: Synergizing Reasoning, Acting, and World Model Simulation in AI Agents
Xiao Yu, Baolin Peng, Ruize Xu, Michel Galley, Hao Cheng, Suman Nath, Jianfeng Gao, Zhou Yu