
Xiao Yu
I am a second year Ph.D. student in Computer Science at Columbia University advised by Zhou Yu. Before joining the Ph.D. program, I was an undergrad also at Columbia University, majoring in Computer Science and minoring in Applied Physics.
🌟 Currently I am interested in improving computer-use agents' planning capabilities and their understanding of the digital environment with:
Scalable Reinforcement Learning algorithms
World Model training methods such as Dyna
Planning Algorithms such as MCTS
🚀 My most recent work include:
arXiv
Dyna-Think: Synergizing Reasoning, Acting, and World Model Simulation in AI Agents
Xiao Yu, Baolin Peng, Ruize Xu, Michel Galley, Hao Cheng, Suman Nath, Jianfeng Gao, Zhou Yu