Profile Picture

Xiao Yu

I am a second year Ph.D. student in Computer Science at Columbia University advised by Zhou Yu. Before joining the Ph.D. program, I was an undergrad also at Columbia University, majoring in Computer Science and minoring in Applied Physics.

🌟 Currently I am interested in improving computer-use agents' planning capabilities and their understanding of the digital environment with:

Scalable Reinforcement Learning algorithms

World Model training methods such as Dyna

Planning Algorithms such as MCTS

🚀 My most recent work include:

arXiv

Dyna-Think: Synergizing Reasoning, Acting, and World Model Simulation in AI Agents

Xiao Yu, Baolin Peng, Ruize Xu, Michel Galley, Hao Cheng, Suman Nath, Jianfeng Gao, Zhou Yu

ICLR 2025

ExACT: Teaching AI Agents to Explore with Reflective-MCTS and Exploratory Learning

Xiao Yu, Baolin Peng, Vineeth Vajipey, Hao Cheng, Michel Galley, Jianfeng Gao, Zhou Yu

ACL 2025

ConFit v2: Improving Resume-Job Matching using Hypothetical Resume Embedding and Runner-Up Hard-Negative Mining

Xiao Yu*, Ruize Xu*, Chengyuan Xue*, Jinzhong Zhang, Xu Ma, Zhou Yu

EMNLP 2024

LIONs: An Empirically Optimized Approach to Align Language Models

Xiao Yu, Qingyang Wu, Yu Li, Zhou Yu

NAACL 2024🏆

Teaching Language Models to Self-Improve through Interactive Demonstrations

Xiao Yu, Baolin Peng, Michel Galley, Jianfeng Gao, Zhou Yu