Research
arXiv
Reinforcement World Model Learning for LLM-based Agents
Xiao Yu, Baolin Peng, Ruize Xu, Yelong Shen, Pengcheng He, Suman Nath, Nikhil Singh, Jiangfeng Gao, Zhou Yu
ICLR 2026
(Workshop)
Dyna-Think: Synergizing Reasoning, Acting, and World Model Simulation in AI Agents
Xiao Yu, Baolin Peng, Ruize Xu, Michel Galley, Hao Cheng, Suman Nath, Jianfeng Gao, Zhou Yu
ACL 2023
Controllable Mixed-Initiative Dialogue Generation through Prompting
Maximillian Chen, Xiao Yu, Weiyan Shi, Urvi Awasthi, Zhou Yu
IEEE 2022
Distributed MQTT Brokers at Network Edges: A Study on Message Dissemination
Luoyao Hao, Xiao Yu, Tingrui Zhang, Henning Schulzrinne