AI4GC Lab

Xinchen Xu

xinchenxu@zju.edu.cn

About Me

Hi! I'm Xinchen Xu (许馨宸), a Ph.D. student at Zhejiang University, advised by Shengyu Zhang and Fei Wu.

My research focuses on multimodal large language models and model merging — understanding how models can be composed and aligned to handle complex, cross-modal tasks more efficiently.

Feel free to reach out for collaborations!

Research Interests

Multimodal LLM — Building and understanding large language models that perceive and reason across text, vision, and other modalities.
Model Merging — Exploring how independently trained models can be merged to combine capabilities without expensive retraining.

Selected Papers

EACL2026

InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection

Yuhang Liu, Pengxiang Li, Zishu Wei, Congkai Xie, Xueyu Hu, Xinchen Xu, Shengyu Zhang^✉, Xiaotian Han, Hongxia Yang, Fei Wu

Paper Project74

ACL2025

OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use

Xueyu Hu, Tao Xiong, Biao Yi, Zishu Wei, Ruixuan Xiao, Yurun Chen, Jiasheng Ye, Meiling Tao, Xiangxin Zhou, Ziyu Zhao, Yuhuai Li, Shengze Xu, Shenzhi Wang, Xinchen Xu, Shuofei Qiao, Zhaokai Wang, Kun Kuang, Tieyong Zeng, Liang Wang, Jiwei Li, Yuchen Eleanor Jiang, Wangchunshu Zhou, Guoyin Wang, Keting Yin, Zhou Zhao, Hongxia Yang, Fan Wu, Shengyu Zhang^✉, Fei Wu

Paper Project487