Hao-Xuan Ma   马浩轩

Master Student

LAMDA, Nanjing University

Email: mahx@lamda.nju.edu.cn;
             hunterwrynn@gmail.com;
Google Scholar: Google Scholar Link
Github: https://github.com/Hunter-Wrynn
LinkedIn: https://www.linkedin.com/in/haoxuan-ma-a50bba28a

Biography

I am presently pursuing my graduate studies at LAMDA@Nanjing University, advised by Prof. Hanjia Ye.

I currently focus on Model Reuse and Agentic RL, with additional interests in leveraging LLMs for recommendation systems and quantitative trading.

Selected Publications

* denotes equal contribution. ♦ denotes project leader.
MMR-Bench: A Comprehensive Benchmark for Multimodal LLM Routing
Haoxuan Ma*, Guannan Lai*, Han-Jia Ye
arXiv preprint, 2026
[Paper] [Code]
Inverse Reinforcement Learning with Dynamic Reward Scaling for LLM Alignment
Ruoxi Cheng*, Haoxuan Ma*, Weixin Wang*, Ranjie Duan, Xiaojun Jia
International Conference on Learning Representations (ICLR), 2026
[Paper]
Lever LM: Configuring In-Context Sequence to Lever Large Vision Language Models
Xu Yang, Yingzhe Peng, Haoxuan Ma, Shuo Xu, Chi Zhang, Yucheng Han, Hanwang Zhang
Neural Information Processing Systems (NeurIPS), 2024
[Paper] [Code]

Experience

Awards

Misc