Haoxuan Ma   马浩轩

Master Student

LAMDA, Nanjing University

Email: mahx@lamda.nju.edu.cn;
             hunterwrynn@gmail.com;
Google Scholar: Google Scholar Link
Github: https://github.com/Hunter-Wrynn
LinkedIn: https://www.linkedin.com/in/haoxuan-ma-a50bba28a

Biography

I am presently pursuing my graduate studies at LAMDA@Nanjing University, advised by Prof. Hanjia Ye.

I currently focus on Model Reuse and Agentic RL, with additional interests in leveraging LLMs for recommendation systems and quantitative trading.

Selected Publications

* denotes equal contribution. ♦ denotes project leader.
Reinforcement Learning from Multi-role Debates as Feedback for Bias Mitigation in LLMs
Ruoxi Cheng*, Haoxuan Ma*, Shuirong Cao*
Neural Information Processing Systems (NeurIPS Workshop), 2024
[Paper] [Code]
Lever LM: Configuring In-Context Sequence to Lever Large Vision Language Models
Xu Yang, Yingzhe Peng, Haoxuan Ma, Shuo Xu, Chi Zhang, Yucheng Han, Hanwang Zhang
Neural Information Processing Systems (NeurIPS), 2024
[Paper] [Code]
Never Compromise to Vulnerabilities: A Comprehensive Survey on AI Governance
TeleAI
arXiv preprint, 2025
[Paper]
Inverse Reinforcement Learning with Dynamic Reward Scaling for LLM Alignment
Ruoxi Cheng*, Haoxuan Ma*, Weixin Wang*, Ranjie Duan, Jiexi Liu, Xiaoshuang Jia, Simeng Qin, Xiaochun Cao, Yang Liu, Xiaojun Jia
arXiv preprint, 2025
[Paper]
PixCLIP: Achieving Fine-grained Visual Language Understanding via Any-granularity Pixel-Text Alignment Learning
YiCheng Xiao*, Yu Chen*, Haoxuan Ma*, Jiale Hong*, Caorui Li, Lingxiang Wu, Kuan Zhu, Haiyun Guo
arXiv preprint, 2025
[Paper]

Experience

Awards