Haoxuan Ma 马浩轩Master StudentLAMDA, Nanjing University
|
![]() |
I am presently pursuing my graduate studies at LAMDA@Nanjing University, advised by Prof. Hanjia Ye.
I currently focus on Model Reuse and Agentic RL, with additional interests in leveraging LLMs for recommendation systems and quantitative trading.
Reinforcement Learning from Multi-role Debates as Feedback for Bias Mitigation in LLMs
Ruoxi Cheng*, Haoxuan Ma*, Shuirong Cao* Neural Information Processing Systems (NeurIPS Workshop), 2024 [Paper] [Code] |
Lever LM: Configuring In-Context Sequence to Lever Large Vision Language Models
Xu Yang, Yingzhe Peng, Haoxuan Ma, Shuo Xu, Chi Zhang, Yucheng Han, Hanwang Zhang Neural Information Processing Systems (NeurIPS), 2024 [Paper] [Code] |
Never Compromise to Vulnerabilities: A Comprehensive Survey on AI Governance
TeleAI arXiv preprint, 2025 [Paper] |
Inverse Reinforcement Learning with Dynamic Reward Scaling for LLM Alignment
Ruoxi Cheng*, Haoxuan Ma*, Weixin Wang*, Ranjie Duan, Jiexi Liu, Xiaoshuang Jia, Simeng Qin, Xiaochun Cao, Yang Liu, Xiaojun Jia arXiv preprint, 2025 [Paper] |
PixCLIP: Achieving Fine-grained Visual Language Understanding via Any-granularity Pixel-Text Alignment Learning
YiCheng Xiao*, Yu Chen*, Haoxuan Ma*, Jiale Hong*, Caorui Li, Lingxiang Wu, Kuan Zhu, Haiyun Guo arXiv preprint, 2025 [Paper] |