Qiaosheng (Eric) Zhang 张乔生
About me
I received my B.Eng. degree (2015) and Ph.D. degree (2019) in the Department of Information Engineering at The Chinese University of Hong Kong, advised by Prof. Sidharth Jaggi and Prof. Mayank Bakshi. From 2019-2022, I was a research fellow at National University of Singapore, with Prof. Vincent Y. F. Tan. I was also fortunate to visit Prof. Matthieu Bloch at Georgia Tech in 2018. I have been a researcher at the Shanghai AI Lab since 2022. I am an adjunct Ph.D. advisor at the Shanghai Jiao Tong University and Fudan University, and a Ph.D. Advisor at Shanghai Innovation Institute.
Grants and Awards
-
Excellent Young Scientists Fund (Overseas) by NSFC
-
Young Scholars Award, Information Theory Society of Chinese Institute of Electronics (2024)
-
Outstanding Teaching Assistant Award, Department of Information Engineering, CUHK (2019)
Intern Opportunites
I look for self-motivated interns at Shanghai AI Lab. Research topics include but not limited to reinforcement learning, information theory, and large language model. Feel free to send me an email if you are interested.
Research
Research interests
Reinforcement Learning (RL): Online/Offline RL, RL from Human Feedback (RLHF), Multi-agent RL
Large Language Model (LLM): LLM reasoning, RLHF, Multi-LLM-agents collaboration
Information Theory: Covert communication, Information-theoretic security, Identification, Mismatched decoding
Community Detection (a.k.a. clustering): Stochastic Block Model (SBM), Contextual SBM, Hypergraph SBM
Projects
Selected Publications
† denotes students or interns mentored by me.
-
Sample-Efficient Reinforcement Learning from Human Feedback via Information-Directed
Sampling
Han Qi†, Haochen Yang, Qiaosheng Zhang, Zhuoran Yang
Submitted to IEEE Transactions on Information Theory, 2025
-
Online Preference Alignment for Language Models via Count-based Exploration
Chenjia Bai, Yang Zhang, Shuang Qiu, Qiaosheng Zhang, Kang Xu, Xuelong Li
International Conference on Learning Representations (ICLR), 2025 (Spotlight, Top 5.1% ).
-
Multi-LLM-Agents Debate - Performance, Efficiency, and Scaling Challenges
Hangfan Zhang†, Zhiyao Cui, Qiaosheng Zhang, Shuyue Hu
International Conference on Learning Representations (ICLR), Blogpost Track, 2025.
-
Graph Feedback Bandits on Similar Arms: With and Without Graph Structures
Han Qi†, Fei Guo, Li Zhu Qiaosheng Zhang, Xuelong Li
Submitted to IEEE Transactions on Information Theory, 2025
-
Community Detection for Contextual-LSBM: Theoretical Limitations of Misclassification Rate and Efficient Algorithms
Dian Jin†, Yuqian Zhang, Qiaosheng Zhang
Submitted to IEEE International Symposium on Information Theory (ISIT), 2025
-
Optimal Information Security Against Limited-View Adversaries: The Benefits of Causality and Feedback
Mayank Bakshi, Swanand Kadhe, Qiaosheng Zhang, Sidharth Jaggi, Alex Sprintson
IEEE Transactions on Communications, 2025
-
Understanding When and Why Graph Attention Mechanisms Work via Node Classification
Zhongtian Ma†, Qiaosheng Zhang, Bocheng Zhou†, Yexing Zhang†, Zhen Wang
arXiv Preprint, 2024.
-
Matrix Completion with Hypergraphs: Sharp Thresholds and Efficient Algorithms
Zhongtian Ma†, Qiaosheng Zhang, Zhen Wang
Learning on Graphs Conference (LoG), 2024.
-
Community Detection in the Multi-View Stochastic Block Model
Yexin Zhang†, Zhongtian Ma†, Qiaosheng Zhang, Zhen Wang, Xuelong Li
arXiv Preprint, 2024.
-
Provably Efficient Information-Directed Sampling Algorithms for Multi-Agent Reinforcement Learning
Qiaosheng Zhang, Chenjia Bai, Shuyue Hu, Zhen Wang, Xuelong Li
Artificial Intelligence (AIJ), accepted with mandatory revisions, 2024.
-
Constrained Ensemble Exploration for Unsupervised Skill Discovery
Chenjia Bai, Rushuai Yang, Qiaosheng Zhang, Kang Xu, Yi Chen, Ting Xiao, Xuelong Li
International Conference on Machine Learning (ICML), 2024.
-
On the Role of General Function Approximation in Offline Reinforcement Learning
Chenjie Mao†, Qiaosheng Zhang, Zhen Wang, Xuelong Li
International Conference on Learning Representations (ICLR), 2024. (Spotlight, Top 5% )
-
Enhancing Covert Communication in OOK Schemes by Phase Deflection
Xiaopeng Ji, Ruizhi Zhu, Qiaosheng Zhang, Chunguo Li, Daming Cao
IEEE Transactions on Information Forensic and Security, 2024.
-
Ensemble Successor Representations for Task Generalization in Offline-to-Online Reinforcement Learning
Changhong Wang, Xudong Yu, Chenjia Bai, Qiaosheng Zhang, Zhen Wang
SCIENCE CHINA Information Sciences, 2024.
-
Exact Recovery in the General Hypergraph Stochastic Block Model
Qiaosheng Zhang, Vincent Y. F. Tan
IEEE Transactions on Information Theory, 2023.
-
Covert Communication with Mismatched Decoders
Qiaosheng Zhang, Vincent Y. F. Tan
IEEE Transactions on Information Theory, 2023. (accepted in part in IEEE ISIT 2022)
-
Optimal Information Security Against Limited-view Adversaries: Beyond MDS Codes
Qiaosheng Zhang, Swanand Kadhe, Mayank Bakshi, Sidharth Jaggi, Alex Sprintson
IEEE Transactions on Communications, 2023. (accepted in part in IEEE ISIT 2015 and IEEE ITW 2015)
-
Covert Communication Gains from Adversary’s Uncertainty of Phase Angles
Sen Qiao, Daming Cao, Qiaosheng Zhang, Yinfei Xu, Guangjie Liu
IEEE Transactions on Information Forensic and Security, 2023.
-
MC2G: An Efficient Algorithm for Matrix Completion with Social and Item Similarity Graphs
Qiaosheng Zhang#, Geewon Suh#, Changho Suh, Vincent Y. F. Tan (# indicates equal contribution)
IEEE Transactions on Signal Processing, 2022.
-
Covert Communication over Adversarially Jammed Channels
Qiaosheng Zhang, Mayank Bakshi, Sidharth Jaggi
IEEE Transactions on Information Theory, 2021. (accepted in part in IEEE ITW 2018)
-
Covert Identification over Binary-Input Discrete Memoryless Channels
Qiaosheng Zhang, Vincent Y. F. Tan
IEEE Transactions on Information Theory, 2021.
-
Community Detection and Matrix Completion with Social and Item Similarity Graphs
Qiaosheng Zhang, Vincent Y. F. Tan, Changho Suh
IEEE Transactions on Signal Processing, 2021. (accepted in part in IEEE ISIT 2020)
-
Optimal Change-Point Detection with Training Sequences in the Large and Moderate Deviations Regimes
Haiyun He, Qiaosheng Zhang, Vincent Y. F. Tan
IEEE Transactions on Information Theory, 2021. (accepted in part in IEEE ISITA 2020)
-
Covert Communication with Polynomial Computational Complexity
Qiaosheng Zhang, Mayank Bakshi, Sidharth Jaggi
IEEE Transactions on Information Theory, 2020. (accepted in part in IEEE ISIT 2016)
-
Stealthy Communication Over Adversarially Jammed Multipath Networks
Jianhan Song†, Qiaosheng Zhang, Swanand Kadhe, Mayank Bakshi, Sidharth Jaggi
IEEE Transactions on Communications, 2020. (accepted in part in IEEE ISIT 2018)
-
Undetectable Radios: Covert Communication under Spectral Mask Constraints
Qiaosheng Zhang, Matthieu Bloch, Mayank Bakshi, Sidharth Jaggi
IEEE International Symposium on Information Theory (ISIT), 2019
|