Preprints
Model-Free, Regret-Optimal Best Policy Identification in Online CMDPs. [arXiv]
Zihan Zhou, Honghao Wei, and Lei Ying.
Scalable and Sample Efficient Distributed Policy Gradient Algorithms in Multi-Agent Networked Systems. [arXiv]
Xin Liu, Honghao Wei, and Lei Ying.
Enhancing Safety in Reinforcement Learning with Human Feedback via Rectified Policy Optimization. [arXiv] [Code]
Xiyue Peng, Hengquan Guo, Jiawei Zhang, Dongqing Zou, Ziyu Shao, Honghao Wei, Xin Liu.
Published
Adversarially Trained Weighted Actor-Critic for Safe Offline Reinforcement Learning. [PDF]
Honghao Wei, Xiyue Peng, Arnob Ghosh, Xin Liu.
NeurIPS, 2024.
Safe and Efficient: A Primal-Dual Method for Offline Convex CMDPs under Partial Data Coverage. [PDF]
Haobo Zhang, Xiyue Peng, Honghao Wei, Xin Liu.
NeurIPS, 2024.
Optimistic Joint Flow Control and Link Scheduling with Unknown Utility Functions. [PDF]
Xin Liu, Honghao Wei, Lei Ying.
Mobichod, 2024.
Reinforcement Learning from Human Feedback without Reward Inference: Model-Free Algorithm and Instance-Dependent Analysis. [PDF]
Qining Zhang, Honghao Wei, Lei Ying.
RLC, 2024.
Safe Reinforcement Learning with Instantaneous Constraints: The Role of Aggressive Exploration. [PDF]
Honghao Wei, Xin Liu, and Lei Ying.
AAAI oral, 2024.
Sample Efficient Reinforcement Learning in Mixed Systems through Augmented Samples and Its Applications to Queueing Networks. [PDF]
Honghao Wei, Xin Liu, Weina Wang, and Lei Ying.
NeurIPS spotlight, 2023 (~3% acceptance).
A Reinforcement Learning and Prediction-Based Lookahead Policy for Vehicle Repositioning in Online Ride-Hailing Systems. [PDF] [Code]
Honghao Wei, Zixian Yang, Xin Liu, Zhiwei (Tony) Qin, Xiaocheng Tang, and Lei Ying.
IEEE Trans. ITS, 2023.
Provably Efficient Model-Free Algorithms for Non-stationary CMDPs. [PDF]
Honghao Wei, Arnob Ghosh, Xingyu Zhou, Lei Ying, and Ness Shroff.
AISTATS, 2023.
Online Convex Optimization with Hard Constraints: Towards the Best of Two Worlds and Beyond. [PDF]
Hengquan Guo, Xin Liu, Honghao Wei, and Lei Ying.
NeurIPS, 2022.
Triple-Q: A Model-Free Algorithm for Constrained Reinforcement Learning with Sublinear Regret and Zero Constraint Violation. [PDF]
Honghao Wei, Xin Liu, and Lei Ying.
AISTATS, 2022.
A Provably-Efficient Model-Free Algorithm for Infinite-Horizon Average-Reward Constrained Markov Decision Processes. [PDF]
Honghao Wei, Xin Liu, and Lei Ying.
AAAI, 2022.
On Low-Complexity Quickest Intervention of Mutated Diffusion Processes Through Local Approximation. [PDF]
Qining Zhang, Honghao Wei, Weina Wang, and Lei Ying.
MobiHoc, 2022.
Fork: A forward-looking actor for model-free reinforcement learning. [PDF] [Code]
Honghao Wei, and Lei Ying.
CDC, 2021.
QuickStop: A Markov Optimal Stopping Approach for Quickest Misinformation Detection. [PDF]
Honghao Wei, Xiaohan Kang, Weina Wang, and Lei Ying.
SIGMEETRICS, 2019.
|