Chenjia Bai   白辰甲

I am a Researcher at Shanghai AI Laboratory. Prior to this, I obtained my Ph.D. degree in Computer Science from Harbin Institute of Technology (HIT), advised by Prof. Peng Liu. My research mainly focuses on deep Reinforcement Learning (RL), including offline RL, robust RL, efficient exploration, representation learning, risk-sensitive learning, and multi-agent RL.

I am fortunate to have been collaborated with many fantastic researchers. I was a visiting student at University of Toronto and Vector Institute, working with Prof. Animesh Garg . I was a visiting student at Northwestern University (remotely), working with Prof. Zhaoran Wang . I also used to be an intern at Huawei Noah's Ark Lab (advised by Prof. Jianye Hao), Tencent Robotics X (advised by Dr. Lei Han), and Alibaba. I received my Bachelor's degree and Master's degree in Computer Science from HIT.

Internship chances:
Our group is looking for highly-motivated Interns on board Reinforcement Learning research. We are also interested in RL applications including Robot Arm and Quadruped. Please drop me an email if you are interested in.

Book

新书出版: 🏠 白辰甲,赵英男,郝建业,刘鹏,王震. 《强化学习:前沿算法与应用》. 机械工业出版社. 2023
强化学习 [京东购买链接] | [淘宝购买链接] | [目录页] | [ 媒体报道: 深度强化学习实验室 | RL China | 机器之心 ] | [ 勘误 ]

Publications

  • Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty and Smoothness. [arxiv]
    Xiaoyu Wen, Xudong Yu, Rui Yang, Chenjia Bai*, Zhen Wang
    under review
  • Robust Quadrupedal Locomotion via Risk-Averse Policy Learning. [arxiv, demo]
    Jiyuan Shi, Chenjia Bai*, Haoran He, Lei Han, Dong Wang, Bin Zhao, Xiu Li, Xuelong Li
    under review
  • Privileged Knowledge Distillation for Sim-to-Real Policy Generalization. [arxiv]
    Haoran He, Chenjia Bai, Hang Lai, Lingxiao Wang, Weinan Zhang*
    under review
  • Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning. [arxiv]
    Haoran He, Chenjia Bai*, Kang Xu, Zhuoran Yang, Weinan Zhang, Dong Wang, Bin Zhao, Xuelong Li
    Neural Information Processing Systems (NeurIPS), 2023
  • Cross-Domain Policy Adaptation via Value-Guided Data Filtering. [arxiv]
    Kang Xu, Chenjia Bai*, Xiaoteng Ma, Dong Wang, Bin Zhao, Zhen Wang, Xuelong Li, Wei Li
    Neural Information Processing Systems (NeurIPS), 2023
  • On the Value of Myopic Behavior in Policy Reuse. [arxiv]
    Kang Xu, Chenjia Bai*, Shuang Qiu, Haoran He, Bin Zhao, Zhen Wang, Wei Li, Xuelong Li
    IEEE Transactions on Pattern Analysis and Machine Intelligence. 2023 (under review)
  • Ensemble Successor Representations for Task Generalization in Offline-to-Online Reinforcement Learning.
    Changhong Wang, Xudong Yu, Chenjia Bai, Zhen Wang*
    SCIENCE CHINA Information Sciences (2nd round), 2023
  • Pessimistic Value Iteration for Multi-Task Data Sharing in Offline Reinforcement Learning. [pdf, code]
    Chenjia Bai, Lingxiao Wang, Jianye Hao, Zhuoran Yang, Bin Zhao, Zhen Wang*, and Xuelong Li*
    Artificial Intelligence (AIJ), 2023
  • Behavior Contrastive Learning for Unsupervised Skill Discovery. [arxiv]
    Rushuai Yang, Chenjia Bai*, Hongyi Guo, Siyuan Li, Bin Zhao, Zhen Wang, Peng Liu, and Xuelong Li
    International Conference on Machine Learning (ICML), 2023
  • Diverse Randomized Value Functions: A Provably Pessimistic Approach for Offline Reinforcement Learning.
    Xudong Yu, Chenjia Bai*, Hongyi Guo, Changhong Wang*, and Zhen Wang
    Information Sciences, 2023
  • False Correlation Reduction for Offline Reinforcement Learning . [arxiv, pdf]
    Zhihong Deng, Zuyue Fu, Lingxiao Wang, Zhuoran Yang, Chenjia Bai, Tianyi Zhou, Zhaoran Wang, and Jing Jiang
    IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
  • RORL: Robust Offline Reinforcement Learning via Conservative Smoothing. [arxiv]
    Rui Yang*, Chenjia Bai*, Xiaoteng Ma, Zhaoran Wang, Chongjie Zhang, Lei Han
    Neural Information Processing Systems (NeurIPS), 2022    Spotlight
  • Self-Supervised Imitation for Offline Reinforcement Learning with Hindsight Relabeling. [pdf]
    Xudong Yu, Chenjia Bai, Changhong Wang, Dengxiu Yu, C. L. Philip Chen, Zhen Wang*
    IEEE Transactions on Systems, Man, and Cybernetics: Systems. 2022
  • Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning. [arxiv, code]
    Shuang Qiu, Lingxiao Wang, Chenjia Bai, Zhuoran Yang, and Zhaoran Wang
    International Conference on Machine Learning (ICML), 2022    Spotlight
  • Monotonic Quantile Network for Worst-Case Offline Reinforcement Learning. [pdf]
    Chenjia Bai, Ting Xiao, Zhoufan Zhu, Lingxiao Wang, Fan Zhou, Peng Liu, and Zhaoran Wang
    IEEE Transactions on Neural Networks and Learning Systems, 2022
  • Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain. [pdf, arxiv]
    Jianye Hao, Tianpei Yang, Hongyao Tang, Chenjia Bai, Jinyi Liu, Zhaopeng Meng, Peng Liu, and Zhen Wang
    IEEE Transactions on Neural Networks and Learning Systems, 2022
  • Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning. [pdf, arxiv, code]
    Chenjia Bai, Lingxiao Wang, Zhuoran Yang, Zhihong Deng, Animesh Garg, Peng Liu, and Zhaoran Wang
    International Conference on Learning Representations (ICLR), 2022    Spotlight
  • OVD-Explorer: A General Information-theoretic Exploration Approach for Reinforcement Learning. [pdf]
    Jinyi Liu, Wang Zhi, Yan Zheng, Jianye Hao, Junjie Ye, Chenjia Bai, Pengyi Li
    NeurIPS Deep RL Workshop, 2021
  • Dynamic Bottleneck for Robust Self-Supervised Exploration. [arxiv, code]
    Chenjia Bai, Lingxiao Wang, Lei Han, Animesh Garg, Jianye Hao, Peng Liu, and Zhaoran Wang
    Neural Information Processing Systems (NeurIPS), 2021
  • Principled Exploration via Optimistic Bootstrapping and Backward Induction . [pdf, arxiv, code]
    Chenjia Bai, Lingxiao Wang, Lei Han, Jianye Hao, Animesh Garg, Peng Liu, and Zhaoran Wang
    International Conference on Machine Learning (ICML), 2021    Spotlight
  • Addressing Hindsight Bias in Multi-Goal Reinforcement Learning. [pdf, code]
    Chenjia Bai, Lingxiao Wang, Yixin Wang, Zhaoran Wang, Rui Zhao, Chenyao Bai and Peng Liu
    IEEE Transactions on Cybernetics, 2021
  • Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning. [arxiv, website]
    Chenjia Bai, Peng Liu, Kaiyu Liu, Lingxiao Wang, Yingnan Zhao, Lei Han, and Zhaoran Wang
    IEEE Transactions on Neural Networks and Learning Systems, 2021 .
  • Generating Attentive Goals for Prioritized Hindsight Reinforcement Learning. [pdf]
    Peng Liu, Chenjia Bai, Yingnan Zhao, Chenyao Bai, Wei Zhao, and Xianglong Tang
    Knowledge-Based Systems (KBS), 2020
  • Obtaining Accurate Estimated Action Values in Categorical Distributional Reinforcement Learning. [pdf]
    Yingnan Zhao, Peng Liu, Chenjia Bai, Wei Zhao, and Xianglong Tang
    Knowledge-Based Systems (KBS), 2020
  • Active Sampling for Deep Q-learning Based on TD-error Adaptive Correction. [pdf]
    Chenjia Bai, Peng Liu, Wei Zhao, and Xianglong Tang
    Journal of Computer Research and Development (in Chinese), 2019.

Talks

Service

  • Senior Program Committee Members (SPC) of AAMAS (2024)
  • Program Committee Members (PC) / Conference Reviewer of NeurIPS (2021 - 2023)
  • Program Committee Members (PC) / Conference Reviewer of ICLR (2021 - 2023)
  • Program Committee Members (PC) / Conference Reviewer of ICML (2022 - 2023)
  • Program Committee Members (PC) / Conference Reviewer of AAAI (2021 - 2023)
  • Journal Reviewer: IEEE Trans. Cybernetics, IEEE Trans. TNNLS, IEEE Trans. TETCI, IEEE Trans. Intelligent Vehicles

© 2023 Chenjia Bai