Chenjia Bai
Chenjia Bai
Home
Book
Publications
Team
Join us
Light
Dark
Automatic
Under-Review
On the Value of Myopic Behavior in Policy Reuse.
IEEE Transactions on Pattern Analysis and Machine Intelligence. 2023 (under review)
We present a framework called Selective Myopic bEhavior Control~(SMEC), which results from the insight that the short-term behaviors of prior policies are sharable across tasks.
Kang Xu
,
Chenjia Bai
✉
,
Shuang Qiu
,
Haoran He
,
Bin Zhao
,
Zhen Wang
,
Wei Li
,
Xuelong Li
PDF
Cite
Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty and Smoothness.
Journal of Artificial Intelligence Research (under review)
, 2023
We propose the Robust Offline-to-Online (RO2O) algorithm, designed to enhance offline policies through uncertainty and smoothness, and to mitigate the performance drop in online adaptation.
Xiaoyu Wen
,
Xudong Yu
,
Rui Yang
,
Chenjia Bai
✉
,
Zhen Wang
PDF
Cite
Skill Matters: Dynamic Skill Learning for Multi-Agent Cooperative Reinforcement Learning.
Neural Networks (under review)
, 2024
We propose a novel Dynamic Skill Learning (DSL) framework to enable more effective adaptation and collaboration in complex tasks.
Tong Li
,
Chenjia Bai
✉
,
Kang Xu
,
Chen Chu
,
Peican Zhu
,
Zhen Wang
✉
PDF
Cite
Privileged Knowledge Distillation for Sim-to-Real Policy Generalization.
under review
We propose a novel single-stage privileged knowledge distillation method called the Historical Information Bottleneck (HIB) to narrow the sim-to-real gap.
Haoran He
,
Chenjia Bai
,
Hang Lai
,
Lingxiao Wang
,
Weinan Zhang
✉
PDF
Cite
Large-Scale Actionless Video Pre-Training via Discrete Diffusion for Efficient Policy Learning.
under review
We introduce a novel framework that leverages a unified discrete diffusion to combine generative pre-training on human videos and policy fine-tuning on a small number of action-labeled robot videos.
Haoran He
,
Chenjia Bai
✉
,
Ling Pan
,
Weinan Zhang
,
Bin Zhao
,
Xuelong Li
PDF
Cite
Project
公众号
Regularized Conditional Diffusion Model for Multi-Task Preference Alignment.
under review
We adopt multi-task preferences as a unified condition for both single- and multi-task decision-making, and propose preference representations aligned with preference labels.
Xudong Yu
,
Chenjia Bai
✉
,
Haoran He
,
Changhong Wang
,
Xuelong Li
PDF
Cite
Provably Efficient Information-Directed Sampling Algorithms for Multi-Agent Reinforcement Learning.
under review
This work designs and analyzes a novel set of algorithms for multi-agent reinforcement learning (MARL) based on the principle of information-directed sampling (IDS).
Qiaosheng Zhang
,
Chenjia Bai
,
Shuyu Hu
,
Zhen Wang
✉
,
Xuelong Li
✉
PDF
Cite
Towards Efficient LLM Grounding for Embodied Multi-Agent Collaboration.
under review
We propose a novel framework for multi-agent collaboration that introduces Reinforced Advantage feedback (ReAd) for efficient self-refinement of plans.
Yang Zhang
,
Shixin Yang
,
Chenjia Bai
✉
,
Fei Wu
,
Xiu Li
,
Xuelong Li
,
Zhen Wang
PDF
Cite
Project
公众号
Decentralized Transformers with Centralized Aggregation are Sample-Efficient Multi-Agent World Models.
under review
we propose a novel world model for MARL that learns decentralized local dynamics for scalability, combined with a centralized representation aggregation from all agents.
Yang Zhang
,
Chenjia Bai
✉
,
Bin Zhao
,
Junchi Yan
,
Xiu Li
,
Xuelong Li
PDF
Cite
Cite
×