Chenjia Bai
Chenjia Bai
Home
Book
Publications
Team
Join us
Light
Dark
Automatic
Conference
Robust Quadrupedal Locomotion via Risk-Averse Policy Learning.
In
IEEE International Conference on Robotics and Automation (
ICRA
)
, 2024
Oral
We consider a novel risk-sensitive perspective to enhance the robustness of legged locomotion.
Jiyuan Shi
,
Chenjia Bai
✉
,
Haoran He
,
Lei Han
,
Dong Wang
,
Bin Zhao
,
Mingguo Zhao
,
Xiu Li
,
Xuelong Li
PDF
Cite
Project
Cross-Domain Policy Adaptation by Capturing Representation Mismatch.
In
International Conference on Machine Learning (
ICML
)
, 2024
We consider dynamics adaptation settings where there exists dynamics mismatch between the source domain and the target domain, and one can get access to sufficient source domain data, while can only have limited interactions with the target domain.
Jiafei Lyu
,
Chenjia Bai
,
Jing-Wen Yang
,
Zongqing Lu
,
Xiu Li
PDF
Cite
Code
How Does Goal Relabeling Improve Sample Efficiency?
In
International Conference on Machine Learning (
ICML
)
, 2024
We construct an example to show the information-theoretical improvement in sample efficiency achieved by goal relabeling and develop an RL algorithm called
GOALIVE
.
Sirui Zheng
,
Chenjia Bai
,
Zhuoran Yang
,
Zhaoran Wang
PDF
Cite
SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation.
In
International Conference on Machine Learning (
ICML
)
, 2024
We propose SAM-E, a novel architecture for robot manipulation by leveraging a vision-foundation model for generalizable scene understanding and sequence imitation for long-term action reasoning.
Junjie Zhang
,
Chenjia Bai
✉
,
Haoran He
,
Zhigang Wang
,
Bin Zhao
,
Xiu Li
,
Xuelong Li
PDF
Cite
Code
Project
公众号
Contrastive Representation for Data Filtering in Cross-Domain Offline Reinforcement Learning.
In
International Conference on Machine Learning (
ICML
)
, 2024
We propose a novel representation-based approach to measure the domain gap, where the representation is learned through a contrastive objective by sampling transitions from different domains.
Xiaoyu Wen
,
Chenjia Bai
✉
,
Kang Xu
,
Xudong Yu
,
Yang Zhang
,
Xuelong Li
,
Zhen Wang
PDF
Cite
Code
Constrained Ensemble Exploration for Unsupervised Skill Discovery.
In
International Conference on Machine Learning (
ICML
)
, 2024
We propose a novel unsupervised RL framework via an ensemble of skills, where each skill performs partition exploration based on the state prototypes.
Chenjia Bai
,
Rushuai Yang
,
Qiaosheng Zhang
,
Kang Xu
,
Yi Chen
,
Ting Xiao
,
Xuelong Li
PDF
Cite
Code
SelfBC: Self Behavior Cloning for Offline Reinforcement Learning .
In
European Conference on Artificial Intelligence (
ECAI
)
, 2024
We propose a novel dynamic policy constraint that restricts the learned policy on the samples generated by the exponentional moving average of previously learned policies for offline RL.
Shirong Liu
,
Chenjia Bai
,
Zixian Guo
,
Hao Zhang
,
Gaurav Sharma
,
Yang Liu
✉
PDF
Cite
Bridging the Sim-to-Real Gap from the Information Bottleneck Perspective.
In
Annual Conference on Robot Learning (
CORL
)
, 2024
Oral
We propose a novel single-stage privileged knowledge distillation method called the Historical Information Bottleneck (HIB) to narrow the sim-to-real gap for legged locomotion.
Haoran He
,
Peilin Wu
,
Chenjia Bai
,
Hang Lai
,
Lingxiao Wang
,
Ling Pan,
,
Xiaolin Hu
,
Weinan Zhang
✉
PDF
Cite
Learning an Actionable Discrete Diffusion Policy via Large-Scale Actionless Video Pre-Training.
In
Neural Information Processing Systems (
NeurIPS
)
, 2024
We introduce a novel framework that leverages a unified discrete diffusion to combine generative pre-training on human videos and policy fine-tuning on a small number of action-labeled robot videos.
Haoran He
,
Chenjia Bai
✉
,
Ling Pan
,
Weinan Zhang
,
Bin Zhao
,
Xuelong Li
PDF
Cite
Project
Regularized Conditional Diffusion Model for Multi-Task Preference Alignment.
In
Neural Information Processing Systems (
NeurIPS
)
, 2024
We adopt multi-task preferences as a unified condition for both single- and multi-task decision-making, and propose preference representations aligned with preference labels.
Xudong Yu
,
Chenjia Bai
✉
,
Haoran He
,
Changhong Wang
,
Xuelong Li
PDF
Cite
«
»
Cite
×