MoRE: Mixture of Residual Experts for Humanoid Lifelike Gaits Learning on Complex Terrains

Image credit: Chenjia Bai

Abstract

Humanoid robots have demonstrated robust locomotion capabilities using Reinforcement Learning (RL)-based approaches. However, existing methods are limited in flat terrains with proprioception only, restricting their abilities to traverse challenging terrains with human-like gaits. In this work, we propose a novel framework using a mixture of latent residual experts with multi-discriminators to train an RL policy, which is capable of traversing complex terrains in controllable lifelike gaits with exteroception. Our two-stage training pipeline first teaches the policy to traverse complex terrains using a depth camera, and then enables gait-commanded switching between human-like gait patterns. We also design gait rewards to adjust human-like behaviors like robot base height. Simulation and real-world experiments demonstrate that our framework exhibits exceptional performance in traversing complex terrains, and achieves seamless transitions between multiple human-like gait patterns.

Publication
under review