Document detail
ID

oai:arXiv.org:2403.14864

Topic
Computer Science - Robotics Computer Science - Artificial Inte...
Author
Song, Yunlong Kim, Sangbae Scaramuzza, Davide
Category

Computer Science

Year

2024

listing date

10/23/2024

Keywords
using approach dynamics locomotion learning quadruped
Metrics

Abstract

This work explores the potential of using differentiable simulation for learning quadruped locomotion.

Differentiable simulation promises fast convergence and stable training by computing low-variance first-order gradients using robot dynamics.

However, its usage for legged robots is still limited to simulation.

The main challenge lies in the complex optimization landscape of robotic tasks due to discontinuous dynamics.

This work proposes a new differentiable simulation framework to overcome these challenges.

Our approach combines a high-fidelity, non-differentiable simulator for forward dynamics with a simplified surrogate model for gradient backpropagation.

This approach maintains simulation accuracy by aligning the robot states from the surrogate model with those of the precise, non-differentiable simulator.

Our framework enables learning quadruped walking in simulation in minutes without parallelization.

When augmented with GPU parallelization, our approach allows the quadruped robot to master diverse locomotion skills on challenging terrains in minutes.

We demonstrate that differentiable simulation outperforms a reinforcement learning algorithm (PPO) by achieving significantly better sample efficiency while maintaining its effectiveness in handling large-scale environments.

Our method represents one of the first successful applications of differentiable simulation to real-world quadruped locomotion, offering a compelling alternative to traditional RL methods.

;Comment: 8th Annual Conference on Robot Learning (CoRL)

Song, Yunlong,Kim, Sangbae,Scaramuzza, Davide, 2024, Learning Quadruped Locomotion Using Differentiable Simulation

Document

Open

Share

Source

Articles recommended by ES/IODE AI

Systematic druggable genome-wide Mendelian randomization identifies therapeutic targets for lung cancer
agphd1 subtypes replication hykk squamous cell gene carcinoma causal targets mendelian randomization cancer analysis