Deep Reinforcement Learning for Dynamic Locomotion of Humanoid Robots

Lead Research Organisation: University of Edinburgh

Department Name: Sch of Informatics

Abstract

The research project is focused on using deep reinforcement learning to
solve dynamic locomotion of humanoid robots. In the past, locomotion is
mainly done using conventional analytical approaches, e.g.
Model-Predictive Control, which are limited because they require human
effort and knowledge, and demand high computing power to run online. In
contrast to on-line computation methods, the computation for machine
learning approaches can be outsourced offline. By doing so, faster
online performance for high dimensional control systems, such as
humanoids, can be achieved. Machine learning approaches such as deep
reinforcement learning also requires less human effort to design
compared to analytical approaches.

Given the increasingly more powerful deep RL algorithms, an increasing
number of research works have used deep RL to solve control tasks, as
the recent progress in deep RL algorithms designed for continuous action
domain has brought forward the possibility to apply reinforcement
learning continuous control tasks that involve complicated dynamics.

The project plans to explore the feasibilities of using deep
reinforcement learning to acquire bipedal control policies comparable or
better than analytical approaches while using less human effort. And
eventually design a control framework based on deep RL that is capable
of learning a wide range of balancing and walking strategies with
minimum human intervention, and compare the performance of the designed
control framework with current human labour intensive analytic
engineering approach. The project will also explore other methodologies
related to reinforcement learning, such as imitation learning and
transfer learning, and implement them into the proposed control
framework.

Student:

Chuanyu Yang

Period of Study:

Feb 17 - Nov 20

Funder:

EPSRC

Project Status:

Closed

Project Category:

Studentship

Project Reference:

1957059

Research Topic:

Unclassified

Organisations

University of Edinburgh (Lead Research Organisation)

People	ORCID iD
Taku Komura (Primary Supervisor)
Chuanyu Yang (Student)

Publications

Author Name

Title Publication Date Published

10 25 50

Yang C (2017) Emergence of human-comparable balancing behaviours by deep reinforcement learning

Yang C (2018) Learning Whole-Body Motor Skills for Humanoids

Yang C (2020) Learning Natural Locomotion Behaviors for Humanoid Robots Using Human Bias in IEEE Robotics and Automation Letters

Song D (2018) Recurrent Deterministic Policy Gradient Method for Bipedal Locomotion on Rough Terrain Challenge

Studentship Projects

Project Reference	Relationship	Related To	Start	End	Student Name
EP/N509644/1			01/10/2016	30/09/2021
1957059	Studentship	EP/N509644/1	01/02/2017	30/11/2020	Chuanyu Yang

Impact Summary


Description	The research results from the paper "Emergence of human-comparable balancing behaviours by deep reinforcement learning" have been reported by a tech reporter on TechExplore website and got public exposure. https://techxplore.com/news/2018-09-human-like-strategies-robots.html
First Year Of Impact	2018

Abstract

Organisations

People

ORCID iD

Publications

Studentship Projects