Deep Reinforcement Learning for Walking Robots - MATLAB and Simulink Robotics Arena
Sebastian Castro demonstrates an example of controlling humanoid robot locomotion using deep reinforcement learning, specifically the Deep Deterministic Policy Gradient (DDPG) algorithm. The robot is simulated using Simscape Multibody™, while training the control policy is done using Reinforcement Learning Toolbox™.
In this video, Sebastian outlines the setup, training, and evaluation of reinforcement learning with Simulink® models. First, he introduces how to choose states, actions, and a reward function for the reinforcement learning problem. Then he describes the neural network structure and training algorithm parameters. Finally, he shows some training results and discusses the benefits and drawbacks of reinforcement learning.
You can find the example models used in this video in the MATLAB Central File Exchange: http://bit.ly/2HBxe79
For more information, you can access the following resources:
- Reinforcement Learning Tech Talks: http://bit.ly/2HBzMlS
- Blog and Videos: Walking Robot Modeling and Simulation: http://bit.ly/2GV4vL8
- Paper: Continuous Control with Deep Reinforcement Learning: http://bit.ly/2HAkJsp
- Paper: Emergence of Locomotion Behaviours in Rich Environments: http://bit.ly/2HBuTsO
In this video, Sebastian outlines the setup, training, and evaluation of reinforcement learning with Simulink® models. First, he introduces how to choose states, actions, and a reward function for the reinforcement learning problem. Then he describes the neural network structure and training algorithm parameters. Finally, he shows some training results and discusses the benefits and drawbacks of reinforcement learning.
You can find the example models used in this video in the MATLAB Central File Exchange: http://bit.ly/2HBxe79
For more information, you can access the following resources:
- Reinforcement Learning Tech Talks: http://bit.ly/2HBzMlS
- Blog and Videos: Walking Robot Modeling and Simulation: http://bit.ly/2GV4vL8
- Paper: Continuous Control with Deep Reinforcement Learning: http://bit.ly/2HAkJsp
- Paper: Emergence of Locomotion Behaviours in Rich Environments: http://bit.ly/2HBuTsO
No comments