Generative Adversarial Immitation Learning for Steering an Unmanned Surface Vehicle

Vedeler, Alexandra Skau; Warakagoda, Narada Dilp

dc.contributor.author	Vedeler, Alexandra Skau	en_GB
dc.contributor.author	Warakagoda, Narada Dilp	en_GB
dc.date.accessioned	2021-02-22T08:53:21Z
dc.date.accessioned	2021-03-03T09:15:48Z
dc.date.available	2021-02-22T08:53:21Z
dc.date.available	2021-03-03T09:15:48Z
dc.date.issued	2020-02
dc.identifier.citation	Vedeler AS, Warakagoda ND. Generative Adversarial Immitation Learning for Steering an Unmanned Surface Vehicle. Proceedings of the Northern Lights Deep Learning Workshop. 2020;1	en_GB
dc.identifier.uri	http://hdl.handle.net/20.500.12242/2842
dc.description	Vedeler, Alexandra Skau; Warakagoda, Narada Dilp. Generative Adversarial Immitation Learning for Steering an Unmanned Surface Vehicle. Proceedings of the Northern Lights Deep Learning Workshop 2020 ;Volum 1.	en_GB
dc.description.abstract	The task of obstacle avoidance using maritime vessels, such as Unmanned Surface Vehicles (USV), has traditionally been solved using specialized modules that are designed and optimized separately. However, this approach requires a deep insight into the environment, the vessel, and their complex dynamics. We propose an alternative method using Imitation Learning (IL) through Deep Reinforcement Learning (RL) and Deep Inverse Reinforcement Learning (IRL) and present a system that learns an end-to-end steering model capable of mapping radar-like images directly to steering actions in an obstacle avoidance scenario. The USV used in the work is equipped with a Radar sensor and we studied the problem of generating a single action parameter, heading. We apply an IL algorithm known as generative adversarial imitation learning (GAIL) to develop an end-to-end steering model for a scenario where avoidance of an obstacle is the goal. The performance of the system was studied for different design choices and compared to that of a system that is based on pure RL. The IL system produces results that indicate it is able to grasp the concept of the task and that in many ways are on par with the RL system. We deem this to be promising for future use in tasks that are not as easily described by a reward function.	en_GB
dc.language.iso	en	en_GB
dc.subject	Dyp læring	en_GB
dc.subject	Ubemannede overflatefartøyer (USV)	en_GB
dc.title	Generative Adversarial Immitation Learning for Steering an Unmanned Surface Vehicle	en_GB
dc.type	Article	en_GB
dc.date.updated	2021-02-22T08:53:21Z
dc.identifier.cristinID	1883515
dc.identifier.doi	10.7557/18.5147
dc.source.issn	2703-6928
dc.type.document	Journal article
dc.relation.journal	Proceedings of the Northern Lights Deep Learning Workshop

Files in this item

Name:: 1883515.pdf
Size:: 551.2Kb
Format:: PDF

This item appears in the following Collection(s)

Articles

Show simple item record