Paper
13 May 2019 Navigation and collision avoidance with human augmented supervisory training and fine tuning via reinforcement learning
Christopher J. Maxey, E. Jared Shamwell
Author Affiliations +
Abstract
Robust navigation and orientation under complex conditions is a must for autonomous drones operating in new and varied environments. Creating drones with adequate behaviors can be a challenge from both a training standpoint and a generalization standpoint. Using human expertise data is an option to help bootstrap the learning process; however, using the human data can lead to side consequences that are not immediately intuitive. This study focuses on applying varying levels of human input to an agent to determine how this input affects the agent's performance. The Unreal Engine and the Airsim plugin are used to train a quadcopter agent in an abstract "blocks world" type environment. Six agents in total are trained, with the first five having increasing amounts of human input and the sixth agent having no human input. A variety of metrics are looked at, including total goals achieved and time to achieve some number of goals.
© (2019) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Christopher J. Maxey and E. Jared Shamwell "Navigation and collision avoidance with human augmented supervisory training and fine tuning via reinforcement learning", Proc. SPIE 10982, Micro- and Nanotechnology Sensors, Systems, and Applications XI, 1098228 (13 May 2019); https://doi.org/10.1117/12.2518551
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Video

Collision avoidance

Statistical analysis

LIDAR

Machine learning

Neural networks

Robotics

RELATED CONTENT

Learning class predication distributions with noisy labels
Proceedings of SPIE (February 16 2022)
Assets prediction using machine learning techniques
Proceedings of SPIE (August 23 2022)
Prediction model of mobike demand based on T-GCN
Proceedings of SPIE (August 10 2023)
Video coding based on pre-attentive processing
Proceedings of SPIE (February 25 2005)
Image segmentation by a multilayer neural network
Proceedings of SPIE (December 31 1996)

Back to Top