Paper
27 September 2022 Motion control of unmanned surface vehicle based on improved reinforcement learning proximal policy optimization algorithm
Shuai Wu
Author Affiliations +
Proceedings Volume 12346, 2nd International Conference on Information Technology and Intelligent Control (CITIC 2022); 1234615 (2022) https://doi.org/10.1117/12.2653441
Event: 2nd International Conference on Information Technology and Intelligent Control (CITIC 2022), 2022, Kunming, China
Abstract
In this paper, a novel deep reinforcement learning algorithm Proximal Policy Optimization (PPO) based on F-divergence is proposed to realize the motion control of unmanned surface vehicle. Aiming at the nonlinear and underactuated characteristics of unmanned surface vehicle system, the new reinforcement learning algorithm can overcome the problem that PPO algorithm falls into local optimization in the training process, and improve the diversity of algorithm exploration. Based on the Open AI simulation environment, this paper analyzes the motion law of the unmanned surface vehicle on the water surface, and establishes a three-degree of freedom kinematics and dynamics mathematical model. An improved reinforcement learning algorithm is used to design the motion controller of unmanned surface vehicle, and a compound reward is designed, which effectively improves the learning efficiency of the network. Simulation results shows the effectiveness of the improved Proximal Policy Optimization algorithm in the motion control of unmanned surface vehicle, and verify the superiority of the improved algorithm.
© (2022) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Shuai Wu "Motion control of unmanned surface vehicle based on improved reinforcement learning proximal policy optimization algorithm", Proc. SPIE 12346, 2nd International Conference on Information Technology and Intelligent Control (CITIC 2022), 1234615 (27 September 2022); https://doi.org/10.1117/12.2653441
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Evolutionary algorithms

Optimization (mathematics)

Detection and tracking algorithms

Mathematical modeling

Motion controllers

Neural networks

Complex systems

RELATED CONTENT


Back to Top