Paper
13 May 2019 Centralized and decentralized application of neural networks learning optimized solutions of distributed agents
Joshua A. Shaffer, Huan Xu
Author Affiliations +
Abstract
This paper explores a methodology for training recurrent neural networks in replicating path planning solutions from optimization problems of multi-agent systems. Training data is generated by solving a centralized nonlinear programming problem, from which both centralized (representing all agents) and decentralized (representing individual agents) recurrent neural networks are trained with reinforcement learning to produce an agent’s state path through fixed time-step execution. Path-tracking controllers are formulated for each agent to follow the path generated by the network. The control signal from such a controller should mimic that of the optimized solution. Results for a 10 agent, 2D dynamics problem with synchronized arrival and collision avoidance constraints showcase the ability of this approach to achieve the desired controller execution and resulting state path. Through these results, this work showcases the ability of recurrent neural networks to learn and generalize centralized and synchronous multi-agent optimization solutions, the end of which is a much more computationally fast multi-agent path planner that trends the slower-to-compute optimization solutions.
© (2019) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Joshua A. Shaffer and Huan Xu "Centralized and decentralized application of neural networks learning optimized solutions of distributed agents", Proc. SPIE 10982, Micro- and Nanotechnology Sensors, Systems, and Applications XI, 109822G (13 May 2019); https://doi.org/10.1117/12.2518604
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Neural networks

Control systems

Computer programming

Collision avoidance

Machine learning

Optimization (mathematics)

Computing systems

Back to Top