Multi-agent reinforcement learning (MARL) has transformed research and development in robotics especially for navigation purposes. Using deep learning, multi-agent coordination, reinforcement learning can solve critical tasks such as finding shortest paths or avoiding obstacles with exceptional speed. However, given a LiDAR point cloud structure, performing such tasks directly using MARL, can be computationally expensive and cumbersome due to massive nature of 3D point clouds. In this work, we leverage 2D (MARL), path planning and obstacle avoidance to obtain a low latency navigation in 3D though a correspondence 2D floor plan. We change the environment dynamically with adversarial threats such as fire or leakage -- the RL agents can quickly find a new path given the learned environment. The method shows strong performances in both throughput and latency.
|