Abstract:Autonomous underwater vehicle (AUV) is an important equipment for human to explore and utilize the ocean, and whether it can be intelligent enough to solve the path planning control problem is the basis for AUV to accomplish other complex tasks. Considering the local path planning problem under terminal attitude constraints and combining with the autonomous docking control of AUV, which is a practical use scenario, a docking controller is developed based on the improved Deep Reinforcement Learning (DRL) algorithm, which equips it with the ability of autonomous docking and extends its endurance time. Considering the complex wave disturbance factors in the practical working scenario, a nonlinear disturbance observer (NDO) is used to estimate the external disturbances of each degree of freedom in the three-dimensional motion of the AUV, and scientific observation quantities and reward functions are designed for the DRL agent in combination with measurable state quantities, so as to enable the AUV to accomplish the three-dimensional docking control task in a disturbed environment. The simulation results demonstrate the effectiveness and robustness of the proposed method.