KEYWORDS: Detection and tracking algorithms, Control systems, Control systems design, Evolutionary algorithms, Signal processing, Neural networks, Fuzzy logic, Process control, Device simulation, Computer simulations
Due to the nonlinear and underactuated characteristics of unmanned surface vehicle system and the uncertainty of environmental model, it is hard to establish accurate dynamic model and control law obtained by traditional algorithm which is too complex and has no engineering practice realization. In this paper, based on deep reinforcement learning algorithm of deep deterministic policy gradients, the line of sight algorithm is used firstly to obtains the expected value of heading angle of USV according to the current time position and the expected trajectory of USV. Meanwhile, we adopt the double Gaussian reward function to evaluate the training action, so as to obtain the optimal control action to realize the accurate tracking control. Finally, compared with explicit model predictive controller and linear quadratic regulator, the designed track controller based on DDPG has shorter adjusting time and smaller overshoot than explicit model predictive controller and linear quadratic regulator.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.