Logo
    • English
    • Ελληνικά
    • Deutsch
    • français
    • italiano
    • español
  • English 
    • English
    • Ελληνικά
    • Deutsch
    • français
    • italiano
    • español
  • Login
View Item 
  •   University of Thessaly Institutional Repository
  • Επιστημονικές Δημοσιεύσεις Μελών ΠΘ (ΕΔΠΘ)
  • Δημοσιεύσεις σε περιοδικά, συνέδρια, κεφάλαια βιβλίων κλπ.
  • View Item
  •   University of Thessaly Institutional Repository
  • Επιστημονικές Δημοσιεύσεις Μελών ΠΘ (ΕΔΠΘ)
  • Δημοσιεύσεις σε περιοδικά, συνέδρια, κεφάλαια βιβλίων κλπ.
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.
Institutional repository
All of DSpace
  • Communities & Collections
  • By Issue Date
  • Authors
  • Titles
  • Subjects

A Deep Reinforcement Learning Motion Control Strategy of a Multi-rotor UAV for Payload Transportation with Minimum Swing

Thumbnail
Author
Panetsos F., Karras G.C., Kyriakopoulos K.J.
Date
2022
Language
en
DOI
10.1109/MED54222.2022.9837220
Keyword
Aircraft control
Deep learning
Learning algorithms
Reinforcement learning
Control strategies
Coupled dynamics
Deterministics
Multirotors
Neural-networks
Policy gradient
Reinforcement learning algorithms
Reinforcement learnings
Suspended loads
Swinging motions
Unmanned aerial vehicles (UAV)
Institute of Electrical and Electronics Engineers Inc.
Metadata display
Abstract
This paper addresses the problem of controlling a multirotor UAV with a cable-suspended load. In order to ensure the safe transportation of the load, the swinging motion, induced by the strongly coupled dynamics, has to be minimized. Specifically, using the Twin Delayed Deep Deterministic Policy Gradient (TD3) Reinforcement Learning algorithm, a policy Neural Network is trained in a model-free manner which navigates the vehicle to the desired waypoints while, simultaneously, compensating for the load oscillations. The learned policy network is incorporated into the cascaded control architecture of the autopilot by replacing the common PID position controller and, thus, communicating directly with the inner attitude one. The performance of the proposed policy is demonstrated through a comparative simulation and experimental study while using an octorotor UAV. © 2022 IEEE.
URI
http://hdl.handle.net/11615/77485
Collections
  • Δημοσιεύσεις σε περιοδικά, συνέδρια, κεφάλαια βιβλίων κλπ. [19735]

Related items

Showing items related by title, author, creator and subject.

  • Thumbnail

    Εξυπνοι και αλληλεπιδρώμενοι πράκτορες e-learning, smartive e-learning agents - smart and interactive e-learning agents 

    Μόσχος, Λάκης (2011)
  • Thumbnail

    Μηχανική και ενισχυτική μάθηση μέσω του αλγορίθμου Q-learning 

    Μπάτσιος, Ιωάννης (2021)
  • Thumbnail

    Motivating Engineer Students in E-learning Courses with Problem Based Learning and Self-Regulated Learning on the apT2CLE4‘Research Methods’ Environment 

    Paraskeva F., Alexiou A., Bouta H., Mysirlaki S., Sotiropoulos D.J., Souki A.-M. (2019)
    More and more university programs try to establish an understanding of research methodology with relevant courses at undergraduate schools. Engineer students should have adequate academic training and experience to gain ...
htmlmap 

 

Browse

All of DSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

My Account

LoginRegister (MyDspace)
Help Contact
DepositionAboutHelpContact Us
Choose LanguageAll of DSpace
EnglishΕλληνικά
htmlmap