Development of a New Robust Stable Walking Algorithm for a Humanoid Robot Using Deep Reinforcement Learning with Multi-Sensor Data Fusion

dc.contributor.author Cagri Kaymak
dc.contributor.author Aysegul Ucar
dc.contributor.author Cuneyt Guzelis
dc.contributor.author Güzeliş, Cüneyt
dc.contributor.author Kaymak, Çağrı
dc.contributor.author Uçar, Ayşegül
dc.date FEB
dc.date.accessioned 2025-10-06T16:21:28Z
dc.date.issued 2023
dc.description.abstract The difficult task of creating reliable mobility for humanoid robots has been studied for decades. Even though several different walking strategies have been put forth and walking performance has substantially increased stability still needs to catch up to expectations. Applications for Reinforcement Learning (RL) techniques are constrained by low convergence and ineffective training. This paper develops a new robust and efficient framework based on the Robotis-OP2 humanoid robot combined with a typical trajectory-generating controller and Deep Reinforcement Learning (DRL) to overcome these limitations. This framework consists of optimizing the walking trajectory parameters and posture balancing system. Multi-sensors of the robot are used for parameter optimization. Walking parameters are optimized using the Dueling Double Deep Q Network (D3QN) one of the DRL algorithms in the Webots simulator. The hip strategy is adopted for the posture balancing system. Experimental studies are carried out in both simulation and real environments with the proposed framework and Robotis-OP2's walking algorithm. Experimental results show that the robot performs more stable walking with the proposed framework than Robotis-OP2's walking algorithm. It is thought that the proposed framework will be beneficial for researchers studying in the field of humanoid robot locomotion.
dc.description.sponsorship This study was supported by the Scientific and Technological Research Council of Turkey (TUBITAK) grant numbered 117E589. Additionally, the GTX Titan X Pascal GPU in this research was donated by NVIDIA Corporation.
dc.description.sponsorship Scientific and Technological Research Council of Turkey (TUBITAK) and NVIDIA.
dc.description.sponsorship Nvidia; Türkiye Bilimsel ve Teknolojik Araştırma Kurumu, TÜBİTAK, (117E589)
dc.identifier.doi 10.3390/electronics12030568
dc.identifier.issn 2079-9292
dc.identifier.scopus 2-s2.0-85147877640
dc.identifier.uri http://dx.doi.org/10.3390/electronics12030568
dc.identifier.uri https://gcris.yasar.edu.tr/handle/123456789/6899
dc.identifier.uri https://doi.org/10.3390/electronics12030568
dc.language.iso English
dc.publisher MDPI
dc.relation.ispartof Electronics
dc.rights info:eu-repo/semantics/openAccess
dc.source ELECTRONICS
dc.subject humanoid robot, stable walking, parameter optimization, Deep Reinforcement Learning, multi-sensor
dc.subject Stable Walking
dc.subject Deep Reinforcement Learning
dc.subject Parameter Optimization
dc.subject Multi-sensor
dc.subject Humanoid Robot
dc.title Development of a New Robust Stable Walking Algorithm for a Humanoid Robot Using Deep Reinforcement Learning with Multi-Sensor Data Fusion
dc.type Article
dspace.entity.type Publication
gdc.author.id KAYMAK, CAGRI/0000-0001-5343-226X
gdc.author.id ucar, aysegul/0000-0002-5253-3779
gdc.author.scopusid 56024631400
gdc.author.scopusid 55937768800
gdc.author.scopusid 7004549716
gdc.author.wosid ucar, aysegul/P-8443-2015
gdc.author.wosid KAYMAK, CAGRI/W-4261-2018
gdc.bip.impulseclass C4
gdc.bip.influenceclass C4
gdc.bip.popularityclass C4
gdc.coar.type text::journal::journal article
gdc.collaboration.industrial false
gdc.description.department
gdc.description.departmenttemp [Kaymak, Cagri; Ucar, Aysegul] Firat Univ, Engn Fac, Mechatron Engn Dept, TR-23119 Elazig, Turkiye; [Guzelis, Cuneyt] Yasar Univ, Engn Fac, Elect & Engn Dept, TR-35100 Izmir, Turkiye
gdc.description.issue 3
gdc.description.publicationcategory Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
gdc.description.startpage 568
gdc.description.volume 12
gdc.description.woscitationindex Science Citation Index Expanded
gdc.identifier.openalex W4317743590
gdc.identifier.wos WOS:000988345100001
gdc.index.type WoS
gdc.index.type Scopus
gdc.oaire.accesstype GOLD
gdc.oaire.diamondjournal false
gdc.oaire.impulse 17.0
gdc.oaire.influence 3.4739729E-9
gdc.oaire.isgreen true
gdc.oaire.keywords stable walking
gdc.oaire.keywords humanoid robot
gdc.oaire.keywords Deep Reinforcement Learning
gdc.oaire.keywords parameter optimization
gdc.oaire.keywords multi-sensor
gdc.oaire.popularity 1.4666002E-8
gdc.oaire.publicfunded false
gdc.oaire.sciencefields 0202 electrical engineering, electronic engineering, information engineering
gdc.oaire.sciencefields 02 engineering and technology
gdc.openalex.collaboration National
gdc.openalex.fwci 2.0694
gdc.openalex.normalizedpercentile 0.86
gdc.opencitations.count 15
gdc.plumx.crossrefcites 12
gdc.plumx.mendeley 33
gdc.plumx.newscount 1
gdc.plumx.scopuscites 17
gdc.scopus.citedcount 18
gdc.virtual.author Güzeliş, Cüneyt
gdc.wos.citedcount 13
person.identifier.orcid ucar- aysegul/0000-0002-5253-3779, KAYMAK- CAGRI/0000-0001-5343-226X
project.funder.name Scientific and Technological Research Council of Turkey (TUBITAK), NVIDIA
publicationissue.issueNumber 3
publicationvolume.volumeNumber 12
relation.isAuthorOfPublication 10f564e3-6c1c-4354-9ce3-b5ac01e39680
relation.isAuthorOfPublication.latestForDiscovery 10f564e3-6c1c-4354-9ce3-b5ac01e39680
relation.isOrgUnitOfPublication ac5ddece-c76d-476d-ab30-e4d3029dee37
relation.isOrgUnitOfPublication.latestForDiscovery ac5ddece-c76d-476d-ab30-e4d3029dee37

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
electronics-12-00568-v3.pdf
Size:
16.09 MB
Format:
Adobe Portable Document Format