Development of a New Robust Stable Walking Algorithm for a Humanoid Robot Using Deep Reinforcement Learning with Multi-Sensor Data Fusion

Cagri Kaymak; Aysegul Ucar; Cuneyt Guzelis; Güzeliş, Cüneyt; Kaymak, Çağrı; Uçar, Ayşegül

Development of a New Robust Stable Walking Algorithm for a Humanoid Robot Using Deep Reinforcement Learning with Multi-Sensor Data Fusion

dc.contributor.author	Cagri Kaymak
dc.contributor.author	Aysegul Ucar
dc.contributor.author	Cuneyt Guzelis
dc.contributor.author	Güzeliş, Cüneyt
dc.contributor.author	Kaymak, Çağrı
dc.contributor.author	Uçar, Ayşegül
dc.date	FEB
dc.date.accessioned	2025-10-06T16:21:28Z
dc.date.issued	2023
dc.description.abstract	The difficult task of creating reliable mobility for humanoid robots has been studied for decades. Even though several different walking strategies have been put forth and walking performance has substantially increased stability still needs to catch up to expectations. Applications for Reinforcement Learning (RL) techniques are constrained by low convergence and ineffective training. This paper develops a new robust and efficient framework based on the Robotis-OP2 humanoid robot combined with a typical trajectory-generating controller and Deep Reinforcement Learning (DRL) to overcome these limitations. This framework consists of optimizing the walking trajectory parameters and posture balancing system. Multi-sensors of the robot are used for parameter optimization. Walking parameters are optimized using the Dueling Double Deep Q Network (D3QN) one of the DRL algorithms in the Webots simulator. The hip strategy is adopted for the posture balancing system. Experimental studies are carried out in both simulation and real environments with the proposed framework and Robotis-OP2's walking algorithm. Experimental results show that the robot performs more stable walking with the proposed framework than Robotis-OP2's walking algorithm. It is thought that the proposed framework will be beneficial for researchers studying in the field of humanoid robot locomotion.
dc.description.sponsorship	This study was supported by the Scientific and Technological Research Council of Turkey (TUBITAK) grant numbered 117E589. Additionally, the GTX Titan X Pascal GPU in this research was donated by NVIDIA Corporation.
dc.description.sponsorship	Scientific and Technological Research Council of Turkey (TUBITAK) and NVIDIA.
dc.description.sponsorship	Nvidia; Türkiye Bilimsel ve Teknolojik Araştırma Kurumu, TÜBİTAK, (117E589)
dc.identifier.doi	10.3390/electronics12030568
dc.identifier.issn	2079-9292
dc.identifier.scopus	2-s2.0-85147877640
dc.identifier.uri	http://dx.doi.org/10.3390/electronics12030568
dc.identifier.uri	https://gcris.yasar.edu.tr/handle/123456789/6899
dc.identifier.uri	https://doi.org/10.3390/electronics12030568
dc.language.iso	English
dc.publisher	MDPI
dc.relation.ispartof	Electronics
dc.rights	info:eu-repo/semantics/openAccess
dc.source	ELECTRONICS
dc.subject	humanoid robot, stable walking, parameter optimization, Deep Reinforcement Learning, multi-sensor
dc.subject	Stable Walking
dc.subject	Deep Reinforcement Learning
dc.subject	Parameter Optimization
dc.subject	Multi-sensor
dc.subject	Humanoid Robot
dc.title	Development of a New Robust Stable Walking Algorithm for a Humanoid Robot Using Deep Reinforcement Learning with Multi-Sensor Data Fusion
dc.type	Article
dspace.entity.type	Publication
gdc.author.id	KAYMAK, CAGRI/0000-0001-5343-226X
gdc.author.id	ucar, aysegul/0000-0002-5253-3779
gdc.author.scopusid	56024631400
gdc.author.scopusid	55937768800
gdc.author.scopusid	7004549716
gdc.author.wosid	ucar, aysegul/P-8443-2015
gdc.author.wosid	KAYMAK, CAGRI/W-4261-2018
gdc.bip.impulseclass	C4
gdc.bip.influenceclass	C4
gdc.bip.popularityclass	C4
gdc.coar.type	text::journal::journal article
gdc.collaboration.industrial	false
gdc.description.department
gdc.description.departmenttemp	[Kaymak, Cagri; Ucar, Aysegul] Firat Univ, Engn Fac, Mechatron Engn Dept, TR-23119 Elazig, Turkiye; [Guzelis, Cuneyt] Yasar Univ, Engn Fac, Elect & Engn Dept, TR-35100 Izmir, Turkiye
gdc.description.issue	3
gdc.description.publicationcategory	Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
gdc.description.startpage	568
gdc.description.volume	12
gdc.description.woscitationindex	Science Citation Index Expanded
gdc.identifier.openalex	W4317743590
gdc.identifier.wos	WOS:000988345100001
gdc.index.type	WoS
gdc.index.type	Scopus
gdc.oaire.accesstype	GOLD
gdc.oaire.diamondjournal	false
gdc.oaire.impulse	17.0
gdc.oaire.influence	3.4739729E-9
gdc.oaire.isgreen	true
gdc.oaire.keywords	stable walking
gdc.oaire.keywords	humanoid robot
gdc.oaire.keywords	Deep Reinforcement Learning
gdc.oaire.keywords	parameter optimization
gdc.oaire.keywords	multi-sensor
gdc.oaire.popularity	1.4666002E-8
gdc.oaire.publicfunded	false
gdc.oaire.sciencefields	0202 electrical engineering, electronic engineering, information engineering
gdc.oaire.sciencefields	02 engineering and technology
gdc.openalex.collaboration	National
gdc.openalex.fwci	2.0694
gdc.openalex.normalizedpercentile	0.86
gdc.opencitations.count	15
gdc.plumx.crossrefcites	12
gdc.plumx.mendeley	33
gdc.plumx.newscount	1
gdc.plumx.scopuscites	17
gdc.scopus.citedcount	18
gdc.virtual.author	Güzeliş, Cüneyt
gdc.wos.citedcount	13
person.identifier.orcid	ucar- aysegul/0000-0002-5253-3779, KAYMAK- CAGRI/0000-0001-5343-226X
project.funder.name	Scientific and Technological Research Council of Turkey (TUBITAK), NVIDIA
publicationissue.issueNumber	3
publicationvolume.volumeNumber	12
relation.isAuthorOfPublication	10f564e3-6c1c-4354-9ce3-b5ac01e39680
relation.isAuthorOfPublication.latestForDiscovery	10f564e3-6c1c-4354-9ce3-b5ac01e39680
relation.isOrgUnitOfPublication	ac5ddece-c76d-476d-ab30-e4d3029dee37
relation.isOrgUnitOfPublication.latestForDiscovery	ac5ddece-c76d-476d-ab30-e4d3029dee37

Files

Original bundle

Now showing 1 - 1 of 1

Name:: electronics-12-00568-v3.pdf
Size:: 16.09 MB
Format:: Adobe Portable Document Format

Download

Collections

WoS İndeksli Yayınlar Koleksiyonu
Scopus İndeksli Yayınlar Koleksiyonu