An Implementation of Vision Based Deep Reinforcement Learning for Humanoid Robot Locomotion

Recep Ozalp; Çaǧri Kaymak; Özal Yildirim; Ayşegül Uçar; Yakup Demir; Cüneyt Güzeliş; Ozaln, Recen; Yildirum, Ozal; Guzelis, Cuneyt; Kaymak, Cagri; Ucar, Ayscgul; Demir, Yakup

An Implementation of Vision Based Deep Reinforcement Learning for Humanoid Robot Locomotion

dc.contributor.author	Recep Ozalp
dc.contributor.author	Çaǧri Kaymak
dc.contributor.author	Özal Yildirim
dc.contributor.author	Ayşegül Uçar
dc.contributor.author	Yakup Demir
dc.contributor.author	Cüneyt Güzeliş
dc.contributor.author	Ozaln, Recen
dc.contributor.author	Yildirum, Ozal
dc.contributor.author	Guzelis, Cuneyt
dc.contributor.author	Kaymak, Cagri
dc.contributor.author	Ucar, Ayscgul
dc.contributor.author	Demir, Yakup
dc.contributor.editor	P. Koprinkova-Hristova , T. Yildirim , V. Piuri , L. Iliadis , D. Camacho
dc.date.accessioned	2025-10-06T17:51:21Z
dc.date.issued	2019
dc.description.abstract	Deep reinforcement learning (DRL) exhibits a promising approach for controlling humanoid robot locomotion. However only values relating sensors such as IMU gyroscope and GPS are not sufficient robots to learn their locomotion skills. In this article we aim to show the success of vision based DRL. We propose a new vision based deep reinforcement learning algorithm for the locomotion of the Robotis-op2 humanoid robot for the first time. In experimental setup we construct the locomotion of humanoid robot in a specific environment in the Webots software. We use Double Dueling Q Networks (D3QN) and Deep Q Networks (DQN) that are a kind of reinforcement learning algorithm. We present the performance of vision based DRL algorithm on a locomotion experiment. The experimental results show that D3QN is better than DQN in that stable locomotion and fast training and the vision based DRL algorithms will be successfully able to use at the other complex environments and applications. © 2020 Elsevier B.V. All rights reserved.
dc.description.sponsorship	Bulgarian National Science Fund, Bulgarian Section
dc.description.sponsorship	TUBITAK, (117E589); Nvidia; Türkiye Bilimsel ve Teknolojik Araştirma Kurumu, TÜBITAK
dc.description.sponsorship	This work was supported by the Scientific and Technological Research Council of Turkey (TUBITAK) grant numbers 117E589. In addition, GTX Titan X Pascal GPU in this research was donated by the NVIDIA Corporation.
dc.description.sponsorship	ACKNOWLEDGMENT This work was supported by the Scientific and Technological Research Council of Turkey (TUBITAK) grant numbers 117E589. In addition, GTX Titan X Pascal GPU in this research was donated by the NVIDIA Corporation.
dc.identifier.doi	10.1109/INISTA.2019.8778209
dc.identifier.isbn	9781728118628
dc.identifier.scopus	2-s2.0-85070740676
dc.identifier.uri	https://www.scopus.com/inward/record.uri?eid=2-s2.0-85070740676&doi=10.1109%2FINISTA.2019.8778209&partnerID=40&md5=db28d19538187e86361bfe93872b4962
dc.identifier.uri	https://gcris.yasar.edu.tr/handle/123456789/9395
dc.identifier.uri	https://doi.org/10.1109/INISTA.2019.8778209
dc.language.iso	English
dc.publisher	Institute of Electrical and Electronics Engineers Inc.
dc.relation.ispartof	2019 IEEE International Symposium on INnovations in Intelligent SysTems and Applications INISTA 2019
dc.rights	info:eu-repo/semantics/closedAccess
dc.subject	Control, Deep Reinforcement Learning, Humanoid Robots, Locomotion Skills, Anthropomorphic Robots, Body Sensor Networks, Control Engineering, Intelligent Systems, Learning Algorithms, Machine Learning, Reinforcement Learning, Complex Environments, Humanoid Robot, Humanoid Robot Locomotion, Stable Locomotion, Vision Based, Webots Software, Deep Learning
dc.subject	Anthropomorphic robots, Body sensor networks, Control engineering, Intelligent systems, Learning algorithms, Machine learning, Reinforcement learning, Complex environments, Humanoid robot, Humanoid robot locomotion, Stable locomotion, Vision based, Webots software, Deep learning
dc.subject	Locomotion Skills
dc.subject	control
dc.subject	Deep Reinforcement Learning
dc.subject	Humanoid Robots
dc.title	An Implementation of Vision Based Deep Reinforcement Learning for Humanoid Robot Locomotion
dc.type	Conference Object
dspace.entity.type	Publication
gdc.author.scopusid	57194274546
gdc.author.scopusid	55293146500
gdc.author.scopusid	7006472523
gdc.author.scopusid	56024631400
gdc.author.scopusid	55937768800
gdc.author.scopusid	7004549716
gdc.bip.impulseclass	C5
gdc.bip.influenceclass	C5
gdc.bip.popularityclass	C4
gdc.coar.type	text::conference output
gdc.collaboration.industrial	false
gdc.description.department
gdc.description.departmenttemp	[Ozaln R.] Department of Mechatronics Engineering, Firat University, Elazig, Turkey; [Kaymak C.] Department of Mechatronics Engineering, Firat University, Elazig, Turkey; [Yildirum O.] Department of Computer Engineering, Muznur University, Tunceli, Turkey; [Ucar A.] Department of Mechatronics Engineering, Firat University, Elazig, Turkey; [Demir Y.] Department of Mechatronics Engineering, Firat University, Elazig, Turkey; [Guzelis C.] Department of Electrical and Electronics Engineering, Yasar University, Izmir, Turkey
gdc.description.endpage	5
gdc.description.publicationcategory	Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı
gdc.description.startpage	1
gdc.identifier.openalex	W2963177351
gdc.index.type	Scopus
gdc.oaire.diamondjournal	false
gdc.oaire.impulse	2.0
gdc.oaire.influence	2.5940006E-9
gdc.oaire.isgreen	false
gdc.oaire.popularity	4.194291E-9
gdc.oaire.publicfunded	false
gdc.oaire.sciencefields	0209 industrial biotechnology
gdc.oaire.sciencefields	02 engineering and technology
gdc.openalex.collaboration	National
gdc.openalex.fwci	0.6847
gdc.openalex.normalizedpercentile	0.67
gdc.opencitations.count	7
gdc.plumx.crossrefcites	1
gdc.plumx.mendeley	29
gdc.plumx.scopuscites	16
gdc.scopus.citedcount	16
gdc.virtual.author	Güzeliş, Cüneyt
person.identifier.scopus-author-id	Ozalp- Recep (57194274546), Kaymak- Çaǧri (56024631400), Yildirim- Özal (55293146500), Uçar- Ayşegül (7004549716), Demir- Yakup (7006472523), Güzeliş- Cüneyt (55937768800)
project.funder.name	Funding text 1: ACKNOWLEDGMENT This work was supported by the Scientific and Technological Research Council of Turkey (TUBITAK) grant numbers 117E589. In addition GTX Titan X Pascal GPU in this research was donated by the NVIDIA Corporation., Funding text 2: This work was supported by the Scientific and Technological Research Council of Turkey (TUBITAK) grant numbers 117E589. In addition GTX Titan X Pascal GPU in this research was donated by the NVIDIA Corporation.
relation.isAuthorOfPublication	10f564e3-6c1c-4354-9ce3-b5ac01e39680
relation.isAuthorOfPublication.latestForDiscovery	10f564e3-6c1c-4354-9ce3-b5ac01e39680
relation.isOrgUnitOfPublication	ac5ddece-c76d-476d-ab30-e4d3029dee37
relation.isOrgUnitOfPublication.latestForDiscovery	ac5ddece-c76d-476d-ab30-e4d3029dee37

Collections

Scopus İndeksli Yayınlar Koleksiyonu

An Implementation of Vision Based Deep Reinforcement Learning for Humanoid Robot Locomotion

Files

Collections