An Implementation of Vision Based Deep Reinforcement Learning for Humanoid Robot Locomotion

dc.contributor.author Recep Ozalp
dc.contributor.author Çaǧri Kaymak
dc.contributor.author Özal Yildirim
dc.contributor.author Ayşegül Uçar
dc.contributor.author Yakup Demir
dc.contributor.author Cüneyt Güzeliş
dc.contributor.author Ozaln, Recen
dc.contributor.author Yildirum, Ozal
dc.contributor.author Guzelis, Cuneyt
dc.contributor.author Kaymak, Cagri
dc.contributor.author Ucar, Ayscgul
dc.contributor.author Demir, Yakup
dc.contributor.editor P. Koprinkova-Hristova , T. Yildirim , V. Piuri , L. Iliadis , D. Camacho
dc.date.accessioned 2025-10-06T17:51:21Z
dc.date.issued 2019
dc.description.abstract Deep reinforcement learning (DRL) exhibits a promising approach for controlling humanoid robot locomotion. However only values relating sensors such as IMU gyroscope and GPS are not sufficient robots to learn their locomotion skills. In this article we aim to show the success of vision based DRL. We propose a new vision based deep reinforcement learning algorithm for the locomotion of the Robotis-op2 humanoid robot for the first time. In experimental setup we construct the locomotion of humanoid robot in a specific environment in the Webots software. We use Double Dueling Q Networks (D3QN) and Deep Q Networks (DQN) that are a kind of reinforcement learning algorithm. We present the performance of vision based DRL algorithm on a locomotion experiment. The experimental results show that D3QN is better than DQN in that stable locomotion and fast training and the vision based DRL algorithms will be successfully able to use at the other complex environments and applications. © 2020 Elsevier B.V. All rights reserved.
dc.description.sponsorship Bulgarian National Science Fund, Bulgarian Section
dc.description.sponsorship TUBITAK, (117E589); Nvidia; Türkiye Bilimsel ve Teknolojik Araştirma Kurumu, TÜBITAK
dc.description.sponsorship This work was supported by the Scientific and Technological Research Council of Turkey (TUBITAK) grant numbers 117E589. In addition, GTX Titan X Pascal GPU in this research was donated by the NVIDIA Corporation.
dc.description.sponsorship ACKNOWLEDGMENT This work was supported by the Scientific and Technological Research Council of Turkey (TUBITAK) grant numbers 117E589. In addition, GTX Titan X Pascal GPU in this research was donated by the NVIDIA Corporation.
dc.identifier.doi 10.1109/INISTA.2019.8778209
dc.identifier.isbn 9781728118628
dc.identifier.scopus 2-s2.0-85070740676
dc.identifier.uri https://www.scopus.com/inward/record.uri?eid=2-s2.0-85070740676&doi=10.1109%2FINISTA.2019.8778209&partnerID=40&md5=db28d19538187e86361bfe93872b4962
dc.identifier.uri https://gcris.yasar.edu.tr/handle/123456789/9395
dc.identifier.uri https://doi.org/10.1109/INISTA.2019.8778209
dc.language.iso English
dc.publisher Institute of Electrical and Electronics Engineers Inc.
dc.relation.ispartof 2019 IEEE International Symposium on INnovations in Intelligent SysTems and Applications INISTA 2019
dc.rights info:eu-repo/semantics/closedAccess
dc.subject Control, Deep Reinforcement Learning, Humanoid Robots, Locomotion Skills, Anthropomorphic Robots, Body Sensor Networks, Control Engineering, Intelligent Systems, Learning Algorithms, Machine Learning, Reinforcement Learning, Complex Environments, Humanoid Robot, Humanoid Robot Locomotion, Stable Locomotion, Vision Based, Webots Software, Deep Learning
dc.subject Anthropomorphic robots, Body sensor networks, Control engineering, Intelligent systems, Learning algorithms, Machine learning, Reinforcement learning, Complex environments, Humanoid robot, Humanoid robot locomotion, Stable locomotion, Vision based, Webots software, Deep learning
dc.subject Locomotion Skills
dc.subject control
dc.subject Deep Reinforcement Learning
dc.subject Humanoid Robots
dc.title An Implementation of Vision Based Deep Reinforcement Learning for Humanoid Robot Locomotion
dc.type Conference Object
dspace.entity.type Publication
gdc.author.scopusid 57194274546
gdc.author.scopusid 55293146500
gdc.author.scopusid 7006472523
gdc.author.scopusid 56024631400
gdc.author.scopusid 55937768800
gdc.author.scopusid 7004549716
gdc.bip.impulseclass C5
gdc.bip.influenceclass C5
gdc.bip.popularityclass C4
gdc.coar.type text::conference output
gdc.collaboration.industrial false
gdc.description.department
gdc.description.departmenttemp [Ozaln R.] Department of Mechatronics Engineering, Firat University, Elazig, Turkey; [Kaymak C.] Department of Mechatronics Engineering, Firat University, Elazig, Turkey; [Yildirum O.] Department of Computer Engineering, Muznur University, Tunceli, Turkey; [Ucar A.] Department of Mechatronics Engineering, Firat University, Elazig, Turkey; [Demir Y.] Department of Mechatronics Engineering, Firat University, Elazig, Turkey; [Guzelis C.] Department of Electrical and Electronics Engineering, Yasar University, Izmir, Turkey
gdc.description.endpage 5
gdc.description.publicationcategory Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı
gdc.description.startpage 1
gdc.identifier.openalex W2963177351
gdc.index.type Scopus
gdc.oaire.diamondjournal false
gdc.oaire.impulse 2.0
gdc.oaire.influence 2.5940006E-9
gdc.oaire.isgreen false
gdc.oaire.popularity 4.194291E-9
gdc.oaire.publicfunded false
gdc.oaire.sciencefields 0209 industrial biotechnology
gdc.oaire.sciencefields 02 engineering and technology
gdc.openalex.collaboration National
gdc.openalex.fwci 0.6847
gdc.openalex.normalizedpercentile 0.67
gdc.opencitations.count 7
gdc.plumx.crossrefcites 1
gdc.plumx.mendeley 29
gdc.plumx.scopuscites 16
gdc.scopus.citedcount 16
gdc.virtual.author Güzeliş, Cüneyt
person.identifier.scopus-author-id Ozalp- Recep (57194274546), Kaymak- Çaǧri (56024631400), Yildirim- Özal (55293146500), Uçar- Ayşegül (7004549716), Demir- Yakup (7006472523), Güzeliş- Cüneyt (55937768800)
project.funder.name Funding text 1: ACKNOWLEDGMENT This work was supported by the Scientific and Technological Research Council of Turkey (TUBITAK) grant numbers 117E589. In addition GTX Titan X Pascal GPU in this research was donated by the NVIDIA Corporation., Funding text 2: This work was supported by the Scientific and Technological Research Council of Turkey (TUBITAK) grant numbers 117E589. In addition GTX Titan X Pascal GPU in this research was donated by the NVIDIA Corporation.
relation.isAuthorOfPublication 10f564e3-6c1c-4354-9ce3-b5ac01e39680
relation.isAuthorOfPublication.latestForDiscovery 10f564e3-6c1c-4354-9ce3-b5ac01e39680
relation.isOrgUnitOfPublication ac5ddece-c76d-476d-ab30-e4d3029dee37
relation.isOrgUnitOfPublication.latestForDiscovery ac5ddece-c76d-476d-ab30-e4d3029dee37

Files