An Implementation of Vision Based Deep Reinforcement Learning for Humanoid Robot Locomotion
| dc.contributor.author | Recep Ozalp | |
| dc.contributor.author | Çaǧri Kaymak | |
| dc.contributor.author | Özal Yildirim | |
| dc.contributor.author | Ayşegül Uçar | |
| dc.contributor.author | Yakup Demir | |
| dc.contributor.author | Cüneyt Güzeliş | |
| dc.contributor.author | Ozaln, Recen | |
| dc.contributor.author | Yildirum, Ozal | |
| dc.contributor.author | Guzelis, Cuneyt | |
| dc.contributor.author | Kaymak, Cagri | |
| dc.contributor.author | Ucar, Ayscgul | |
| dc.contributor.author | Demir, Yakup | |
| dc.contributor.editor | P. Koprinkova-Hristova , T. Yildirim , V. Piuri , L. Iliadis , D. Camacho | |
| dc.date.accessioned | 2025-10-06T17:51:21Z | |
| dc.date.issued | 2019 | |
| dc.description.abstract | Deep reinforcement learning (DRL) exhibits a promising approach for controlling humanoid robot locomotion. However only values relating sensors such as IMU gyroscope and GPS are not sufficient robots to learn their locomotion skills. In this article we aim to show the success of vision based DRL. We propose a new vision based deep reinforcement learning algorithm for the locomotion of the Robotis-op2 humanoid robot for the first time. In experimental setup we construct the locomotion of humanoid robot in a specific environment in the Webots software. We use Double Dueling Q Networks (D3QN) and Deep Q Networks (DQN) that are a kind of reinforcement learning algorithm. We present the performance of vision based DRL algorithm on a locomotion experiment. The experimental results show that D3QN is better than DQN in that stable locomotion and fast training and the vision based DRL algorithms will be successfully able to use at the other complex environments and applications. © 2020 Elsevier B.V. All rights reserved. | |
| dc.description.sponsorship | Bulgarian National Science Fund, Bulgarian Section | |
| dc.description.sponsorship | TUBITAK, (117E589); Nvidia; Türkiye Bilimsel ve Teknolojik Araştirma Kurumu, TÜBITAK | |
| dc.description.sponsorship | This work was supported by the Scientific and Technological Research Council of Turkey (TUBITAK) grant numbers 117E589. In addition, GTX Titan X Pascal GPU in this research was donated by the NVIDIA Corporation. | |
| dc.description.sponsorship | ACKNOWLEDGMENT This work was supported by the Scientific and Technological Research Council of Turkey (TUBITAK) grant numbers 117E589. In addition, GTX Titan X Pascal GPU in this research was donated by the NVIDIA Corporation. | |
| dc.identifier.doi | 10.1109/INISTA.2019.8778209 | |
| dc.identifier.isbn | 9781728118628 | |
| dc.identifier.scopus | 2-s2.0-85070740676 | |
| dc.identifier.uri | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85070740676&doi=10.1109%2FINISTA.2019.8778209&partnerID=40&md5=db28d19538187e86361bfe93872b4962 | |
| dc.identifier.uri | https://gcris.yasar.edu.tr/handle/123456789/9395 | |
| dc.identifier.uri | https://doi.org/10.1109/INISTA.2019.8778209 | |
| dc.language.iso | English | |
| dc.publisher | Institute of Electrical and Electronics Engineers Inc. | |
| dc.relation.ispartof | 2019 IEEE International Symposium on INnovations in Intelligent SysTems and Applications INISTA 2019 | |
| dc.rights | info:eu-repo/semantics/closedAccess | |
| dc.subject | Control, Deep Reinforcement Learning, Humanoid Robots, Locomotion Skills, Anthropomorphic Robots, Body Sensor Networks, Control Engineering, Intelligent Systems, Learning Algorithms, Machine Learning, Reinforcement Learning, Complex Environments, Humanoid Robot, Humanoid Robot Locomotion, Stable Locomotion, Vision Based, Webots Software, Deep Learning | |
| dc.subject | Anthropomorphic robots, Body sensor networks, Control engineering, Intelligent systems, Learning algorithms, Machine learning, Reinforcement learning, Complex environments, Humanoid robot, Humanoid robot locomotion, Stable locomotion, Vision based, Webots software, Deep learning | |
| dc.subject | Locomotion Skills | |
| dc.subject | control | |
| dc.subject | Deep Reinforcement Learning | |
| dc.subject | Humanoid Robots | |
| dc.title | An Implementation of Vision Based Deep Reinforcement Learning for Humanoid Robot Locomotion | |
| dc.type | Conference Object | |
| dspace.entity.type | Publication | |
| gdc.author.scopusid | 57194274546 | |
| gdc.author.scopusid | 55293146500 | |
| gdc.author.scopusid | 7006472523 | |
| gdc.author.scopusid | 56024631400 | |
| gdc.author.scopusid | 55937768800 | |
| gdc.author.scopusid | 7004549716 | |
| gdc.bip.impulseclass | C5 | |
| gdc.bip.influenceclass | C5 | |
| gdc.bip.popularityclass | C4 | |
| gdc.coar.type | text::conference output | |
| gdc.collaboration.industrial | false | |
| gdc.description.department | ||
| gdc.description.departmenttemp | [Ozaln R.] Department of Mechatronics Engineering, Firat University, Elazig, Turkey; [Kaymak C.] Department of Mechatronics Engineering, Firat University, Elazig, Turkey; [Yildirum O.] Department of Computer Engineering, Muznur University, Tunceli, Turkey; [Ucar A.] Department of Mechatronics Engineering, Firat University, Elazig, Turkey; [Demir Y.] Department of Mechatronics Engineering, Firat University, Elazig, Turkey; [Guzelis C.] Department of Electrical and Electronics Engineering, Yasar University, Izmir, Turkey | |
| gdc.description.endpage | 5 | |
| gdc.description.publicationcategory | Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı | |
| gdc.description.startpage | 1 | |
| gdc.identifier.openalex | W2963177351 | |
| gdc.index.type | Scopus | |
| gdc.oaire.diamondjournal | false | |
| gdc.oaire.impulse | 2.0 | |
| gdc.oaire.influence | 2.5940006E-9 | |
| gdc.oaire.isgreen | false | |
| gdc.oaire.popularity | 4.194291E-9 | |
| gdc.oaire.publicfunded | false | |
| gdc.oaire.sciencefields | 0209 industrial biotechnology | |
| gdc.oaire.sciencefields | 02 engineering and technology | |
| gdc.openalex.collaboration | National | |
| gdc.openalex.fwci | 0.6847 | |
| gdc.openalex.normalizedpercentile | 0.67 | |
| gdc.opencitations.count | 7 | |
| gdc.plumx.crossrefcites | 1 | |
| gdc.plumx.mendeley | 29 | |
| gdc.plumx.scopuscites | 16 | |
| gdc.scopus.citedcount | 16 | |
| gdc.virtual.author | Güzeliş, Cüneyt | |
| person.identifier.scopus-author-id | Ozalp- Recep (57194274546), Kaymak- Çaǧri (56024631400), Yildirim- Özal (55293146500), Uçar- Ayşegül (7004549716), Demir- Yakup (7006472523), Güzeliş- Cüneyt (55937768800) | |
| project.funder.name | Funding text 1: ACKNOWLEDGMENT This work was supported by the Scientific and Technological Research Council of Turkey (TUBITAK) grant numbers 117E589. In addition GTX Titan X Pascal GPU in this research was donated by the NVIDIA Corporation., Funding text 2: This work was supported by the Scientific and Technological Research Council of Turkey (TUBITAK) grant numbers 117E589. In addition GTX Titan X Pascal GPU in this research was donated by the NVIDIA Corporation. | |
| relation.isAuthorOfPublication | 10f564e3-6c1c-4354-9ce3-b5ac01e39680 | |
| relation.isAuthorOfPublication.latestForDiscovery | 10f564e3-6c1c-4354-9ce3-b5ac01e39680 | |
| relation.isOrgUnitOfPublication | ac5ddece-c76d-476d-ab30-e4d3029dee37 | |
| relation.isOrgUnitOfPublication.latestForDiscovery | ac5ddece-c76d-476d-ab30-e4d3029dee37 |
