Learning to Move an Object by the Humanoid Robots by Using Deep Reinforcement Learning

dc.contributor.author Simge Nur Aslan
dc.contributor.author Burak Tasci
dc.contributor.author Aysegul Ucar
dc.contributor.author Cuneyt Guzelis
dc.contributor.editor M Lustrek
dc.contributor.editor E Bashir
dc.coverage.spatial Middlesex Univ Dubai ELECTR NETWORK
dc.date.accessioned 2025-10-06T16:20:49Z
dc.date.issued 2021
dc.description.abstract This paper proposes an algorithm for learning to move the desired object by humanoid robots. In this algorithm the semantic segmentation algorithm and Deep Reinforcement Learning (DRL) algorithms are combined. The semantic segmentation algorithm is used to detect and recognize the object be moved. DRL algorithms are used at the walking and grasping steps. Deep Q Network (DQN) is used to walk towards the target object by means of the previously defined actions at the gate manager and the different head positions of the robot. Deep Deterministic Policy Gradient (DDPG) network is used for grasping by means of the continuous actions. The previously defined commands are finally assigned for the robot to stand up turn left side and move forward together with the object. In the experimental setup the Robotis-Op3 humanoid robot is used. The obtained results show that the proposed algorithm has successfully worked.
dc.identifier.doi 10.3233/AISE210092
dc.identifier.isbn 978-1-64368-187-0, 978-1-64368-186-3
dc.identifier.issn 1875-4163
dc.identifier.uri http://dx.doi.org/10.3233/AISE210092
dc.identifier.uri https://gcris.yasar.edu.tr/handle/123456789/6566
dc.language.iso English
dc.publisher IOS PRESS
dc.relation.ispartof 17th International Conference on Intelligent Environments (IE)
dc.source INTELLIGENT ENVIRONMENTS 2021
dc.subject Humanoid robots, DQN, DDPG, deep semantic segmentation, object manipulation, locomotion
dc.subject NAVIGATION
dc.title Learning to Move an Object by the Humanoid Robots by Using Deep Reinforcement Learning
dc.type Conference Object
dspace.entity.type Publication
gdc.bip.impulseclass C5
gdc.bip.influenceclass C5
gdc.bip.popularityclass C5
gdc.coar.type text::conference output
gdc.collaboration.industrial false
gdc.identifier.openalex W3171469064
gdc.index.type WoS
gdc.oaire.accesstype HYBRID
gdc.oaire.diamondjournal false
gdc.oaire.impulse 2.0
gdc.oaire.influence 2.4860618E-9
gdc.oaire.isgreen false
gdc.oaire.popularity 2.933143E-9
gdc.oaire.publicfunded false
gdc.openalex.collaboration National
gdc.openalex.fwci 1.9742
gdc.openalex.normalizedpercentile 0.84
gdc.opencitations.count 2
gdc.plumx.mendeley 3
gdc.plumx.scopuscites 3
oaire.citation.endPage 155
oaire.citation.startPage 143
person.identifier.orcid TASCI- BURAK/0000-0002-4490-0946, ucar- aysegul/0000-0002-5253-3779
project.funder.name Scientific and Technological Research Council of Turkey (TUBITAK) [117E589]
publicationvolume.volumeNumber 29
relation.isOrgUnitOfPublication ac5ddece-c76d-476d-ab30-e4d3029dee37
relation.isOrgUnitOfPublication.latestForDiscovery ac5ddece-c76d-476d-ab30-e4d3029dee37

Files