Development of a deep wavelet pyramid scene parsing semantic segmentation network for scene perception in indoor environments

dc.contributor.author Simge Nur Aslan
dc.contributor.author Ayşegül Uçar
dc.contributor.author Cüneyt Güzeliş
dc.contributor.author Güzeliş, Cüneyt
dc.contributor.author Uçar, Ayşegül
dc.contributor.author Aslan, Simge Nur
dc.date.accessioned 2025-10-06T17:49:23Z
dc.date.issued 2023
dc.description.abstract In this paper a new Deep Wavelet Pyramid Scene Parsing Network (DW-PSPNet) is proposed as an effective combination of Discrete Wavelet Transform (DWT) inception module the channel and spatial attention modules and PSPNet. Improved semantic segmentation via the combination to our best knowledge is not yet reported in the literature. The paper has two main contributions: (1) a new backbone network into PSPNET introduced by a combination of DWT inspection modules and attention mechanisms, (2) a new and improved version of PSPNet base structure. Further three new modifications are introduced. First the drop activation function is used to increase validation and test accuracy of the segmentation. Second a skip connection from the backbone is applied to increase validation and test accuracies by restoring the resolution of feature maps via full utilization of multilevel semantic features. Third Inverse Wavelet Transform (IWT) and convolution layer are applied to obtain the segmented images without information loss. DW-PSPNet was implemented via our own data generated by using a Robotis-Op3 humanoid robot to detect objects in indoor environments and and benchmark data set. Simulation results show higher performance of the proposed network compared with that of previous successful networks in handling semantic segmentation tasks in indoor environments. Moreover extensive experiments on the benchmark Ade20K data set were also conducted. DW-PSPNET achieved an mIoU score of 45.97% on the ADE20K validation set which are new state-of-the-art results. © 2023 Elsevier B.V. All rights reserved.
dc.description.sponsorship This work was supported by the Scientific and Technological Research Council of Turkey (TUBITAK) grant numbers 117E589. In addition, GTX Titan X Pascal GPU in this research was donated by the NVIDIA Corporation.
dc.description.sponsorship Nvidia; Türkiye Bilimsel ve Teknolojik Araştırma Kurumu, TÜBİTAK, (117E589)
dc.identifier.doi 10.1007/s12652-022-04231-y
dc.identifier.issn 18685145, 18685137
dc.identifier.issn 1868-5137
dc.identifier.issn 1868-5145
dc.identifier.scopus 2-s2.0-85134345927
dc.identifier.uri https://www.scopus.com/inward/record.uri?eid=2-s2.0-85134345927&doi=10.1007%2Fs12652-022-04231-y&partnerID=40&md5=a0c8ffd600d3fd10a610d24df18742fc
dc.identifier.uri https://gcris.yasar.edu.tr/handle/123456789/8412
dc.identifier.uri https://doi.org/10.1007/s12652-022-04231-y
dc.language.iso English
dc.publisher Springer Science and Business Media Deutschland GmbH
dc.relation.ispartof Journal of Ambient Intelligence and Humanized Computing
dc.rights info:eu-repo/semantics/closedAccess
dc.source Journal of Ambient Intelligence and Humanized Computing
dc.subject Channel And Spatial Attention Mechanisms, Discrete Wavelet Transform, Humanoid Robots, Inception Module, Indoor Image Segmentation, Pspnet, Anthropomorphic Robots, Object Detection, Semantic Segmentation, Semantics, Signal Reconstruction, Attention Mechanisms, Channel And Spatial Attention Mechanism, Discrete-wavelet-transform, Humanoid Robot, Images Segmentations, Inception Module, Indoor Image Segmentation, Pspnet, Spatial Attention, Wavelet Pyramid, Discrete Wavelet Transforms
dc.subject Anthropomorphic robots, Object detection, Semantic Segmentation, Semantics, Signal reconstruction, Attention mechanisms, Channel and spatial attention mechanism, Discrete-wavelet-transform, Humanoid robot, Images segmentations, Inception module, Indoor image segmentation, PSPNet, Spatial attention, Wavelet pyramid, Discrete wavelet transforms
dc.subject Channel and Spatial Attention Mechanisms
dc.subject Discrete Wavelet Transform
dc.subject Humanoid Robots
dc.subject Inception Module
dc.subject Indoor Image Segmentation
dc.subject Pspnet
dc.title Development of a deep wavelet pyramid scene parsing semantic segmentation network for scene perception in indoor environments
dc.type Article
dspace.entity.type Publication
gdc.author.scopusid 57219265872
gdc.author.scopusid 55937768800
gdc.author.scopusid 7004549716
gdc.bip.impulseclass C5
gdc.bip.influenceclass C5
gdc.bip.popularityclass C5
gdc.coar.type text::journal::journal article
gdc.collaboration.industrial false
gdc.description.department
gdc.description.departmenttemp [Aslan S.N.] Electronics and Automation Department, Vocational School, Istanbul Arel University, Istanbul, 34295, Turkey; [Uçar A.] Mechatronics Engineering Department, Engineering Faculty, Firat University, Elazig, 23119, Turkey; [Güzeliş C.] Electrical and Engineering Department, Engineering Faculty, Yaşar University, Izmir, 35100, Turkey
gdc.description.endpage 12695
gdc.description.issue 9
gdc.description.publicationcategory Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
gdc.description.startpage 12673
gdc.description.volume 14
gdc.identifier.openalex W4285088620
gdc.index.type Scopus
gdc.oaire.diamondjournal false
gdc.oaire.impulse 0.0
gdc.oaire.influence 2.513421E-9
gdc.oaire.isgreen true
gdc.oaire.keywords Inception Module
gdc.oaire.keywords 000
gdc.oaire.keywords Indoor Image Segmentation
gdc.oaire.keywords PSPNet
gdc.oaire.keywords Humanoid Robots
gdc.oaire.keywords Channel and Spatial Attention Mechanisms
gdc.oaire.keywords Discrete Wavelet Transform
gdc.oaire.keywords 004
gdc.oaire.popularity 2.4517643E-9
gdc.oaire.publicfunded false
gdc.oaire.sciencefields 0202 electrical engineering, electronic engineering, information engineering
gdc.oaire.sciencefields 02 engineering and technology
gdc.openalex.collaboration National
gdc.openalex.fwci 0.2038
gdc.openalex.normalizedpercentile 0.47
gdc.opencitations.count 0
gdc.plumx.mendeley 5
gdc.plumx.scopuscites 3
gdc.scopus.citedcount 3
gdc.virtual.author Güzeliş, Cüneyt
oaire.citation.endPage 12695
oaire.citation.startPage 12673
person.identifier.scopus-author-id Aslan- Simge Nur (57219265872), Uçar- Ayşegül (7004549716), Güzeliş- Cüneyt (55937768800)
project.funder.name Funding text 1: This work was supported by the Scientific and Technological Research Council of Turkey (TUBITAK) grant numbers 117E589. In addition GTX Titan X Pascal GPU in this research was donated by the NVIDIA Corporation., Funding text 2: This work was supported by the Scientific and Technological Research Council of Turkey (TUBITAK) grant numbers 117E589. In addition GTX Titan X Pascal GPU in this research was donated by the NVIDIA Corporation.
publicationissue.issueNumber 9
publicationvolume.volumeNumber 14
relation.isAuthorOfPublication 10f564e3-6c1c-4354-9ce3-b5ac01e39680
relation.isAuthorOfPublication.latestForDiscovery 10f564e3-6c1c-4354-9ce3-b5ac01e39680
relation.isOrgUnitOfPublication ac5ddece-c76d-476d-ab30-e4d3029dee37
relation.isOrgUnitOfPublication.latestForDiscovery ac5ddece-c76d-476d-ab30-e4d3029dee37

Files