Development of a deep wavelet pyramid scene parsing semantic segmentation network for scene perception in indoor environments

Simge Nur Aslan; Ayşegül Uçar; Cüneyt Güzeliş; Güzeliş, Cüneyt; Uçar, Ayşegül; Aslan, Simge Nur

Development of a deep wavelet pyramid scene parsing semantic segmentation network for scene perception in indoor environments

dc.contributor.author	Simge Nur Aslan
dc.contributor.author	Ayşegül Uçar
dc.contributor.author	Cüneyt Güzeliş
dc.contributor.author	Güzeliş, Cüneyt
dc.contributor.author	Uçar, Ayşegül
dc.contributor.author	Aslan, Simge Nur
dc.date.accessioned	2025-10-06T17:49:23Z
dc.date.issued	2023
dc.description.abstract	In this paper a new Deep Wavelet Pyramid Scene Parsing Network (DW-PSPNet) is proposed as an effective combination of Discrete Wavelet Transform (DWT) inception module the channel and spatial attention modules and PSPNet. Improved semantic segmentation via the combination to our best knowledge is not yet reported in the literature. The paper has two main contributions: (1) a new backbone network into PSPNET introduced by a combination of DWT inspection modules and attention mechanisms, (2) a new and improved version of PSPNet base structure. Further three new modifications are introduced. First the drop activation function is used to increase validation and test accuracy of the segmentation. Second a skip connection from the backbone is applied to increase validation and test accuracies by restoring the resolution of feature maps via full utilization of multilevel semantic features. Third Inverse Wavelet Transform (IWT) and convolution layer are applied to obtain the segmented images without information loss. DW-PSPNet was implemented via our own data generated by using a Robotis-Op3 humanoid robot to detect objects in indoor environments and and benchmark data set. Simulation results show higher performance of the proposed network compared with that of previous successful networks in handling semantic segmentation tasks in indoor environments. Moreover extensive experiments on the benchmark Ade20K data set were also conducted. DW-PSPNET achieved an mIoU score of 45.97% on the ADE20K validation set which are new state-of-the-art results. © 2023 Elsevier B.V. All rights reserved.
dc.description.sponsorship	This work was supported by the Scientific and Technological Research Council of Turkey (TUBITAK) grant numbers 117E589. In addition, GTX Titan X Pascal GPU in this research was donated by the NVIDIA Corporation.
dc.description.sponsorship	Nvidia; Türkiye Bilimsel ve Teknolojik Araştırma Kurumu, TÜBİTAK, (117E589)
dc.identifier.doi	10.1007/s12652-022-04231-y
dc.identifier.issn	18685145, 18685137
dc.identifier.issn	1868-5137
dc.identifier.issn	1868-5145
dc.identifier.scopus	2-s2.0-85134345927
dc.identifier.uri	https://www.scopus.com/inward/record.uri?eid=2-s2.0-85134345927&doi=10.1007%2Fs12652-022-04231-y&partnerID=40&md5=a0c8ffd600d3fd10a610d24df18742fc
dc.identifier.uri	https://gcris.yasar.edu.tr/handle/123456789/8412
dc.identifier.uri	https://doi.org/10.1007/s12652-022-04231-y
dc.language.iso	English
dc.publisher	Springer Science and Business Media Deutschland GmbH
dc.relation.ispartof	Journal of Ambient Intelligence and Humanized Computing
dc.rights	info:eu-repo/semantics/closedAccess
dc.source	Journal of Ambient Intelligence and Humanized Computing
dc.subject	Channel And Spatial Attention Mechanisms, Discrete Wavelet Transform, Humanoid Robots, Inception Module, Indoor Image Segmentation, Pspnet, Anthropomorphic Robots, Object Detection, Semantic Segmentation, Semantics, Signal Reconstruction, Attention Mechanisms, Channel And Spatial Attention Mechanism, Discrete-wavelet-transform, Humanoid Robot, Images Segmentations, Inception Module, Indoor Image Segmentation, Pspnet, Spatial Attention, Wavelet Pyramid, Discrete Wavelet Transforms
dc.subject	Anthropomorphic robots, Object detection, Semantic Segmentation, Semantics, Signal reconstruction, Attention mechanisms, Channel and spatial attention mechanism, Discrete-wavelet-transform, Humanoid robot, Images segmentations, Inception module, Indoor image segmentation, PSPNet, Spatial attention, Wavelet pyramid, Discrete wavelet transforms
dc.subject	Channel and Spatial Attention Mechanisms
dc.subject	Discrete Wavelet Transform
dc.subject	Humanoid Robots
dc.subject	Inception Module
dc.subject	Indoor Image Segmentation
dc.subject	Pspnet
dc.title	Development of a deep wavelet pyramid scene parsing semantic segmentation network for scene perception in indoor environments
dc.type	Article
dspace.entity.type	Publication
gdc.author.scopusid	57219265872
gdc.author.scopusid	55937768800
gdc.author.scopusid	7004549716
gdc.bip.impulseclass	C5
gdc.bip.influenceclass	C5
gdc.bip.popularityclass	C5
gdc.coar.type	text::journal::journal article
gdc.collaboration.industrial	false
gdc.description.department
gdc.description.departmenttemp	[Aslan S.N.] Electronics and Automation Department, Vocational School, Istanbul Arel University, Istanbul, 34295, Turkey; [Uçar A.] Mechatronics Engineering Department, Engineering Faculty, Firat University, Elazig, 23119, Turkey; [Güzeliş C.] Electrical and Engineering Department, Engineering Faculty, Yaşar University, Izmir, 35100, Turkey
gdc.description.endpage	12695
gdc.description.issue	9
gdc.description.publicationcategory	Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
gdc.description.startpage	12673
gdc.description.volume	14
gdc.identifier.openalex	W4285088620
gdc.index.type	Scopus
gdc.oaire.diamondjournal	false
gdc.oaire.impulse	0.0
gdc.oaire.influence	2.513421E-9
gdc.oaire.isgreen	true
gdc.oaire.keywords	Inception Module
gdc.oaire.keywords	000
gdc.oaire.keywords	Indoor Image Segmentation
gdc.oaire.keywords	PSPNet
gdc.oaire.keywords	Humanoid Robots
gdc.oaire.keywords	Channel and Spatial Attention Mechanisms
gdc.oaire.keywords	Discrete Wavelet Transform
gdc.oaire.keywords	004
gdc.oaire.popularity	2.4517643E-9
gdc.oaire.publicfunded	false
gdc.oaire.sciencefields	0202 electrical engineering, electronic engineering, information engineering
gdc.oaire.sciencefields	02 engineering and technology
gdc.openalex.collaboration	National
gdc.openalex.fwci	0.2038
gdc.openalex.normalizedpercentile	0.47
gdc.opencitations.count	0
gdc.plumx.mendeley	5
gdc.plumx.scopuscites	3
gdc.scopus.citedcount	3
gdc.virtual.author	Güzeliş, Cüneyt
oaire.citation.endPage	12695
oaire.citation.startPage	12673
person.identifier.scopus-author-id	Aslan- Simge Nur (57219265872), Uçar- Ayşegül (7004549716), Güzeliş- Cüneyt (55937768800)
project.funder.name	Funding text 1: This work was supported by the Scientific and Technological Research Council of Turkey (TUBITAK) grant numbers 117E589. In addition GTX Titan X Pascal GPU in this research was donated by the NVIDIA Corporation., Funding text 2: This work was supported by the Scientific and Technological Research Council of Turkey (TUBITAK) grant numbers 117E589. In addition GTX Titan X Pascal GPU in this research was donated by the NVIDIA Corporation.
publicationissue.issueNumber	9
publicationvolume.volumeNumber	14
relation.isAuthorOfPublication	10f564e3-6c1c-4354-9ce3-b5ac01e39680
relation.isAuthorOfPublication.latestForDiscovery	10f564e3-6c1c-4354-9ce3-b5ac01e39680
relation.isOrgUnitOfPublication	ac5ddece-c76d-476d-ab30-e4d3029dee37
relation.isOrgUnitOfPublication.latestForDiscovery	ac5ddece-c76d-476d-ab30-e4d3029dee37

Collections

Scopus İndeksli Yayınlar Koleksiyonu

Development of a deep wavelet pyramid scene parsing semantic segmentation network for scene perception in indoor environments

Files

Collections