Development of a deep wavelet pyramid scene parsing semantic segmentation network for scene perception in indoor environments
| dc.contributor.author | Simge Nur Aslan | |
| dc.contributor.author | Ayşegül Uçar | |
| dc.contributor.author | Cüneyt Güzeliş | |
| dc.contributor.author | Güzeliş, Cüneyt | |
| dc.contributor.author | Uçar, Ayşegül | |
| dc.contributor.author | Aslan, Simge Nur | |
| dc.date.accessioned | 2025-10-06T17:49:23Z | |
| dc.date.issued | 2023 | |
| dc.description.abstract | In this paper a new Deep Wavelet Pyramid Scene Parsing Network (DW-PSPNet) is proposed as an effective combination of Discrete Wavelet Transform (DWT) inception module the channel and spatial attention modules and PSPNet. Improved semantic segmentation via the combination to our best knowledge is not yet reported in the literature. The paper has two main contributions: (1) a new backbone network into PSPNET introduced by a combination of DWT inspection modules and attention mechanisms, (2) a new and improved version of PSPNet base structure. Further three new modifications are introduced. First the drop activation function is used to increase validation and test accuracy of the segmentation. Second a skip connection from the backbone is applied to increase validation and test accuracies by restoring the resolution of feature maps via full utilization of multilevel semantic features. Third Inverse Wavelet Transform (IWT) and convolution layer are applied to obtain the segmented images without information loss. DW-PSPNet was implemented via our own data generated by using a Robotis-Op3 humanoid robot to detect objects in indoor environments and and benchmark data set. Simulation results show higher performance of the proposed network compared with that of previous successful networks in handling semantic segmentation tasks in indoor environments. Moreover extensive experiments on the benchmark Ade20K data set were also conducted. DW-PSPNET achieved an mIoU score of 45.97% on the ADE20K validation set which are new state-of-the-art results. © 2023 Elsevier B.V. All rights reserved. | |
| dc.description.sponsorship | This work was supported by the Scientific and Technological Research Council of Turkey (TUBITAK) grant numbers 117E589. In addition, GTX Titan X Pascal GPU in this research was donated by the NVIDIA Corporation. | |
| dc.description.sponsorship | Nvidia; Türkiye Bilimsel ve Teknolojik Araştırma Kurumu, TÜBİTAK, (117E589) | |
| dc.identifier.doi | 10.1007/s12652-022-04231-y | |
| dc.identifier.issn | 18685145, 18685137 | |
| dc.identifier.issn | 1868-5137 | |
| dc.identifier.issn | 1868-5145 | |
| dc.identifier.scopus | 2-s2.0-85134345927 | |
| dc.identifier.uri | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85134345927&doi=10.1007%2Fs12652-022-04231-y&partnerID=40&md5=a0c8ffd600d3fd10a610d24df18742fc | |
| dc.identifier.uri | https://gcris.yasar.edu.tr/handle/123456789/8412 | |
| dc.identifier.uri | https://doi.org/10.1007/s12652-022-04231-y | |
| dc.language.iso | English | |
| dc.publisher | Springer Science and Business Media Deutschland GmbH | |
| dc.relation.ispartof | Journal of Ambient Intelligence and Humanized Computing | |
| dc.rights | info:eu-repo/semantics/closedAccess | |
| dc.source | Journal of Ambient Intelligence and Humanized Computing | |
| dc.subject | Channel And Spatial Attention Mechanisms, Discrete Wavelet Transform, Humanoid Robots, Inception Module, Indoor Image Segmentation, Pspnet, Anthropomorphic Robots, Object Detection, Semantic Segmentation, Semantics, Signal Reconstruction, Attention Mechanisms, Channel And Spatial Attention Mechanism, Discrete-wavelet-transform, Humanoid Robot, Images Segmentations, Inception Module, Indoor Image Segmentation, Pspnet, Spatial Attention, Wavelet Pyramid, Discrete Wavelet Transforms | |
| dc.subject | Anthropomorphic robots, Object detection, Semantic Segmentation, Semantics, Signal reconstruction, Attention mechanisms, Channel and spatial attention mechanism, Discrete-wavelet-transform, Humanoid robot, Images segmentations, Inception module, Indoor image segmentation, PSPNet, Spatial attention, Wavelet pyramid, Discrete wavelet transforms | |
| dc.subject | Channel and Spatial Attention Mechanisms | |
| dc.subject | Discrete Wavelet Transform | |
| dc.subject | Humanoid Robots | |
| dc.subject | Inception Module | |
| dc.subject | Indoor Image Segmentation | |
| dc.subject | Pspnet | |
| dc.title | Development of a deep wavelet pyramid scene parsing semantic segmentation network for scene perception in indoor environments | |
| dc.type | Article | |
| dspace.entity.type | Publication | |
| gdc.author.scopusid | 57219265872 | |
| gdc.author.scopusid | 55937768800 | |
| gdc.author.scopusid | 7004549716 | |
| gdc.bip.impulseclass | C5 | |
| gdc.bip.influenceclass | C5 | |
| gdc.bip.popularityclass | C5 | |
| gdc.coar.type | text::journal::journal article | |
| gdc.collaboration.industrial | false | |
| gdc.description.department | ||
| gdc.description.departmenttemp | [Aslan S.N.] Electronics and Automation Department, Vocational School, Istanbul Arel University, Istanbul, 34295, Turkey; [Uçar A.] Mechatronics Engineering Department, Engineering Faculty, Firat University, Elazig, 23119, Turkey; [Güzeliş C.] Electrical and Engineering Department, Engineering Faculty, Yaşar University, Izmir, 35100, Turkey | |
| gdc.description.endpage | 12695 | |
| gdc.description.issue | 9 | |
| gdc.description.publicationcategory | Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı | |
| gdc.description.startpage | 12673 | |
| gdc.description.volume | 14 | |
| gdc.identifier.openalex | W4285088620 | |
| gdc.index.type | Scopus | |
| gdc.oaire.diamondjournal | false | |
| gdc.oaire.impulse | 0.0 | |
| gdc.oaire.influence | 2.513421E-9 | |
| gdc.oaire.isgreen | true | |
| gdc.oaire.keywords | Inception Module | |
| gdc.oaire.keywords | 000 | |
| gdc.oaire.keywords | Indoor Image Segmentation | |
| gdc.oaire.keywords | PSPNet | |
| gdc.oaire.keywords | Humanoid Robots | |
| gdc.oaire.keywords | Channel and Spatial Attention Mechanisms | |
| gdc.oaire.keywords | Discrete Wavelet Transform | |
| gdc.oaire.keywords | 004 | |
| gdc.oaire.popularity | 2.4517643E-9 | |
| gdc.oaire.publicfunded | false | |
| gdc.oaire.sciencefields | 0202 electrical engineering, electronic engineering, information engineering | |
| gdc.oaire.sciencefields | 02 engineering and technology | |
| gdc.openalex.collaboration | National | |
| gdc.openalex.fwci | 0.2038 | |
| gdc.openalex.normalizedpercentile | 0.47 | |
| gdc.opencitations.count | 0 | |
| gdc.plumx.mendeley | 5 | |
| gdc.plumx.scopuscites | 3 | |
| gdc.scopus.citedcount | 3 | |
| gdc.virtual.author | Güzeliş, Cüneyt | |
| oaire.citation.endPage | 12695 | |
| oaire.citation.startPage | 12673 | |
| person.identifier.scopus-author-id | Aslan- Simge Nur (57219265872), Uçar- Ayşegül (7004549716), Güzeliş- Cüneyt (55937768800) | |
| project.funder.name | Funding text 1: This work was supported by the Scientific and Technological Research Council of Turkey (TUBITAK) grant numbers 117E589. In addition GTX Titan X Pascal GPU in this research was donated by the NVIDIA Corporation., Funding text 2: This work was supported by the Scientific and Technological Research Council of Turkey (TUBITAK) grant numbers 117E589. In addition GTX Titan X Pascal GPU in this research was donated by the NVIDIA Corporation. | |
| publicationissue.issueNumber | 9 | |
| publicationvolume.volumeNumber | 14 | |
| relation.isAuthorOfPublication | 10f564e3-6c1c-4354-9ce3-b5ac01e39680 | |
| relation.isAuthorOfPublication.latestForDiscovery | 10f564e3-6c1c-4354-9ce3-b5ac01e39680 | |
| relation.isOrgUnitOfPublication | ac5ddece-c76d-476d-ab30-e4d3029dee37 | |
| relation.isOrgUnitOfPublication.latestForDiscovery | ac5ddece-c76d-476d-ab30-e4d3029dee37 |
