Evaluation of Convolutional Networks for Event Camera Face Pose Alignment

dc.contributor.author Arman Savran
dc.contributor.author Burhan Burak Oral
dc.contributor.author Alptuğ Çakıcı
dc.contributor.author Oral, Burhan Burak
dc.contributor.author Çakıcı, Alptuğ
dc.contributor.author Savran, Arman
dc.date.accessioned 2025-10-22T16:04:53Z
dc.date.issued 2025
dc.description.abstract Event camera offers substantial advantages over conventional video cameras with their efficiency extremely high temporal resolutions low latency and high dynamic range. These benefits have led to applications in various vision domains. Recently they have been applied in facial recognition tasks as well. However while significant advantages of event cameras in some facial processing tasks have been demonstrated the initial stage in almost any task i.e. face alignment is not at par with the conventional cameras. This study investigates the use of face alignment convolutional networks regarding both performance and complexity for event camera processing. Our aim is event camera face pose alignment that can be used as an efficient preprocessor for facial tasks. Therefore we comparatively evaluate simple convolutional coordinate regression with a hybrid of coordinate and heatmap regression known as pixel-in-pixel regression. Our experimental results reveal the superior performance of the hybrid method. However we also show that if there is a computation bottleneck simple convolutional coordinate regression is preferable for their low resource requirements though at the expense of some performance loss.
dc.identifier.citation [1] G. Gallego et al. “Event-Based Vision: A Survey” IEEE Trans. Pattern Anal. Mach. Intell. vol. 44 no. 1 pp. 154–180 Jan. 2022 doi: 10.1109/TPAMI.2020.3008413.[2] G. Tan Y. Wang H. Han Y. Cao F. Wu and Z.-J. Zha “Multi-grained Spatio-Temporal Features Perceived Network for Event-based Lip-Reading” in 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) New Orleans LA USA: IEEE Jun. 2022 pp. 20062–20071. doi: 10.1109/CVPR52688.2022.01946.[3] G. Moreira A. Graca B. Silva P. Martins and J. Batista “Neuromorphic Event-based Face Identity Recognition” in 2022 26th International Conference on Pattern Recognition (ICPR) Montreal QC Canada: IEEE Aug. 2022 pp. 922–929. doi: 10.1109/ICPR56361.2022.9956236.[4] A. Savran “Fully Convolutional Event-camera Voice Activity Detection Based on Event Intensity” in 2023 Innovations in Intelligent Systems and Applications Conference (ASYU) Sivas Turkiye: IEEE Oct. 2023 pp. 1–6. doi: 10.1109/ASYU58738.2023.10296754.[5] A. Savran “Multi-timescale boosting for efficient and improved event camera face pose alignment” Computer Vision and Image Understanding vol. 236 p. 103817 Nov. 2023 doi: 10.1016/j.cviu.2023.103817.[6] A. Savran and C. Bartolozzi “Face Pose Alignment with Event Cameras” Sensors vol. 20 no. 24 p. 7079 Dec. 2020 doi: 10.3390/s20247079.[7] Z.-H. Feng J. Kittler M. Awais and X.-J. Wu “Rectified Wing Loss for Efficient and Robust Facial Landmark Localisation with Convolutional Neural Networks” Int J Comput Vis vol. 128 no. 8–9 pp. 2126–2145 Sep. 2020 doi: 10.1007/s11263-019- 01275-0.[8] H. Jin S. Liao and L. Shao “Pixel-in-Pixel Net: Towards Efficient Facial Landmark Detection in the Wild” Int J Comput Vis vol. 129 no. 12 pp. 3174– 3194 Dec. 2021 doi: 10.1007/s11263-021-01521-4.[9] B. Browatzki and C. Wallraven “3FabRec: Fast Few- Shot Face Alignment by Reconstruction” in 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Seattle WA USA: IEEE Jun. 2020 pp. 6109–6119. doi: 10.1109/CVPR42600.2020.00615.[10] Y. Sun X. Wang and X. Tang “Deep Convolutional Network Cascade for Facial Point Detection” in 2013 IEEE Conference on Computer Vision and Pattern Recognition Portland OR USA: IEEE Jun. 2013 pp. 3476–3483. doi: 10.1109/CVPR.2013.446.[11] Y. Wu T. Hassner K. Kim G. Medioni and P. Natarajan “Facial Landmark Detection with Tweaked Convolutional Neural Networks” IEEE Trans. Pattern Anal. Mach. Intell. vol. 40 no. 12 pp. 3067–3074 Dec. 2018 doi: 10.1109/TPAMI.2017.2787130.[12] S. Honari P. Molchanov S. Tyree P. Vincent C. Pal and J. Kautz “Improving Landmark Localization with Semi-Supervised Learning” in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Salt Lake City UT USA: IEEE Jun. 2018 pp. 1546–1555. doi: 10.1109/CVPR.2018.00167.[13] A. Kumar et al. “LUVLi Face Alignment: Estimating Landmarks’ Location Uncertainty and Visibility Likelihood” in 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Seattle WA USA: IEEE Jun. 2020 pp. 8233–8243. doi: 10.1109/CVPR42600.2020.00826.[14] X. Dong and Y. Yang “Teacher Supervises Students How to Learn From Partially Labeled Images for Facial Landmark Detection” in 2019 IEEE/CVF International Conference on Computer Vision (ICCV) Seoul Korea (South): IEEE Oct. 2019 pp. 783–792. doi: 10.1109/ICCV.2019.00087.[15] X. Wang L. Bo and L. Fuxin “Adaptive Wing Loss for Robust Face Alignment via Heatmap Regression” arXiv:1904.07399 [cs] May 2020 Accessed: Mar. 27 2022. [Online]. Available: http://arxiv.org/abs/1904.07399[16] M. Cannici M. Ciccone A. Romanoni and M. Matteucci “Asynchronous Convolutional Networks for Object Detection in Neuromorphic Cameras” in 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) Long Beach CA USA: IEEE Jun. 2019 pp. 1656–1665. doi: 10.1109/CVPRW.2019.00209.[17] F. Paredes-Valles and G. C. H. E. de Croon “Back to Event Basics: Self-Supervised Learning of Image Reconstruction for Event Cameras via Photometric Constancy” in 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Nashville TN USA: IEEE Jun. 2021 pp. 3445–3454. doi: 10.1109/CVPR46437.2021.00345.[18] D. Gehrig A. Loquercio K. Derpanis and D. Scaramuzza “End-to-End Learning of Representations for Asynchronous Event-Based Data” in 2019 IEEE/CVF International Conference on Computer Vision (ICCV) Seoul Korea (South): IEEE Oct. 2019 pp. 5632–5642. doi: 10.1109/ICCV.2019.00573.[19] E. Perot P. de Tournemire D. Nitti J. Masci and A. Sironi “Learning to Detect Objects with a 1 Megapixel Event Camera.” arXiv Dec. 09 2020. Accessed: Apr. 24 2024. [Online]. Available: http://arxiv.org/abs/2009.13436[20] A. Kugele T. Pfeil M. Pfeiffer and E. Chicca “How Many Events Make an Object? Improving Single-frame Object Detection on the 1 Mpx Dataset” in 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) Vancouver BC Canada: IEEE Jun. 2023 pp. 3913– 3922. doi: 10.1109/CVPRW59228.2023.00406.[21] C. Boretti P. Bich F. Pareschi L. Prono R. Rovatti and G. Setti “PEDRo: an Event-based Dataset for Person Detection in Robotics” in 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) Vancouver BC Canada: IEEE Jun. 2023 pp. 4065–4070. doi: 10.1109/CVPRW59228.2023.00426.[22] G. Goyal F. Di Pietro N. Carissimi A. Glover and C. Bartolozzi “MoveEnet: Online High-Frequency Human Pose Estimation with an Event Camera” in 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) Vancouver BC Canada: IEEE Jun. 2023 pp. 4024– 4033. doi: 10.1109/CVPRW59228.2023.00420.[23] P. R. Gantier Cadena Y. Qian C. Wang and M. Yang “Sparse-E2VID: A Sparse Convolutional Model for Event-Based Video Reconstruction Trained with Real Event Noise” in 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) Vancouver BC Canada: IEEE Jun. 2023 pp. 4150–4158. doi: 10.1109/CVPRW59228.2023.00437.[24] L. Berlincioni et al. “Neuromorphic Event-based Facial Expression Recognition.” arXiv Apr. 13 2023. Accessed: Apr. 24 2024. [Online]. Available: http://arxiv.org/abs/2304.06351[25] H. Bulzomi M. Schweiker A. Gruel and J. Martinet “End-to-end Neuromorphic Lip Reading” in 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) Vancouver BC Canada: IEEE Jun. 2023 pp. 4101– 4108. doi: 10.1109/CVPRW59228.2023.00431.[26] A. Savran R. Tavarone B. Higy L. Badino and C. Bartolozzi “Energy and Computation Efficient Audio- Visual Voice Activity Detection Driven by Event- Cameras” in 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018) Xi’an: IEEE May 2018 pp. 333–340. doi: 10.1109/FG.2018.00055.[27] A. Savran \"Temporal Convolutional Networks for Efficient Voice Activity Detection with Event Camera\" Journal of Intelligent Systems: Theory and Applications vol. 7 no. 2 pp. 102–115 Sep. 2024 doi: 10.38016/jista.1400047.[28] A. Savran “Comparison of Timing Strategies for Face Pose Alignment with Event Camera” in 2023 8th International Conference on Computer Science and Engineering (UBMK) Burdur Turkiye: IEEE Sep. 2023 pp. 97–101. doi: 10.1109/UBMK59864.2023.10286582.[29] K. He X. Zhang S. Ren and J. Sun “Deep Residual Learning for Image Recognition.” arXiv Dec. 10 2015. Accessed: Jan. 09 2024. [Online]. Available: http://arxiv.org/abs/1512.03385[30] S. Xie R. Girshick P. Dollar Z. Tu and K. He “Aggregated Residual Transformations for Deep Neural Networks” in 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Honolulu HI: IEEE Jul. 2017 pp. 5987–5995. doi: 10.1109/CVPR.2017.634.[31] J. Hu L. Shen and G. Sun \"Squeeze-and-Excitation Networks\" 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Salt Lake City UT USA 2018 pp. 7132-7141 doi: 10.1109/CVPR.2018.00745.[32] A. G. Howard et al. “MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications.” arXiv Apr. 16 2017. Accessed: Apr. 29 2024. [Online]. Available: http://arxiv.org/abs/1704.04861[33] M. Sandler A. Howard M. Zhu A. Zhmoginov and L.-C. Chen “MobileNetV2: Inverted Residuals and Linear Bottlenecks” in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Salt Lake City UT: IEEE Jun. 2018 pp. 4510–4520. doi: 10.1109/CVPR.2018.00474.
dc.identifier.doi 10.21541/apjess.1417068
dc.identifier.issn 2822-2385
dc.identifier.uri https://gcris.yasar.edu.tr/handle/123456789/10406
dc.identifier.uri https://search.trdizin.gov.tr/en/yayin/detay/1315376
dc.language.iso İngilizce
dc.relation.ispartof Academic Platform Journal of Engineering and Smart Systems
dc.rights info:eu-repo/semantics/openAccess
dc.source Academic Platform journal of engineering and smart systems (Online)
dc.subject Bilgisayar Bilimleri, Yapay Zeka
dc.subject Görüntüleme Bilimi Ve Fotoğraf Teknolojisi
dc.title Evaluation of Convolutional Networks for Event Camera Face Pose Alignment
dc.type Article
dc.type Article
dspace.entity.type Publication
gdc.author.id 0000-0001-5142-6384
gdc.author.id 0000-0002-9635-078X
gdc.author.id 0009-0001-1251-8793
gdc.bip.impulseclass C5
gdc.bip.influenceclass C5
gdc.bip.popularityclass C5
gdc.coar.type text::journal::journal article
gdc.collaboration.industrial false
gdc.description.department
gdc.description.departmenttemp [Savran, Arman; Çakıcı, Alptuğ; Oral, Burhan Burak] Yaşar Üniversitesi, Bilgisayar Mühendisliği Bölümü, İzmir, Türkiye
gdc.description.endpage 30
gdc.description.issue 2
gdc.description.publicationcategory Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
gdc.description.startpage 22
gdc.description.volume 13
gdc.identifier.openalex W4410892419
gdc.identifier.trdizinid 1315376
gdc.index.type TR-Dizin
gdc.oaire.diamondjournal false
gdc.oaire.impulse 0.0
gdc.oaire.influence 2.3811355E-9
gdc.oaire.isgreen false
gdc.oaire.keywords Deep Learning
gdc.oaire.keywords Yapay Görme
gdc.oaire.keywords Derin Öğrenme
gdc.oaire.keywords Machine Vision
gdc.oaire.keywords Event Camera;Face Pose Alignment;Convolutional Neural Network;Coordinate Regression;Heatmap Regression
gdc.oaire.popularity 2.5970819E-9
gdc.oaire.publicfunded false
gdc.openalex.collaboration National
gdc.openalex.fwci 0.0
gdc.openalex.normalizedpercentile 0.1
gdc.opencitations.count 0
gdc.virtual.author Oral, Burhan Burak
gdc.virtual.author Savran, Arman
oaire.citation.endPage 30
oaire.citation.startPage 22
publicationissue.issueNumber 2
publicationvolume.volumeNumber 13
relation.isAuthorOfPublication a254752c-7f31-47e1-85d1-a102460f0546
relation.isAuthorOfPublication ec3245ee-803e-4537-8ade-40b369fad1c3
relation.isAuthorOfPublication.latestForDiscovery a254752c-7f31-47e1-85d1-a102460f0546
relation.isOrgUnitOfPublication ac5ddece-c76d-476d-ab30-e4d3029dee37
relation.isOrgUnitOfPublication.latestForDiscovery ac5ddece-c76d-476d-ab30-e4d3029dee37

Files