Predicting cost of defects for segmented products and customers using ensemble learning

dc.contributor.author Gorkem Sariyer
dc.contributor.author Sachin Kumar Kumar Mangla
dc.contributor.author Yigit Kazancoglu
dc.contributor.author Lei Xu
dc.contributor.author Ceren Ocal Tasar
dc.contributor.author Tasar, Ceren Ocal
dc.contributor.author Kumar Mangla, Sachin
dc.contributor.author Kazancoglu, Yigit
dc.contributor.author Sariyer, Gorkem
dc.contributor.author Xu, Lei
dc.contributor.author Mangla, Sachin Kumar
dc.contributor.author Ocal Tasar, Ceren
dc.date.accessioned 2025-10-06T17:49:55Z
dc.date.issued 2022
dc.description.abstract Due to technological advances Big Data Analytics (BDA) has become increasingly important over the last few years. This has led companies to evolve BDA capabilities (BDAC) to manage operations and make better decisions. In this study we propose a model Clustering Based Classifier Ensemble Method for Cost of Defect Prediction (CBCEM-CoD) incorporating clustering classification prediction and learning techniques of BDA for quality management in the manufacturing industry. CBCEM-CoD (1) is fact-driven as it is based on a fundamental problem of the manufacturing industry (2) integrates different BDA techniques in a specific way when an output of one technique is used as an input of another and (3) extracts insights from real-world big data and directly offers many implications for practice. In the first stage of the CBCEM-CoD k-means and agglomerative clustering techniques are used comparatively for segmenting customers and products. CoD values of each product and customer segment are predicted using ensemble learning techniques in the second stage. The model is tested using a case data set from the kitchenware industry. As a result 53 and 720 different types of customers and products in the train data set are segmented in optimal numbers of 4 and 20 clusters. Around 89% accuracy is obtained for CoD predictions in the test data set. These results have substantial business value since they inform managers how to prioritize their focus on specific products and customer types to reduce the cost of a defect. We also highlight the importance of developing BDAC in dynamically changing environments to create a competitive advantage. © 2022 Elsevier B.V. All rights reserved.
dc.description.sponsorship SAFEA High-End Foreign Experts Project, (G2022202001L); National Natural Science Foundation of China, NSFC, (71672125, 72172148); National Natural Science Foundation of China, NSFC; Civil Aviation University of China, CAUC, (3122022PT08); Civil Aviation University of China, CAUC
dc.description.sponsorship This research was supported in part by National Natural Science Foundation of China (Grant No. 72172148, 71672125), NSFC special supporting funding of CAUC (Grant No. 3122022PT08), SAFEA High-End Foreign Experts Project (Grant No. G2022202001L) for authors Prof. Sachin Kumar Mangla and Prof. Lei Xu.
dc.description.sponsorship National Natural Science Foundation of China [72172148, 71672125]; NSFC [3122022PT08]; SAFEA High-End Foreign Experts Project [G2022202001L]
dc.identifier.doi 10.1016/j.cie.2022.108502
dc.identifier.issn 03608352
dc.identifier.issn 0360-8352
dc.identifier.issn 1879-0550
dc.identifier.scopus 2-s2.0-85135313404
dc.identifier.uri https://www.scopus.com/inward/record.uri?eid=2-s2.0-85135313404&doi=10.1016%2Fj.cie.2022.108502&partnerID=40&md5=e2eaa71908e1eb9df24fe0b794915cc2
dc.identifier.uri https://gcris.yasar.edu.tr/handle/123456789/8664
dc.identifier.uri https://doi.org/10.1016/j.cie.2022.108502
dc.language.iso English
dc.publisher Elsevier Ltd
dc.relation.ispartof Computers & Industrial Engineering
dc.rights info:eu-repo/semantics/closedAccess
dc.source Computers and Industrial Engineering
dc.subject Big Data Analytics, Cost Of Defect, Decision Making, Ensemble Learning, Segmentation, Big Data, Cluster Analysis, Competition, Data Analytics, Defects, Forecasting, K-means Clustering, Learning Systems, Manufacture, Quality Management, Sales, Statistical Tests, Big Data Analytic, Classifier Ensemble Methods, Clusterings, Cost Of Defect, Data Analytics, Data Set, Decisions Makings, Defect Prediction, Ensemble Learning, Segmentation, Decision Making
dc.subject Big data, Cluster analysis, Competition, Data Analytics, Defects, Forecasting, K-means clustering, Learning systems, Manufacture, Quality management, Sales, Statistical tests, Big data analytic, Classifier ensemble methods, Clusterings, Cost of defect, Data analytics, Data set, Decisions makings, Defect prediction, Ensemble learning, Segmentation, Decision making
dc.subject Ensemble Learning
dc.subject Segmentation
dc.subject Big Data Analytics
dc.subject Cost of Defect
dc.subject Decision Making
dc.title Predicting cost of defects for segmented products and customers using ensemble learning
dc.type Article
dspace.entity.type Publication
gdc.author.id Kazancoglu, Yigit/0000-0001-9199-671X
gdc.author.scopusid 15848066400
gdc.author.scopusid 55735821600
gdc.author.scopusid 57189867008
gdc.author.scopusid 57113894500
gdc.author.scopusid 57205023626
gdc.author.wosid Taşar, Ceren/AAA-4770-2019
gdc.author.wosid Kazancoglu, Yigit/E-7705-2015
gdc.author.wosid Mangla, Sachin/B-7605-2017
gdc.author.wosid sariyer, gorkem/AAA-1524-2019
gdc.bip.impulseclass C4
gdc.bip.influenceclass C4
gdc.bip.popularityclass C4
gdc.coar.type text::journal::journal article
gdc.collaboration.industrial false
gdc.description.department
gdc.description.departmenttemp [Sariyer, Gorkem] Yasar Univ, Dept Business Adm, Izmir, Turkey; [Mangla, Sachin Kumar] OP Jindal Global Univ, Res Ctr Digital Circular Econ Sustainbale Dev Goal, Jindal Global Business Sch, Sonipat, Haryana, India; [Kazancoglu, Yigit] Yasar Univ, Dept Logist Management, Izmir, Turkey; [Xu, Lei] Civil Aviat Univ China, Econ & Management Coll, Tianjin 300300, Peoples R China; [Tasar, Ceren Ocal] Yasar Univ, Dept Comp Engn, Izmir, Turkey
gdc.description.publicationcategory Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
gdc.description.startpage 108502
gdc.description.volume 171
gdc.description.woscitationindex Science Citation Index Expanded
gdc.identifier.openalex W4287844696
gdc.identifier.wos WOS:000862698400011
gdc.index.type Scopus
gdc.index.type WoS
gdc.oaire.diamondjournal false
gdc.oaire.impulse 17.0
gdc.oaire.influence 3.1489875E-9
gdc.oaire.isgreen false
gdc.oaire.popularity 1.5130862E-8
gdc.oaire.publicfunded false
gdc.oaire.sciencefields 0502 economics and business
gdc.oaire.sciencefields 05 social sciences
gdc.oaire.sciencefields 0211 other engineering and technologies
gdc.oaire.sciencefields 02 engineering and technology
gdc.openalex.fwci 3.7095
gdc.openalex.normalizedpercentile 0.93
gdc.openalex.toppercent TOP 10%
gdc.opencitations.count 15
gdc.plumx.crossrefcites 6
gdc.plumx.mendeley 44
gdc.plumx.scopuscites 17
gdc.scopus.citedcount 17
gdc.virtual.author Kazançoğlu, Yiğit
gdc.wos.citedcount 14
person.identifier.scopus-author-id Sariyer- Gorkem (57189867008), Kumar Mangla- Sachin Kumar (55735821600), Kazancoglu- Yigit (15848066400), Xu- Lei (57113894500), Ocal Tasar- Ceren (57205023626)
project.funder.name This research was supported in part by National Natural Science Foundation of China (Grant No. 72172148 71672125) NSFC special supporting funding of CAUC (Grant No. 3122022PT08) SAFEA High-End Foreign Experts Project (Grant No. G2022202001L) for authors Prof. Sachin Kumar Mangla and Prof. Lei Xu.
publicationvolume.volumeNumber 171
relation.isAuthorOfPublication cd2013c9-29e1-443f-8df4-2d1b140984ee
relation.isAuthorOfPublication.latestForDiscovery cd2013c9-29e1-443f-8df4-2d1b140984ee
relation.isOrgUnitOfPublication ac5ddece-c76d-476d-ab30-e4d3029dee37
relation.isOrgUnitOfPublication.latestForDiscovery ac5ddece-c76d-476d-ab30-e4d3029dee37

Files