Predicting cost of defects for segmented products and customers using ensemble learning

dc.contributor.author Gorkem Sariyer
dc.contributor.author Sachin Kumar Mangla
dc.contributor.author Yigit Kazancoglu
dc.contributor.author Lei Xu
dc.contributor.author Ceren Ocal Tasar
dc.date SEP
dc.date.accessioned 2025-10-06T16:22:36Z
dc.date.issued 2022
dc.description.abstract Due to technological advances Big Data Analytics (BDA) has become increasingly important over the last few years. This has led companies to evolve BDA capabilities (BDAC) to manage operations and make better de-cisions. In this study we propose a model Clustering Based Classifier Ensemble Method for Cost of Defect Prediction (CBCEM-CoD) incorporating clustering classification prediction and learning techniques of BDA for quality management in the manufacturing industry. CBCEM-CoD (1) is fact-driven as it is based on a fundamental problem of the manufacturing industry (2) integrates different BDA techniques in a specific way when an output of one technique is used as an input of another and (3) extracts insights from real-world big data and directly offers many implications for practice. In the first stage of the CBCEM-CoD k-means and agglomerative clustering techniques are used comparatively for segmenting customers and products. CoD values of each product and customer segment are predicted using ensemble learning techniques in the second stage. The model is tested using a case data set from the kitchenware industry. As a result 53 and 720 different types of customers and products in the train data set are segmented in optimal numbers of 4 and 20 clusters. Around 89% accuracy is obtained for CoD predictions in the test data set. These results have substantial business value since they inform managers how to prioritize their focus on specific products and customer types to reduce the cost of a defect. We also highlight the importance of developing BDAC in dynamically changing environments to create a competitive advantage.
dc.identifier.doi 10.1016/j.cie.2022.108502
dc.identifier.issn 0360-8352
dc.identifier.uri http://dx.doi.org/10.1016/j.cie.2022.108502
dc.identifier.uri https://gcris.yasar.edu.tr/handle/123456789/7435
dc.language.iso English
dc.publisher PERGAMON-ELSEVIER SCIENCE LTD
dc.relation.ispartof Computers & Industrial Engineering
dc.source COMPUTERS & INDUSTRIAL ENGINEERING
dc.subject Big data analytics, Ensemble learning, Cost of defect, Segmentation, Decision making
dc.subject BIG DATA, DYNAMIC CAPABILITIES, OPERATIONS MANAGEMENT, FIRM PERFORMANCE, DATA ANALYTICS, ROUTINES, DEMAND
dc.title Predicting cost of defects for segmented products and customers using ensemble learning
dc.type Article
dspace.entity.type Publication
gdc.bip.impulseclass C4
gdc.bip.influenceclass C4
gdc.bip.popularityclass C4
gdc.coar.type text::journal::journal article
gdc.collaboration.industrial false
gdc.description.startpage 108502
gdc.description.volume 171
gdc.identifier.openalex W4287844696
gdc.index.type WoS
gdc.oaire.diamondjournal false
gdc.oaire.impulse 17.0
gdc.oaire.influence 3.1489875E-9
gdc.oaire.isgreen false
gdc.oaire.popularity 1.5130862E-8
gdc.oaire.publicfunded false
gdc.oaire.sciencefields 0502 economics and business
gdc.oaire.sciencefields 05 social sciences
gdc.oaire.sciencefields 0211 other engineering and technologies
gdc.oaire.sciencefields 02 engineering and technology
gdc.openalex.fwci 3.7095
gdc.openalex.normalizedpercentile 0.93
gdc.openalex.toppercent TOP 10%
gdc.opencitations.count 15
gdc.plumx.crossrefcites 6
gdc.plumx.mendeley 44
gdc.plumx.scopuscites 17
person.identifier.orcid Kazancoglu- Yigit/0000-0001-9199-671X,
project.funder.name National Natural Science Foundation of China [72172148- 71672125], NSFC [3122022PT08], SAFEA High-End Foreign Experts Project [G2022202001L]
publicationvolume.volumeNumber 171
relation.isOrgUnitOfPublication ac5ddece-c76d-476d-ab30-e4d3029dee37
relation.isOrgUnitOfPublication.latestForDiscovery ac5ddece-c76d-476d-ab30-e4d3029dee37

Files