Gorkem SariyerSachin Kumar Kumar ManglaYigit KazancogluLei XuCeren Ocal TasarTasar, Ceren OcalKumar Mangla, SachinKazancoglu, YigitSariyer, GorkemXu, LeiMangla, Sachin KumarOcal Tasar, Ceren2025-10-062022036083520360-83521879-055010.1016/j.cie.2022.1085022-s2.0-85135313404https://www.scopus.com/inward/record.uri?eid=2-s2.0-85135313404&doi=10.1016%2Fj.cie.2022.108502&partnerID=40&md5=e2eaa71908e1eb9df24fe0b794915cc2https://gcris.yasar.edu.tr/handle/123456789/8664https://doi.org/10.1016/j.cie.2022.108502Due to technological advances Big Data Analytics (BDA) has become increasingly important over the last few years. This has led companies to evolve BDA capabilities (BDAC) to manage operations and make better decisions. In this study we propose a model Clustering Based Classifier Ensemble Method for Cost of Defect Prediction (CBCEM-CoD) incorporating clustering classification prediction and learning techniques of BDA for quality management in the manufacturing industry. CBCEM-CoD (1) is fact-driven as it is based on a fundamental problem of the manufacturing industry (2) integrates different BDA techniques in a specific way when an output of one technique is used as an input of another and (3) extracts insights from real-world big data and directly offers many implications for practice. In the first stage of the CBCEM-CoD k-means and agglomerative clustering techniques are used comparatively for segmenting customers and products. CoD values of each product and customer segment are predicted using ensemble learning techniques in the second stage. The model is tested using a case data set from the kitchenware industry. As a result 53 and 720 different types of customers and products in the train data set are segmented in optimal numbers of 4 and 20 clusters. Around 89% accuracy is obtained for CoD predictions in the test data set. These results have substantial business value since they inform managers how to prioritize their focus on specific products and customer types to reduce the cost of a defect. We also highlight the importance of developing BDAC in dynamically changing environments to create a competitive advantage. © 2022 Elsevier B.V. All rights reserved.Englishinfo:eu-repo/semantics/closedAccessBig Data Analytics, Cost Of Defect, Decision Making, Ensemble Learning, Segmentation, Big Data, Cluster Analysis, Competition, Data Analytics, Defects, Forecasting, K-means Clustering, Learning Systems, Manufacture, Quality Management, Sales, Statistical Tests, Big Data Analytic, Classifier Ensemble Methods, Clusterings, Cost Of Defect, Data Analytics, Data Set, Decisions Makings, Defect Prediction, Ensemble Learning, Segmentation, Decision MakingBig data, Cluster analysis, Competition, Data Analytics, Defects, Forecasting, K-means clustering, Learning systems, Manufacture, Quality management, Sales, Statistical tests, Big data analytic, Classifier ensemble methods, Clusterings, Cost of defect, Data analytics, Data set, Decisions makings, Defect prediction, Ensemble learning, Segmentation, Decision makingEnsemble LearningSegmentationBig Data AnalyticsCost of DefectDecision MakingPredicting cost of defects for segmented products and customers using ensemble learningArticle