Gamze ErdemMehmet Cemali DinçerMehmet Murat FadilogluDincer, M. CemaliFadiloglu, M. MuratErdem, GamzeC. Kahraman , S. Cebi , B. Oztaysi , S. Cevik Onar , C. Tolga , I. Ucal Sari , I. Otay2025-10-0620259789819652372, 9783031931055, 9789819662968, 9783031999963, 9783031950162, 9783031947698, 9783032004406, 9783031910074, 9783031926105, 97898196394109783031985645978303198565223673389, 236733702367-33702367-338910.1007/978-3-031-98565-2_692-s2.0-105013082603https://www.scopus.com/inward/record.uri?eid=2-s2.0-105013082603&doi=10.1007%2F978-3-031-98565-2_69&partnerID=40&md5=d6dd0e74b82f22aab14141670dbeb7b2https://gcris.yasar.edu.tr/handle/123456789/8078https://doi.org/10.1007/978-3-031-98565-2_69This literature review examines the convergence of Reinforcement Learning (RL) and Condition-Based Maintenance (CBM) emphasizing the trans- formative impact of RL methodologies on maintenance decision-making in com- plex industrial settings. By integrating insights from a diverse array of studies the review critically assesses the use of various RL techniques such as Q-learning deep reinforcement learning and policy gradient approaches in forecasting equipment failures optimizing maintenance schedules and reducing operational downtime. It outlines the shift from conventional rule-based maintenance practices to adaptive data-driven strategies that exploit real-time sensor data and probabilistic modeling. Key challenges highlighted include computational complexity the extensive training data requirements and the integration of RL models into existing industrial frameworks. Furthermore the review explores literature on CBM within multi-component systems where prevalent approaches include numerical analyses Markov Decision Processes (MDPs) and case studies all of which demonstrate notable cost reductions and decreased downtime. Relevant studies were identified through searches on databases such as Google Scholar Scopus and Web of Science. Overall this review provides a comprehensive analysis of the current state and prospects of employing reinforcement learning in condition-based maintenance offering valuable insights for both academic researchers and industry practitioners. © 2025 Elsevier B.V. All rights reserved.Englishinfo:eu-repo/semantics/closedAccessCondition-based Maintenance, Machine Learning, Reinforcement Learning, Automation, Condition Based Maintenance, Cost Benefit Analysis, Cost Reduction, Decision Making, Deep Learning, Deep Reinforcement Learning, Learning Systems, Markov Processes, Gradient Approach, Industrial Settings, Literature Reviews, Machine-learning, Maintenance Decision Making, Policy Gradient, Q-learning, Reinforcement Learning Techniques, Reinforcement Learnings, Reinforcement LearningAutomation, Condition based maintenance, Cost benefit analysis, Cost reduction, Decision making, Deep learning, Deep reinforcement learning, Learning systems, Markov processes, Gradient approach, Industrial settings, Literature reviews, Machine-learning, Maintenance decision making, Policy gradient, Q-learning, Reinforcement learning techniques, Reinforcement learnings, Reinforcement learningCondition-Based MaintenanceMachine LearningReinforcement LearningReinforcement Learning in Condition-Based Maintenance: A SurveyConference Object