Q-learning guided algorithms for bi-criteria minimization of total flow time and makespan in no-wait permutation flowshops

Damla Yüksel; Levent Kandiller; M. Fatih Tasgetiren

Q-learning guided algorithms for bi-criteria minimization of total flow time and makespan in no-wait permutation flowshops

dc.contributor.author	Damla Yüksel
dc.contributor.author	Levent Kandiller
dc.contributor.author	M. Fatih Tasgetiren
dc.date.accessioned	2025-10-06T17:48:56Z
dc.date.issued	2024
dc.description.abstract	Combining Deep Reinforcement Learning and meta-heuristic techniques represents a new research direction for enhancing the search capabilities of meta-heuristic methods in the context of production scheduling. Q-learning is a prominent reinforcement learning in which its utilization aims to direct the selection of actions thus preventing the necessity for a random exploration in the iterative process of the metaheuristics. In this study we provide Q-learning guided algorithms for the Bi-Criteria No-Wait Flowshop Scheduling Problem (NWFSP). The problem is treated as a bi-criteria combinatorial optimization problem where total flow time and makespan are optimized simultaneously. Firstly a deterministic mixed-integer linear programming (MILP) model is provided. Then Q-learning guided algorithms are developed: Bi-Criteria Iterated Greedy Algorithm with Q-Learning (BC-IG<inf>QL</inf>). Bi-Criteria Block Insertion Heuristic Algorithm with Q-Learning (BC-BIH<inf>QL</inf>). Moreover the performance of the proposed Q-learning guided algorithms is compared over a collection of Bi-Criteria Genetic Local Search Algorithms (BC-GLS) Bi-Criteria Iterated Greedy Algorithm (BC-IG) Bi-Criteria Iterated Greedy Algorithm with a Local Search (BC-IG<inf>ALL</inf>) and Bi-Criteria Variable Block Insertion Heuristic Algorithm (BC-VBIH). The complete computational experiment performed on 480 problem instances of Vallada et al. (2015) which is known as the VRF benchmark set indicates that the BC-BIH<inf>QL</inf> and the BC-IG<inf>QL</inf> algorithms outperform the BC-GLS BC-IG BC-IG<inf>ALL</inf> and BC-VBIH algorithms in comparative performance metrics. More specifically the proposed BC-BIH<inf>QL</inf> and BC-IG<inf>QL</inf> algorithms can yield more non-dominated bi-criteria solutions with the most substantial competitiveness than the remaining algorithms. At the same time both are competitive with each other on the benchmark problems. Moreover the BC-IG<inf>QL</inf> algorithm dominates almost 97% and 99% of the solutions reached by the BC-IG BC-IG<inf>ALL</inf> and BC-VBIH algorithms in small and large datasets. Similarly The BC-BIH<inf>QL</inf> algorithm dominates almost 98% and 99% of the solutions reached by the BC-IG BC-IG<inf>ALL</inf> and BC-VBIH algorithms in small and large datasets respectively. This means that among all the features that have been compared the Q-learning-guided algorithms demonstrate the highest level of competitiveness. The outcomes of this study encourage us to discover many more bi-criteria NWFSPs to reveal the trade-off between other conflicting objectives such as makespan & the number of early jobs to overcome various industries' problems. © 2024 Elsevier B.V. All rights reserved.
dc.identifier.doi	10.1016/j.swevo.2024.101617
dc.identifier.issn	22106502
dc.identifier.issn	2210-6502
dc.identifier.uri	https://www.scopus.com/inward/record.uri?eid=2-s2.0-85195865594&doi=10.1016%2Fj.swevo.2024.101617&partnerID=40&md5=0e4b20b42b7f838dcb32ef229cfcb66b
dc.identifier.uri	https://gcris.yasar.edu.tr/handle/123456789/8176
dc.language.iso	English
dc.publisher	Elsevier B.V.
dc.relation.ispartof	Swarm and Evolutionary Computation
dc.source	Swarm and Evolutionary Computation
dc.subject	Bi-criteria Heuristic Optimization, Bi-criteria Scheduling Problems, Makespan, Mixed-integer Linear Programming, No-wait Flowshop Scheduling Problem, Total Flow Time, Benchmarking, Combinatorial Optimization, Deep Learning, Heuristic Methods, Integer Programming, Iterative Methods, Large Datasets, Learning Algorithms, Local Search (optimization), Production Control, Reinforcement Learning, Bi-criteria, Bi-criteria Heuristic Optimization, Bi-criteria Scheduling Problem, Flow Shop Scheduling Problem, Flowshop Scheduling Problems, Heuristic Optimization, Integer Linear Programming, Makespan, Mixed Integer Linear, Mixed-integer Linear Programming, No-wait Flowshop, No-wait Flowshop Scheduling Problem, Scheduling Problem, Total Flowtime, Heuristic Algorithms
dc.subject	Benchmarking, Combinatorial optimization, Deep learning, Heuristic methods, Integer programming, Iterative methods, Large datasets, Learning algorithms, Local search (optimization), Production control, Reinforcement learning, Bi-criteria, Bi-criteria heuristic optimization, Bi-criteria scheduling problem, Flow shop scheduling problem, Flowshop scheduling problems, Heuristic optimization, Integer Linear Programming, Makespan, Mixed integer linear, Mixed-integer linear programming, No-wait flowshop, No-wait flowshop scheduling problem, Scheduling problem, Total flowtime, Heuristic algorithms
dc.title	Q-learning guided algorithms for bi-criteria minimization of total flow time and makespan in no-wait permutation flowshops
dc.type	Article
dspace.entity.type	Publication
gdc.bip.impulseclass	C4
gdc.bip.influenceclass	C5
gdc.bip.popularityclass	C4
gdc.coar.type	text::journal::journal article
gdc.collaboration.industrial	false
gdc.description.startpage	101617
gdc.description.volume	89
gdc.identifier.openalex	W4399082303
gdc.index.type	Scopus
gdc.oaire.diamondjournal	false
gdc.oaire.impulse	16.0
gdc.oaire.influence	2.85726E-9
gdc.oaire.isgreen	false
gdc.oaire.popularity	1.30808795E-8
gdc.oaire.publicfunded	false
gdc.openalex.collaboration	National
gdc.openalex.fwci	6.5039
gdc.openalex.normalizedpercentile	0.97
gdc.openalex.toppercent	TOP 10%
gdc.opencitations.count	10
gdc.plumx.mendeley	4
gdc.plumx.scopuscites	14
person.identifier.scopus-author-id	Yüksel- Damla (57212210455), Kandiller- Levent (6506822666), Tasgetiren- M. Fatih (6505799356)
publicationvolume.volumeNumber	89
relation.isOrgUnitOfPublication	ac5ddece-c76d-476d-ab30-e4d3029dee37
relation.isOrgUnitOfPublication.latestForDiscovery	ac5ddece-c76d-476d-ab30-e4d3029dee37

Collections

Scopus İndeksli Yayınlar Koleksiyonu

Q-learning guided algorithms for bi-criteria minimization of total flow time and makespan in no-wait permutation flowshops

Files

Collections