Q-learning guided algorithms for bi-criteria minimization of total flow time and makespan in no-wait permutation flowshops

Damla Yuksel; Levent Kandiller; Mehmet Fatih Tasgetiren; Yüksel, Damla; Taşgetiren, Mehmet Fatih; Kandiller, Levent

Q-learning guided algorithms for bi-criteria minimization of total flow time and makespan in no-wait permutation flowshops

Date

2024

Authors

Damla Yuksel

Levent Kandiller

Mehmet Fatih Tasgetiren

Yüksel, Damla

Taşgetiren, Mehmet Fatih

Kandiller, Levent

Publisher

ELSEVIER

Green Open Access

No

Publicly Funded

No

Impulse

Top 10%

Influence

Average

Popularity

Top 10%

Abstract

Combining Deep Reinforcement Learning and meta-heuristic techniques represents a new research direction for enhancing the search capabilities of meta-heuristic methods in the context of production scheduling. Q-learning is a prominent reinforcement learning in which its utilization aims to direct the selection of actions thus preventing the necessity for a random exploration in the iterative process of the metaheuristics. In this study we provide Q-learning guided algorithms for the Bi-Criteria No-Wait Flowshop Scheduling Problem (NWFSP). The problem is treated as a bi-criteria combinatorial optimization problem where total flow time and makespan are optimized simultaneously. Firstly a deterministic mixed-integer linear programming (MILP) model is provided. Then Q-learning guided algorithms are developed: Bi-Criteria Iterated Greedy Algorithm with Q-Learning (BCIGQL). Bi-Criteria Block Insertion Heuristic Algorithm with Q-Learning (BC-BIHQL). Moreover the performance of the proposed Q-learning guided algorithms is compared over a collection of Bi-Criteria Genetic Local Search Algorithms (BC-GLS) Bi-Criteria Iterated Greedy Algorithm (BC-IG) Bi-Criteria Iterated Greedy Algorithm with a Local Search (BC-IGALL) and Bi-Criteria Variable Block Insertion Heuristic Algorithm (BC-VBIH). The complete computational experiment performed on 480 problem instances of Vallada et al. (2015) which is known as the VRF benchmark set indicates that the BC-BIHQL and the BC-IGQL algorithms outperform the BC-GLS BC-IG BCIGALL and BC-VBIH algorithms in comparative performance metrics. More specifically the proposed BC-BIHQL and BC-IGQL algorithms can yield more non-dominated bi-criteria solutions with the most substantial competitiveness than the remaining algorithms. At the same time both are competitive with each other on the benchmark problems. Moreover the BC-IGQL algorithm dominates almost 97% and 99% of the solutions reached by the BC-IG BC-IGALL and BC-VBIH algorithms in small and large datasets. Similarly The BC-BIHQL algorithm dominates almost 98% and 99% of the solutions reached by the BC-IG BC-IGALL and BC-VBIH algorithms in small and large datasets respectively. This means that among all the features that have been compared the Qlearning-guided algorithms demonstrate the highest level of competitiveness. The outcomes of this study encourage us to discover many more bi-criteria NWFSPs to reveal the trade-off between other conflicting objectives such as makespan & the number of early jobs to overcome various industries' problems.

ORCID

YÜKSEL, DAMLA

Keywords

Bi-criteria scheduling problems, No -wait flowshop scheduling problem, Makespan, Total flow time, Mixed -integer linear programming, Bi-criteria heuristic optimization, TOTAL WEIGHTED TARDINESS, MULTIOBJECTIVE ELECTROMAGNETISM ALGORITHM, ITERATED GREEDY ALGORITHM, DEPENDENT SETUP TIMES, SCHEDULING PROBLEM, SINGLE-MACHINE, LOCAL SEARCH, MEMETIC ALGORITHM, OPTIMIZATION, SUBJECT, Makespan, Bi-Criteria Heuristic Optimization, Mixed -Integer Linear Programming, Bi-Criteria Scheduling Problems, No-Wait Flowshop Scheduling Problem, Mixed-Integer Linear Programming, No -Wait Flowshop Scheduling Problem, Total Flow Time

OpenCitations Citation Count

10

Source

Swarm and Evolutionary Computation

Volume

89

Start Page

101617

URI

http://dx.doi.org/10.1016/j.swevo.2024.101617
https://gcris.yasar.edu.tr/handle/123456789/6736
https://doi.org/10.1016/j.swevo.2024.101617

Collections

WoS İndeksli Yayınlar Koleksiyonu
Scopus İndeksli Yayınlar Koleksiyonu

PlumX Metrics

Citations

Scopus : 14

Captures

Mendeley Readers : 4

Full item page

SCOPUS™ Citations

14

checked on Apr 10, 2026

Web of Science™ Citations

15

checked on Apr 10, 2026

Google Scholar™

Check

Q-learning guided algorithms for bi-criteria minimization of total flow time and makespan in no-wait permutation flowshops

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Open Access Color

Green Open Access

OpenAIRE Downloads

OpenAIRE Views

Publicly Funded

BIP! Indicators

Research Projects

Journal Issue

Abstract

Description

ORCID

Keywords

Fields of Science

Citation

WoS Q

Scopus Q

OpenCitations Citation Count

Source

Volume

Issue

Start Page

End Page

URI

Collections

PlumX Metrics

Citations

Captures

SCOPUS™ Citations

14

Web of Science™ Citations

15

Google Scholar™

OpenAlex FWCI

6.5039

Sustainable Development Goals

SDG data is not available