TY - JOUR
T1 - A hierarchical strategy-optimized double deep Q-network for dynamic reentrant hybrid flow-shop scheduling problem with multi-stage batch processing machines
AU - Ren, Xiaoyu
AU - Zhang, Jian
AU - Li, Jiwei
AU - Qin, Shengfeng
AU - Chen, Haojie
N1 - Publisher Copyright:
© 2025 Elsevier Ltd
PY - 2026/1/1
Y1 - 2026/1/1
N2 - Due to the simultaneous consideration of operations sequencing and single/batch processing machine selection, the Reentrant Hybrid Flow-shop Scheduling Problem with Batch Processing Machine is a very complex NP-hard problem. In addition, the characteristics of large-scale, disturbance factors and multi-stage batch in actual production not only significantly increase complexity, but also require scheduling methods to have fast response capabilities. Based on the above characteristics and considering the new job arrival, this study develops a Dynamic Reentrant Hybrid Flow-shop Scheduling Problem with Multi-stage Batch Processing Machines (DRHFSP-MB) and proposes a Hierarchical Strategy-optimized Double Deep Q-network (HSDDQN) for solving it. First, by incorporating the self-attention mechanism, a hierarchical double deep Q-network structure is introduced to construct two agents, namely the batching agent and the scheduling agent, for solving the job batching and scheduling subproblems of DRHFSP-MB. In addition, to address the characteristics of multi-stage batch-processing and reentrant scheduling in DRHFSP-MB, a Markov decision process considering these two agents is designed, consisting of states, actions and rewards. Furthermore, a mask-based action selection strategy combined with the ε-greedy and a soft-start target network update strategy are developed to enhance the efficiency and generalization of HSDDQN. By comparing with existing rules and deep reinforcement learning methods, extensive experiments have shown the effectiveness of HSDDQN and the proposed improvement strategies in solving DRHFSP-MB.
AB - Due to the simultaneous consideration of operations sequencing and single/batch processing machine selection, the Reentrant Hybrid Flow-shop Scheduling Problem with Batch Processing Machine is a very complex NP-hard problem. In addition, the characteristics of large-scale, disturbance factors and multi-stage batch in actual production not only significantly increase complexity, but also require scheduling methods to have fast response capabilities. Based on the above characteristics and considering the new job arrival, this study develops a Dynamic Reentrant Hybrid Flow-shop Scheduling Problem with Multi-stage Batch Processing Machines (DRHFSP-MB) and proposes a Hierarchical Strategy-optimized Double Deep Q-network (HSDDQN) for solving it. First, by incorporating the self-attention mechanism, a hierarchical double deep Q-network structure is introduced to construct two agents, namely the batching agent and the scheduling agent, for solving the job batching and scheduling subproblems of DRHFSP-MB. In addition, to address the characteristics of multi-stage batch-processing and reentrant scheduling in DRHFSP-MB, a Markov decision process considering these two agents is designed, consisting of states, actions and rewards. Furthermore, a mask-based action selection strategy combined with the ε-greedy and a soft-start target network update strategy are developed to enhance the efficiency and generalization of HSDDQN. By comparing with existing rules and deep reinforcement learning methods, extensive experiments have shown the effectiveness of HSDDQN and the proposed improvement strategies in solving DRHFSP-MB.
KW - Batch processing machine
KW - Double Deep Q-network
KW - Incompatible job family
KW - New job arrival
KW - Reentrant hybrid flow-shop scheduling problem
UR - https://www.scopus.com/pages/publications/105020665283
U2 - 10.1016/j.cie.2025.111630
DO - 10.1016/j.cie.2025.111630
M3 - Article
AN - SCOPUS:105020665283
SN - 0360-8352
VL - 211
JO - Computers and Industrial Engineering
JF - Computers and Industrial Engineering
M1 - 111630
ER -