Production Cell Operation Optimization by Reinforcement Learning

Bence, Pálvölgyi and Zsolt, János Viharos and Jenő, Csanaki and Krisztián, Meskó and Zsolt, Nagy (2026) Production Cell Operation Optimization by Reinforcement Learning. HUNGARIAN JOURNAL OF INDUSTRY AND CHEMISTRY, 54 (SI). pp. 87-94. ISSN 0133-0276

Text
1154 - Published Version
Download (505kB)

Official URL: https://doi.org/10.33927//hjic-2026-21

Abstract

Machine learning, particularly reinforcement learning, plays an increasing role in optimizing complex industrial processes. One such challenge arises in production systems, where products must be processed, often involving nontrivial scheduling and routing problems. The paper presents a reinforcement learning (RL)-based method to optimize a specific production cell, where two material-moving units and several machining units must cooperate to manufacture items that require both processing and occasional cleaning. The proposed methodology models the environment as a Markov Decision Process and employs RL algorithms to maximize throughput. Several popular RL algorithms were compared, and it was found that Maskable Proximal Policy Optimization (Maskable PPO) delivers the best performance, as agent-specific, valid and differentiated behavior is ensured for both material handling and machining units through action masking. Among the various masking strategies tested, a distinct masking approach proved to be the most effective.

Item Type:	Article
Uncontrolled Keywords:	reinforcement learning, manufacturing operation optimization, action masking, production cell control
Subjects:	T Technology / alkalmazott, műszaki tudományok > T2 Technology (General) / műszaki tudományok általában
SWORD Depositor:	MTMT SWORD
Depositing User:	MTMT SWORD
Date Deposited:	11 Jun 2026 09:02
Last Modified:	11 Jun 2026 09:02
URI:	https://real.mtak.hu/id/eprint/239853

Actions (login required)

Edit Item