Bence, Pálvölgyi and Zsolt, János Viharos and Jenő, Csanaki and Krisztián, Meskó and Zsolt, Nagy (2026) Production Cell Operation Optimization by Reinforcement Learning. HUNGARIAN JOURNAL OF INDUSTRY AND CHEMISTRY, 54 (SI). pp. 87-94. ISSN 0133-0276
|
Text
1154 - Published Version Download (505kB) |
Abstract
Machine learning, particularly reinforcement learning, plays an increasing role in optimizing complex industrial processes. One such challenge arises in production systems, where products must be processed, often involving nontrivial scheduling and routing problems. The paper presents a reinforcement learning (RL)-based method to optimize a specific production cell, where two material-moving units and several machining units must cooperate to manufacture items that require both processing and occasional cleaning. The proposed methodology models the environment as a Markov Decision Process and employs RL algorithms to maximize throughput. Several popular RL algorithms were compared, and it was found that Maskable Proximal Policy Optimization (Maskable PPO) delivers the best performance, as agent-specific, valid and differentiated behavior is ensured for both material handling and machining units through action masking. Among the various masking strategies tested, a distinct masking approach proved to be the most effective.
| Item Type: | Article |
|---|---|
| Uncontrolled Keywords: | reinforcement learning, manufacturing operation optimization, action masking, production cell control |
| Subjects: | T Technology / alkalmazott, műszaki tudományok > T2 Technology (General) / műszaki tudományok általában |
| SWORD Depositor: | MTMT SWORD |
| Depositing User: | MTMT SWORD |
| Date Deposited: | 11 Jun 2026 09:02 |
| Last Modified: | 11 Jun 2026 09:02 |
| URI: | https://real.mtak.hu/id/eprint/239853 |
Actions (login required)
![]() |
Edit Item |




