Journal of the Korean Society for Precision Engineering

A Dual-network-based Deep Reinforcement Learning Method for Scheduling in Manufacturing Systems with Multiple Processing Alternatives: Jun Kim; J. Korean Soc. Precis. Eng. 2026;43(5):431-442.
Published online May 1, 2026; DOI: https://doi.org/10.7736/JKSPE.026.00027

Abstract
PDF

Manufacturing systems are increasingly required to operate in high-mix, low-volume production environments, where process flexibility is crucial. One effective way to achieve this flexibility is through the use of multiple processing alternatives (MPA), allowing a product to be produced using different process plans or component structures. In MPA environments, scheduling decisions must address both the selection of processing alternatives for each product and the execution order of the resulting production tasks. Additionally, processing times often vary due to machine conditions and process variability, further complicating scheduling. This study introduces a dual-network-based deep reinforcement learning method for scheduling in manufacturing systems with multiple processing alternatives. The framework utilizes two Q-networks to learn both the selection of processing alternatives and the dispatching rules. Computational experiments demonstrate that the proposed method effectively reduces both the average makespan and its variability compared to a genetic algorithm-based approach, particularly as the problem size increases, showcasing its effectiveness in the face of processing time uncertainty.

633 View
8 Download

Path Optimization for 6-axis Robot Control Using Open Simulation-based Reinforcement Learning: Cho A Kim, Jong U Baek, Su Han Lee, Ju Yeon Lee; J. Korean Soc. Precis. Eng. 2026;43(5):421-430.
Published online May 1, 2026; DOI: https://doi.org/10.7736/JKSPE.026.00010

Abstract
PDF

The increasing adoption of industrial robot arms in advanced manufacturing has heightened the need for flexible trajectory planning methods that go beyond traditional offline programming (OLP) tools, which are often expensive, proprietary, and limiting. This study introduces an OLP-free pipeline designed to generate robot trajectory data and optimize paths for six-degree-of-freedom (6-DOF) robot arms using discrete reinforcement learning. Initially, five-axis NC code derived from CAD/CAM data is transformed into tool center point (TCP) trajectories through coordinate transformations. An analytical inverse kinematics solver then produces multiple joint solutions for each TCP pose, creating a discrete action space from which the learning agent can select feasible joint configurations along the trajectory. A reward function that considers variations in joint velocity and acceleration, as well as pose error, facilitates the simultaneous optimization of motion smoothness and tracking accuracy. The optimized trajectories are validated using an open-source physics simulator, showing enhanced motion stability, accuracy, and collision safety compared to conventional OLP-based paths. This proposed framework provides a flexible and cost-effective alternative to commercial OLP tools and lays a scalable foundation for future applications in automated and collaborative manufacturing systems.

800 View
22 Download

Optimization of Manufacturing Layout Using Deep Reinforcement Learning and Simulation: Ye Ji Choi, Minsung Kim, Byeong Soo Kim; J. Korean Soc. Precis. Eng. 2025;42(3):253-261.
Published online March 1, 2025; DOI: https://doi.org/10.7736/JKSPE.024.137

Abstract
PDF

Facility Layout Problem (FLP) aims to optimize arrangement of facilities to enhance productivity and minimize costs. Traditional methods face challenges in dealing with the complexity and non-linearity of modern manufacturing environments. This study introduced an approach combining Reinforcement Learning (RL) and simulation to optimize manufacturing line layouts. Deep Q-Network (DQN) learns to reduce unused space, improve path efficiency, and maximize space utilization by optimizing facility placement and material flow. Simulations were used to validate layouts and evaluate performance based on production output, path length, and bending frequency. This RL-based method offers a more adaptable and efficient solution for FLP than traditional techniques, addressing both physical and operational optimization.

302 View
14 Download

Optimal Eco Driving Control for Electric Vehicle based on Reinforcement Learning: Hyun Joong Kim, Dong Min Kim, Su Hyeon Kim, Heeyun Lee; J. Korean Soc. Precis. Eng. 2024;41(5):355-364.
Published online May 1, 2024; DOI: https://doi.org/10.7736/JKSPE.024.020

Abstract
PDF

Environmental issues have become a global concern recently. Countries worldwide are making efforts for carbon neutrality. In the automotive industry, focus has shifted from internal combustion engine vehicle to eco-friendly vehicles such as Electric Vehicles (EVs), Hybrid Electric Vehicles (HEVs), and Fuel Cell Electric Vehicles (FCEVs). For driving strategy, research on vehicle driving method that can reduce vehicle energy consumption, called eco-driving, has been actively conducted recently. Conventional cruise mode driving control is not considered an optimal driving strategy for various driving environments. To maximize energy efficiency, this paper conducted research on eco-driving strategy for EVs-based on reinforcement learning. A longitudinal dynamics-based electric vehicle simulator was constructed using MATLAB Simulink with a road slope. Reinforcement learning algorithms, specifically Deep Deterministic Policy Gradient (DDPG) and Deep QNetwork (DQN), were applied to minimize energy consumption of EVs with a road slope. The simulator was trained to maximize rewards and derive an optimal speed profile. In this study, we compared learning results of DDPG and DQN algorithms and confirmed tendencies by parameters in each algorithm. The simulation showed that energy efficiency of EVs was improved compared to that of cruise mode driving.

310 View
7 Download

Active Suspension Control Using Reinforcement Learning: Do-Gyeong Yuk, Jung Woo Sohn; J. Korean Soc. Precis. Eng. 2024;41(3):223-230.
Published online March 1, 2024; DOI: https://doi.org/10.7736/JKSPE.023.141

Abstract
PDF

In recent years, research on machine learning techniques that can be integrated with existing suspension control algorithms for enhanced control effects has advanced considerably. Machine learning, especially involving neural networks, often requires many samples, which makes maintaining robust performance in diverse, changing environments challenging. The present study applied reinforcement learning, which can generalize complex situations not previously encountered, to overcome this obstacle and is crucial for suspension control under varying road conditions. The effectiveness of the proposed control method was evaluated on different road conditions using the quarter-vehicle model. The impact of training data was assessed by comparing models trained under two distinct road conditions. In addition, a validation exercise on the performance of the control method that utilizes reinforcement learning demonstrated its potential for enhancing the adaptability and efficiency of suspension systems under various road conditions.

Citations

Citations to this article as recorded by

Control Characteristics of Active Suspension in Vehicles using Adaptive Control Algorithm
Jeong Seo Jang, Jung Woo Sohn
Transactions of the Korean Society for Noise and Vibration Engineering.2024; 34(5): 568. CrossRef
Suspension Mechanism Design of a Low-platform Target Robot for Evaluating Autonomous Vehicle Active Safety
Jae Sang Yoo, Do Hyeon Kim, Jayil Jeong
Journal of the Korean Society for Precision Engineering.2024; 41(5): 375. CrossRef

311 View
5 Download
Crossref

Application of Deep Reinforcement Learning to Temperature Control of a Chamber for Ultra-precision Machines: Byung-Sub Kim, Seung-Kook Ro; J. Korean Soc. Precis. Eng. 2023;40(6):467-472.
Published online June 1, 2023; DOI: https://doi.org/10.7736/JKSPE.022.124

Abstract
PDF

Deep reinforcement learning (RL) has attracted research interest in the manufacturing area in recent years, but real implemented applications are rarely found. This is because agents have to explore the given environments many times until they learn how to maximize the rewards for actions, which they provide to the environments. While training, random actions or exploration from agents may be disastrous in many real-world applications, and thus, people usually use computer generated simulation environments to train agents. In this paper, we present a RL experiment applied to temperature control of a chamber for ultra-precision machines. The RL agent was built in Python and PyTorch framework using a Deep Q-Network (DQN) algorithm and its action commands were sent to National Instruments (NI) hardware, which ran C codes with a sampling rate of 1 Hz. For communication between the agent and the NI data acquisition unit, a data pipeline was constructed from the subprocess module and Popen class. The agent was forced to learn temperature control while reducing the energy consumption through a reward function, which considers both temperature bounds and energy savings. Effectiveness of the RL approach to a multi-objective temperature control problem was demonstrated in this research.