Beyond Trial and Error: Lane Keeping with Monte Carlo Tree Search-Driven Optimization of Reinforcement Learning

Kővári, Bálint and Pelenczei, Bálint and Knáb, István Gellért and Bécsi, Tamás (2024) Beyond Trial and Error: Lane Keeping with Monte Carlo Tree Search-Driven Optimization of Reinforcement Learning. ELECTRONICS (SWITZ), 13 (11). ISSN 2079-9292

Preview

Text
electronics-13-02058-v2.pdf - Published Version
Available under License Creative Commons Attribution.
Download (3MB) | Preview

Official URL: https://doi.org/10.3390/electronics13112058

Abstract

In recent years, Reinforcement Learning (RL) has excelled in the realm of autonomous vehicle control, which is distinguished by the absence of limitations, such as specific training data or the necessity for explicit mathematical model identification. Particularly in the context of lane keeping, a diverse set of rewarding strategies yields a spectrum of realizable policies. Nevertheless, the challenge lies in discerning the optimal behavior that maximizes performance. Traditional approaches entail exhaustive training through a trial-and-error strategy across conceivable reward functions, which is a process notorious for its time-consuming nature and substantial financial implications. Contrary to conventional methodologies, the Monte Carlo Tree Search (MCTS) enables the prediction of reward function quality through Monte Carlo simulations, thereby eliminating the need for exhaustive training on all available reward functions. The findings obtained from MCTS simulations can be effectively leveraged to selectively train only the most suitable RL models. This approach helps alleviate the resource-heavy nature of traditional RL processes through altering the training pipeline. This paper validates the theoretical framework concerning the unique property of the Monte Carlo Tree Search algorithm by emphasizing its generality through highlighting crossalgorithmic and crossenvironmental capabilities while also showcasing its potential to reduce training costs.

Item Type:	Article
Uncontrolled Keywords:	autonomous vehicles; reinforcement learning; lane keeping assist systems; Monte Carlo methods; vehicle dynamics
Subjects:	H Social Sciences / társadalomtudományok > HE Transportation and Communications / Szállítás, hírközlés > HE1 Transportation / szállítás
SWORD Depositor:	MTMT SWORD
Depositing User:	MTMT SWORD
Date Deposited:	24 Sep 2024 15:28
Last Modified:	24 Sep 2024 15:28
URI:	https://real.mtak.hu/id/eprint/205718

Actions (login required)

Edit Item