A novel approach for self-driving car in partially observable environment using life long reinforcement learning

dc.authorscopusid 57955732100
dc.authorscopusid 58580896200
dc.authorscopusid 57198791439
dc.authorscopusid 35203460000
dc.authorscopusid 6602275590
dc.authorscopusid 15123985000
dc.authorscopusid 15123985000
dc.contributor.author Quadir, Md Abdul
dc.contributor.author Jaiswal, Dibyanshu
dc.contributor.author Mohan, Senthilkumar
dc.contributor.author Innab, Nisreen
dc.contributor.author Sulaiman, Riza
dc.contributor.author Alaoui, Mohammed Kbiri
dc.contributor.author Ahmadian, Ali
dc.date.accessioned 2024-05-25T11:37:34Z
dc.date.available 2024-05-25T11:37:34Z
dc.date.issued 2024
dc.department Okan University en_US
dc.department-temp [Quadir, Md Abdul; Jaiswal, Dibyanshu] Vellore Inst Technol, Sch Comp Sci & Engn, Chennai 600127, India; [Mohan, Senthilkumar] Vellore Inst Technol, Sch Comp Sci Engn & Informat Syst, Vellore 632014, Tamilnadu, India; [Innab, Nisreen] AlMaarefa Univ, Coll Appl Sci, Dept Comp Sci & Informat Syst, Riyadh, Saudi Arabia; [Sulaiman, Riza] Univ Kebangsaan Malaysia, Inst Visual Informat, Bangi 43600, Malaysia; [Alaoui, Mohammed Kbiri] King Khalid Univ, Coll Sci, Dept Math, Abha 61413, POB 9004, Saudi Arabia; [Ahmadian, Ali] Mediterranea Univ Reggio Calabria, Decis Lab, Reggio Di Calabria, Italy; [Ahmadian, Ali] Istanbul Okan Univ, Fac Engn & Nat Sci, Istanbul, Turkiye en_US
dc.description.abstract Despite ground-breaking advancements in robotics, gaming, and other challenging domains, reinforcement learning still faces significant challenges in solving dynamic, open-world problems. Since reinforcement learning algorithms usually perform poorly when exposed to new tasks outside of their data distribution, continuous learning algorithms have drawn significant attention. In parallel with work on lifelong learning algorithms, there is a need for challenging environments, properly planned trials, and metrics to measure research success. In this context, a Deep Asynchronous Autonomous Learning System (DAALS) is proposed in this paper for training a selfdriving car in a partially observable environment for real-world scenarios in a continuous state-action space. To cater to three different use cases, three different algorithms were used. To train their agents for learning and upgrading discrete state policies, DAALS used the Asynchronous Advantage Stager Reviewer (AASR) algorithm. To train its agent for continuous state spaces, DAALS also uses an Extensive Deterministic Policy Gradient (EDPG) algorithm. To train the agent in a lifelong form of learning for partially observable environments, DAALS uses a Deep Deterministic Policy Gradient Novel Lifelong Learning Algorithm (DDPGNLLA). The system offers flexibility to the user to train the agents for both discrete and continuous state-action spaces. Compared to previous models in continuous state-action spaces, Deep deterministic policy gradient lifelong learning algorithm outperforms previous models by 46.09%. Furthermore, the Deep Asynchronous Autonomous System tends to outperform all previous reinforcement learning algorithms, making our proposed approach a real-world solution. As DAALS has tested on number of different environments it provides the insights on how modern Artificial Intelligence (AI) solutions can be generalized making it one of the better solutions for AI general domain problems. en_US
dc.description.sponsorship Deanship of Scientific Research at King Khalid University; [RGP. 2/222/44] en_US
dc.description.sponsorship The authors extend their appreciation to the Deanship of Scientific Research at King Khalid University for funding this work through Large Groups Project under grant number RGP. 2/222/44. en_US
dc.identifier.citationcount 0
dc.identifier.doi 10.1016/j.segan.2024.101356
dc.identifier.issn 2352-4677
dc.identifier.scopus 2-s2.0-85189427398
dc.identifier.scopusquality Q2
dc.identifier.uri https://doi.org/10.1016/j.segan.2024.101356
dc.identifier.uri https://hdl.handle.net/20.500.14517/1191
dc.identifier.volume 38 en_US
dc.identifier.wos WOS:001217637500001
dc.identifier.wosquality Q1
dc.language.iso en
dc.publisher Elsevier en_US
dc.relation.publicationcategory Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı en_US
dc.rights info:eu-repo/semantics/closedAccess en_US
dc.scopus.citedbyCount 2
dc.subject Reinforcement Learning en_US
dc.subject Lifelong Learning en_US
dc.subject Self-driving car en_US
dc.subject Lifelong reinforcement learning en_US
dc.subject Partially observable Environment en_US
dc.title A novel approach for self-driving car in partially observable environment using life long reinforcement learning en_US
dc.type Article en_US
dc.wos.citedbyCount 1

Files