Vis enkel innførsel

dc.contributor.authorOommen, John B.
dc.contributor.authorYazidi, Anis
dc.date.accessioned2023-12-08T07:50:41Z
dc.date.available2023-12-08T07:50:41Z
dc.date.created2023-11-29T10:56:38Z
dc.date.issued2023
dc.identifier.citationKnowledge engineering review (Print). 2023, 38 .en_US
dc.identifier.issn0269-8889
dc.identifier.urihttps://hdl.handle.net/11250/3106539
dc.description.abstractArtificial barriers in Learning Automata (LA) is a powerful and yet under-explored concept although it was first proposed in the 1980s. Introducing artificial non-absorbing barriers makes the LA schemes resilient to being trapped in absorbing barriers, a phenomenon which is often referred to as lock in probability leading to an exclusive choice of one action after convergence. Within the field of LA and reinforcement learning in general, there is a sacristy of theoretical works and applications of schemes with artificial barriers. In this paper, we devise a LA with artificial barriers for solving a general form of stochastic bimatrix game. Classical LA systems possess properties of absorbing barriers and they are a powerful tool in game theory and were shown to converge to game’s of Nash equilibrium under limited information. However, the stream of works in LA for solving game theoretical problems can merely solve the case where the Saddle Point of the game exists in a pure strategy and fail to reach mixed Nash equilibrium when no Saddle Point exists for a pure strategy. Furthermore, we provide experimental results that are in line with our theoretical findings.en_US
dc.language.isoengen_US
dc.rightsNavngivelse 4.0 Internasjonal*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/deed.no*
dc.titleAdaptive learning with artificial barriers yielding Nash equilibria in general gamesen_US
dc.typePeer revieweden_US
dc.typeJournal articleen_US
dc.description.versionpublishedVersionen_US
cristin.ispublishedtrue
cristin.fulltextoriginal
cristin.qualitycode2
dc.identifier.doi10.1017/S0269888923000103
dc.identifier.cristin2204898
dc.source.journalKnowledge engineering review (Print)en_US
dc.source.volume38en_US
dc.source.pagenumber24en_US


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel

Navngivelse 4.0 Internasjonal
Med mindre annet er angitt, så er denne innførselen lisensiert som Navngivelse 4.0 Internasjonal