Learning Automata with Artificial Reflecting Barriers in Games with Limited Information

Hassan, Ismail; Oommen, John B.; Yazidi, Anis

dc.contributor.author	Hassan, Ismail
dc.contributor.author	Oommen, John B.
dc.contributor.author	Yazidi, Anis
dc.date.accessioned	2023-02-02T10:39:23Z
dc.date.available	2023-02-02T10:39:23Z
dc.date.created	2022-06-06T17:08:38Z
dc.date.issued	2022-05-04
dc.identifier.issn	2334-0762
dc.identifier.uri	https://hdl.handle.net/11250/3047958
dc.description.abstract	This paper deals with the problem of solving stochastic games (which have numerous business and economic applications), using the interesting tools of Learning Automata (LA), the precursors to Reinforcement Learning (RL). Classical LA systems that possess properties of absorbing barriers, have been used as powerful tools in game theory to devise solutions that converge to the game's Nash equilibrium under limited information(Sastry, Phansalkar, and Thathachar 1994). Games with limited information are intrinsically hard because the player does not know the actions chosen of other players, neither their outcomes. The player might not be even aware of the fact that he/she is playing against an opponent. With the state-of-the-art, the numerous works in LA applicable for solving game theoretical problems, can merely solve the case where the game possesses a Saddle Point in a pure strategy. They are unable to reach mixed Nash equilibria when a Saddle Point is non-existent in pure strategies. Additionally, within the field of LA and RL in general, the theoretical and applied schemes of LA with artificial barriers are scarce, even though incorporating artificial barriers in LA has served as a powerful and yet under-explored concept, since its inception in the 1980’s. More recently, the phenomenon of introducing artificial non-absorbing barriers was pioneered, and this renders the LA schemes to be resilient to being trapped in absorbing barriers. In this paper, we devise a LA with artificial barriers for solving a general form of stochastic bimatrix games. The problem’s complexity has been augmented with the scenario that we consider games in which there is no Saddle Point. By resorting to the above-mentioned powerful concept of artificial reflecting barriers, we propose a LA that converges to an optimal mixed Nash equilibrium even though there may be no Saddle Point when a pure strategy is invoked.	en_US
dc.language.iso	eng	en_US
dc.publisher	University of Florida Press	en_US
dc.relation.ispartofseries	The International FLAIRS Conference Proceedings;Vol. 35 (2022): Proceedings of FLAIRS-35
dc.rights	Navngivelse-Ikkekommersiell 4.0 Internasjonal	*
dc.rights.uri	http://creativecommons.org/licenses/by-nc/4.0/deed.no	*
dc.subject	Learning automata	en_US
dc.subject	Learning automata with artificial barriers	en_US
dc.subject	Games with incomplete information	en_US
dc.title	Learning Automata with Artificial Reflecting Barriers in Games with Limited Information	en_US
dc.type	Peer reviewed	en_US
dc.type	Journal article	en_US
dc.description.version	publishedVersion	en_US
dc.rights.holder	(c) 2022 Ismail Hassan, B. John Oommen, Anis Yazidi	en_US
cristin.ispublished	true
cristin.fulltext	original
dc.identifier.doi	https://doi.org/10.32473/flairs.v35i.130850
dc.identifier.cristin	2029670
dc.source.journal	The International FLAIRS Conference Proceedings	en_US
dc.source.volume	35	en_US
dc.source.issue	35	en_US
dc.source.pagenumber	6	en_US

Tilhørende fil(er)

Filnavn:: learning-automata-with-artific ...
Størrelse:: 1.402Mb
Format:: PDF

Åpne

Denne innførselen finnes i følgende samling(er)

Publikasjoner fra Cristin [3269]
TKD - Institutt for informasjonsteknologi [945]
TKD - Department of Computer Science

Vis enkel innførsel

Med mindre annet er angitt, så er denne innførselen lisensiert som Navngivelse-Ikkekommersiell 4.0 Internasjonal