Two-time scale learning automata: an efficient decision making mechanism for stochastic nonlinear resource allocation

Yazidi, Anis; Hammer, Hugo Lewi; Jonassen, Tore Møller

dc.contributor.author	Yazidi, Anis
dc.contributor.author	Hammer, Hugo Lewi
dc.contributor.author	Jonassen, Tore Møller
dc.date.accessioned	2020-02-08T17:30:03Z
dc.date.accessioned	2020-02-19T14:45:17Z
dc.date.available	2020-02-08T17:30:03Z
dc.date.available	2020-02-19T14:45:17Z
dc.date.issued	2019-04-11
dc.identifier.citation	Yazidi A., Hammer H.L., Jonassen T.M. Two-time scale learning automata: an efficient decision making mechanism for stochastic nonlinear resource allocation. Applied intelligence (Boston). 2019;49(9):3392-3405	en
dc.identifier.issn	0924-669X
dc.identifier.issn	0924-669X
dc.identifier.issn	1573-7497
dc.identifier.uri	https://hdl.handle.net/10642/8147
dc.description.abstract	The Stochastic Non-linear Fractional Equality Knapsack (NFEK) problem is a substantial resource allocation problem which admits a large set of applications such as web polling under polling constraints, and constrained estimation. The NFEK problem is usually solved by trial and error based on noisy feedback information from the environment. The available solutions to NFEK are based on the traditional family of Reward-Inaction Learning Automata (LA) scheme where the action probabilities are updated based on only the last feedback. Such an update form seems counterproductive for two reasons: 1) it only uses the last feedback and does not consider the whole history of the feedback and 2) it ignores updates whenever the last feedback does not correspond to a reward. In this paper, we rather suggest instead a learning solution that resorts to the whole history of feedback using the theory of two time-scale separation. Through comprehensive experimental results we show that the proposed solution is not only superior to the state-of-the-art in terms of peak performance but is also robust to the choice of the tuning parameters.	en
dc.language.iso	en	en
dc.publisher	Springer	en
dc.relation.ispartofseries	Applied Intelligence;Volume 49, Issue 9
dc.rights	This is a post-peer-review, pre-copyedit version of an article published in Applied Intelligence. The final authenticated version is available online at: https://dx.doi.org/10.1007/s10489-019-01453-0	en
dc.subject	Decision making uncertainties	en
dc.subject	Continuous learning automata	en
dc.subject	Two time scales	en
dc.subject	Stochastic non linear fractional equality knapsacks	en
dc.subject	Resource allocations	en
dc.title	Two-time scale learning automata: an efficient decision making mechanism for stochastic nonlinear resource allocation	en
dc.type	Journal article	en
dc.type	Peer reviewed	en
dc.date.updated	2020-02-08T17:30:03Z
dc.description.version	acceptedVersion	en
dc.identifier.doi	https://dx.doi.org/10.1007/s10489-019-01453-0
dc.identifier.cristin	1744683
dc.source.journal	Applied intelligence (Boston)

Tilhørende fil(er)

Filnavn:: APIN_TwoTimeScale.pdf
Størrelse:: 2.346Mb
Format:: PDF

Åpne

Denne innførselen finnes i følgende samling(er)

TKD - Institutt for informasjonsteknologi [940]
TKD - Department of Computer Science

Vis enkel innførsel