Vis enkel innførsel

dc.contributor.authorYazidi, Anis
dc.contributor.authorHammer, Hugo Lewi
dc.contributor.authorJonassen, Tore Møller
dc.date.accessioned2020-02-08T17:30:03Z
dc.date.accessioned2020-02-19T14:45:17Z
dc.date.available2020-02-08T17:30:03Z
dc.date.available2020-02-19T14:45:17Z
dc.date.issued2019-04-11
dc.identifier.citationYazidi A., Hammer H.L., Jonassen T.M. Two-time scale learning automata: an efficient decision making mechanism for stochastic nonlinear resource allocation. Applied intelligence (Boston). 2019;49(9):3392-3405en
dc.identifier.issn0924-669X
dc.identifier.issn0924-669X
dc.identifier.issn1573-7497
dc.identifier.urihttps://hdl.handle.net/10642/8147
dc.description.abstractThe Stochastic Non-linear Fractional Equality Knapsack (NFEK) problem is a substantial resource allocation problem which admits a large set of applications such as web polling under polling constraints, and constrained estimation. The NFEK problem is usually solved by trial and error based on noisy feedback information from the environment. The available solutions to NFEK are based on the traditional family of Reward-Inaction Learning Automata (LA) scheme where the action probabilities are updated based on only the last feedback. Such an update form seems counterproductive for two reasons: 1) it only uses the last feedback and does not consider the whole history of the feedback and 2) it ignores updates whenever the last feedback does not correspond to a reward. In this paper, we rather suggest instead a learning solution that resorts to the whole history of feedback using the theory of two time-scale separation. Through comprehensive experimental results we show that the proposed solution is not only superior to the state-of-the-art in terms of peak performance but is also robust to the choice of the tuning parameters.en
dc.language.isoenen
dc.publisherSpringeren
dc.relation.ispartofseriesApplied Intelligence;Volume 49, Issue 9
dc.rightsThis is a post-peer-review, pre-copyedit version of an article published in Applied Intelligence. The final authenticated version is available online at: https://dx.doi.org/10.1007/s10489-019-01453-0en
dc.subjectDecision making uncertaintiesen
dc.subjectContinuous learning automataen
dc.subjectTwo time scalesen
dc.subjectStochastic non linear fractional equality knapsacksen
dc.subjectResource allocationsen
dc.titleTwo-time scale learning automata: an efficient decision making mechanism for stochastic nonlinear resource allocationen
dc.typeJournal articleen
dc.typePeer revieweden
dc.date.updated2020-02-08T17:30:03Z
dc.description.versionacceptedVersionen
dc.identifier.doihttps://dx.doi.org/10.1007/s10489-019-01453-0
dc.identifier.cristin1744683
dc.source.journalApplied intelligence (Boston)


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel