Show simple item record

dc.contributor.authorMeng, Li
dc.contributor.authorGoodwin, Morten
dc.contributor.authorYazidi, Anis
dc.contributor.authorEngelstad, Paal E.
dc.date.accessioned2024-02-29T11:25:26Z
dc.date.available2024-02-29T11:25:26Z
dc.date.created2023-11-29T17:58:12Z
dc.date.issued2023
dc.identifier.isbn978-3-031-44239-1
dc.identifier.isbn978-3-031-44240-7
dc.identifier.issn0302-9743
dc.identifier.issn1611-3349
dc.identifier.urihttps://hdl.handle.net/11250/3120462
dc.description.abstractState representation learning aims to capture latent factors of an environment. Although some researchers realize the connections between masked image modeling and contrastive representation learning, the effort is focused on using masks as an augmentation technique to represent the latent generative factors better. Partially observable environments in reinforcement learning have not yet been carefully studied using unsupervised state representation learning methods. In this article, we create an unsupervised state representation learning scheme for partially observable states. We conducted our experiment on a previous Atari 2600 framework designed to evaluate representation learning models. A contrastive method called Spatiotemporal DeepInfomax (ST-DIM) has shown state-of-the-art performance on this benchmark but remains inferior to its supervised counterpart. Our approach improves ST-DIM when the environment is not fully observable and achieves higher F1 scores and accuracy scores than the supervised learning counterpart. The mean accuracy score averaged over categories of our approach is 66%, compared to 38% of supervised learning. The mean F1 score is 64% to 33%. The code can be found on https://github.com/mengli11235/MST_DIM.en_US
dc.language.isoengen_US
dc.publisherSpringeren_US
dc.relation.ispartofComputer Analysis of Images and Patterns. CAIP 2023
dc.relation.ispartofseriesLecture Notes in Computer Science;
dc.titleUnsupervised State Representation Learning in Partially Observable Atari Gamesen_US
dc.typeChapteren_US
dc.typePeer revieweden_US
dc.typeConference objecten_US
dc.typeJournal articleen_US
dc.description.versionacceptedVersionen_US
cristin.ispublishedtrue
cristin.fulltextoriginal
cristin.qualitycode1
dc.identifier.doihttps://doi.org/10.1007/978-3-031-44240-7_21
dc.identifier.cristin2205633
dc.source.journalLecture Notes in Computer Scienceen_US
dc.source.pagenumber212-222en_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record