Vis enkel innførsel

dc.contributor.authorShrestha, Raju
dc.date.accessioned2022-05-31T08:48:22Z
dc.date.available2022-05-31T08:48:22Z
dc.date.created2022-01-26T14:29:41Z
dc.date.issued2021
dc.identifier.isbn978-1-4503-8416-2
dc.identifier.urihttps://hdl.handle.net/11250/2996980
dc.description.abstractMillions of people who are either blind or visually impaired have difficulty understanding the content in an image. To address the problem textual image descriptions or captions are provided separately or as alternative texts on the web so that the users can read them through a screen reader. However, most of the image descriptions provided are inadequate to make them accessible enough. Image descriptions could be written either manually or automatically generated using software tools. There are tools, methods, and metrics used to evaluate the quality of the generated text. However, almost all of them are word-similarity-based and generic. Even though there are standard guidelines such as WCAG2.0 and NCAM image accessibility guidelines, they are rarely used in the evaluation of image descriptions. In this paper, we propose a neural network-based framework and models for an automatic evaluation of image descriptions in terms of compliance with the NCAM guidelines. A custom dataset was created from a widely used Flickr8K dataset to train and test the models. The experimental results show the proposed framework performing very well with an average accuracy of above 98%. We believe that the framework could be helpful and useful for the authors of image descriptions in writing accessible image descriptions for the users.en_US
dc.language.isoengen_US
dc.publisherACM Digital Libraryen_US
dc.relation.ispartofProceedings of the 4th Artificial Intelligence and Cloud Computing Conference
dc.subjectImage accessibilityen_US
dc.subjectDescriptionen_US
dc.subjectCaptionen_US
dc.subjectMachine learningen_US
dc.subjectNeural networken_US
dc.subjectAutomatic evaluationen_US
dc.titleA neural network model and framework for an automatic evaluation of image descriptions based on NCAM image accessibility guidelinesen_US
dc.typeChapteren_US
dc.typePeer revieweden_US
dc.description.versionpublishedVersionen_US
cristin.ispublishedtrue
cristin.fulltextpostprint
cristin.qualitycode1
dc.identifier.cristin1990546
dc.subject.nsiVDP::Teknologi: 500::Informasjons- og kommunikasjonsteknologi: 550::Datateknologi: 551en_US


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel