Vis enkel innførsel

dc.contributor.authorSteinhauer, Jeremyen_US
dc.contributor.authorDelcambre, Lois M.L.en_US
dc.contributor.authorLykke, Marianneen_US
dc.contributor.authorÅdland, Marit Kristineen_US
dc.date.accessioned2015-02-10T09:38:18Z
dc.date.available2015-02-10T09:38:18Z
dc.date.issued2013en_US
dc.identifier.citationSteinhauer, J., Delcambre, L. M., Lykke, M., & Ådland, M. K. (2013). Do User (Browse and Click) Sessions Relate to Their Questions in a Domain-Specific Collection?. In Research and Advanced Technology for Digital Libraries (pp. 96-107). Springer Berlin Heidelberg.en_US
dc.identifier.isbn978-3-642-40500-6en_US
dc.identifier.issn0302-9743en_US
dc.identifier.otherFRIDAID 1091791en_US
dc.identifier.urihttps://hdl.handle.net/10642/2375
dc.description.abstractWe seek to improve information retrieval in a domain-specific collection by clustering user sessions from a click log and then classifying later user sessions in real-time. As a preliminary step, we explore the main assumption of this approach: whether user sessions in such a site are related to the question that they are answering. Since a large class of machine learning algorithms use a distance measure at the core, we evaluate the suitability of common machine learning distance measures to distinguish sessions of users searching for the answer to same or different questions. We found that two distance measures work very well for our task and three others do not. As a further step, we then investigate how effective the distance measures are when used in clustering. For our dataset, we conducted a user study where we had multiple users answer the same set of questions. This data, grouped by question, was used as our gold standard for evaluating the clusters produced by the clustering algorithms. We found that the observed difference between the two classes of distance measures affected the quality of the clusterings, as expected. We also found that one of the two distance measures that worked well to differentiate sessions, worked significantly better than the other when clustering. Finally, we discuss why some distance metrics performed better than others in the two parts of our work.en_US
dc.language.isoengen_US
dc.publisherSpringeren_US
dc.relation.ispartofseriesLecture Notes in Computer Science;8092en_US
dc.subjectClick loggingen_US
dc.subjectUser sessionsen_US
dc.subjectMachine learningen_US
dc.subjectVDP::Samfunnsvitenskap: 200::Biblioteks- og informasjonsvitenskap: 320::Kunnskapsgjenfinning og organisering: 323en_US
dc.subjectVDP::Teknologi: 500::Informasjons- og kommunikasjonsteknologi: 550::Datateknologi: 551en_US
dc.titleDo User (Browse and Click) Sessions Relate to Their Questions in a Domain-Specific Collection?en_US
dc.typeJournal articleen_US
dc.typePeer revieweden_US
dc.description.versionPostprint version of published article. Original article is available at www.springerlink.comen_US
dc.identifier.doihttp://dx.doi.org/10.1007/978-3-642-40501-3_10


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel