Implementing an XML Object Identification System on an archive data

Tefera, Getnet Lemma

dc.contributor.advisor	Sødring, Thomas
dc.contributor.author	Tefera, Getnet Lemma
dc.date.accessioned	2011-11-22T13:59:40Z
dc.date.available	2011-11-22T13:59:40Z
dc.date.issued	2011
dc.identifier.uri	https://hdl.handle.net/10642/982
dc.description	Joint Master Degree in Digital Library Learning (DILL)	en_US
dc.description.abstract	Despite the existence of various techniques and tools at early stage, the data quality problem was not given the attention it deserves, until recent time,1990s the data quality was restricted to certain sectors, but later following the exposition of the huge losses due to data quality related problems different works has been seen. A few scholars have been involved in exposing the data quality problem and also finding solutions; among the initiatives to study the data quality problem systematically was the total data quality management methodology. The archiving sector is not a different from the above case, in the process of archiving or long term preservation unless the data preserved is accurate and authentic its use would be of little value. This paper is the study of how to ensure the accuracy of digital archives data and it presents a data quality approach called an object identification technique as a way of ensuring that an archive data is accurate. Most of the research undertakings have been focusing on relational data, but with the increasing popularity and importance of the XML data, there is a concern for developing data quality tools and methodologies which suit the XML data need. Based on this fact the object identification technique on this study focused on an XML data. The research used the Noark data as a case study and developed a prototype of an object identification technique. The prototyped object identification technique has shown a good result upon a test on sample Noark representative data. This study is of significant in taking the initiative to create the awareness on data quality issues in the case of an archive.	en_US
dc.language.iso	eng	en_US
dc.publisher	Høgskolen i Oslo. Avdeling for journalistikk, bibliotek- og informasjonsvitenskap	en_US
dc.publisher	Universitetet i Tallinn	en_US
dc.publisher	Universitetet i Parma	en_US
dc.subject	Data quality	en_US
dc.subject	Object identification	en_US
dc.subject	Noark	en_US
dc.subject	Archives	en_US
dc.subject	XML	en_US
dc.subject	VDP::Samfunnsvitenskap: 200::Biblioteks- og informasjonsvitenskap: 320	en_US
dc.subject	VDP::Teknologi: 500::Informasjons- og kommunikasjonsteknologi: 550::Datateknologi: 551	en_US
dc.title	Implementing an XML Object Identification System on an archive data	en_US
dc.type	Master thesis	en_US

Tilhørende fil(er)

Filnavn:: Tefera_Getnet_Lemma.pdf
Størrelse:: 532.0Kb
Format:: PDF

Åpne

Denne innførselen finnes i følgende samling(er)

SAM - Joint Master Degree in Digital Library Learning (DILL) [78]

Vis enkel innførsel