dc.contributor.advisor | Sødring, Thomas | |
dc.contributor.author | Tefera, Getnet Lemma | |
dc.date.accessioned | 2011-11-22T13:59:40Z | |
dc.date.available | 2011-11-22T13:59:40Z | |
dc.date.issued | 2011 | |
dc.identifier.uri | https://hdl.handle.net/10642/982 | |
dc.description | Joint Master Degree in Digital Library Learning (DILL) | en_US |
dc.description.abstract | Despite the existence of various techniques and tools at early stage, the data quality
problem was not given the attention it deserves, until recent time,1990s the data quality
was restricted to certain sectors, but later following the exposition of the huge losses
due to data quality related problems different works has been seen. A few scholars have
been involved in exposing the data quality problem and also finding solutions; among
the initiatives to study the data quality problem systematically was the total data
quality management methodology.
The archiving sector is not a different from the above case, in the process of archiving or
long term preservation unless the data preserved is accurate and authentic its use
would be of little value.
This paper is the study of how to ensure the accuracy of digital archives data and it
presents a data quality approach called an object identification technique as a way of
ensuring that an archive data is accurate. Most of the research undertakings have been
focusing on relational data, but with the increasing popularity and importance of the
XML data, there is a concern for developing data quality tools and methodologies
which suit the XML data need. Based on this fact the object identification technique on
this study focused on an XML data.
The research used the Noark data as a case study and developed a prototype of an
object identification technique. The prototyped object identification technique has
shown a good result upon a test on sample Noark representative data.
This study is of significant in taking the initiative to create the awareness on data
quality issues in the case of an archive. | en_US |
dc.language.iso | eng | en_US |
dc.publisher | Høgskolen i Oslo. Avdeling for journalistikk, bibliotek- og informasjonsvitenskap | en_US |
dc.publisher | Universitetet i Tallinn | en_US |
dc.publisher | Universitetet i Parma | en_US |
dc.subject | Data quality | en_US |
dc.subject | Object identification | en_US |
dc.subject | Noark | en_US |
dc.subject | Archives | en_US |
dc.subject | XML | en_US |
dc.subject | VDP::Samfunnsvitenskap: 200::Biblioteks- og informasjonsvitenskap: 320 | en_US |
dc.subject | VDP::Teknologi: 500::Informasjons- og kommunikasjonsteknologi: 550::Datateknologi: 551 | en_US |
dc.title | Implementing an XML Object Identification System on an archive data | en_US |
dc.type | Master thesis | en_US |