Data Quality Barriers for Transparency in Public Procurement
Soylu, Ahmet; Corcho, Oscar; Elvesæter, Brian; Badenes-Olmedo, Carlos; Yedro Martínez, Francisco; Kovavic, Matej; Posinkovic, Matej; Medvescek, Mitja; Makgill, Ian; Taggart, Chris; Simperl, Elena; Lech, Till Christopher; Roman, Dumitru
Peer reviewed, Journal article
Published version
View/ Open
Date
2022-02-20Metadata
Show full item recordCollections
Original version
https://doi.org/10.3390/info13020099Abstract
Governments need to be accountable and transparent for their public spending decisions in order to prevent losses through fraud and corruption as well as to build healthy and sustainable economies. Open data act as a major instrument in this respect by enabling public administrations, service providers, data journalists, transparency activists, and regular citizens to identify fraud or uncompetitive markets through connecting related, heterogeneous, and originally unconnected data sources. To this end, in this article, we present our experience in the case of Slovenia, where we successfully applied a number of anomaly detection techniques over a set of open disparate data sets integrated into a Knowledge Graph, including procurement, company, and spending data, through a linked data-based platform called TheyBuyForYou. We then report a set of guidelines for publishing high quality procurement data for better procurement analytics, since our experience has shown us that there are significant shortcomings in the quality of data being published. This article contributes to enhanced policy making by guiding public administrations at local, regional, and national levels on how to improve the way they publish and use procurement-related data; developing technologies and solutions that buyers in the public and private sectors can use and adapt to become more transparent, make markets more competitive, and reduce waste and fraud; and providing a Knowledge Graph, which is a data resource that is designed to facilitate integration across multiple data silos by showing how it adds context and domain knowledge to machine-learning-based procurement analytic.