Enabling policy making processes by unifying and reconciling corporate names in public procurement data. The CORFU technique Articles uri icon

publication date

  • September 2015

start page

  • 28

end page

  • 38

volume

  • 41

International Standard Serial Number (ISSN)

  • 0920-5489

Electronic International Standard Serial Number (EISSN)

  • 1872-7018

abstract

  • This paper introduces the design, implementation and evaluation of the CORFU technique to deal with corporate name ambiguities and heterogeneities in the context of public procurement meta-data. This technique is applied to the "PublicSpending.ner initiative to show how the unification of corporate names is the cornerstone to provide a visualization service that can serve policy-makers to detect and prevent upcoming necessities. Furthermore, a research study to evaluate the precision, recall and robustness of the proposed technique is conducted using more than 40 million of names extracted from public procurement datasets (Australia, United States and United Kingdom) and the CrocTail project. (C) 2015 Elsevier B.V. All rights reserved.