Big-BOE: Fusing Spanish Official Gazette with Big Data Technology Articles uri icon

publication date

  • June 2018

start page

  • 124

end page

  • 138

issue

  • 124

volume

  • 6

International Standard Serial Number (ISSN)

  • 2167-6461

abstract

  • The proliferation of new data sources, stemmed from the adoption of open-data schemes, in combination with an increasing computing capacity causes the inception of new type of analytics that process Internet of things with low-cost engines to speed up data processing using parallel computing. In this context, the article presents an initiative, called BIG-Boletin official del Estado (BOE), designed to process the Spanish official government gazette (BOE) with state-of-the-art processing engines, to reduce computation time and to offer additional speed up for big data analysts. The goal of including a big data infrastructure is to be able to process different BOE documents in parallel with specific analytics, to search for several issues in different documents. The application infrastructure processing engine is described from an architectural perspective and from performance, showing evidence on how this type of infrastructure improves the performance of different types of simple analytics as several machines cooperate.

keywords

  • analytics; big data; data fusion; efficient big data; fusion of services; open-data