Enrichment of Text Documents using Information Retrieval Techniques in a Distributed Environment Articles uri icon

authors

  • Bueno, Francisco
  • GARCIA-SERRANO, ANA
  • MARTINEZ FERNANDEZ, JOSE LUIS

publication date

  • December 2010

start page

  • 8348

end page

  • 8358

issue

  • 12

volume

  • 37

International Standard Serial Number (ISSN)

  • 0957-4174

Electronic International Standard Serial Number (EISSN)

  • 1873-6793

abstract

  • The main goal of the paper is to describe a distributed information retrieval model deployed in order to enable the different functionalities needed for the enrichment of a document. Enriching a
    document here means finding, in a distributed environment, most of the
    documents related to it. Moreover, the environment is in a context in
    which documents are news, which may arrive to the system at any time,
    and the response time is critical. We first define the architecture to
    be deployed, designed with the aim of testing the effect of different
    combination approaches for selecting and ranking a set of documents in a
    continuously changing environment. Then we discuss the different
    techniques that can be used in the approach. Finally, we describe a
    prototype version of the developed software, previously settled in EU
    project NEDINE (e-Content 2225), using Ciao and taking advantage of its
    features for the development of distributed systems, using also Java for
    interfacing the system.