Exploring stream parallel patterns in distributed MPI environments Articles uri icon

publication date

  • May 2019

start page

  • 24

end page

  • 36

volume

  • 84

International Standard Serial Number (ISSN)

  • 0167-8191

Electronic International Standard Serial Number (EISSN)

  • 1872-7336

abstract

  • In recent years, the large volumes of stream data and the near real-time requirements of data streaming applications have exacerbated the need for new scalable algorithms and programming interfaces for distributed and shared-memory platforms. To contribute in this direction, this paper presents a new distributed MPI back end for GrPPI, a C++ high-level generic interface of data-intensive and stream processing parallel patterns. This back end, as a new execution policy, supports distributed and hybrid (distributed+shared-memory) parallel executions of the Pipeline and Farm patterns, where the hybrid mode combines the MPI policy with a GrPPI shared-memory one. These patterns internally leverage distributed queues, which can be configured to use two-sided or one-sided MPI primitives to communicate items among nodes. A detailed analysis of the GrPPI MPI execution policy reports considerable benefits from the programmability, flexibility and readability points of view. The experimental evaluation of two different streaming applications with different distributed and shared-memory scenarios reports considerable performance gains with respect to the sequential versions at the expense of negligible GrPPI overheads.

keywords

  • parallel patterns; stream processing; distributed patterns; c++ programming; generic programming