Comparison of parallel implementation strategies in GPU-accelerated System-on-Chip under proton irradiation Articles uri icon

publication date

  • March 2022

start page

  • 444

end page

  • 452

issue

  • 3

volume

  • 69

International Standard Serial Number (ISSN)

  • 0018-9499

Electronic International Standard Serial Number (EISSN)

  • 1558-1578

abstract

  • Commercial off-the-shelf (COTS) system-on-chip (SoC) are becoming widespread in embedded systems. Many of them include a multicore central processing unit (CPU) and a high-end graphics processing unit (GPU). They combine high computational performance with low power consumption and flexible multilevel parallelism. This kind of device is also being considered for radiation environments where large amounts of data must be processed or compute-intensive applications must be executed. In this article, we compare three different strategies to perform matrix multiplication in the GPU of a Tegra TK1 SoC. Our aim is to analyze how the different use of the resources of the GPU influences not only the computational performance of the algorithm, but also its radiation sensitivity. Radiation experiments with protons were performed to compare the behavior of the three strategies. Experimental results show that most of the errors force a reboot of the platform. The number of errors is directly related with how the algorithms use the internal memories of the GPU and increases with the matrix size. It is also related with the number of transactions with the global memory, which in our experiments is not affected by the radiation. Results show that the smallest cross section is obtained with the fastest algorithm, even if it uses the cores of the GPU more intensively.

subjects

  • Electronics

keywords

  • embedded systems; graphics processing unit (gpu); parallelization; proton irradiation