GPU coprocessors as a service for deep learning inference in high energy physics

Shih-Chieh Hsu, Scott Hauck, MatthewTrahms, KelvinLin, Natchanon Suaysom

July 2020

PDF Project Project Custom Link

Throughput of ResNet-50 as a service in events per second versus the number of simultaneous clients.

Abstract

In the next decade, the demands for computing in large scientific experiments are expected to grow tremendously. During the same time period, CPU performance increases will be limited. At the CERN Large Hadron Collider (LHC), these two issues will confront one another as the collider is upgraded for high luminosity running. Alternative processors such as graphics processing units (GPUs) can resolve this confrontation provided that algorithms can be sufficiently accelerated. In many cases, algorithmic speedups are found to be largest through the adoption of deep learning algorithms. We present a comprehensive exploration of the use of GPU-based hardware acceleration for deep learning inference within the data reconstruction workflow of high energy physics. We present several realistic examples and discuss a strategy for the seamless integration of coprocessors so that the LHC can maintain, if not exceed, its current performance throughout its running.

Type

Preprint

Supplementary notes can be added here, including code and math.