JCSE, vol. 11, no. 2, pp.58-68, 2017
DOI: http://dx.doi.org/10.5626/JCSE.2017.11.2.58
Warp-Based Load/Store Reordering to Improve GPU Time Predictability
Yijie Huangfu and Wei Zhang
Department of Electrical and Computer Engineering, Virginia Commonwealth University, Richmond, VA, USA
Abstract: While graphics processing units (GPUs) can be used to improve the performance of real-time embedded applications that
require high throughput, it is challenging to estimate the worst-case execution time (WCET) of GPU programs, because
modern GPUs are designed for improving the average-case performance rather than time predictability. In this paper, a
reordering framework is proposed to regulate the access to the GPU data cache, which helps to improve the accuracy of
the estimation of GPU L1 data cache miss rate with low performance overhead. Also, with the improved cache miss rate
estimation, tighter WCET estimations can be achieved for GPU programs.
Keyword:
GPU; Data cache; WCET estimation
Full Paper: 482 Downloads, 1495 View
|