TY - GEN
T1 - Stream experiments
T2 - 9th IASTED International Conference on Parallel and Distributed Computing and Networks, PDCN 2010
AU - Laosooksathit, Supada
AU - Leangsuksun, Chokchai
AU - Baggag, Abdelkader
AU - Chandler, Clayton
PY - 2010
Y1 - 2010
N2 - In multithreaded programming on GPUs, data transfer between CPU and GPUs is a major impendence that prevents GPU to achieve its potential. Hence, stream management framework-a latency hiding strategy introduced by CUDA, becomes our attention. Streaming allows overlapping between kernel execution time and transfer time of independent data between CPU and GPUs. For this reason, the total execution time can potentially be reduced. In this paper, we introduced performance models in order to study the utilization of streams. Moreover, we have studied two methods that are used for timing operations in CUDA, namely CUDA calls and CUDA events. CUDA call functions are functions implemented in C++, while CUDA events method is an API. Our finding shows that CUDA events method is more accurate for timing operations running on GPU than CUDA call functions.
AB - In multithreaded programming on GPUs, data transfer between CPU and GPUs is a major impendence that prevents GPU to achieve its potential. Hence, stream management framework-a latency hiding strategy introduced by CUDA, becomes our attention. Streaming allows overlapping between kernel execution time and transfer time of independent data between CPU and GPUs. For this reason, the total execution time can potentially be reduced. In this paper, we introduced performance models in order to study the utilization of streams. Moreover, we have studied two methods that are used for timing operations in CUDA, namely CUDA calls and CUDA events. CUDA call functions are functions implemented in C++, while CUDA events method is an API. Our finding shows that CUDA events method is more accurate for timing operations running on GPU than CUDA call functions.
KW - GPGPU
KW - High performance computing
KW - Latency hiding
UR - http://www.scopus.com/inward/record.url?scp=77954574912&partnerID=8YFLogxK
U2 - 10.2316/p.2010.676-066
DO - 10.2316/p.2010.676-066
M3 - Conference contribution
AN - SCOPUS:77954574912
SN - 9780889868205
T3 - Proceedings of the 9th IASTED International Conference on Parallel and Distributed Computing and Networks, PDCN 2010
SP - 240
EP - 248
BT - Proceedings of the 9th IASTED International Conference on Parallel and Distributed Computing and Networks, PDCN 2010
PB - Acta Press
Y2 - 16 February 2010 through 18 February 2010
ER -