The Library
On the acceleration of wavefront applications using distributed many-core architectures
Tools
Pennycook, Simon J., Hammond, Simon D., Mudalige, Gihan R., Wright, Steven A. and Jarvis, Stephen A. (2012) On the acceleration of wavefront applications using distributed many-core architectures. Computer Journal, Volume 55 (Number 2). pp. 138-153. doi:10.1093/comjnl/bxr073 ISSN 0010-4620.
|
Text
WRAP_Jarvis_On_the_Acceleration_of_Wavefront_Applications_using_Distributed_Many_Core_Architectures.pdf Download (673Kb) | Preview |
Official URL: http://dx.doi.org/10.1093/comjnl/bxr073
Abstract
In this paper we investigate the use of distributed graphics processing unit (GPU)-based architectures to accelerate pipelined wavefront applications—a ubiquitous class of parallel algorithms used for the solution of a number of scientific and engineering applications. Specifically, we employ a recently developed port of the LU solver (from the NAS Parallel Benchmark suite) to investigate the performance of these algorithms on high-performance computing solutions from NVIDIA (Tesla C1060 and C2050) as well as on traditional clusters (AMD/InfiniBand and IBM BlueGene/P). Benchmark results are presented for problem classes A to C and a recently developed performance model is used to provide projections for problem classes D and E, the latter of which represents a billion-cell problem. Our results demonstrate that while the theoretical performance of GPU solutions will far exceed those of many traditional technologies, the sustained application performance is currently comparable for scientific wavefront applications. Finally, a breakdown of the GPU solution is conducted, exposing PCIe overheads and decomposition constraints. A new k-blocking strategy is proposed to improve the future performance of this class of algorithm on GPU-based architectures.
Item Type: | Journal Article | ||||
---|---|---|---|---|---|
Subjects: | Q Science > QA Mathematics > QA76 Electronic computers. Computer science. Computer software | ||||
Divisions: | Faculty of Science, Engineering and Medicine > Science > Computer Science | ||||
Library of Congress Subject Headings (LCSH): | High performance computing, Parallel processing (Electronic computers), Computer architecture, Parallel programming (Computer science), Application software -- Development | ||||
Journal or Publication Title: | Computer Journal | ||||
Publisher: | Oxford University Press | ||||
ISSN: | 0010-4620 | ||||
Official Date: | February 2012 | ||||
Dates: |
|
||||
Volume: | Volume 55 | ||||
Number: | Number 2 | ||||
Page Range: | pp. 138-153 | ||||
DOI: | 10.1093/comjnl/bxr073 | ||||
Status: | Peer Reviewed | ||||
Publication Status: | Published | ||||
Access rights to Published version: | Restricted or Subscription Access | ||||
Funder: | Royal Society (Great Britain), Atomic Weapons Establishment (Great Britain) (AWE), Knowledge Transfer Partnerships (KTP) | ||||
Grant number: | IF090020/AM (RS), KTP006740 (KTP) |
Request changes or add full text files to a record
Repository staff actions (login required)
View Item |