The Library
WMTools - assessing parallel application memory utilisation at scale
Tools
Perks, O. F. J., Hammond, Simon D., Pennycook, Simon J. and Jarvis, Stephen A. (2011) WMTools - assessing parallel application memory utilisation at scale. In: Computer Performance Engineering 8th European Performance Engineering Workshop, EPEW 2011, Borrowdale, UK, October 12-13, 2011. Proceedings. Lecture Notes in Computer Science, 6977 (No.6977). Berlin, Germany: Springer, pp. 148-162. ISBN 9783642247484
Text
epew-mem-3.pdf - Published Version Embargoed item. Restricted access to Repository staff only Download (421Kb) |
Official URL: http://dx.doi.org/10.1007/978-3-642-24749-1
Abstract
The divergence between processor and memory performance has been a well discussed aspect of computer architecture literature for some years. The recent use of multi-core processor designs has, however, brought new problems to the design of memory architectures - as more cores are added to each successive generation of processor, equivalent improvement in memory capacity and memory sub-systems must be made if the compute components of the processor are to remain sufficiently supplied with data. These issues combined with the traditional problem of designing cache-efficient code help to ensure that memory remains an on-going challenge for application and machine designers. In this paper we present a comprehensive discussion of WMTools - a trace-based toolkit designed to support the analysis of memory allocation for parallel applications. This paper features an extended discussion of the WMTrace tracing tool presented in previous work including a revised discussion on trace-compression and several refinements to the tracing methodology to reduce overheads and improve tool scalability. The second half of this paper features a case study in which we apply WMTools to five parallel scientific applications and benchmarks, demon- strating its effectiveness at recording high-water mark memory consumption as well as memory use per-function over time. An in-depth analysis is provided for an unstructured mesh benchmark which reveals significant memory allocation imbalance across its participating processes. This study demonstrates the use of WMTools in elucidating memory allocation issues in high-performance scientific codes.
Item Type: | Book Item | ||||
---|---|---|---|---|---|
Subjects: | Q Science > QA Mathematics > QA76 Electronic computers. Computer science. Computer software | ||||
Divisions: | Faculty of Science, Engineering and Medicine > Science > Computer Science | ||||
Series Name: | Lecture Notes in Computer Science | ||||
Publisher: | Springer | ||||
Place of Publication: | Berlin, Germany | ||||
ISBN: | 9783642247484 | ||||
Book Title: | Computer Performance Engineering 8th European Performance Engineering Workshop, EPEW 2011, Borrowdale, UK, October 12-13, 2011. Proceedings | ||||
Official Date: | 12 October 2011 | ||||
Dates: |
|
||||
Volume: | 6977 | ||||
Number: | No.6977 | ||||
Page Range: | pp. 148-162 | ||||
DOI: | 10.1007/978-3-642-24749-1 | ||||
Status: | Peer Reviewed | ||||
Publication Status: | Published | ||||
Access rights to Published version: | Restricted or Subscription Access | ||||
Date of first compliant deposit: | 1 August 2016 | ||||
Version or Related Resource: | Based on a paper given at 8th European Performance Engineering Workshop, EPEW 2011, Borrowdale, UK, October 12-13, 2011. | ||||
Conference Paper Type: | Paper | ||||
Title of Event: | Proceedings of the 8th European Performance Engineering Workshop (EPEW11) | ||||
Type of Event: | Workshop | ||||
Location of Event: | Borrowdale, United Kingdom | ||||
Date(s) of Event: | 12-13th October, 2011 |
Request changes or add full text files to a record
Repository staff actions (login required)
View Item |