3DyRM: a dynamic roofline model including memory latency information
Loading...
Identifiers
Publication date
Advisors
Tutors
Editors
Journal Title
Journal ISSN
Volume Title
Publisher
Springer Nature
Abstract
Modern systems present complex memory hierarchies and heterogeneity among cores and processors. As a consequence, efficient programming is challenging. An easy-to-understand performance model, offering guidelines and information about the behaviour of a code, may be useful to alleviate these issues. In this paper, we present two extensions of the well-known Berkeley Roofline Model. The first of these extensions, the Dynamic Roofline Model (DyRM), takes into consideration the complexities of multicore and heterogeneous systems, offering a more detailed view of the evolution of the execution of a code. The second, the 3DyRM, also adds information about the latency of memory accesses to better represent the behaviour on systems with complex memory hierarchies. A set of tools to obtain and represent the models has been implemented. These tools obtain the needed data from hardware counters, with low overhead. Different views are displayed by the tool that can be used to extract the main features of the code. Results of studying, with these tools, the NAS Parallel Benchmarks for OpenMP on two different systems are presented.
Description
Keywords
Bibliographic citation
Lorenzo, O.G., Pena, T.F., Cabaleiro, J.C. et al. 3DyRM: a dynamic roofline model including memory latency information. J Supercomput 70, 696–708 (2014). https://doi.org/10.1007/s11227-014-1163-4
Relation
Has part
Has version
Is based on
Is part of
Is referenced by
Is version of
Requires
Publisher version
https://link.springer.com/article/10.1007/s11227-014-1163-4Sponsors
Rights
Copyright Springer Science+Business Media New York 2014








