García Lorenzo, ÓscarFernández Pena, Anselmo TomásCabaleiro Domínguez, José CarlosPichel Campos, Juan CarlosFernández Rivera, Francisco2025-01-232025-01-232014Lorenzo, O.G., Pena, T.F., Cabaleiro, J.C. et al. 3DyRM: a dynamic roofline model including memory latency information. J Supercomput 70, 696–708 (2014). https://doi.org/10.1007/s11227-014-1163-40920-8542https://hdl.handle.net/10347/38926Modern systems present complex memory hierarchies and heterogeneity among cores and processors. As a consequence, efficient programming is challenging. An easy-to-understand performance model, offering guidelines and information about the behaviour of a code, may be useful to alleviate these issues. In this paper, we present two extensions of the well-known Berkeley Roofline Model. The first of these extensions, the Dynamic Roofline Model (DyRM), takes into consideration the complexities of multicore and heterogeneous systems, offering a more detailed view of the evolution of the execution of a code. The second, the 3DyRM, also adds information about the latency of memory accesses to better represent the behaviour on systems with complex memory hierarchies. A set of tools to obtain and represent the models has been implemented. These tools obtain the needed data from hardware counters, with low overhead. Different views are displayed by the tool that can be used to extract the main features of the code. Results of studying, with these tools, the NAS Parallel Benchmarks for OpenMP on two different systems are presented.engCopyright Springer Science+Business Media New York 2014Roofline modelPerformanceHardware countersPEBSNPBMulticore3DyRM: a dynamic roofline model including memory latency informationjournal article10.1007/s11227-014-1163-4open access