3DyRM: a dynamic roofline model including memory latency information

dc.contributor.affiliationUniversidade de Santiago de Compostela. Departamento de Electrónica e Computación
dc.contributor.affiliationUniversidade de Santiago de Compostela. Centro de Investigación en Tecnoloxías Intelixentes da USC (CiTIUS)
dc.contributor.authorGarcía Lorenzo, Óscar
dc.contributor.authorFernández Pena, Anselmo Tomás
dc.contributor.authorCabaleiro Domínguez, José Carlos
dc.contributor.authorPichel Campos, Juan Carlos
dc.contributor.authorFernández Rivera, Francisco
dc.date.accessioned2025-01-23T08:58:01Z
dc.date.available2025-01-23T08:58:01Z
dc.date.issued2014
dc.description.abstractModern systems present complex memory hierarchies and heterogeneity among cores and processors. As a consequence, efficient programming is challenging. An easy-to-understand performance model, offering guidelines and information about the behaviour of a code, may be useful to alleviate these issues. In this paper, we present two extensions of the well-known Berkeley Roofline Model. The first of these extensions, the Dynamic Roofline Model (DyRM), takes into consideration the complexities of multicore and heterogeneous systems, offering a more detailed view of the evolution of the execution of a code. The second, the 3DyRM, also adds information about the latency of memory accesses to better represent the behaviour on systems with complex memory hierarchies. A set of tools to obtain and represent the models has been implemented. These tools obtain the needed data from hardware counters, with low overhead. Different views are displayed by the tool that can be used to extract the main features of the code. Results of studying, with these tools, the NAS Parallel Benchmarks for OpenMP on two different systems are presented.
dc.description.peerreviewedSI
dc.identifier.citationLorenzo, O.G., Pena, T.F., Cabaleiro, J.C. et al. 3DyRM: a dynamic roofline model including memory latency information. J Supercomput 70, 696–708 (2014). https://doi.org/10.1007/s11227-014-1163-4
dc.identifier.doi10.1007/s11227-014-1163-4
dc.identifier.issn0920-8542
dc.identifier.urihttps://hdl.handle.net/10347/38926
dc.issue.number3
dc.journal.titleJournal of Supercomputing
dc.language.isoeng
dc.page.final708
dc.page.initial696
dc.publisherSpringer Nature
dc.relation.projectIDinfo:eu-repo/grantAgreement/MICINN//TIN2010-17541/ES/HARDWARE Y SOFTWARE PARA COMPUTACION DE ALTAS PRESTACIONES/
dc.relation.publisherversionhttps://link.springer.com/article/10.1007/s11227-014-1163-4
dc.rightsCopyright Springer Science+Business Media New York 2014
dc.rights.accessRightsopen access
dc.subjectRoofline model
dc.subjectPerformance
dc.subjectHardware counters
dc.subjectPEBS
dc.subjectNPB
dc.subjectMulticore
dc.title3DyRM: a dynamic roofline model including memory latency information
dc.typejournal article
dc.type.hasVersionAM
dc.volume.number70
dspace.entity.typePublication
relation.isAuthorOfPublication0faa7141-ea10-4a10-9414-45cd7b726fef
relation.isAuthorOfPublicationdecb372f-b9cd-4237-8dda-2c0f5c40acbe
relation.isAuthorOfPublication1959c3e1-552e-4a0b-bc17-a5f9f687ad38
relation.isAuthorOfPublicationdb334853-753e-4afc-9f4f-ad847d0353a7
relation.isAuthorOfPublicationf905807b-c6bd-4e37-97d1-2e644fc5af62
relation.isAuthorOfPublication.latestForDiscovery0faa7141-ea10-4a10-9414-45cd7b726fef

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2014_SI-CMMSE_JS_Dynamic_Roofline.pdf
Size:
1.1 MB
Format:
Adobe Portable Document Format