Using an extended Roofline Model to understand data and thread affinities on NUMA systems
Loading...
Identifiers
Publication date
Advisors
Tutors
Editors
Journal Title
Journal ISSN
Volume Title
Publisher
Universidad de Granada
Abstract
Today’s microprocessors include multicores that feature a diverse set of compute cores and onboard memory subsystems connected by complex communication networks and protocols. The analysis of factors that affect performance in such complex systems is far from being an easy task. Anyway, it is clear that increasing data locality and affinity is one of the main challenges to reduce the access latency to data. As the number of cores increases, the influence of this issue on the performance of parallel codes is more and more important. Therefore, models to characterize the performance in such systems are broadly demanded. This paper shows the use of an extension of the well known Roofline Model adapted to the main features of the memory hierarchy present in most of the current multicore systems. Also the Roofline Model was extended to show the dynamic evolution of the execution of a given code. In order to reduce the overheads to get the information needed to obtain this dynamic Roofline Model, hardware counters present in most of the current microprocessors are used. To illustrate its use, two simple parallel vector operations, SAXPY and SDOT, were considered. Different access strides and initial location of vectors in memory modules were used to show the influence of different scenarios in terms of locality and affinity. The effect of thread migration were also considered. We conclude that the proposed Roofline Model is an useful tool to understand and characterise the behaviour of the execution of parallel codes in multicore systems
Description
Keywords
Bibliographic citation
García Lorenzo, O., Pena, T. F., Cabaleiro, J.C., Pichel, J.C. and Fernández Rivera, F. (2014). Using an extended Roofline Model to understand data and thread affinities on NUMA systems. Annals of Multicore and GPU Programming, v. 1, n. 1, pp. 37-48
Relation
Has part
Has version
Is based on
Is part of
Is referenced by
Is version of
Requires
Publisher version
http://revistaseug.ugr.es/index.php/amgp/article/view/1992Sponsors
This work has been partially supported by the Ministry of
Education and Science of Spain, FEDER funds under contract
TIN 2010-17541, and Xunta de Galicia, EM2013/041. It has
been developed in the framework of the European network
HiPEAC and the Spanish network CAPAP-H
Rights
This work is licensed under a Creative Commons Attribution 3.0 License








