Study of basic vector operations on Intel Xeon Phi and NVIDIA Tesla using OpenCL

dc.contributor.affiliationUniversidade de Santiago de Compostela. Centro de Investigación en Tecnoloxías da Informacióngl
dc.contributor.affiliationUniversidade de Santiago de Compostela. Departamento de Electrónica e Computacióngl
dc.contributor.areaÁrea de Enxeñaría e Arquitectura
dc.contributor.authorCoronado Barrientos, Edoardo
dc.contributor.authorIndalecio Fernández, Guillermo
dc.contributor.authorGarcía Loureiro, Antonio Jesús
dc.date.accessioned2018-11-13T08:16:47Z
dc.date.available2018-11-13T08:16:47Z
dc.date.issued2015
dc.description.abstractThe present work is an analysis of the performance of the basic vector operations AXPY, DOT and SpMV using OpenCL. The code was tested on the NVIDIA Tesla S2050 GPU and Intel Xeon Phi 3120A coprocessor. Due to the nature of the AXPY function, only two versions were implemented, the routine to be executed by the CPU and the kernel to be executed on the previously mentioned devices. It was studied how they perform for different vector’s sizes. Their results show the NVIDIA architecture better suited for the smaller vectors sizes and the Intel architecture for the larger vector’s sizes. For the DOT and SpMV functions, there are three versions implemented. The first is the CPU routine, the second one is an OpenCL kernel that uses local memory and the third one is an OpenCL kernel that only uses global memory. The kernels that use local memory are tested by varying the size of the work-group; the kernels that only uses global memory are tested by varying the arrays size. In the case of the first ones, the results show the optimum work-group size and that the NVIDIA architecture benefits from the use of local memory. For the latter kernels, the results show that larger computational loads benefits the Intel architecturegl
dc.description.peerreviewedSIgl
dc.description.sponsorshipThis work has been supported by FEDER funds and Xunta de Galicia under contract GRC 2014/008, and by Spanish Government (MCYT) under project TEC2010-17320 and TIN-2013-41129-Pgl
dc.identifier.citationCoronado Barrientos, E., Indalecio Fernández, G. and García Loureiro, A. (2015). Study of basic vector operations on Intel Xeon Phi and NVIDIA Tesla using OpenCL, v. 2, n. 1, pp. 26-40gl
dc.identifier.issn2341-3158
dc.identifier.urihttp://hdl.handle.net/10347/17698
dc.language.isoenggl
dc.publisherUniversidad de Granadagl
dc.relation.projectIDinfo:eu-repo/grantAgreement/MICINN/Plan Nacional de I+D+i 2008-2011/TEC2010-17320/ES/DESARROLLO DE HERRAMIENTAS PARA EL MODELADO Y SIMULACION DE DISPOSITIVOS SEMICONDUCTORES AVANZADOS: APLICACION AL ESTUDIO DE FLUCTUACIONES DE PARAMETROS INTRINSECOS
dc.relation.projectIDinfo:eu-repo/grantAgreement/MINECO/Plan Estatal de Investigación Científica y Técnica y de Innovación 2013-2016/TIN-2013-41129-P/ES/SOLUCIONES HARDWARE Y SOFTWARE PARA LA COMPUTACION DE ALTAS PRESTACIONES
dc.relation.publisherversionhttp://revistaseug.ugr.es/index.php/amgp/article/view/3056gl
dc.rightsThis work is licensed under a Creative Commons Attribution 3.0 Licensegl
dc.rights.accessRightsopen accessgl
dc.rights.urihttps://creativecommons.org/licenses/by/3.0/
dc.subjectNvidia Teslagl
dc.subjectIntel Xeon Phigl
dc.subjectOpenCLgl
dc.subjectKernelgl
dc.subjectVector operationsgl
dc.subjectAXPYgl
dc.subjectDOTgl
dc.subjectSPMVgl
dc.titleStudy of basic vector operations on Intel Xeon Phi and NVIDIA Tesla using OpenCLgl
dc.typejournal articlegl
dc.type.hasVersionVoRgl
dspace.entity.typePublication
relation.isAuthorOfPublication67acc331-d835-4cbb-9789-f7eebbcc253d
relation.isAuthorOfPublication7c94bda5-3924-4484-9121-f327b8d2962c
relation.isAuthorOfPublication.latestForDiscovery67acc331-d835-4cbb-9789-f7eebbcc253d

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2015_amgp_coronado_study.pdf
Size:
1.41 MB
Format:
Adobe Portable Document Format
Description: