ImpactU - Detalle del Producto

Improving the execution time of a lattice-boltzmann kernel using the NVIDIA G80 architecture

Acceso Cerrado

Idioma: Inglés

Publicado: 01/07/2009

APC (est): No disponible

JSON

HTML

BibTeX

Abstract:

Control statements such as loops and branches poseserious challenges for their efficient utilization onGraphic Processing Units (GPUs) as those controlstatements will lead to a serialization of threads andconsequently ruin the occupancy and parallelism on GPUs. Unlike traditional central processingunits (CPUs), the GPU cannot leave the controlstatements to the CPU because fine-grain statementscheduling between GPU and CPU cannot begranted, as the GPU acts as a co-processing device.This paper analyzes the impact for using two leveltransformation techniques, namely loop/branchsplitting, which improves the register utilizationto manage the control statements on GPUs, inorder to implement the Lattice Boltzmann Method(LBM) benchmark application. Results executed inthe NVIDIA G80 architecture illustrate that thesetechniques are very efficient in term of parallelismand can lead to an increase in occupancy and adrastic improvement in performance, compared tonon-split version of the programs.

Tópico:

Lattice Boltzmann Simulation Studies

Citaciones:

Citaciones por año:

No hay datos de citaciones disponibles

Altmétricas:

Información de la Fuente:

FuenteTecnura	Cuartil año de publicaciónNo disponible	Volumen13
Issue25	Páginas5 - 13	pISSN0123-921X
ISSNNo disponible	Perfil OpenAlexhttps://openalex.org/S2737955884

Enlaces e Identificadores:

Oaipmh URL	https://repository.udistrital.edu.co/server/oai/request?verb=GetRecord&metadataPrefix=dim&identifier=oai:repository.udistrital.edu.co:11349/20476	Scholar URL	https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=info%3ATnBrzK5Fn_gJ%3Ascholar.google.com&btnG=	Dspace URL	https://repository.udistrital.edu.co/handle/11349/20476
Doi URL	https://doi.org/10.14483/22487638.6665	Openalex URL	https://openalex.org/W1488200372	Pdf URL	https://www.redalyc.org/pdf/2570/257020617002.pdf
Uri URL	http://hdl.handle.net/11349/20476

Artículo de revista