High performance FDTD code implementation for GPGPU supercomputers

Zakirov A.V.; Levchenko V. D.; Perepelkina A.Y.; Zempo Yasunari

Abstract:

An implementation of FDTD (Finite Difference Time Domain) method for solution of optical and other electrodynamic problems of high computational cost is described. The implementation is based on LRnLA (Locally Recursive non-Locally Asynchronous) algorithm DiamondTorre, which is developed specifically for GPGPU (General Purpose Graphical Processing Unit) hardware. The specifics of the DiamondTorre algorithms for staggered grid (Yee cell) and many-GPU devices are shown. The algorithm is implemented in software for real physics calculation. The software performance is estimated through algorithms parameters and computer model. The real performance is tested on one GPU device, as well as on many-GPU cluster. The performance of up to 0.65・10¹² cell updates per second for 3D domain with 0.3・10¹² Yee cells total is achieved.

Keywords:

LRnLA algorithms, GPU, CUDA, FDTD, supercomputer

Publication language: english, pages: 22

Research direction:

Programming, parallel computing, multimedia

English source text:

List of publications citation:

Export link to publication in format:

View statistics (updated once a day)
over the last 30 days — 12 (-6), total hit from 01.09.2019 — 786

About authors:

Zakirov Andrey Vladimirovich, , Кинтех Лаб

Levchenko Vadim Dmitrievich, , ,

Perepelkina Anastasia Yurievna, , ,

Zempo Yasunari, , Университет Хосей, Токио