Matrix-free finite-element computations on graphics processors with adaptively refined unstructured meshes
2017 (English)In: Proc. 25th High Performance Computing Symposium, San Diego, CA: The Society for Modeling and Simulation International, 2017Conference paper (Refereed)
This paper concerns efficient matrix-free finite-element algorithms on modern manycore processors such as graphics cards (GPUs) as an alternative to sparse matrix-vector products. In matrix-free finite element algorithms, the assembly and solution phases are merged, yielding a significantly lower memory bandwidth footprint, with a corresponding increase in efficiency on bandwidth limited processors. Additionally, no system matrix must be assembled or stored in memory.
We present a GPU parallelization of the matrix-free method including a novel algorithm for resolving hanging-node constraints on the GPU, capable of simulation on adaptively refined grids. For second-order elements and higher in 3D, our GPU implementation of the adaptive algorithm is between 1.8 and 2.3 times faster than an existing optimized CPU version, on comparable hardware. Compared to a matrix-based implementation using CUSPARSE, we get a speedup of 8 and can solve problems 8 times larger in 3D.
Place, publisher, year, edition, pages
San Diego, CA: The Society for Modeling and Simulation International, 2017.
Computer Science Computational Mathematics
IdentifiersURN: urn:nbn:se:uu:diva-320146OAI: oai:DiVA.org:uu-320146DiVA: diva2:1088816
HPC 2017, April 23–26, Virginia Beach, VA