Logo: to the web site of Uppsala University

uu.sePublications from Uppsala University
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Efficient Application of Hanging-Node Constraints for Matrix-Free High-Order FEM Computations on CPU and GPU
Helmholtz Zentrum Hereon, Geesthacht, Germany.;Tech Univ Munich, Munich, Germany..
Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Computational Science.
Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Numerical Analysis.ORCID iD: 0000-0001-8406-835X
2022 (English)In: HIGH PERFORMANCE COMPUTING, ISC HIGH PERFORMANCE 2022 / [ed] Varbanescu, AL Bhatele, A Luszczek, P Marc, B, Springer Nature, 2022, Vol. 13289, p. 133-152Conference paper, Published paper (Refereed)
Abstract [en]

This contribution presents an efficient algorithm for resolving hanging-node constraints on the fly for high-order finite-element computations on adaptively refined meshes, using matrix-free implementations. We concentrate on unstructured hex-dominated meshes and on multi-component elements with nodal Lagrange shape functions in at least one of their components. The application of general constraints is split up into two distinct operators, one specialized in the hanging-node part and a generic one for the remaining constraints, such as Dirichlet boundary conditions. The former implements in-face interpolations efficiently by a sequence of 1D interpolations with sum factorization according to the refinement configuration of the cell. We discuss ways to efficiently encode and decode such refinement configurations. Furthermore, we present distinct differences in the interpolation step on GPU and CPU, as well as compare different vectorization strategies for the latter. Experimental comparisons with a state-of-the-art algorithm that does not exploit the tensor-product structure show that, on CPUs, the additional costs of cells with hanging-node constraints can be reduced by a factor of 5-10 for a Laplace operator evaluation with high-order elements (k = 3) and affine meshes. For non-affine meshes, the costs for the application of hanging-node constraints can be completely hidden behind the memory transfer. The algorithm has been integrated into the open-source finite-element library deal.II.

Place, publisher, year, edition, pages
Springer Nature, 2022. Vol. 13289, p. 133-152
Series
Lecture Notes in Computer Science, ISSN 0302-9743, E-ISSN 1611-3349
Keywords [en]
Adaptively refined meshes, Finite element methods, High order, Hanging-node constraints, Matrix-free operator evaluation, Node-level optimization, SIMD vectorization, Manycore optimizations
National Category
Computational Mathematics Computer Sciences
Identifiers
URN: urn:nbn:se:uu:diva-488240DOI: 10.1007/978-3-031-07312-0_7ISI: 000871773100007ISBN: 978-3-031-07312-0 (electronic)ISBN: 978-3-031-07311-3 (print)OAI: oai:DiVA.org:uu-488240DiVA, id: diva2:1709969
Conference
37th International Supercomputing Conference on High Performance Computing (ISC High Performance Computing), MAY 29-JUN 02, 2022, Hamburg, GERMANY
Available from: 2022-11-10 Created: 2022-11-10 Last updated: 2022-11-10Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full text

Authority records

Ljungkvist, KarlKronbichler, Martin

Search in DiVA

By author/editor
Ljungkvist, KarlKronbichler, Martin
By organisation
Computational ScienceNumerical Analysis
Computational MathematicsComputer Sciences

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 36 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf