uu.seUppsala universitets publikationer
Ändra sökning
Länk till posten
Permanent länk

Direktlänk
BETA
Ljungkvist, Karl
Publikationer (7 of 7) Visa alla publikationer
Kronbichler, M. & Ljungkvist, K. (2019). Multigrid for matrix-free high-order finite element computations on graphics processors. ACM Transactions on Parallel Computing, 6(1), 2:1-32, Article ID 2.
Öppna denna publikation i ny flik eller fönster >>Multigrid for matrix-free high-order finite element computations on graphics processors
2019 (Engelska)Ingår i: ACM Transactions on Parallel Computing, ISSN 2329-4949, Vol. 6, nr 1, s. 2:1-32, artikel-id 2Artikel i tidskrift (Refereegranskat) Published
Nationell ämneskategori
Datavetenskap (datalogi) Beräkningsmatematik
Identifikatorer
urn:nbn:se:uu:diva-390587 (URN)10.1145/3322813 (DOI)000472838200002 ()
Projekt
UPMARCeSSENCE
Tillgänglig från: 2019-06-24 Skapad: 2019-08-13 Senast uppdaterad: 2019-08-30Bibliografiskt granskad
Ljungkvist, K. (2017). Matrix-free finite-element computations on graphics processors with adaptively refined unstructured meshes. In: Proc. 25th High Performance Computing Symposium: . Paper presented at HPC 2017, April 23–26, Virginia Beach, VA (pp. 1-12). San Diego, CA: The Society for Modeling and Simulation International
Öppna denna publikation i ny flik eller fönster >>Matrix-free finite-element computations on graphics processors with adaptively refined unstructured meshes
2017 (Engelska)Ingår i: Proc. 25th High Performance Computing Symposium, San Diego, CA: The Society for Modeling and Simulation International, 2017, s. 1-12Konferensbidrag, Publicerat paper (Refereegranskat)
Ort, förlag, år, upplaga, sidor
San Diego, CA: The Society for Modeling and Simulation International, 2017
Nationell ämneskategori
Datavetenskap (datalogi) Beräkningsmatematik
Identifikatorer
urn:nbn:se:uu:diva-320146 (URN)978-1-5108-3822-2 (ISBN)
Konferens
HPC 2017, April 23–26, Virginia Beach, VA
Projekt
UPMARC
Tillgänglig från: 2017-04-23 Skapad: 2017-04-16 Senast uppdaterad: 2018-01-13Bibliografiskt granskad
Ljungkvist, K. & Kronbichler, M. (2017). Multigrid for matrix-free finite element computations on graphics processors.
Öppna denna publikation i ny flik eller fönster >>Multigrid for matrix-free finite element computations on graphics processors
2017 (Engelska)Rapport (Övrigt vetenskapligt)
Serie
Technical report / Department of Information Technology, Uppsala University, ISSN 1404-3203 ; 2017-006
Nationell ämneskategori
Datavetenskap (datalogi) Beräkningsmatematik
Identifikatorer
urn:nbn:se:uu:diva-320073 (URN)
Projekt
UPMARCeSSENCE
Tillgänglig från: 2017-04-20 Skapad: 2017-04-13 Senast uppdaterad: 2018-01-13Bibliografiskt granskad
Ljungkvist, K. (2015). Techniques for finite element methods on modern processors. (Licentiate dissertation). Uppsala University
Öppna denna publikation i ny flik eller fönster >>Techniques for finite element methods on modern processors
2015 (Engelska)Licentiatavhandling, sammanläggning (Övrigt vetenskapligt)
Abstract [en]

In this thesis, methods for efficient utilization of modern computer hardware for numerical simulation are considered. In particular, we study techniques for speeding up the execution of finite-element methods.

One of the greatest challenges in finite-element computation is how to efficiently perform the the system matrix assembly efficiently in parallel, due to its complicated memory access pattern. The main difficulty lies in the fact that many entries of the matrix are being updated concurrently by several parallel threads. We consider transactional memory, an exotic hardware feature for concurrent update of shared variables, and conduct benchmarks on a prototype processor supporting it. Our experiments show that transactions can both simplify programming and provide good performance for concurrent updates of floating point data.

Furthermore, we study a matrix-free approach to finite-element computation which avoids the matrix assembly. Motivated by its computational properties, we implement the matrix-free method for execution on graphics processors, using either atomic updates or a mesh coloring approach to handle the concurrent updates. A performance study shows that on the GPU, the matrix-free method is faster than a matrix-based implementation for many element types, and allows for solution of considerably larger problems. This suggests that the matrix-free method can speed up execution of large realistic simulations.

Ort, förlag, år, upplaga, sidor
Uppsala University, 2015
Serie
IT licentiate theses / Uppsala University, Department of Information Technology, ISSN 1404-5117 ; 2015-001
Nationell ämneskategori
Datavetenskap (datalogi) Beräkningsmatematik
Forskningsämne
Beräkningsvetenskap
Identifikatorer
urn:nbn:se:uu:diva-242186 (URN)
Handledare
Projekt
UPMARCeSSENCE
Tillgänglig från: 2015-01-18 Skapad: 2015-01-22 Senast uppdaterad: 2018-01-11Bibliografiskt granskad
Ljungkvist, K. (2014). Matrix-free finite-element operator application on graphics processing units. In: Euro-Par 2014: Parallel Processing Workshops, Part II. Paper presented at 7th Workshop on Unconventional High-Performance Computing (pp. 450-461). Springer
Öppna denna publikation i ny flik eller fönster >>Matrix-free finite-element operator application on graphics processing units
2014 (Engelska)Ingår i: Euro-Par 2014: Parallel Processing Workshops, Part II, Springer, 2014, s. 450-461Konferensbidrag, Publicerat paper (Refereegranskat)
Ort, förlag, år, upplaga, sidor
Springer, 2014
Serie
Lecture Notes in Computer Science ; 8806
Nationell ämneskategori
Datavetenskap (datalogi) Beräkningsmatematik
Identifikatorer
urn:nbn:se:uu:diva-238380 (URN)10.1007/978-3-319-14313-2_38 (DOI)000354785000038 ()978-3-319-14312-5 (ISBN)
Konferens
7th Workshop on Unconventional High-Performance Computing
Projekt
UPMARCeSSENCE
Tillgänglig från: 2014-12-11 Skapad: 2014-12-11 Senast uppdaterad: 2018-01-11Bibliografiskt granskad
Ljungkvist, K., Tillenius, M., Black-Schaffer, D., Holmgren, S., Karlsson, M. & Larsson, E. (2011). Using hardware transactional memory for high-performance computing. In: Proc. 25th International Symposium on Parallel and Distributed Processing Workshops and PhD Forum. Paper presented at IPDPS Workshop on Multi-Threaded Architectures and Applications (pp. 1660-1667). Piscataway, NJ: IEEE
Öppna denna publikation i ny flik eller fönster >>Using hardware transactional memory for high-performance computing
Visa övriga...
2011 (Engelska)Ingår i: Proc. 25th International Symposium on Parallel and Distributed Processing Workshops and PhD Forum, Piscataway, NJ: IEEE , 2011, s. 1660-1667Konferensbidrag, Publicerat paper (Refereegranskat)
Ort, förlag, år, upplaga, sidor
Piscataway, NJ: IEEE, 2011
Nationell ämneskategori
Datavetenskap (datalogi)
Identifikatorer
urn:nbn:se:uu:diva-158551 (URN)10.1109/IPDPS.2011.322 (DOI)978-1-61284-425-1 (ISBN)
Konferens
IPDPS Workshop on Multi-Threaded Architectures and Applications
Projekt
eSSENCEUPMARC
Tillgänglig från: 2011-09-01 Skapad: 2011-09-10 Senast uppdaterad: 2018-01-12Bibliografiskt granskad
Ljungkvist, K., Tillenius, M., Holmgren, S., Karlsson, M. & Larsson, E. (2010). Early results using hardware transactional memory for high-performance computing applications. In: Proc. 3rd Swedish Workshop on Multi-Core Computing (pp. 93-97). Göteborg, Sweden: Chalmers University of Technology
Öppna denna publikation i ny flik eller fönster >>Early results using hardware transactional memory for high-performance computing applications
Visa övriga...
2010 (Engelska)Ingår i: Proc. 3rd Swedish Workshop on Multi-Core Computing, Göteborg, Sweden: Chalmers University of Technology , 2010, s. 93-97Konferensbidrag, Publicerat paper (Övrigt vetenskapligt)
Ort, förlag, år, upplaga, sidor
Göteborg, Sweden: Chalmers University of Technology, 2010
Nationell ämneskategori
Datavetenskap (datalogi)
Identifikatorer
urn:nbn:se:uu:diva-134615 (URN)
Projekt
eSSENCEUPMARC
Tillgänglig från: 2010-11-18 Skapad: 2010-11-29 Senast uppdaterad: 2018-01-12Bibliografiskt granskad
Organisationer

Sök vidare i DiVA

Visa alla publikationer