uu.seUppsala universitets publikasjoner
Endre søk
Link to record
Permanent link

Direct link
BETA
Ljungkvist, Karl
Publikasjoner (7 av 7) Visa alla publikasjoner
Kronbichler, M. & Ljungkvist, K. (2019). Multigrid for matrix-free high-order finite element computations on graphics processors. ACM Transactions on Parallel Computing, 6(1), 2:1-32, Article ID 2.
Åpne denne publikasjonen i ny fane eller vindu >>Multigrid for matrix-free high-order finite element computations on graphics processors
2019 (engelsk)Inngår i: ACM Transactions on Parallel Computing, ISSN 2329-4949, Vol. 6, nr 1, s. 2:1-32, artikkel-id 2Artikkel i tidsskrift (Fagfellevurdert) Published
HSV kategori
Identifikatorer
urn:nbn:se:uu:diva-390587 (URN)10.1145/3322813 (DOI)000472838200002 ()
Prosjekter
UPMARCeSSENCE
Tilgjengelig fra: 2019-06-24 Laget: 2019-08-13 Sist oppdatert: 2019-08-30bibliografisk kontrollert
Ljungkvist, K. (2017). Matrix-free finite-element computations on graphics processors with adaptively refined unstructured meshes. In: Proc. 25th High Performance Computing Symposium: . Paper presented at HPC 2017, April 23–26, Virginia Beach, VA (pp. 1-12). San Diego, CA: The Society for Modeling and Simulation International
Åpne denne publikasjonen i ny fane eller vindu >>Matrix-free finite-element computations on graphics processors with adaptively refined unstructured meshes
2017 (engelsk)Inngår i: Proc. 25th High Performance Computing Symposium, San Diego, CA: The Society for Modeling and Simulation International, 2017, s. 1-12Konferansepaper, Publicerat paper (Fagfellevurdert)
sted, utgiver, år, opplag, sider
San Diego, CA: The Society for Modeling and Simulation International, 2017
HSV kategori
Identifikatorer
urn:nbn:se:uu:diva-320146 (URN)978-1-5108-3822-2 (ISBN)
Konferanse
HPC 2017, April 23–26, Virginia Beach, VA
Prosjekter
UPMARC
Tilgjengelig fra: 2017-04-23 Laget: 2017-04-16 Sist oppdatert: 2018-01-13bibliografisk kontrollert
Ljungkvist, K. & Kronbichler, M. (2017). Multigrid for matrix-free finite element computations on graphics processors.
Åpne denne publikasjonen i ny fane eller vindu >>Multigrid for matrix-free finite element computations on graphics processors
2017 (engelsk)Rapport (Annet vitenskapelig)
Serie
Technical report / Department of Information Technology, Uppsala University, ISSN 1404-3203 ; 2017-006
HSV kategori
Identifikatorer
urn:nbn:se:uu:diva-320073 (URN)
Prosjekter
UPMARCeSSENCE
Tilgjengelig fra: 2017-04-20 Laget: 2017-04-13 Sist oppdatert: 2018-01-13bibliografisk kontrollert
Ljungkvist, K. (2015). Techniques for finite element methods on modern processors. (Licentiate dissertation). Uppsala University
Åpne denne publikasjonen i ny fane eller vindu >>Techniques for finite element methods on modern processors
2015 (engelsk)Licentiatavhandling, med artikler (Annet vitenskapelig)
Abstract [en]

In this thesis, methods for efficient utilization of modern computer hardware for numerical simulation are considered. In particular, we study techniques for speeding up the execution of finite-element methods.

One of the greatest challenges in finite-element computation is how to efficiently perform the the system matrix assembly efficiently in parallel, due to its complicated memory access pattern. The main difficulty lies in the fact that many entries of the matrix are being updated concurrently by several parallel threads. We consider transactional memory, an exotic hardware feature for concurrent update of shared variables, and conduct benchmarks on a prototype processor supporting it. Our experiments show that transactions can both simplify programming and provide good performance for concurrent updates of floating point data.

Furthermore, we study a matrix-free approach to finite-element computation which avoids the matrix assembly. Motivated by its computational properties, we implement the matrix-free method for execution on graphics processors, using either atomic updates or a mesh coloring approach to handle the concurrent updates. A performance study shows that on the GPU, the matrix-free method is faster than a matrix-based implementation for many element types, and allows for solution of considerably larger problems. This suggests that the matrix-free method can speed up execution of large realistic simulations.

sted, utgiver, år, opplag, sider
Uppsala University, 2015
Serie
IT licentiate theses / Uppsala University, Department of Information Technology, ISSN 1404-5117 ; 2015-001
HSV kategori
Forskningsprogram
Beräkningsvetenskap
Identifikatorer
urn:nbn:se:uu:diva-242186 (URN)
Veileder
Prosjekter
UPMARCeSSENCE
Tilgjengelig fra: 2015-01-18 Laget: 2015-01-22 Sist oppdatert: 2018-01-11bibliografisk kontrollert
Ljungkvist, K. (2014). Matrix-free finite-element operator application on graphics processing units. In: Euro-Par 2014: Parallel Processing Workshops, Part II. Paper presented at 7th Workshop on Unconventional High-Performance Computing (pp. 450-461). Springer
Åpne denne publikasjonen i ny fane eller vindu >>Matrix-free finite-element operator application on graphics processing units
2014 (engelsk)Inngår i: Euro-Par 2014: Parallel Processing Workshops, Part II, Springer, 2014, s. 450-461Konferansepaper, Publicerat paper (Fagfellevurdert)
sted, utgiver, år, opplag, sider
Springer, 2014
Serie
Lecture Notes in Computer Science ; 8806
HSV kategori
Identifikatorer
urn:nbn:se:uu:diva-238380 (URN)10.1007/978-3-319-14313-2_38 (DOI)000354785000038 ()978-3-319-14312-5 (ISBN)
Konferanse
7th Workshop on Unconventional High-Performance Computing
Prosjekter
UPMARCeSSENCE
Tilgjengelig fra: 2014-12-11 Laget: 2014-12-11 Sist oppdatert: 2018-01-11bibliografisk kontrollert
Ljungkvist, K., Tillenius, M., Black-Schaffer, D., Holmgren, S., Karlsson, M. & Larsson, E. (2011). Using hardware transactional memory for high-performance computing. In: Proc. 25th International Symposium on Parallel and Distributed Processing Workshops and PhD Forum. Paper presented at IPDPS Workshop on Multi-Threaded Architectures and Applications (pp. 1660-1667). Piscataway, NJ: IEEE
Åpne denne publikasjonen i ny fane eller vindu >>Using hardware transactional memory for high-performance computing
Vise andre…
2011 (engelsk)Inngår i: Proc. 25th International Symposium on Parallel and Distributed Processing Workshops and PhD Forum, Piscataway, NJ: IEEE , 2011, s. 1660-1667Konferansepaper, Publicerat paper (Fagfellevurdert)
sted, utgiver, år, opplag, sider
Piscataway, NJ: IEEE, 2011
HSV kategori
Identifikatorer
urn:nbn:se:uu:diva-158551 (URN)10.1109/IPDPS.2011.322 (DOI)978-1-61284-425-1 (ISBN)
Konferanse
IPDPS Workshop on Multi-Threaded Architectures and Applications
Prosjekter
eSSENCEUPMARC
Tilgjengelig fra: 2011-09-01 Laget: 2011-09-10 Sist oppdatert: 2018-01-12bibliografisk kontrollert
Ljungkvist, K., Tillenius, M., Holmgren, S., Karlsson, M. & Larsson, E. (2010). Early results using hardware transactional memory for high-performance computing applications. In: Proc. 3rd Swedish Workshop on Multi-Core Computing (pp. 93-97). Göteborg, Sweden: Chalmers University of Technology
Åpne denne publikasjonen i ny fane eller vindu >>Early results using hardware transactional memory for high-performance computing applications
Vise andre…
2010 (engelsk)Inngår i: Proc. 3rd Swedish Workshop on Multi-Core Computing, Göteborg, Sweden: Chalmers University of Technology , 2010, s. 93-97Konferansepaper, Publicerat paper (Annet vitenskapelig)
sted, utgiver, år, opplag, sider
Göteborg, Sweden: Chalmers University of Technology, 2010
HSV kategori
Identifikatorer
urn:nbn:se:uu:diva-134615 (URN)
Prosjekter
eSSENCEUPMARC
Tilgjengelig fra: 2010-11-18 Laget: 2010-11-29 Sist oppdatert: 2018-01-12bibliografisk kontrollert
Organisasjoner