uu.seUppsala universitets publikasjoner
Endre søk
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Scientific Computing on Multicore Architectures
Uppsala universitet, Teknisk-naturvetenskapliga vetenskapsområdet, Matematisk-datavetenskapliga sektionen, Institutionen för informationsteknologi, Avdelningen för beräkningsvetenskap. Uppsala universitet, Teknisk-naturvetenskapliga vetenskapsområdet, Matematisk-datavetenskapliga sektionen, Institutionen för informationsteknologi, Tillämpad beräkningsvetenskap.
2014 (engelsk)Doktoravhandling, med artikler (Annet vitenskapelig)
Abstract [en]

Computer simulations are an indispensable tool for scientists to gain new insights about nature. Simulations of natural phenomena are usually large, and limited by the available computer resources. By using the computer resources more efficiently, larger and more detailed simulations can be performed, and more information can be extracted to help advance human knowledge.

The topic of this thesis is how to make best use of modern computers for scientific computations. The challenge here is the high level of parallelism that is required to fully utilize the multicore processors in these systems.

Starting from the basics, the primitives for synchronizing between threads are investigated. Hardware transactional memory is a new construct for this, which is evaluated for a new use of importance for scientific software: atomic updates of floating point values. The evaluation includes experiments on real hardware and comparisons against standard methods.

Higher level programming models for shared memory parallelism are then considered. The state of the art for efficient use of multicore systems is dynamically scheduled task-based systems, where tasks can depend on data. In such systems, the software is divided up into many small tasks that are scheduled asynchronously according to their data dependencies. This enables a high level of parallelism, and avoids global barriers.

A new system for managing task dependencies is developed in this thesis, based on data versioning. The system is implemented as a reusable software library, and shown to be as efficient or more efficient than other shared-memory task-based systems in experimental comparisons.

The developed runtime system is then extended to distributed memory machines, and used for implementing a parallel version of a software for global climate simulations. By running the optimized and parallelized version on eight servers, an equally sized problem can be solved over 100 times faster than in the original sequential version. The parallel version also allowed significantly larger problems to be solved, previously unreachable due to memory constraints.

sted, utgiver, år, opplag, sider
Uppsala: Acta Universitatis Upsaliensis, 2014. , s. 47
Serie
Digital Comprehensive Summaries of Uppsala Dissertations from the Faculty of Science and Technology, ISSN 1651-6214 ; 1139
Emneord [en]
multicore, scientific computing, shared memory parallelism, task-based programming, parallel programming model, task scheduling, data versioning
HSV kategori
Forskningsprogram
Beräkningsvetenskap
Identifikatorer
URN: urn:nbn:se:uu:diva-221241ISBN: 978-91-554-8928-1 (tryckt)OAI: oai:DiVA.org:uu-221241DiVA, id: diva2:708164
Disputas
2014-05-23, Room 2446, Polacksbacken, Lägerhyddsvägen 2, Uppsala, 10:15 (engelsk)
Opponent
Veileder
Prosjekter
UPMARCeSSENCETilgjengelig fra: 2014-04-29 Laget: 2014-03-26 Sist oppdatert: 2018-01-11bibliografisk kontrollert
Delarbeid
1. Using hardware transactional memory for high-performance computing
Åpne denne publikasjonen i ny fane eller vindu >>Using hardware transactional memory for high-performance computing
Vise andre…
2011 (engelsk)Inngår i: Proc. 25th International Symposium on Parallel and Distributed Processing Workshops and PhD Forum, Piscataway, NJ: IEEE , 2011, s. 1660-1667Konferansepaper, Publicerat paper (Fagfellevurdert)
sted, utgiver, år, opplag, sider
Piscataway, NJ: IEEE, 2011
HSV kategori
Identifikatorer
urn:nbn:se:uu:diva-158551 (URN)10.1109/IPDPS.2011.322 (DOI)978-1-61284-425-1 (ISBN)
Konferanse
IPDPS Workshop on Multi-Threaded Architectures and Applications
Prosjekter
eSSENCEUPMARC
Tilgjengelig fra: 2011-09-01 Laget: 2011-09-10 Sist oppdatert: 2018-01-12bibliografisk kontrollert
2. SuperGlue: A shared memory framework using data versioning for dependency-aware task-based parallelization
Åpne denne publikasjonen i ny fane eller vindu >>SuperGlue: A shared memory framework using data versioning for dependency-aware task-based parallelization
2014 (engelsk)Rapport (Annet vitenskapelig)
Serie
Technical report / Department of Information Technology, Uppsala University, ISSN 1404-3203 ; 2014-010
HSV kategori
Identifikatorer
urn:nbn:se:uu:diva-221155 (URN)
Prosjekter
UPMARCeSSENCE
Tilgjengelig fra: 2014-04-03 Laget: 2014-03-25 Sist oppdatert: 2018-01-11bibliografisk kontrollert
3. Resource-aware task scheduling
Åpne denne publikasjonen i ny fane eller vindu >>Resource-aware task scheduling
2015 (engelsk)Inngår i: ACM Transactions on Embedded Computing Systems, ISSN 1539-9087, E-ISSN 1558-3465, Vol. 14, nr 1, s. 5:1-25, artikkel-id 5Artikkel i tidsskrift (Fagfellevurdert) Published
HSV kategori
Identifikatorer
urn:nbn:se:uu:diva-216287 (URN)10.1145/2638554 (DOI)000349302200005 ()
Prosjekter
UPMARCeSSENCE
Tilgjengelig fra: 2015-01-21 Laget: 2014-01-20 Sist oppdatert: 2018-01-11bibliografisk kontrollert
4. A task parallel implementation of an RBF-generated finite difference method for the shallow water equations on the sphere
Åpne denne publikasjonen i ny fane eller vindu >>A task parallel implementation of an RBF-generated finite difference method for the shallow water equations on the sphere
2014 (engelsk)Rapport (Annet vitenskapelig)
Serie
Technical report / Department of Information Technology, Uppsala University, ISSN 1404-3203 ; 2014-011
HSV kategori
Identifikatorer
urn:nbn:se:uu:diva-221156 (URN)
Prosjekter
eSSENCEUPMARC
Tilgjengelig fra: 2014-04-03 Laget: 2014-03-25 Sist oppdatert: 2018-01-11bibliografisk kontrollert
5. Programming models based on data versioning for dependency-aware task-based parallelisation
Åpne denne publikasjonen i ny fane eller vindu >>Programming models based on data versioning for dependency-aware task-based parallelisation
2012 (engelsk)Inngår i: Proc. 15th International Conference on Computational Science and Engineering, Los Alamitos, CA: IEEE Computer Society, 2012, s. 275-280Konferansepaper, Publicerat paper (Fagfellevurdert)
sted, utgiver, år, opplag, sider
Los Alamitos, CA: IEEE Computer Society, 2012
HSV kategori
Identifikatorer
urn:nbn:se:uu:diva-187446 (URN)10.1109/ICCSE.2012.45 (DOI)000317475000038 ()978-1-4673-5165-2 (ISBN)
Konferanse
CSE 2012
Prosjekter
eSSENCEUPMARC
Tilgjengelig fra: 2013-01-24 Laget: 2012-12-06 Sist oppdatert: 2018-01-16bibliografisk kontrollert

Open Access i DiVA

fulltext(523 kB)471 nedlastinger
Filinformasjon
Fil FULLTEXT01.pdfFilstørrelse 523 kBChecksum SHA-512
51e6dcd20512408555f5faed7c43c3478c5ab2b377e21ce5d012be180135a627fb9308e0533970a3fb10407e43913a1dc5bcd12cad95d45eaf9bfaaf3e0a5a17
Type fulltextMimetype application/pdf
Kjøp publikasjonen >>

Personposter BETA

Tillenius, Martin

Søk i DiVA

Av forfatter/redaktør
Tillenius, Martin
Av organisasjonen

Søk utenfor DiVA

GoogleGoogle Scholar
Totalt: 471 nedlastinger
Antall nedlastinger er summen av alle nedlastinger av alle fulltekster. Det kan for eksempel være tidligere versjoner som er ikke lenger tilgjengelige

isbn
urn-nbn

Altmetric

isbn
urn-nbn
Totalt: 2186 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf