uu.seUppsala University Publications
Change search
ReferencesLink to record
Permanent link

Direct link
Application Task and Data Placement in Embedded Multi-core NUMA Architectures: Optimization techniques for the Samsung 16-SRP
Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology.
2013 (English)Independent thesis Advanced level (professional degree), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

The evolution of microprocessors  has lead to a situation where more memory is integrated closer to the computational cores. This has created architectures where memory latencies vary depending on the calling cores location. Such architectures are referred to as Non-Uniform Memory Access (NUMA) architectures. This adds further complexity to the already complex environment of developing parallel applications.

In this paper I research effective task and data placement optimization techniques for a Samsung Multi-Processor System-on-Chip (MPSoC) prototype. The research was structured  by first conducting a series of extreme case micro benchmarks to gain insight of hardware behavior. These insights was then used to optimize two applications from the imaging domain; a 2D image blurring application and a 3D Seeded Region Growing (SRG) application.

The results from conducted benchmarks show that a wide range of factors are of importance when optimizing applications for the Samsung 16-SRP architec- ture. Although NUMA penalties exists, reducing congestion at the memory controllers and in the DMA channels are of importance to overall execution time. I propose task and data distribution schemes that work well for benchmarks with static and dynamic workloads. Clustered hierarchical work queues with work stealing have shown to be an effective approach to optimizing applications with a dynamic workload.

For future research it would be interesting to run further micro benchmarks of the system under congestion. To gain further verification of suggested task and data distribution schemes suggested in this thesis it would be of interest to apply them to more applications.

Place, publisher, year, edition, pages
UPTEC IT, ISSN 1401-5749 ; 13 006
National Category
Engineering and Technology
URN: urn:nbn:se:uu:diva-199614OAI: oai:DiVA.org:uu-199614DiVA: diva2:620342
Educational program
Master of Science Programme in Information Technology Engineering
Available from: 2013-05-08 Created: 2013-05-08 Last updated: 2013-05-08Bibliographically approved

Open Access in DiVA

fulltext(13668 kB)526 downloads
File information
File name FULLTEXT01.pdfFile size 13668 kBChecksum SHA-512
Type fulltextMimetype application/pdf

By organisation
Department of Information Technology
Engineering and Technology

Search outside of DiVA

GoogleGoogle Scholar
Total: 526 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 221 hits
ReferencesLink to record
Permanent link

Direct link