Logo: to the web site of Uppsala University

uu.sePublikasjoner fra Uppsala universitet
Endre søk
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Genomics in the Cloud
Uppsala universitet, Teknisk-naturvetenskapliga vetenskapsområdet, Matematisk-datavetenskapliga sektionen, Institutionen för informationsteknologi.
2021 (engelsk)Independent thesis Advanced level (degree of Master (Two Years)), 20 poäng / 30 hpOppgave
Abstract [en]

The continued cost reduction for sequencing genomics data is causing an exponentialgrowth in the amount of data available. Moving both storage and calculation of thisdata to the cloud has been a common trend, but the way to do it is not alwaysobvious. This report compares three different alternatives for doing ad-hoc queries ina cloud based setting: two solutions using data lakes and one solution using arelational database hosted in the cloud. The data lake solutions proved to be easy toset up and fully functional for querying genomics data. The relational database wasmore complicated to set up, but the queries were more time efficient and more costefficient when performing more than 1200 queries per month on at least 100GB ofdata. To make the cloud computing possible for genomics data it had to betransformed into a file format supported by the cloud providers. For this purpose theParquet file format was chosen, tested, and proven to work well

sted, utgiver, år, opplag, sider
2021. , s. 52
Serie
UPTEC IT, ISSN 1401-5749 ; 21030
Emneord [en]
Cloud, IT, Genomics, GCP, AWS
HSV kategori
Identifikatorer
URN: urn:nbn:se:uu:diva-453802OAI: oai:DiVA.org:uu-453802DiVA, id: diva2:1596566
Eksternt samarbeid
Data Ductus
Utdanningsprogram
Master of Science Programme in Information Technology Engineering
Veileder
Examiner
Tilgjengelig fra: 2021-09-23 Laget: 2021-09-22 Sist oppdatert: 2021-09-23bibliografisk kontrollert

Open Access i DiVA

fulltext(506 kB)819 nedlastinger
Filinformasjon
Fil FULLTEXT01.pdfFilstørrelse 506 kBChecksum SHA-512
b3288b561f978a8f5296421494d536756caf90827aa1fc35c57eb8a840b0e87c9d9d74d53198bd803c8fc8910650e35c57d02e3f3b97fe6fc9632f1d2b8ea381
Type fulltextMimetype application/pdf

Av organisasjonen

Søk utenfor DiVA

GoogleGoogle Scholar
Totalt: 820 nedlastinger
Antall nedlastinger er summen av alle nedlastinger av alle fulltekster. Det kan for eksempel være tidligere versjoner som er ikke lenger tilgjengelige

urn-nbn

Altmetric

urn-nbn
Totalt: 506 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf