uu.seUppsala University Publications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Analyzing the impact of data compression in Hive
Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology.
2014 (English)Independent thesis Basic level (degree of Bachelor), 10 credits / 15 HE creditsStudent thesis
Abstract [en]

Executing expensive queries over many large tables can be prohibitively time consuming in conventional relational databases. Hadoop and its data warehouse Hive is a powerful alternative for large scale data processing. Conventionally, data is stored in Hive without compression. There is value in storing the data with compression, if the overhead of compression does not negatively impact the query processing time. This paper describes through experiments using imports, transformations and exports of Hive data in various file formats and with different compression techniques how this can be achieved.

Place, publisher, year, edition, pages
2014. , 36 p.
Series
IT, 14074
National Category
Engineering and Technology
Identifiers
URN: urn:nbn:se:uu:diva-269235OAI: oai:DiVA.org:uu-269235DiVA: diva2:882559
Educational program
Bachelor Programme in Computer Science
Supervisors
Examiners
Available from: 2015-12-15 Created: 2015-12-15 Last updated: 2016-02-11Bibliographically approved

Open Access in DiVA

fulltext(733 kB)432 downloads
File information
File name FULLTEXT01.pdfFile size 733 kBChecksum SHA-512
cfa7566fc636a2092f9ea2d9dc2319886d7a62f6adaa827aaa63d4e1a13db543763e5821e1542ccc39797acacd6af94d1459173303252b79d41c5651f1e81de7
Type fulltextMimetype application/pdf

By organisation
Department of Information Technology
Engineering and Technology

Search outside of DiVA

GoogleGoogle Scholar
Total: 432 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 2149 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf