Clustering of DNA sequence reads from repeat regions using defined nucleotide positions (DNPs)
Student paper other, 20 credits / 30 HE creditsStudent thesis
Sequencing genomes with a high frequency of repeat regions is a difficult task.The aim of the project was to develop an algorithm to speed up the sequencing process of highly repetitive genome. By using specific differences between the repeats called defined nucleotide positions (DNPs), cluster DNA sequence reads into contigs. The strategy used in the development of the algorithm resulted in a quite complex algorithm. Test runs of the algorithm showed that there is still work to be done to get a desirable result.
Place, publisher, year, edition, pages
2010. , 45 p.
DNP, repeat regions, SolidClusters, algorithm
IdentifiersURN: urn:nbn:se:uu:diva-137220OAI: oai:DiVA.org:uu-137220DiVA: diva2:377934