Logo: to the web site of Uppsala University

uu.sePublications from Uppsala University
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Tools and considerations for mitochondrial haplogroup assignment
Uppsala University, Disciplinary Domain of Medicine and Pharmacy, Faculty of Medicine, Department of Immunology, Genetics and Pathology, Medicinsk genetik och genomik. Armed Forces DNA Identification Laboratory, Dover, Delaware, USA.ORCID iD: 0000-0001-6857-923X
Institute of Legal Medicine, Medical University of Innsbruck, Innsbruck, Austria.
Institute of Legal Medicine, Medical University of Innsbruck, Innsbruck, Austria; .
Armed Forces DNA Identification Laboratory, Dover, Delaware, USA.
Show others and affiliations
(English)Manuscript (preprint) (Other academic)
Abstract [en]

Phylogenetic information can be used to infer maternal biogeographic ancestry and provide a valuable quality control (QC) check of mitochondrial DNA (mtDNA) data, identifying errors in the data such as artificial recombination and phantom mutations. Additionally, the phylogeny guides mtDNA nomenclature, which is crucial for forensic searches. This study compared three haplogrouping tools and their ability to provide proper guidance based on the predicted phylogeny. The Mitochondrial Haplogrouper tool of the AFDIL-QIAGEN mtDNA Expert (AQME), a plug-in for the CLC Genomics Workbench, and the web-based tools HaploGrep2 and EMPOP were included in the evaluation. Haplogroups were determined for 92 diverse mtDNA haplotypes by the three tools based on four regions: the entire mitochondrial genome (mitogenome), control region (CR), hypervariable segment 1 and 2 (HVS1-2) regions, and HVS1 only. There were only two differences (out of 92) between all three tools when using the mitogenome, and in these instances the haplogroups were less precise by one or two nodes. Haplogroup assignments for the CR and HVS1-2 were similar; though less precise haplogroups resulted for these HVS1-2 haplotypes compared to the CR due to haplogroup-diagnostic mutations outside the queried region. As an important QC aspect, the comparison between the haplogroup predictions of authentic and artificial haplotypes showed that it is possible to identify recombinant mitogenome haplotypes from two large (~8500-bp) amplicons, but the differentiation is more difficult with HVS1-2 “artificial” haplotypes. Overall, the tools performed similarly, but EMPOP’s SAM2 produced more precise haplogroup predictions than AQME and HaploGrep2 across all haplogroups and regions. An important consideration when using any haplogrouping tool is that the haplogroup identified is only a prediction, particularly when based on smaller regions. All three tools provided valuable phylogenetic information to enable QC of mtDNA data, but critical review of the predicted haplogroup may be required for certain applications.

Keywords [en]
Mitochondrial DNA, haplogroup, quality control, artificial recombination
National Category
Genetics and Genomics Evolutionary Biology Forensic Science
Research subject
Biology with Specialisation in Human Evolution and Genetics; Biology with specialization in Molecular Biology
Identifiers
URN: urn:nbn:se:uu:diva-470689OAI: oai:DiVA.org:uu-470689DiVA, id: diva2:1647794
Available from: 2022-03-28 Created: 2022-03-28 Last updated: 2025-02-01
In thesis
1. Mitochondrial Genome Analysis Using Next Generation Sequencing for Forensic Applications
Open this publication in new window or tab >>Mitochondrial Genome Analysis Using Next Generation Sequencing for Forensic Applications
2022 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Mitochondrial DNA (mtDNA) analysis plays a specialized role in forensic applications, overcoming certain limitations of autosomal DNA markers. The high copy number and uniparental inheritance pattern of mtDNA are advantageous in cases involving shed hairs and aged skeletal elements, especially decades-old missing persons cases. Though the discriminatory power of mtDNA is limited by common haplotypes, next generation sequencing (NGS) offers feasible access to entire mitochondrial genome (mitogenome) data that can provide increased resolution of common haplotypes to unique sequences. The primary implementation challenge of mitogenome analysis is a lack of forensic-quality reference data, which are required to determine the evidentiary weight of a match. A better understanding of NGS methods and data analysis is also necessary to ensure the generation of reliable mitogenome data. Furthermore, appropriate quality control (QC) measures must be established as analysis can be complicated by nuclear mtDNA segments (NUMTs), misalignment of homopolymer regions, sequencing errors, and other artefacts. Including such false variants in mtDNA haplotypes can lead to erroneous conclusions based on misinterpreted data.

This thesis aimed to address the implementation challenges of mitogenome analysis and facilitate the transition to NGS in forensic laboratories. Paper I assessed the feasibility of generating forensic-quality mitogenome data from whole genome sequencing (WGS) data, which are valuable sources of mitogenome haplotypes for population studies.  Due to NUMT interference, a 10% variant frequency threshold was necessary to produce haplotypes consistent with high-quality mitogenome datasets. Since length heteroplasmy (LHP) can also complicate mtDNA data analysis, Paper II characterized LHP in data generated on two NGS platforms as well as with Sanger-type sequencing. Different patterns of LHP were observed across sequencing technologies, further supporting current guidelines to ignore LHP in database searches and match comparisons in forensic analyses. Phylogenetic information can provide a valuable QC check of mtDNA data, identifying errors like artificial recombination and phantom mutations. Therefore, three haplogrouping tools were examined in Paper III, comparing their ability to predict an accurate haplogroup based on different mitogenome target ranges. The tools performed similarly, but EMPOP’s SAM2 algorithm produced more precise haplogroup predictions than the other two tools regardless of phylogeny or interpretation range. Building upon the previous three studies, Paper IV characterized 934 forensic-quality Swedish mitogenomes from a population genetics perspective. The complete mitogenome data demonstrated high haplotype diversity (0.9996) with a random match probability of 0.15%. In summary, these papers combine important insights to facilitate the application of mitogenome NGS analysis in forensic laboratories.

Place, publisher, year, edition, pages
Uppsala: Acta Universitatis Upsaliensis, 2022. p. 73
Series
Digital Comprehensive Summaries of Uppsala Dissertations from the Faculty of Medicine, ISSN 1651-6206 ; 1827
Keywords
Mitochondrial DNA, next generation sequencing, massively parallel sequencing, forensic genetics, nuclear mitochondrial DNA segment (NUMT), length heteroplasmy, haplogroup, quality control, population data
National Category
Forensic Science Genetics and Genomics Biochemistry Molecular Biology
Research subject
Biology with specialization in Molecular Biology
Identifiers
urn:nbn:se:uu:diva-470324 (URN)978-91-513-1455-6 (ISBN)
Public defence
2022-05-19, Room C8:321, Biomedicinskt centrum (BMC), Husargatan 3, Uppsala, 13:00 (English)
Opponent
Supervisors
Available from: 2022-04-28 Created: 2022-03-28 Last updated: 2025-02-20

Open Access in DiVA

No full text in DiVA

Authority records

Allen, Marie

Search in DiVA

By author/editor
Sturk-Andreaggi, KimberlyAllen, Marie
By organisation
Medicinsk genetik och genomikScience for Life Laboratory, SciLifeLab
Genetics and GenomicsEvolutionary BiologyForensic Science
Sturk-Andreaggi, K. (2022). SUPPLEMENTARY MATERIALS for Tools and considerations for mitochondrial haplogroup assignment.

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 394 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf