uu.seUppsala University Publications
Change search
ReferencesLink to record
Permanent link

Direct link
A draft fur seal genome provides insights into factors affecting SNP validation and how to mitigate them
Univ Bielefeld, Dept Anim Behav, Postfach 100131, D-33501 Bielefeld, Germany.;British Antarctic Survey, Madingley Rd, Cambridge CB3 OET, England..
Uppsala University, Disciplinary Domain of Science and Technology, Biology, Department of Cell and Molecular Biology, Molecular Evolution. Uppsala University, Science for Life Laboratory, SciLifeLab.
British Antarctic Survey, Madingley Rd, Cambridge CB3 OET, England..
British Antarctic Survey, Madingley Rd, Cambridge CB3 OET, England..
Show others and affiliations
2016 (English)In: Molecular Ecology Resources, ISSN 1755-098X, E-ISSN 1755-0998, Vol. 16, no 4, 909-921 p.Article in journal (Refereed) Published
Abstract [en]

Custom genotyping arrays provide a flexible and accurate means of genotyping single nucleotide polymorphisms (SNPs) in a large number of individuals of essentially any organism. However, validation rates, defined as the proportion of putative SNPs that are verified to be polymorphic in a population, are often very low. A number of potential causes of assay failure have been identified, but none have been explored systematically. In particular, as SNPs are often developed from transcriptomes, parameters relating to the genomic context are rarely taken into account. Here, we assembled a draft Antarctic fur seal (Arctocephalus gazella) genome (assembly size: 2.41 Gb; scaffold/contig N-50: 3.1 Mb/27.5 kb). We then used this resource to map the probe sequences of 144 putative SNPs genotyped in 480 individuals. The number of probe-to-genome mappings and alignment length together explained almost a third of the variation in validation success, indicating that sequence uniqueness and proximity to intron-exon boundaries play an important role. The same pattern was found after mapping the probe sequences to the Walrus and Weddell seal genomes, suggesting that the genomes of species divergent by as much as 23 million years can hold information relevant to SNP validation outcomes. Additionally, reanalysis of genotyping data from seven previous studies found the same two variables to be significantly associated with SNP validation success across a variety of taxa. Finally, our study reveals considerable scope for validation rates to be improved, either by simply filtering for SNPs whose flanking sequences align uniquely and completely to a reference genome, or through predictive modelling.

Place, publisher, year, edition, pages
2016. Vol. 16, no 4, 909-921 p.
Keyword [en]
Antarctic fur seal, Arctocephalus gazella, cross-validation, draft genome, single nucleotide polymorphism, SNP array
National Category
URN: urn:nbn:se:uu:diva-308282DOI: 10.1111/1755-0998.12502ISI: 000383281100007PubMedID: 26683564OAI: oai:DiVA.org:uu-308282DiVA: diva2:1049277
EU, FP7, Seventh Framework Programme, PCIG-GA-2011-303618Knut and Alice Wallenberg FoundationSwedish Research Council Formas, 231-2012-450German Research Foundation (DFG)
Available from: 2016-11-24 Created: 2016-11-24 Last updated: 2016-11-24Bibliographically approved

Open Access in DiVA

No full text

Other links

Publisher's full textPubMed

Search in DiVA

By author/editor
Martinez-Barrio, AlvaroWolf, Jochen B. W.
By organisation
Molecular EvolutionScience for Life Laboratory, SciLifeLabEvolutionary Biology
In the same journal
Molecular Ecology Resources

Search outside of DiVA

GoogleGoogle Scholar

Altmetric score

Total: 156 hits
ReferencesLink to record
Permanent link

Direct link