Logo: to the web site of Uppsala University

uu.sePublications from Uppsala University
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
The revolutionary partnership of computation and biology
Uppsala University, Disciplinary Domain of Medicine and Pharmacy, Faculty of Medicine, Department of Medical Biochemistry and Microbiology.
2023 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

The organization of living beings is complex. Science uses modeling in order to gain a deeper understanding, and to be able to manipulate the processes of living organisms. To this purpose, I used and developed computational tools to investigate and model different relevant biological phenomena. 

In paper I, I utilized whole-genome data from wild and domesticated European rabbit (Oryctolagus cuniculus sp.) populations to identify segregating insertions of endogenous retroviruses and compare their variation along the host phylogeny and domestication history. The results from this study highlight the importance of genomic modeling beyond reference organisms and reference individuals, and provide deep insights regarding strategies for variant analyses in host population comparative genomics. In paper IV, I studied the process of exaptation of foreign genetic elements at broad-scale by observing the presence and characteristics of retroviral env gene, syncytin, across vertebrates. I searched a library of more than 150 chromosome-length assemblies covering 17 taxonomical orders for syncytin homologs, where I identified and syntenically aligned over 300 loci insertions, including not previously known insertions. Additionally, three-dimensional structures of the recovered sequences were predicted using AlphaFold2. Phylogenomics analyses suggest a complex dynamic of multiple retroviral insertions at different time points with sequence conservation specific to clades that share a similar histo-physiological placental type.

In paper II, I expanded the scope to encompass translational medicine by developing an unsupervised machine learning methodology for detecting anomalies in biomedical signals, MindReader, which I applied primarily to electroencephalogram. In paper III, I developed a hidden Markov model implementation that includes a hypothesis generator for stream time-domain signals, which is used as a dependency for paper II. The work in this thesis substantiates that a combination of biological knowledge, cutting-edge technology, and robust algorithmic design constitute the primordial factors for scientific advancement.

Place, publisher, year, edition, pages
Uppsala: Acta Universitatis Upsaliensis, 2023. , p. 51
Series
Digital Comprehensive Summaries of Uppsala Dissertations from the Faculty of Medicine, ISSN 1651-6206 ; 1916
National Category
Bioinformatics (Computational Biology) Other Basic Medicine
Identifiers
URN: urn:nbn:se:uu:diva-473354ISBN: 978-91-513-1516-4 (print)OAI: oai:DiVA.org:uu-473354DiVA, id: diva2:1654037
Public defence
2023-04-26, Room A1:107a, BMC, Husargatan 3, Uppsala, 09:00 (English)
Opponent
Supervisors
Available from: 2023-04-04 Created: 2022-04-26 Last updated: 2024-04-03Bibliographically approved
List of papers
1. Whole-genome comparison of endogenous retrovirus segregation across wild and domestic host species populations
Open this publication in new window or tab >>Whole-genome comparison of endogenous retrovirus segregation across wild and domestic host species populations
2018 (English)In: Proceedings of the National Academy of Sciences of the United States of America, ISSN 0027-8424, E-ISSN 1091-6490, Vol. 115, no 43, p. 11012-11017, article id 201815056Article in journal (Refereed) Published
Abstract [en]

Although recent advances in sequencing and computational analyses have facilitated use of endogenous retroviruses (ERVs) for deciphering coevolution among retroviruses and their hosts, sampling effects from different host populations present major challenges. Here we utilize available whole-genome data from wild and domesticated European rabbit (Oryctolagus cuniculus sp.) populations, sequenced as DNA pools by paired-end Illumina technology, for identifying segregating reference as well as nonreference ERV loci, to reveal their variation along the host phylogeny and domestication history. To produce new viruses, retroviruses must insert a proviral DNA copy into the host nuclear DNA. Occasional proviral insertions into the host germline have been passed down through generations as inherited ERVs during millions of years. These ERVs represent retroviruses that were active at the time of infection and thus present a remarkable record of historical virus–host associations. To examine segregating ERVs in host populations, we apply a reference library search strategy for anchoring ERV-associated short-sequence read pairs from pooled whole-genome sequences to reference genome assembly positions. We show that most ERVs segregate along host phylogeny but also uncover radiation of some ERVs, identified as segregating loci among wild and domestic rabbits. The study targets pertinent issues regarding genome sampling when examining virus–host evolution from the genomic ERV record and offers improved scope regarding common strategies for single-nucleotide variant analyses in host population comparative genomics.

Keywords
endogenous, retrovirus, host population, segregation, comparative genomics, evolution
National Category
Evolutionary Biology
Identifiers
urn:nbn:se:uu:diva-362814 (URN)10.1073/pnas.1815056115 (DOI)000448040500066 ()30297425 (PubMedID)
Funder
Swedish Research Council, VR-M 2015-02429
Note

Correction in: PNAS, vol. 115, issue 52, pages E12465. DOI: 10.1073/pnas.1820237116

Available from: 2018-10-10 Created: 2018-10-10 Last updated: 2023-03-20Bibliographically approved
2. MindReader: unsupervised electroencephalographic reader
Open this publication in new window or tab >>MindReader: unsupervised electroencephalographic reader
Show others...
2023 (English)Manuscript (preprint) (Other academic)
Abstract [en]

Background: Electroencephalogram (EEG) interpretation plays a critical role in the clinical assessment of neurological conditions, including epilepsy. Manual analysis requires highly specialized and heavily trained personnel. Moreover, the rate of capturing abnormal events makes interpretation time-consuming, resource-hungry, and, overall, an expensive process.

Automatic detection offers the potential to improve the quality of patient care by shortening the time to diagnosis, managing big data, and optimizing the allocation of human resources.

Findings: We present MindReader, an unsupervised method for EEG signals. First, MindReader processes the signal through an autoencoder in order to detect EEG abnormalities. Next, patterns are hypothesized by a Hidden Markov Model. Our algorithm automatically generates labels for non-pathological phases, thus reducing the search space for trained personnel.

Conclusions: MindReader is effective in detecting EEG abnormalities in focal and generalized epilepsy.

National Category
Computer Sciences
Research subject
Bioinformatics
Identifiers
urn:nbn:se:uu:diva-473344 (URN)
Available from: 2022-04-25 Created: 2022-04-25 Last updated: 2023-03-17Bibliographically approved
3. HiddenMarkovModelReaders: A Julia implementation of a Hidden Markov Model and unsupervised hypothesis generation for signal processing
Open this publication in new window or tab >>HiddenMarkovModelReaders: A Julia implementation of a Hidden Markov Model and unsupervised hypothesis generation for signal processing
(English)Manuscript (preprint) (Other academic)
National Category
Computer Sciences
Identifiers
urn:nbn:se:uu:diva-498539 (URN)
Available from: 2023-03-17 Created: 2023-03-17 Last updated: 2023-03-20Bibliographically approved
4. Broad-scale in silico assessment retroviral exaptated gene: syncytin
Open this publication in new window or tab >>Broad-scale in silico assessment retroviral exaptated gene: syncytin
Show others...
(English)Manuscript (preprint) (Other academic)
Abstract [en]

Syncytin is a fossil protein exapted from retroviruses that fulfills a pivotal role during trophoblast implantation and placental metabolite exchange. However, little is yet known about the distribution of syncytin across vertebrates. Here, we searched a library of more than 150 high-quality assemblies across 17 taxonomical orders for syncytin homologs. We identified and syntenically aligned over 300 loci insertions, including not previously known insertions. Additionally, we predicted the tridimensional structures of the recover sequences using AlphaFold2. Sequence conservation and phylogenomics analyses suggest a complex dynamic of multiple retroviral insertions at different time points with sequence conservation specific to clades that share a similar histo-physiological placental type. This research has widened our knowledge about the physiology of placentation through a better understanding of the evolutionary role of syncytin.

National Category
Bioinformatics and Computational Biology
Research subject
Bioinformatics
Identifiers
urn:nbn:se:uu:diva-473342 (URN)
Available from: 2022-04-25 Created: 2022-04-25 Last updated: 2025-02-07Bibliographically approved

Open Access in DiVA

UUThesis_Rivas,D-2023(1132 kB)398 downloads
File information
File name FULLTEXT01.pdfFile size 1132 kBChecksum SHA-512
fd44d781f68a42075649b52d043ab6d15ed0e5a51f2e41617ed86852d9510c283ecc3b2fe3070d3973aaa66712e3a2e7f39db694b515bd74392fae8f1c00a44e
Type fulltextMimetype application/pdf

Authority records

Rivas-Carrillo, Salvador Daniel

Search in DiVA

By author/editor
Rivas-Carrillo, Salvador Daniel
By organisation
Department of Medical Biochemistry and Microbiology
Bioinformatics (Computational Biology)Other Basic Medicine

Search outside of DiVA

GoogleGoogle Scholar
Total: 401 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 739 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf