Logo: to the web site of Uppsala University

uu.sePublications from Uppsala University
Change search
Link to record
Permanent link

Direct link
Rivas-Carrillo, Salvador Daniel
Publications (7 of 7) Show all publications
Rivas-Carrillo, S. D., Akkuratov, E. E., Valdez Ruvalcaba, H., Vargas-Sanchez, A., Komorowski, J., San-Juan, D. & Grabherr, M. G. (2023). MindReader: Unsupervised Classification of Electroencephalographic Data. Sensors, 23(6), Article ID 2971.
Open this publication in new window or tab >>MindReader: Unsupervised Classification of Electroencephalographic Data
Show others...
2023 (English)In: Sensors, E-ISSN 1424-8220, Vol. 23, no 6, article id 2971Article in journal (Refereed) Published
Abstract [en]

Electroencephalogram (EEG) interpretation plays a critical role in the clinical assessment of neurological conditions, most notably epilepsy. However, EEG recordings are typically analyzed manually by highly specialized and heavily trained personnel. Moreover, the low rate of capturing abnormal events during the procedure makes interpretation time-consuming, resource-hungry, and overall an expensive process. Automatic detection offers the potential to improve the quality of patient care by shortening the time to diagnosis, managing big data and optimizing the allocation of human resources towards precision medicine. Here, we present MindReader, a novel unsupervised machine-learning method comprised of the interplay between an autoencoder network, a hidden Markov model (HMM), and a generative component: after dividing the signal into overlapping frames and performing a fast Fourier transform, MindReader trains an autoencoder neural network for dimensionality reduction and compact representation of different frequency patterns for each frame. Next, we processed the temporal patterns using a HMM, while a third and generative component hypothesized and characterized the different phases that were then fed back to the HMM. MindReader then automatically generates labels that the physician can interpret as pathological and non-pathological phases, thus effectively reducing the search space for trained personnel. We evaluated MindReader’s predictive performance on 686 recordings, encompassing more than 980 h from the publicly available Physionet database. Compared to manual annotations, MindReader identified 197 of 198 epileptic events (99.45%), and is, as such, a highly sensitive method, which is a prerequisite for clinical use.

Place, publisher, year, edition, pages
MDPI, 2023
Keywords
electroencephalography, machine learning, precision medicine, unsupervised learning
National Category
Computer Sciences
Research subject
Artificial Intelligence
Identifiers
urn:nbn:se:uu:diva-498534 (URN)10.3390/s23062971 (DOI)000959947500001 ()36991682 (PubMedID)
Funder
NIH (National Institutes of Health), OD010425NIH (National Institutes of Health), HHSN272201300010CeSSENCE - An eScience Collaboration
Available from: 2023-03-17 Created: 2023-03-17 Last updated: 2023-04-19Bibliographically approved
Rivas-Carrillo, S. D., Akkuratov, E. E., Valdez Ruvalcaba, H., Vargas-Sanchez, A., San-Juan, D. & Grabherr, M. G. (2023). MindReader: unsupervised electroencephalographic reader. , 23(6), Article ID 2971.
Open this publication in new window or tab >>MindReader: unsupervised electroencephalographic reader
Show others...
2023 (English)Manuscript (preprint) (Other academic)
Abstract [en]

Background: Electroencephalogram (EEG) interpretation plays a critical role in the clinical assessment of neurological conditions, including epilepsy. Manual analysis requires highly specialized and heavily trained personnel. Moreover, the rate of capturing abnormal events makes interpretation time-consuming, resource-hungry, and, overall, an expensive process.

Automatic detection offers the potential to improve the quality of patient care by shortening the time to diagnosis, managing big data, and optimizing the allocation of human resources.

Findings: We present MindReader, an unsupervised method for EEG signals. First, MindReader processes the signal through an autoencoder in order to detect EEG abnormalities. Next, patterns are hypothesized by a Hidden Markov Model. Our algorithm automatically generates labels for non-pathological phases, thus reducing the search space for trained personnel.

Conclusions: MindReader is effective in detecting EEG abnormalities in focal and generalized epilepsy.

National Category
Computer Sciences
Research subject
Bioinformatics
Identifiers
urn:nbn:se:uu:diva-473344 (URN)
Available from: 2022-04-25 Created: 2022-04-25 Last updated: 2023-03-17Bibliographically approved
Rivas-Carrillo, S. D. (2023). The revolutionary partnership of computation and biology. (Doctoral dissertation). Uppsala: Acta Universitatis Upsaliensis
Open this publication in new window or tab >>The revolutionary partnership of computation and biology
2023 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

The organization of living beings is complex. Science uses modeling in order to gain a deeper understanding, and to be able to manipulate the processes of living organisms. To this purpose, I used and developed computational tools to investigate and model different relevant biological phenomena. 

In paper I, I utilized whole-genome data from wild and domesticated European rabbit (Oryctolagus cuniculus sp.) populations to identify segregating insertions of endogenous retroviruses and compare their variation along the host phylogeny and domestication history. The results from this study highlight the importance of genomic modeling beyond reference organisms and reference individuals, and provide deep insights regarding strategies for variant analyses in host population comparative genomics. In paper IV, I studied the process of exaptation of foreign genetic elements at broad-scale by observing the presence and characteristics of retroviral env gene, syncytin, across vertebrates. I searched a library of more than 150 chromosome-length assemblies covering 17 taxonomical orders for syncytin homologs, where I identified and syntenically aligned over 300 loci insertions, including not previously known insertions. Additionally, three-dimensional structures of the recovered sequences were predicted using AlphaFold2. Phylogenomics analyses suggest a complex dynamic of multiple retroviral insertions at different time points with sequence conservation specific to clades that share a similar histo-physiological placental type.

In paper II, I expanded the scope to encompass translational medicine by developing an unsupervised machine learning methodology for detecting anomalies in biomedical signals, MindReader, which I applied primarily to electroencephalogram. In paper III, I developed a hidden Markov model implementation that includes a hypothesis generator for stream time-domain signals, which is used as a dependency for paper II. The work in this thesis substantiates that a combination of biological knowledge, cutting-edge technology, and robust algorithmic design constitute the primordial factors for scientific advancement.

Place, publisher, year, edition, pages
Uppsala: Acta Universitatis Upsaliensis, 2023. p. 51
Series
Digital Comprehensive Summaries of Uppsala Dissertations from the Faculty of Medicine, ISSN 1651-6206 ; 1916
National Category
Bioinformatics (Computational Biology) Other Basic Medicine
Identifiers
urn:nbn:se:uu:diva-473354 (URN)978-91-513-1516-4 (ISBN)
Public defence
2023-04-26, Room A1:107a, BMC, Husargatan 3, Uppsala, 09:00 (English)
Opponent
Supervisors
Available from: 2023-04-04 Created: 2022-04-26 Last updated: 2024-04-03Bibliographically approved
Rivas-Carrillo, S. D., Pettersson, M., Rubin, C.-J. & Jern, P. (2018). Whole-genome comparison of endogenous retrovirus segregation across wild and domestic host species populations. Proceedings of the National Academy of Sciences of the United States of America, 115(43), 11012-11017, Article ID 201815056.
Open this publication in new window or tab >>Whole-genome comparison of endogenous retrovirus segregation across wild and domestic host species populations
2018 (English)In: Proceedings of the National Academy of Sciences of the United States of America, ISSN 0027-8424, E-ISSN 1091-6490, Vol. 115, no 43, p. 11012-11017, article id 201815056Article in journal (Refereed) Published
Abstract [en]

Although recent advances in sequencing and computational analyses have facilitated use of endogenous retroviruses (ERVs) for deciphering coevolution among retroviruses and their hosts, sampling effects from different host populations present major challenges. Here we utilize available whole-genome data from wild and domesticated European rabbit (Oryctolagus cuniculus sp.) populations, sequenced as DNA pools by paired-end Illumina technology, for identifying segregating reference as well as nonreference ERV loci, to reveal their variation along the host phylogeny and domestication history. To produce new viruses, retroviruses must insert a proviral DNA copy into the host nuclear DNA. Occasional proviral insertions into the host germline have been passed down through generations as inherited ERVs during millions of years. These ERVs represent retroviruses that were active at the time of infection and thus present a remarkable record of historical virus–host associations. To examine segregating ERVs in host populations, we apply a reference library search strategy for anchoring ERV-associated short-sequence read pairs from pooled whole-genome sequences to reference genome assembly positions. We show that most ERVs segregate along host phylogeny but also uncover radiation of some ERVs, identified as segregating loci among wild and domestic rabbits. The study targets pertinent issues regarding genome sampling when examining virus–host evolution from the genomic ERV record and offers improved scope regarding common strategies for single-nucleotide variant analyses in host population comparative genomics.

Keywords
endogenous, retrovirus, host population, segregation, comparative genomics, evolution
National Category
Evolutionary Biology
Identifiers
urn:nbn:se:uu:diva-362814 (URN)10.1073/pnas.1815056115 (DOI)000448040500066 ()30297425 (PubMedID)
Funder
Swedish Research Council, VR-M 2015-02429
Note

Correction in: PNAS, vol. 115, issue 52, pages E12465. DOI: 10.1073/pnas.1820237116

Available from: 2018-10-10 Created: 2018-10-10 Last updated: 2023-03-20Bibliographically approved
Rivas-Carrillo, S. D., Zhu, W., Dudchenko, O., Weisz, D., MacManes, M., Lieberman Aiden, E., . . . Kaur, P.Broad-scale in silico assessment retroviral exaptated gene: syncytin.
Open this publication in new window or tab >>Broad-scale in silico assessment retroviral exaptated gene: syncytin
Show others...
(English)Manuscript (preprint) (Other academic)
Abstract [en]

Syncytin is a fossil protein exapted from retroviruses that fulfills a pivotal role during trophoblast implantation and placental metabolite exchange. However, little is yet known about the distribution of syncytin across vertebrates. Here, we searched a library of more than 150 high-quality assemblies across 17 taxonomical orders for syncytin homologs. We identified and syntenically aligned over 300 loci insertions, including not previously known insertions. Additionally, we predicted the tridimensional structures of the recover sequences using AlphaFold2. Sequence conservation and phylogenomics analyses suggest a complex dynamic of multiple retroviral insertions at different time points with sequence conservation specific to clades that share a similar histo-physiological placental type. This research has widened our knowledge about the physiology of placentation through a better understanding of the evolutionary role of syncytin.

National Category
Bioinformatics and Computational Biology
Research subject
Bioinformatics
Identifiers
urn:nbn:se:uu:diva-473342 (URN)
Available from: 2022-04-25 Created: 2022-04-25 Last updated: 2025-02-07Bibliographically approved
Rivas-Carrillo, S. D., Punga, T. & Grabherr, M. G.Chapulin: a leap forward on mobile element and structural variant identification.
Open this publication in new window or tab >>Chapulin: a leap forward on mobile element and structural variant identification
(English)Manuscript (preprint) (Other academic)
Abstract [en]

Transposable elements represent a substantial proportion of eukaryotic genomes, where they can disrupt or enhance gene expression on the host. However, identification at population scale where often short sequencing signals are available is challenging. Current approaches rely on parsing sequence alignment files looking for anomalies on read length, read orientation and read depth, but they are often slow and complicated to install. Here, we present the Chapulin, a portable cross-platform, open-sourced Rust application for structural variant identification and characterization, including transposable elements. By using concurrent computing and native execution, Chapulin identifies a large fraction of mobile element insertions while outperforming existing transposable element tools. Chapulin was designed to be versatile and robust, in order to accommodate the demands of current data, such as population-scale studies or clinical samples

Keywords
genetic structural variation, population genetics, Chapulin, bioinformatics, research tool
National Category
Pharmaceutical and Medical Biotechnology
Research subject
Bioinformatics
Identifiers
urn:nbn:se:uu:diva-473341 (URN)
Available from: 2022-04-25 Created: 2022-04-25 Last updated: 2025-02-17Bibliographically approved
Rivas-Carrillo, S. D. & Grabherr, M. G.HiddenMarkovModelReaders: A Julia implementation of a Hidden Markov Model and unsupervised hypothesis generation for signal processing.
Open this publication in new window or tab >>HiddenMarkovModelReaders: A Julia implementation of a Hidden Markov Model and unsupervised hypothesis generation for signal processing
(English)Manuscript (preprint) (Other academic)
National Category
Computer Sciences
Identifiers
urn:nbn:se:uu:diva-498539 (URN)
Available from: 2023-03-17 Created: 2023-03-17 Last updated: 2023-03-20Bibliographically approved
Organisations

Search in DiVA

Show all publications