Logo: to the web site of Uppsala University

uu.sePublications from Uppsala University
Planned maintenance
A system upgrade is planned for 10/12-2024, at 12:00-13:00. During this time DiVA will be unavailable.
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Decoding depression: a comprehensive multi-cohort exploration of blood DNA methylation using machine learning and deep learning approaches
Uppsala University, Disciplinary Domain of Medicine and Pharmacy, Faculty of Medicine, Department of Surgical Sciences, Functional Pharmacology and Neuroscience.ORCID iD: 0009-0008-8549-1350
Uppsala University, Disciplinary Domain of Medicine and Pharmacy, Faculty of Medicine, Department of Surgical Sciences, Functional Pharmacology and Neuroscience.ORCID iD: 0000-0001-7112-0921
2024 (English)In: Translational Psychiatry, E-ISSN 2158-3188, Vol. 14, no 1, article id 287Article in journal (Refereed) Published
Abstract [en]

The causes of depression are complex, and the current diagnosis methods rely solely on psychiatric evaluations with no incorporation of laboratory biomarkers in clinical practices. We investigated the stability of blood DNA methylation depression signatures in six different populations using six public and two domestic cohorts (n = 1942) conducting mega-analysis and meta-analysis of the individual studies. We evaluated 12 machine learning and deep learning strategies for depression classification both in cross-validation (CV) and in hold-out tests using merged data from 8 separate batches, constructing models with both biased and unbiased feature selection. We found 1987 CpG sites related to depression in both mega- and meta-analysis at the nominal level, and the associated genes were nominally related to axon guidance and immune pathways based on enrichment analysis and eQTM data. Random forest classifiers achieved the highest performance (AUC 0.73 and 0.76) in CV and hold-out tests respectively on the batch-level processed data. In contrast, the methylation showed low predictive power (all AUCs < 0.57) for all classifiers in CV and no predictive power in hold-out tests when used with harmonized data. All models achieved significantly better performance (>14% gain in AUCs) with pre-selected features (selection bias), with some of the models (joint autoencoder-classifier) reaching AUCs of up to 0.91 in the final testing regardless of data preparation. Different algorithmic feature selection approaches may outperform limma, however, random forest models perform well regardless of the strategy. The results provide an overview over potential future biomarkers for depression and highlight many important methodological aspects for DNA methylation-based depression profiling including the use of machine learning strategies.

Place, publisher, year, edition, pages
Springer Nature, 2024. Vol. 14, no 1, article id 287
National Category
Bioinformatics (Computational Biology)
Identifiers
URN: urn:nbn:se:uu:diva-535236DOI: 10.1038/s41398-024-02992-yISI: 001272549200003PubMedID: 39009577OAI: oai:DiVA.org:uu-535236DiVA, id: diva2:1884864
Available from: 2024-07-18 Created: 2024-07-18 Last updated: 2024-11-28Bibliographically approved
In thesis
1. Biomarkers for depression: genetic, epigenetic, and expression evidence
Open this publication in new window or tab >>Biomarkers for depression: genetic, epigenetic, and expression evidence
2024 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Depression is a very prevalent disorder affecting between 2 to 21% of the world population. This thesis extends the knowledge on the biological aspects of depression, aiming to identify and validate markers of genetic, epigenetic, and gene expression origin.

In Study I, the main focus was depression-related gene MAD1L1 that was previously linked to depression by SNPs and frequently mentioned as a stress-related marker. We identified that depression-related SNPs in MAD1L1 affect DNA methylation levels at cg02825527, cg18302629, and cg19624444 that were associated with depressive phenotypes in independent cohorts.

In Study II, we investigated whether GWAS catalog depression SNPs located in Olink-detectable genes could be replicated in a UKBiobank cohort and whether these associations are supported by DNA methylation and transcriptome. We validated eight depression SNPs and found very weak evidence that TNXB may be related to depression.

Study III was based on comparison of different depression -OMIC layers, including genetics, DNA methylation, and transcriptome. We explored how the identified genes from different -OMICs overlap, are functionally related and if they could show patterns in drugs and clinical trials. Only three genes were supported by evidence at all three -OMIC levels and included: FOXP1, VPS41, and AKTIP. Different -OMIC levels showed involvement of multiple systems in depression.

In Study IV, we used the Neuro Exploratory panel (Olink) to identify depression proteomic changes in blood. We took antidepressant intake into the account and validated associations in the independent datasets. We identified several proteins that showed nominally different levels between depression risk groups in the adolescent cohort. Validation of identified markers yielded that only PPP3R1 was also differentially expressed in prefrontal cortex and whole blood in the independent open-access cohorts with matching association directions.

In Study V, we used the entire blood DNA methylation as a depression marker. We investigated stability of DNA-methylation in eight independent datasets with meta-analysis and compared common machine learning and deep learning strategies for the depression detection purposes. We found 1987 CpG sites related to depression in both mega- and meta-analysis at the nominal level. Random forest classifiers achieved the best performance in identifying depression based on DNA methylation data in blood (AUC 0.73 and 0.76) in CV and hold-out tests respectively on the batch-level processed data.

Overall, the thesis supports multiple depression genetic, epigenetic, and expression markers. However, identified individual and systemic depression changes show high variability, which is in agreement with previous studies and observations.

Place, publisher, year, edition, pages
Uppsala: Acta Universitatis Upsaliensis, 2024. p. 98
Series
Digital Comprehensive Summaries of Uppsala Dissertations from the Faculty of Medicine, ISSN 1651-6206 ; 2093
Keywords
Depression, Biomarkers, Epigenetics, DNA methylation, Proteomics, Transcriptomics, Genetics, Machine learning
National Category
Bioinformatics (Computational Biology) Psychiatry Neurosciences
Research subject
Psychiatry; Bioinformatics
Identifiers
urn:nbn:se:uu:diva-540129 (URN)978-91-513-2270-4 (ISBN)
Public defence
2024-12-04, room A1:111a, Uppsala biomedicinska centrum (BMC), Husargatan 3, Uppsala, 13:00 (English)
Opponent
Supervisors
Available from: 2024-11-13 Created: 2024-10-10 Last updated: 2024-11-13

Open Access in DiVA

fulltext(2964 kB)1 downloads
File information
File name FULLTEXT01.pdfFile size 2964 kBChecksum SHA-512
d8c5ccabe022ebd5a026cd59b67dc5fc2bac54fc34e317890b3479745846a0a844f8455a8f79315de59f87fce3c554e9a00dd86a001349d394d2418b48262104
Type fulltextMimetype application/pdf

Other links

Publisher's full textPubMed

Authority records

Sokolov, Aleksandr V.Schiöth, Helgi

Search in DiVA

By author/editor
Sokolov, Aleksandr V.Schiöth, Helgi
By organisation
Functional Pharmacology and Neuroscience
In the same journal
Translational Psychiatry
Bioinformatics (Computational Biology)

Search outside of DiVA

GoogleGoogle Scholar
Total: 1 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
pubmed
urn-nbn

Altmetric score

doi
pubmed
urn-nbn
Total: 43 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf