Logo: to the web site of Uppsala University

uu.sePublications from Uppsala University
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Comprehensive Peak Characterization (CPC) in Untargeted LC-MS Analysis
Uppsala University, Disciplinary Domain of Medicine and Pharmacy, Faculty of Pharmacy, Department of Medicinal Chemistry, Analytical Pharmaceutical Chemistry.ORCID iD: 0000-0002-5682-7408
Uppsala University, Disciplinary Domain of Medicine and Pharmacy, Faculty of Pharmacy, Department of Medicinal Chemistry, Analytical Pharmaceutical Chemistry.
Univ Lubeck, Affiliated Inst, Eurac Res, Inst Biomed, I-39100 Bolzano, Italy..ORCID iD: 0000-0002-6977-7147
Uppsala University, Disciplinary Domain of Medicine and Pharmacy, Faculty of Pharmacy, Department of Medicinal Chemistry, Analytical Pharmaceutical Chemistry.
Show others and affiliations
2022 (English)In: Metabolites, E-ISSN 2218-1989, Vol. 12, no 2, article id 137Article in journal (Refereed) Published
Abstract [en]

LC-MS-based untargeted metabolomics is heavily dependent on algorithms for automated peak detection and data preprocessing due to the complexity and size of the raw data generated. These algorithms are generally designed to be as inclusive as possible in order to minimize the number of missed peaks. This is known to result in an abundance of false positive peaks that further complicate downstream data processing and analysis. As a consequence, considerable effort is spent identifying features of interest that might represent peak detection artifacts. Here, we present the CPC algorithm, which allows automated characterization of detected peaks with subsequent filtering of low quality peaks using quality criteria familiar to analytical chemists. We provide a thorough description of the methods in addition to applying the algorithms to authentic metabolomics data. In the example presented, the algorithm removed about 35% of the peaks detected by XCMS, a majority of which exhibited a low signal-to-noise ratio. The algorithm is made available as an R-package and can be fully integrated into a standard XCMS workflow.

Place, publisher, year, edition, pages
MDPI AG MDPI, 2022. Vol. 12, no 2, article id 137
Keywords [en]
metabolomics, untargeted, peak characterization, peak detection, XCMS, false peaks, peak filtering, data processing, algorithm, data quality
National Category
Biochemistry Molecular Biology
Identifiers
URN: urn:nbn:se:uu:diva-470228DOI: 10.3390/metabo12020137ISI: 000762518100001PubMedID: 35208212OAI: oai:DiVA.org:uu-470228DiVA, id: diva2:1646303
Funder
Swedish Research Council
Note

R-package available at: https://www.github.com/krispir/cpc/

Available from: 2022-03-22 Created: 2022-03-22 Last updated: 2025-02-20Bibliographically approved
In thesis
1. Development of analytical methods for the determination of the small molecule component of complex biological systems
Open this publication in new window or tab >>Development of analytical methods for the determination of the small molecule component of complex biological systems
2022 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

The research field of untargeted metabolomics aims to determine the relative abundance of all small metabolites in a biological system in order to find biomarkers or make biological inference with regards to the internal or external stimuli. This is no trivial aim, as the small metabolites are both vast in numbers and extremely diverse in their chemical properties. As such, no single analytical method exist that is able to capture the entire metabolome on its own. In addition, the data generated from such experiments is both immense in volume and very complex. This forces researchers to use algorithmic data processing methods to extract the informative part of this data. Such algorithms are, however, both difficult to parametrize and designed to be highly inclusive, the combination of which often leads to errors. One such algorithm is the peak picking procedures used to find chromatographic peaks in liquid chromatography-mass spectrometry (LC-MS) data.

In this thesis, four papers are included that focus both on the development of new methods for sample analysis and data processing as well as the application of such, and other, methods in two interdisciplinary research projects. The first paper describes the development and application of a protocol for LC-MS based untargeted analysis of guinea pig perilymph. The focus of the study was to investigate the biochemical processes underlying the protective effect of hydrogen gas on noise-induced hearing loss (NIHL) in guinea pigs exposed to impulse noise. This study sparked two research projects based on limitations observed during the analytical work. The first limitation was that of limited chemical coverage in the analysis when sample volumes are highly limited. The second paper describes the design and validation of a novel separation method for the sequential analysis of both hydrophilic and lipophilic compounds in biological samples. The second limitation observed was the abundance of false peaks reported by peak picking software. These have a negative effect on both downstream data processing as well as data analysis and metabolite identification. The third paper describes the development of a new algorithm for comprehensive peak characterization in untargeted analytical data with the purpose of filtering such false peaks. Both methods presented in the second and third paper were applied to the analysis of guinea pigs perilymph samples in a follow-up study on the attenuating effect of hydrogen gas on NIHL in guinea pigs exposed to broad band continuous noise.

Place, publisher, year, edition, pages
Uppsala: Acta Universitatis Upsaliensis, 2022. p. 59
Series
Digital Comprehensive Summaries of Uppsala Dissertations from the Faculty of Pharmacy, ISSN 1651-6192 ; 306
Keywords
metabolomics, data processing, peak characterization, algorithm, multivariate data analysis, chromatography, sequential columns, HILIC, RPLC, liquid chromatography, mass spectrometry, LC, MS, noise-induced hearing loss, NIHL, guinea pig, perilymph, method development, validation, R, C++
National Category
Analytical Chemistry
Research subject
Analytical Pharmaceutical Chemistry
Identifiers
urn:nbn:se:uu:diva-461320 (URN)978-91-513-1374-0 (ISBN)
Public defence
2022-02-18, B22, BMC, Husargatan 3, Uppsala, 09:15 (English)
Opponent
Supervisors
Available from: 2022-01-27 Created: 2021-12-14 Last updated: 2023-07-17

Open Access in DiVA

fulltext(2983 kB)943 downloads
File information
File name FULLTEXT01.pdfFile size 2983 kBChecksum SHA-512
df4d9fcc5fa850e860d2df04449c6846c7aaa450584e4814be68876eb0eb3604377be056d19780a5c2565824a42f1b9cf6f6e47714f92a51a66d719ed52fb64e
Type fulltextMimetype application/pdf

Other links

Publisher's full textPubMed

Authority records

Pirttilä, KristianBalgoma, DavidPettersson, CurtHedeland, Mikael

Search in DiVA

By author/editor
Pirttilä, KristianBalgoma, DavidRainer, JohannesPettersson, CurtHedeland, Mikael
By organisation
Analytical Pharmaceutical Chemistry
In the same journal
Metabolites
BiochemistryMolecular Biology

Search outside of DiVA

GoogleGoogle Scholar
Total: 946 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
pubmed
urn-nbn

Altmetric score

doi
pubmed
urn-nbn
Total: 248 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf