Model-driven data curation pipeline for LC-MS-based untargeted metabolomics.
Metabolomics
; 19(3): 15, 2023 03 01.
Article
em En
| MEDLINE
| ID: mdl-36856823
INTRODUCTION: There is still no community consensus regarding strategies for data quality review in liquid chromatography mass spectrometry (LC-MS)-based untargeted metabolomics. Assessing the analytical robustness of data, which is relevant for inter-laboratory comparisons and reproducibility, remains a challenge despite the wide variety of tools available for data processing. OBJECTIVES: The aim of this study was to provide a model to describe the sources of variation in LC-MS-based untargeted metabolomics measurements, to use it to build a comprehensive curation pipeline, and to provide quality assessment tools for data quality review. METHODS: Human serum samples (n=392) were analyzed by ultraperformance liquid chromatography coupled to high-resolution mass spectrometry (UPLC-HRMS) using an untargeted metabolomics approach. The pipeline and tools used to process this dataset were implemented as part of the open source, publicly available TidyMS Python-based package. RESULTS: The model was applied to understand data curation practices used by the metabolomics community. Sources of variation, which are often overlooked in untargeted metabolomic studies, were identified in the analysis. New tools were used to characterize certain types of variations. CONCLUSION: The developed pipeline allowed confirming data robustness by comparing the experimental results with expected values predicted by the model. New quality control practices were introduced to assess the analytical quality of data.
Palavras-chave
Texto completo:
1
Coleções:
01-internacional
Base de dados:
MEDLINE
Assunto principal:
Metabolômica
/
Curadoria de Dados
Tipo de estudo:
Prognostic_studies
Limite:
Humans
Idioma:
En
Revista:
Metabolomics
Ano de publicação:
2023
Tipo de documento:
Article
País de afiliação:
Argentina
País de publicação:
Estados Unidos