Metabolite profiling and beyond: approaches for the rapid processing and annotation of human blood serum mass spectrometry data
In this paper, we describe data processing and metabolite identification approaches which lead to a rapid and semi-automated interpretation of metabolomics experiments. Data from metabolite fingerprinting using LC-ESIQ-TOF/MS were processed with several open-source software packages, including XCMS and CAMERA to detect features and group features into compound spectra. Next, we describe the automatic scheduling of tandem mass spectrometry (MS) acquisitions to acquire a large number of MS/MS spectra, and the subsequent processing and computer-assisted annotation towards identification using the R packages MetShot, Rdisop, and the MetFusion application. We also implement a simple retention time prediction model using predicted lipophilicity logD, which predicts retention times within 42 s (6 min gradient) for most compounds in our setup. We putatively identified 44 common metabolites including several amino acids and phospholipids at metabolomics standards initiative (MSI) levels two and three and confirmed the majority of them by comparison with authentic standards at MSI level one. To aid both data integration within and data sharing between laboratories, we integrated data from two labs and mapped retention times between the chromatographic systems. Despite the different MS instrumentation and different chromatographic gradient programs, the mapped retention times agree within 26 s (20 min gradient) for 90 % of the mapped features.