Sign In

A subscription to JoVE is required to view this content. Sign in or start your free trial.

In This Article

  • Summary
  • Abstract
  • Introduction
  • Protocol
  • Representative Results
  • Discussion
  • Acknowledgements
  • Materials
  • References
  • Reprints and Permissions

Summary

Clinical metaproteomics offers insights into the human microbiome and its contributions to disease. We harnessed the computational power of the Galaxy platform to develop a modular bioinformatics workflow that facilitates complex mass spectrometry-based metaproteomic analysis and characterization of diverse clinical sample types relevant to studies of disease.

Abstract

Clinical metaproteomics reveals host-microbiome interactions underlying diseases. However, challenges to this approach exist. In particular, the characterization of microbial proteins present in low abundance relative to host proteins is difficult. Other significant challenges are attributed to using very large protein sequence databases, which impedes sensitivity and accuracy during peptide and protein identification from mass spectrometry data in addition to retrieving taxonomy and functional annotations and performing statistical analysis. To address these problems, we present an integrated bioinformatics workflow for mass spectrometry-based metaproteomics that combines custom protein sequence database generation, peptide-spectrum match generation and verification, quantification, taxonomic and functional annotations, and statistical analysis. This workflow also offers characterization of human proteins (while prioritizing microbial proteins), thus offering insights into host-microbe dynamics in disease. The tools and workflow are deployed in the Galaxy ecosystem, enabling the development, optimization, and dissemination of these computational resources. We have applied this workflow for metaproteomic analysis of numerous clinical sample types, such as nasopharyngeal swabs and bronchoalveolar lavage fluid. Here, we demonstrate its utility via the analysis of residual fluid from cervical swabs. The complete workflow and accompanying training resources are accessible on the Galaxy Training Network to equip non-experts and experienced researchers with the necessary knowledge and tools to analyze their data.

Introduction

Mass spectrometry (MS)-based metaproteomics identifies and quantifies microbial and human proteins from clinical samples. This approach provides a new understanding of microbiome responses to disease and uncovers potential mediators of host-microbiome interactions1,2. Although metaproteomic analysis of clinical samples can uncover the microbiome's interactions with its host environment, the field still faces many challenges. One main challenge is the relatively high abundance of host (human) proteins, which hampers the identification of lower abundant microbial proteins. Moreover, MS-based metaproteomics d....

Protocol

MS/MS spectral data were obtained from de-identified residual PTF samples that were collected using procedures that followed institutional board-approved guidelines and regulations, as previously described21,29,30.

NOTE: Figure 1 provides an overview of the complete workflow, which consists of five modules. All inputs, outputs, and software tools are summarized in

Representative Results

The general protocol described here was demonstrated on MS/MS files obtained from a subset of PTF samples21. Do et al.21 analyzed four MS/MS files from PTF samples that were collected following procedures described by Boylan et al.29and Afiuni-Zadel et al.30. This workflow prioritizes microbial proteins but offers the flexibility for the characterization of human proteins in parallel with microbial proteins21.......

Discussion

Clinical metaproteomics research offers potential breakthroughs for clinical studies, but challenges in its implementation persist. The lower abundance of microbial proteins relative to the host proteins in most samples hinders the detection and characterization of non-host proteins6,10. Dependence on large protein sequence databases for accurate peptide and protein identification and quantification, along with complexities of taxonomically and functionally annot.......

Acknowledgements

We thank Dr. Amy Skubitz and Dr. Kristin Boylan (University of Minnesota) for the pilot data sets and Dr. Paul Piehowski, Dr. Tao Liu, and Dr. Karin Rodland (Pacific Northwest National Laboratories (PNNL)) for their expertise in the sample collection, and processing of the PTF samples and generation of the TMT-labeled MS data used in this study. This project was funded in part by the Minnesota Ovarian Cancer Alliance (MOCA), the National Institutes of Health/National Cancer Institute Grant Number: 5R01CA262153 (A.P.N.S.), 1R21CA267707 (P.D.J and T.J.G.), and the National Institutes of Health/National Cancer Institute Grant Number: P30CA077598 (P.D.J. and T.J.G.).

....

Materials

NameCompanyCatalog NumberComments
Collapse CollectionGalaxyPGalaxy Version 5.1.1Combines a dataset list collection into a single file (in the order of the list)
Concatenate datasetsGalaxyPGalaxy Version 0.1.1Concatenate files tail-to-head
CutGalaxyPGalaxy Version 1.0.2Cut (select) specified columns from a file
FASTA Merge Files and Filter Unique SequencesGalaxyPGalaxy Version 1.2.0Concatenate FASTA database files together
FastaCLIGalaxyPGalaxy Version 4.0.41+galaxy1Appends decoy sequences to FASTA files
FASTA-to-TablularGalaxyPGalaxy Version 1.1.0Convert FASTA-formatted sequences to TAB-delimited format
FilterGalaxyPGalaxy Version 1.1.1Filter columns using simple expressions
Filter TabularGalaxyPGalaxy Version 3.3.0Filter a tabular file via line filters
Galaxy Europe (EU) serverGalaxyPhttps://usegalaxy.eu/
GroupGalaxyPGalaxy Version 2.1.4Group a file by a particular column and perform aggregate functions
Identification ParametersGalaxyPGalaxy Version 4.0.41+galaxy1Set identification parameters for SearchGUI/PeptideShaker
Learning Pathway: Clinical metaproteomics workflows within GalaxyGalaxyPhttps://training.galaxyproject.org/training-material/learning-pathways/clinical-metaproteomics.html
MaxQuantGalaxyPGalaxy Version 2.0.3.0+galaxy0 (Discovery module); Galaxy Version 1.6.17.0+galaxy4 (Quantification module)Quantitative proteomics software package for analysis of large mass spectrometric data files
MetaNovoGalaxyPGalaxy Version 1.9.4+galaxy4Search MS/MS data against a FASTA database (of known proteins) to produce a targeted database (of matched proteins) for mass spectrometry analysis
msconvertGalaxyPGalaxy Version 3.0.20287.2Convert and/or filter mass spectrometry files
MSstatsTMTGalaxyPGalaxy Version 2.0.0+galaxy1R-based package for detection of differentially abundant proteins in shotgun mass spectrometry-based proteomic experiments using tandem mass tag (TMT) labeling
PepQuery2GalaxyPGalaxy Version 2.0.2+galaxy0Peptide-centric search engine for identification and/or validating known and novel peptides of interest
PeptideShakerGalaxyPGalaxy Version 2.0.33+galaxy1Interpret results from SearchGUI for protein identification
Protein Database DownloaderGalaxyPGalaxy Version 0.3.4Download specified protein sequences as a FASTA file
Query TabularGalaxyPGalaxy Version 3.3.0Load tabular files intoa  SQLite database
Remove beginningGalaxyPGalaxy Version 1.0.0Remove the specified number of (header) lines from a file
SearchGUIGalaxyPGalaxy Version 4.0.41+galaxy1Run search engines on MGF peak lists and prepare results for input to Peptide Shaker
SelectGalaxyPGalaxy Version 1.0.4Select lines that match an expression
UnipeptGalaxyPGalaxy Version 4.5.1Retrieve UniProt entries and taxonomic information for tryptic peptides
UniProtGalaxyPGalaxy Version 2.3.0Download proteome as a XML (UniProtXML) or FASTA file from UniProtKB

References

  1. Zhang, X., Li, L., Butcher, J., Stintzi, A., Figeys, D. Advancing functional and translational microbiome research using meta-omics approaches. Microbiome. 7 (1), 154 (2019).
  2. Van Den Bossche, T., et al.

Explore More Articles

bioinformaticsclinical analysismetaproteomicsmicrobiomemass spectrometry

This article has been published

Video Coming Soon

JoVE Logo

Privacy

Terms of Use

Policies

Research

Education

ABOUT JoVE

Copyright © 2025 MyJoVE Corporation. All rights reserved