Aby wyświetlić tę treść, wymagana jest subskrypcja JoVE. Zaloguj się lub rozpocznij bezpłatny okres próbny.
Method Article
* Wspomniani autorzy wnieśli do projektu równy wkład.
Open searching enables the identification of glycopeptides decorated with previously unknown glycan compositions. Within this article, a streamlined approach for undertaking open searching and subsequent glycan-focused glycopeptide searches are presented for bacterial samples using Acinetobacter baumannii as a model.
Protein glycosylation is increasingly recognized as a common modification within bacterial organisms, contributing to prokaryotic physiology and optimal infectivity of pathogenic species. Due to this, there is increasing interest in characterizing bacterial glycosylation and a need for high-throughput analytical tools to identify these events. Although bottom-up proteomics readily enables the generation of rich glycopeptide data, the breadth and diversity of glycans observed in prokaryotic species make the identification of bacterial glycosylation events extremely challenging.
Traditionally, the manual determination of glycan compositions within bacterial proteomic datasets made this a largely bespoke analysis restricted to field-specific experts. Recently, open searching-based approaches have emerged as a powerful alternative for the identification of unknown modifications. By analyzing the frequency of unique modifications observed on peptide sequences, open searching techniques allow the identification of common glycans attached to peptides within complex samples. This article presents a streamlined workflow for the interpretation and analysis of glycoproteomic data, demonstrating how open searching techniques can be used to identify bacterial glycopeptides without prior knowledge of the glycan compositions.
Using this approach, glycopeptides within samples can rapidly be identified to understand glycosylation differences. Using Acinetobacter baumannii as a model, these approaches enable the comparison of glycan compositions between strains and the identification of novel glycoproteins. Taken together, this work demonstrates the versatility of open database-searching techniques for the identification of bacterial glycosylation, making the characterization of these highly diverse glycoproteomes easier than ever before.
Protein glycosylation, the process of attaching carbohydrates to protein molecules, is one of the most common post-translational modifications (PTMs) in nature1,2. Across all domains of life, a range of complex machinery has evolved dedicated to the generation of glycoproteins that impact a myriad of cellular functions1,3,4,5. While protein glycosylation occurs on a range of amino acids6,7, N-linked and O-linked glycosylation events are two dominant forms observed in nature. N-linked glycosylation involves the attachment of glycans to a nitrogen atom of asparagine (Asn) residues, while in O-linked glycosylation, glycans are attached to an oxygen atom of serine (Ser), threonine (Thr), or tyrosine (Tyr) residues7. Despite the similarities in residues targeted by glycosylation systems, the differences within the glycans attached to proteins result in glycosylation being the most chemically diverse class of PTMs found in nature.
While eukaryotic glycosylation systems possess glycan diversity, these systems are typically restricted in the number of unique carbohydrates utilized. The resulting diversity stems from how these carbohydrates are arranged into glycans8,9,10,11,12. In contrast, bacterial and archaeal species possess virtually unlimited glycan diversity due to the sheer array of unique sugars generated within these systems2,10,13,14,15,16,17. These differences in the glycan diversity observed across domains of life represent a significant analytical challenge for the characterization and identification of glycosylation events. For eukaryotic glycosylation, the ability to anticipate glycan compositions has facilitated the growing interest in glycobiology; yet, the same is not true for bacterial glycosylation, which is still largely restricted to study by specialized laboratories. As the accessibility of mass spectrometry (MS) instrumentation has increased in the biosciences, MS-based approaches are now the primary method for glycoproteomic analysis.
MS has emerged as the quintessential tool for the characterization of glycosylation, with both top-down and bottom-up approaches now commonly used to characterize glycoproteins6. While top-down proteomics is used to assess global glycosylation patterns of specific proteins18,19, bottom-up approaches are used to enable the glycan-specific characterization of glycopeptides, even from complex mixtures6,20,21,22,23. For the analysis of glycopeptides, the generation of informative fragmentation information is essential for the characterization of glycosylation events24,25. A range of fragmentation approaches is now routinely accessible on instruments, including resonance ion trap-based collision-induced dissociation (IT-CID), beam-type collision-induced dissociation (CID), and electron transfer dissociation (ETD). Each approach possesses different strengths and weaknesses for glycopeptide analysis25,26, with significant progress over the last decade in applying these fragmentation approaches to analyze glycosylation6,20. However, for bacterial glycosylation analysis, the critical limitation has not been the ability to fragment glycopeptides but rather the inability to predict the potential glycan compositions within samples. Within these systems, the unknown nature of diverse bacterial glycans limits the identification of glycopeptides, even with glycosylation-focused searching tools now commonplace for the analysis of eukaryotic glycopeptides, such as O-Pair27, GlycopeptideGraphMS28, and GlycReSoft29. To overcome this issue, an alternative searching method is required, with the use of open searching tools emerging as a powerful approach for the study of bacterial glycosylation30.
Open searching, also known as blind or wildcard searching, allows the identification of peptides with unknown or unexpected PTMs21,30,31,32. Open searches utilize a variety of computational techniques, including curated modification searches, multistep database searches, or wide-mass tolerant searching33,34,35,36,37. Although open searching has great potential, its use has typically been hindered by the significant increase in analysis times and loss in sensitivity of the detection of unmodified peptides compared to restricted searches31,32. The decrease in the detection of unmodified peptide-spectral matches (PSMs) is a result of the increased false-positive PSM rates associated with these techniques, which requires increased stringent filtering to maintain the desired false discovery rates (FDRs)33,34,35,36,37. Recently, several tools have become available that significantly improve the accessibility of open searching, including Byonic31,38, Open-pFind39, ANN-SoLo40, and MSFragger21,41. These tools enable the robust identification of glycosylation events by significantly reducing analysis times and implementing approaches to handle heterogeneous glycan compositions.
This article presents a streamlined method for the identification of bacterial glycopeptides by open searching, using the Gram-negative nosocomial pathogen, Acinetobacter baumannii, as a model. A. baumannii possesses a conserved O-linked glycosylation system responsible for the modification of multiple protein substrates, known as the PglL protein glycosylation system42,43,44. While similar proteins are targeted for glycosylation between strains, the PglL glycosylation system is highly variable due to the biosynthesis of the glycan used for protein glycosylation being derived from the capsule locus (known as the K-locus)44,45,46. This results in diverse glycans (also known as a K-unit), derived from single or limited polymerized K-units, being added to protein substrates30,44,46. Within this work, the use of the open searching tool, MSfragger, within the software FragPipe, is used to identify glycans across A. baumannii strains. By combining open searching and manual curation, "glycan-focused searches" can be undertaken to further improve the identification of bacterial glycopeptides. Together, this multistep identification approach enables the identification of glycopeptides without extensive experience in the characterization of novel glycosylation events.
NOTE: The preparation and analysis of bacterial glycopeptide samples can be divided into four sections (Figure 1). For this study, the glycosylation of three sequenced A. baumannii strains was assessed (Table 1). Proteome FASTA databases of each of these strains are accessible via Uniprot. Refer to Table 2 for the composition of buffers used in this protocol.
1. Preparation of protein samples for proteomic analysis
2. Processing of proteome samples
3. LC-MS of proteome/glycopeptide-enriched samples
4. Analysis of proteome/glycopeptide-enriched samples
To illustrate the utility of open searching for bacterial glycopeptide analysis, the chemical diversity of O-linked glycans within three strains of A. baumannii-AB307-0294, ACICU, and D1279779-was assessed. The O-linked glycoproteomes are highly variable between A. baumannii strains as the glycans used for glycosylation are derived from the highly variable capsule loci44,45,46. This chemical d...
Open searching is an effective and systematic method for the identification of unknown modifications. While the identification of unknown glycans within bacterial proteome samples has traditionally been a time-consuming and technically specialized undertaking, the recent developments of tools such as MSfragger21,41 and Byonic31,38 now enable the quick and effective identification of delta masses for furth...
The authors have no conflicts of interest.
N.E.S is supported by an Australian Research Council Future Fellowship (FT200100270) and an ARC Discovery Project Grant (DP210100362). We thank the Melbourne Mass Spectrometry and Proteomics Facility of The Bio21 Molecular Science and Biotechnology Institute for access to MS instrumentation.
Name | Company | Catalog Number | Comments |
14 G Kel-F Hub point style 3 | Hamilton company | hanc90514 | |
2-Chloroacetamide | Sigma Aldrich Pty Ltd | C0267-100G | |
Acetonitrile | Sigma Aldrich Pty Ltd | 34851-4L | |
Ammonium hydroxide (28%) | Sigma Aldrich Pty Ltd | 338818-100ML | |
BCA Protein Assay Reagent A | Pierce | 23228 | |
BCA Protein Assay Reagent B | Pierce | 23224 | |
C8 Empore SPE | Sigma Aldrich Pty Ltd | 66882-U | An alterative vendor for C8 material is Affinisep (https://www.affinisep.com/about-us/) |
Formic acid | Sigma Aldrich Pty Ltd | 5.33002 | |
Isopropanol | Sigma Aldrich Pty Ltd | 650447-2.5L | |
Methanol | Fisher Chemical | M/4058/17 | |
SDB-RPS Empore SPE (Reversed-Phase Sulfonate) | Sigma Aldrich Pty Ltd | 66886-U | An alterative vendor for SDB-RPS is Affinisep (https://www.affinisep.com/about-us/) |
Sodium Deoxycholate | Sigma Aldrich Pty Ltd | D6750-100G | |
ThermoMixer C | Eppendorf | 2232000083 | |
trifluoroacetic acid | Sigma Aldrich Pty Ltd | 302031-10X1ML | |
Tris 2-carboxyethyl phosphine hydrochloride | Sigma Aldrich Pty Ltd | C4706-2G | |
Tris(hydroxymethyl)aminomethane | Sigma Aldrich Pty Ltd | 252859-500G | |
Trypsin/Lys-C protease mixture | Promega | V5073 | |
Vacuum concentrator | Labconco | 7810040 | |
ZIC-HILIC material | Merck | 1504580001 | Resin for use in single use SPE columns can be obtain by emptying a larger form column and using the free resin |
Zapytaj o uprawnienia na użycie tekstu lub obrazów z tego artykułu JoVE
Zapytaj o uprawnieniaThis article has been published
Video Coming Soon
Copyright © 2025 MyJoVE Corporation. Wszelkie prawa zastrzeżone