Aby wyświetlić tę treść, wymagana jest subskrypcja JoVE. Zaloguj się lub rozpocznij bezpłatny okres próbny.
Method Article
This protocol illustrates how to explore, compare, and interpret human protein glycomes with online resources.
The Glyco@Expasy initiative was launched as a collection of interdependent databases and tools spanning several aspects of knowledge in glycobiology. In particular, it aims at highlighting interactions between glycoproteins (such as cell surface receptors) and carbohydrate-binding proteins mediated by glycans. Here, major resources of the collection are introduced through two illustrative examples centered on the N-glycome of the human Prostate Specific Antigen (PSA) and the O-glycome of human serum proteins. Through different database queries and with the help of visualization tools, this article shows how to explore and compare content in a continuum to gather and correlate otherwise scattered pieces of information. Collected data are destined to feed more elaborate scenarios of glycan function. Glycoinformatics introduced here is, therefore, proposed as a means to either strengthen, shape, or refute assumptions on the specificity of a protein glycome in a given context.
Glycans, proteins to which they are attached (glycoproteins) and proteins to which they bind (lectins or carbohydrate-binding proteins) are the main molecular actors at the cell surface1. Despite this central role in cell-cell communication, large-scale studies, including glycomics, glycoproteomics, or glycan-interactomics data are still scarce compared to their counterpart in genomics and proteomics.
Until recently, methods for characterizing the branching structures of complex carbohydrates while still being conjugated to the carrier protein had not been developed. The biosynthesis of glycoproteins is a non-template-driven process in which the monosaccharide donors, the accepting glycoprotein substrates, and the glycosyltransferases and glycosidases play an interactive role. The resulting glycoproteins can bear complex structures with multiple branching points where each monosaccharide component can be one of the several types present in nature1. The non-template-driven process imposes biochemical analysis as the only option for generating oligosaccharide structural data. The analytical process of glycan structures attached to a native protein is often challenging as it requires sensitive, quantitative, and robust technologies to determine monosaccharide composition, linkages, and branching sequences2.
In this context, mass spectrometry (MS) is the most widely used technique in glycomics and glycoproteomics experiments. As time goes, these are carried out in higher throughput settings and data is now accumulating in databases. Glycan structures in various formats3, populate GlyTouCan4, the universal glycan data repository where each structure is associated with a stable identifier irrespective of the level of precision with which the glycan is defined (e.g., possibly missing linkage type or ambiguous composition). Very similar structures are collected but their minor differences are clearly reported. Glycoproteins are described and curated in GlyConnect5 and GlyGen6, two databases cross-referencing each other. MS data supporting structural pieces of evidence are increasingly stored in GlycoPOST7. For a wider coverage of online resources, chapter 52 of the reference manual, Essentials of Glycobiology, is dedicated to glycoinformatics8. Interestingly, glycopeptide identification software has proliferated in recent years9,10 though not to the benefit of reproducibility. The latter concern prompted the leaders of the HUPO GlycoProteomics Initiative (HGI) to set a software challenge in 2019. The MS data obtained from processing complex mixtures of N- and O-glycosylated human serum proteins in CID, ETD, and EThcD fragmentation modes, were made available to competitors whether software users or developers. The full report on the results of this challenge11 is only outlined here. To begin with, a spread of identifications was observed. It was mainly interpreted as caused by the diversity of methods implemented in search engines, of their settings, and how outputs were filtered, and peptide "counted". The experimental design may also have put some software and approaches at a (dis)advantage. Importantly, participants using the same software reported inconsistent results, thereby highlighting serious reproducibility issues. It was concluded by comparing different submissions that some software solutions perform better than others and some search strategies yield better results. This feedback is likely to guide the improvement of automated glycopeptide data analysis methods and will in turn, impact database content.
The expansion of glycoinformatics led to creating web portals that provide information and access to multiple similar or complementing resources. The most recent and up-to-date are described in a chapter of the Comprehensive Glycoscience book series12 and through cooperation, a solution to data sharing and information exchange is offered in an open access mode. One such portal was developed which was originally called Glycomics@ExPASy13 and renamed Glyco@Expasy, following the major overhaul of the Expasy platform14 that has hosted a large collection of tools and databases used across several -omics for decades, the most popular item being UniProt15-the universal protein knowledgebase. Glyco@Expasy offers a didactic discovery of the purpose and usage of databases and tools, based on a visual categorization and a display of their interdependencies. The following protocol illustrates procedures to explore glycomics and glycoproteomics data with a selection of resources from this portal that makes the connection between glycoproteomics and glycan-interactomics explicit via glycomics. As it is, glycomics experiments produce structures where monosaccharides are fully defined and linkages partially or fully determined, but their protein site attachment is poorly, if at all, characterized. In contrast, glycoproteomics experiments generate precise site attachment information but with a poor resolution of glycan structures, often limited to monosaccharide compositions. This information is pieced together in the GlyConnect database. Furthermore, search tools in GlyConnect can be used to detect potential glycan ligands which are described along with the proteins recognizing them in UniLectin16, linked to GlyConnect via glycans. The protocol presented here is divided into two sections to cover questions specific to N-linked and O-linked glycans and glycoproteins.
NOTE: A device with an Internet connection (larger screen preferred) and an up-to-date Web browser such as Chrome or Firefox is required. Using Safari or Edge may not be as reliable.
1. From a protein N-glycome in GlyConnect to a lectin of UniLectin
2. Exploring and comparing O-glycomes in GlyConnect
The first part of the protocol (section 1) showed how to investigate the specificity or the commonality of N-glycans attached on Asn-69 of the human Prostate Specific Antigen (PSA) using the GlyConnect platform. Tissue-dependent (urine and seminal fluid), as well as isoform-dependent (normal and high pI) variations in glycan expression, were emphasized using two visualizing tools (Figure 4 and Figure 5).
First, GlyCon...
GlyConnect Octopus as a tool for revealing unexpected correlations
GlyConnect Octopus was originally designed to query the database with a loose definition of glycans. Indeed, the literature often reports the main characteristics of glycans in a glycome such as being fucosylated or sialylated, being made of two or more antennae, etc. Furthermore, glycans whether N- or O-linked are classified in cores, as detailed in the reference manual Essentials of Glycobiology1, that are ...
The authors declare no conflicts of interest.
The author warmly acknowledges past and present members of the Proteome Informatics Group involved in developing the resources used in this tutorial, specifically, Julien Mariethoz and Catherine Hayes for GlyConnect, François Bonnardel for UniLectin, Davide Alocci, and Frederic Nikitin for the Octopus, and Thibault Robin for Compozitor and final touch on Octopus.
The development of the glyco@Expasy project is supported by the Swiss Federal Government through the State Secretariat for Education, Research and Innovation (SERI) and is currently complemented by the Swiss National Science Foundation (SNSF: 31003A_179249). ExPASy is maintained by the Swiss Institute of Bioinformatics and hosted at the Vital-IT Competency Center. The author also acknowledges Anne Imberty for outstanding cooperation on the UniLectin platform jointly supported by ANR PIA Glyco@Alps (ANR-15-IDEX-02), Alliance Campus Rhodanien Co-funds (http://campusrhodanien.unige-cofunds.ch) Labex Arcane/CBH-EUR-GS (ANR-17-EURE-0003).
Name | Company | Catalog Number | Comments |
internet connection | user's choice | ||
recent version of web browser | user's choice |
Zapytaj o uprawnienia na użycie tekstu lub obrazów z tego artykułu JoVE
Zapytaj o uprawnieniaThis article has been published
Video Coming Soon
Copyright © 2025 MyJoVE Corporation. Wszelkie prawa zastrzeżone