Aby wyświetlić tę treść, wymagana jest subskrypcja JoVE. Zaloguj się lub rozpocznij bezpłatny okres próbny.
Method Article
* Wspomniani autorzy wnieśli do projektu równy wkład.
This article describes the procedure for the identification and characterization of a gene family in grapevine applied to the family of Arabidopsis Tóxicos in Levadura (ATL) E3 ubiquitin ligases.
Classification and nomenclature of genes in a family can significantly contribute to the description of the diversity of encoded proteins and to the prediction of family functions based on several features, such as the presence of sequence motifs or of particular sites for post-translational modification and the expression profile of family members in different conditions. This work describes a detailed protocol for gene family characterization. Here, the procedure is applied to the characterization of the Arabidopsis Tóxicos in Levadura (ATL) E3 ubiquitin ligase family in grapevine. The methods include the genome-wide identification of family members, the characterization of gene localization, structure, and duplication, the analysis of conserved protein motifs, the prediction of protein localization and phosphorylation sites as well as gene expression profiling across the family in different datasets. Such procedure, which could be extended to further analyses depending on experimental purposes, could be applied to any gene family in any plant species for which genomic data are available, and it provides valuable information to identify interesting candidates for functional studies, giving insights into the molecular mechanisms of plant adaptation to their environment.
During the last decade, much research has been carried out in grapevine genomics. Grapevine is a recognized economically relevant crop, which has become a model for research on fruit development and on the responses of woody plants to biotic and abiotic stresses. In this context, the release of the Vitis vinifera cv. PN40024 genome in 20071 and its updated version in 20112 led to a rapid accumulation of "Omics"-scale data and to a burst of high-throughput studies. Based on the published sequence data, the comprehensive analysis of a given gene family (generally composed of proteins sharing conserved motifs, structural and/or functional similarities and evolutionary relationships), can now be performed to uncover its molecular functions, evolution, and gene expression profiles. These analyses can contribute to understanding how gene families control physiological processes at a genome-wide level.
Many aspects of the plant life cycle are regulated by ubiquitin-mediated degradation of key proteins, which require a fine-tuned turnover to ensure regular cellular processes. Important components of the ubiquitin-mediated degradation process are the E3 ubiquitin ligases, which are responsible for system flexibility, thanks to the recruitment of specific targets3. Accordingly, these enzymes represent a huge gene family, with around 1,400 E3 ligase-encoding genes predicted in Arabidopsis thaliana genome4, each E3 ubiquitin ligase acting for the ubiquitination of specific target proteins. Despite the importance of substrate-specific ubiquitination in cellular regulation in plants, little is known about how the ubiquitination pathway is regulated and target proteins have been identified only in a few cases. The deciphering of such specificity and regulation mechanisms relies first on the identification and characterization of the different components of the system, in particular the E3 ligases. Among ubiquitin ligases, the ATL subfamily is characterized by 91 members identified in A. thaliana displaying a RING-H2 finger domain5,6, some of them playing a role in defense and hormone responses7.
The first crucial step to define the members of a new gene family is the precise definition of the family features, such as consensus motifs, key domains, and protein sequence characteristics. Indeed, the reliable retrieval of all gene family members based on BLAST analysis requires some mandatory sequence characteristics, in particular protein domains responsible for protein function/activity, serving as protein signature. This can be facilitated by previous characterization of the same gene family in other plant species or achieved by analyzing different genes putatively belonging to the same family in different plant species, to isolate common sequences. The family members can then be individually named following common rules settled by international consortia for a given plant species. In grapevine, for instance, such procedure is subjected to the recommendations of the Super-Nomenclature Committee for Grape Gene Annotation (sNCGGa), establishing the construction of a phylogenetic tree including V. vinifera and A. thaliana gene family members to allow gene annotation based on nucleotide sequences8.
Chromosome localization of family members and gene duplication survey allow highlighting the presence of whole-genome or tandem duplicated genes. Such information appears useful to unravel putative gene functions, since it might show functional redundancy or reveal different situations, i.e., non-functionalization, neo-functionalization, or sub-functionalization9. Both neo- and sub-functionalization are important events that create genetic novelty, providing new cellular components for plant adaptation to changing environments10. In particular, duplications of ancestral genes and production of new genes were very frequent during the evolution of the grapevine genome and newly formed genes originating from proximal and tandem duplications in grapevine were more likely to produce new functions11.
Another key factor in deciphering gene family function is the transcriptomic profile. The availability of public databases giving access to a huge amount of transcriptomic data can be thus exploited to assign putative functions to gene family members using large-scale in silico expression analyses. Indeed, the peculiar expression of some genes in specific plant organs or in response to certain stresses can give some hints regarding the putative roles of the corresponding proteins in defined conditions, and give support to hypotheses about possible sub-functionalization of duplicated genes to respond to different challenges. For that purpose, it is important to consider several datasets: these can be already available gene expression matrixes, such as the genome-wide transcriptomic atlas of grapevine organs and developmental stages12, or can be built ad hoc by retrieving transcriptomic datasets for the particular plant species subjected to defined stresses. Moreover, a simple approach using two matrices, one with pairwise similarity data and the other one with pairwise co-expression coefficients can be applied to evaluate the relationships between sequence similarity and expression patterns within a gene family.
The aim of this work is to provide a global approach, defining gene structure, conserved protein motifs, chromosomal location, gene duplications, and expression patterns, as well the prediction of protein localization and phosphorylation sites, to attain an exhaustive characterization of a gene family in plants. Such a comprehensive approach is applied here to the characterization of the ATL E3 ubiquitin ligase family in grapevine. According to the emerging role of ATL subfamily members in regulating key cellular processes7, this work can well assist the identification of strong candidates for functional studies, and eventually unravel the molecular mechanisms governing the adaptation of this important crop to its environment.
1. Identification of Putative ATL Gene Family Member(s)
2. Manual Inspection of the PSI-BLAST-identified Family Members
3. Analysis of Protein Physical Parameters and Domains
4. Chromosomal Distribution, Duplications, and Exon-intron Organization
5. Phylogenetic Analysis and Nomenclature
6. Grapevine Organ and Stage Expression Profiling
7. Expression Profiling in Response to Biotic and Abiotic Stresses
8. Analysis of the Relationships Between Paralogous Sequence Divergence and Gene Co-expression
The VIT_05s0077g01970 gene, identified as the most similar to A. thaliana ATL2 (At3g16720) through a BLASTp search, was used as probe to survey the ATL family members in the grapevine genome (V. vinifera cv Pinot Noir PN40024). The PSI-BLAST analysis converged after a few cycles revealing a list of putative genes belonging to the grapevine ATL gene family (Figure 1A). The presence of the canonical RING-H2 domain for each candidate was evalua...
In the genomic era, many gene families have been deeply characterized in several plant species. This information is preliminary to functional studies and provide a frame to investigate further the role of different members in a family. In this context, there is also a need for a nomenclature system allowing to uniquely identify each member in a family, avoiding the redundancy and confusions that may arise when names are assigned independently to different genes by different research groups.
Af...
The authors have nothing to disclose.
The work was supported by the University of Verona within the frame of Joint Project 2014 (Characterization of the ATL gene family in grapevine and of its involvement in resistance to Plasmopara viticola).
Name | Company | Catalog Number | Comments |
Personal computer | |||
Basic Local Alignment Search Tool (BLAST) | https://blast.ncbi.nlm.nih.gov/Blast.cgi | ||
Molecular Evolutionary Genetics Analysis (MEGA) | http://www.megasoftware.net/ | ||
Motif-based sequence analysis tools (MEME) | http://meme-suite.org/ | ||
Geneious | Biomatters Limited | http://www.geneious.com/ | |
ProtParam Tool | http://web.expasy.org/protparam/ | ||
ngLOC | http://genome.unmc.edu/ngLOC/index.html | ||
TargetP v1.1 Server | http://www.cbs.dtu.dk/services/TargetP/ | ||
Protein Prowler | http://bioinf.scmb.uq.edu.au:8080/pprowler_webapp_1-2/ | ||
MUsite | http://musite.sourceforge.net/ | ||
Pfam | http://pfam.xfam.org/ | ||
TMHMM Server v. 2.0 | http://www.cbs.dtu.dk/services/TMHMM/ | ||
ProtScale | http://web.expasy.org/protscale/ | ||
Grape Genome Database (CRIBI) | http://genomes.cribi.unipd.it/grape/ | ||
PhenoGram | http://visualization.ritchielab.psu.edu/phenograms/plot | ||
MCScanX | http://chibba.pgml.uga.edu/mcscan2/ | ||
Interactive Tree Of Life (iTOL) | http://itol.embl.de/ | ||
UniProt | http://www.uniprot.org/ | ||
Phylogeny.fr | http://www.phylogeny.fr/index.cgi | ||
MUSCLE | http://www.ebi.ac.uk/Tools/msa/muscle/ | ||
Gblocks Server | http://molevol.cmima.csic.es/castresana/Gblocks_server.html | ||
Vitis vinifera cv. Corvina gene expression Atlas datamatrix | https://www.researchgate.net/publication/273383414_54sample_ datamatrix_geneIDs_Fasoli2012 | ||
Multi Experiment Viewer (MeV) | http://mev.tm4.org/#/welcome | ||
Sequence Read Archive (SRA) | https://www.ncbi.nlm.nih.gov/sra | ||
R | https://www.r-project.org/ | ||
EMBOSS Needle (EMBL-EBI) | http://www.ebi.ac.uk/Tools/psa/emboss_needle/ |
Zapytaj o uprawnienia na użycie tekstu lub obrazów z tego artykułu JoVE
Zapytaj o uprawnieniaThis article has been published
Video Coming Soon
Copyright © 2025 MyJoVE Corporation. Wszelkie prawa zastrzeżone