Aby wyświetlić tę treść, wymagana jest subskrypcja JoVE. Zaloguj się lub rozpocznij bezpłatny okres próbny.
Method Article
* Wspomniani autorzy wnieśli do projektu równy wkład.
This article describes the application of untargeted metabolomics, transcriptomics and multivariate statistical analysis to grape berry transcripts and metabolites in order to gain insight into the terroir concept, i.e., the impact of the environment on berry quality traits.
Terroir refers to the combination of environmental factors that affect the characteristics of crops such as grapevine (Vitis vinifera) according to particular habitats and management practices. This article shows how certain terroir signatures can be detected in the berry metabolome and transcriptome of the grapevine cultivar Corvina using multivariate statistical analysis. The method first requires an appropriate sampling plan. In this case study, a specific clone of the Corvina cultivar was selected to minimize genetic differences, and samples were collected from seven vineyards representing three different macro-zones during three different growing seasons. An untargeted LC-MS metabolomics approach is recommended due to its high sensitivity, accompanied by efficient data processing using MZmine software and a metabolite identification strategy based on fragmentation tree analysis. Comprehensive transcriptome analysis can be achieved using microarrays containing probes covering ~99% of all predicted grapevine genes, allowing the simultaneous analysis of all differentially expressed genes in the context of different terroirs. Finally, multivariate data analysis based on projection methods can be used to overcome the strong vintage-specific effect, allowing the metabolomics and transcriptomics data to be integrated and analyzed in detail to identify informative correlations.
Large-scale data analysis based on the genomes, transcriptomes, proteomes and metabolomes of plants provides unprecedented insight into the behavior of complex systems, such as the terroir characteristics of wine which reflect the interactions between grapevine plants and their environment. Because the terroir of a wine can be distinct even when identical grapevine clones are grown in different vineyards, genomics analysis is of little use because the clonal genomes are identical. Instead it is necessary to look at correlations between gene expression and the metabolic properties of the berries, which determine the quality traits of wine. The analysis of gene expression at the level of the transcriptome benefits from the similar chemical properties of all transcripts, which facilitates quantitative analysis by exploiting universal characteristics such as hybridization to immobilized probes on microarrays. In contrast, universal analytical methods in proteomics and metabolomics are more challenging because of the huge physical and chemical diversity of individual proteins and metabolites. In the case of metabolomics this diversity is even more extreme because individual metabolites differ vastly in size, polarity, abundance and volatility, so no single extraction process or analytical method offers a holistic approach.
Among the analytical platforms suitable for non-volatile metabolites, those based on high performance liquid chromatography coupled to mass spectrometry (HPLC-MS) are much more sensitive than alternatives such as HPLC with ultraviolet or diode array detectors (HPLC-UV, HPLC-DAD) or nuclear magnetic resonance (NMR) spectroscopy, but quantitative analysis by HPLC-MS can be influenced by phenomena such as the matrix effect and ion suppression/enhancement1-3. The investigation of such effects during the analysis of Corvina grape berries by HPLC-MS using an electrospray ionization source (HPLC-ESI-MS), showed that sugars and other molecules with the lowest retention times were strongly underreported, probably also reflecting the large number of molecules in this zone, and that the abundance of other molecules could be underestimated, overestimated or unaffected by the matrix effect, but the data normalization for the matrix effect seemed to have limited impact on the overall results4,5. The method described herein is optimized for the analysis of medium-polarity metabolites that accumulate at high levels in grape berries during ripening, and which are significantly impacted by the terroir. They include anthocyanins, flavonols, flavan-3-ols, procyanidins, other flavonoids, resveratrol, stilbenes, hydroxycinnamic acids and hydroxybenzoic acids, which together determine the color, taste and health-related properties of wines. Other metabolites, such as sugars and aliphatic organic acids, are ignored because quantitation by HPLC-MS is unreliable due to the matrix effect and ion suppression phenomena5. Within the polarity range selected by this method, the approach is untargeted in that it aims to detect as many different metabolites as possible6.
Transcriptomics methods that allow thousands of grapevine transcripts to be monitored simultaneously are facilitated by the availability of the complete grapevine genome sequence7,8. Early transcriptomics methods based on high-throughput cDNA sequencing have evolved with the advent of next-generation sequencing into a collection of procedures collectively described as RNA-Seq technology. This approach is rapidly becoming the method of choice for transcriptomics studies. However, a large body of literature based on microarray, which allow thousands of transcripts to be quantified in parallel by hybridization, has accumulated for grapevine. Indeed, before RNA-Seq became a mainstream technology, many dedicated commercial microarray platforms had been developed allowing grapevine transcriptome to be inspected in great detail. Among the vast variety of platforms, only two allowed genome-wide transcriptome analysis9. The most evolved array allowed the hybridization of up to 12 independent samples on a single device, thus reducing the costs of each experiment. The 12 sub-arrays each comprised 135,000 60-mer probes representing 29,549 grapevine transcripts. This device has been used in a large number of studies10-24. These two platforms have now been discontinued but a new custom microarray has recently been designed and represents a more recent development as it contains an even greater number of probes representing additional newly discovered grapevine genes25.
The large-sale datasets produced by transcriptomics and metabolomics analysis require suitable statistical methods for data analysis, including multivariate techniques to determine correlations between different forms of data. The most widely used multivariate techniques are those based on projection, and these can be unsupervised, such as principal component analysis (PCA), or supervised, such as bidirectional orthogonal projection to latent structures discriminant analysis (O2PLS-DA)26. The protocol presented in this article utilizes PCA for exploratory data analysis and O2PLS-DA to identify differences between groups of samples.
1. Select Appropriate Materials and Construct a Sampling Plan
AM | BA | BM | CS | FA | MN | PM | |
Macro-zone | Soave | Lake Garda | Valpolicella | Lake Garda | Valpolicella | Valpolicella | Soave |
Height (m) | 250 | 120 | 450 | 100 | 130 | 250 | 130 |
Rootstock | 41B | S04 | K5BB | 420A | 420A | K5BB | 41B |
Row direction | E-W | N-S | E-W | E-W | E-W | N-S | N-S |
Training system | Overhead System (Pergola) | Overhead System (Pergola) | Vertical Shoot Positioning (Guyot) | Overhead System (Pergola) | Overhead System (Pergola) | Vertical Shoot Positioning (Guyot) | Vertical Shoot Positioning (Guyot) |
Soil type | Silty clay | Loam | Clay | Loam | Clay Loam | Silt loam | Clay loam |
Planting layout (m) | 3.20 x 1.00 | 4.50 x 0.80 | 4.00 x 1.25 | 3.50 x 1.20 | 3.50 x 0.75 | 2.80 x 1.00 | 1.80 x 0.80 |
Total lime % | 3.9 | 19.3 | 18.3 | 14.4 | 31 | 5.9 | 27.9 |
Active lime % | 0.5 | 2.6 | 9.4 | 6.3 | 11.3 | 3.1 | 8.3 |
Sand % | 15 | 47 | 66 | 42 | 29 | 13 | 36 |
Loam % | 43 | 36 | 21 | 37 | 39 | 67 | 36 |
Clay % | 42 | 17 | 13 | 21 | 32 | 20 | 28 |
Soil pH | 8.3 | 7.9 | 7.8 | 8.2 | 8.2 | 7.8 | 7.9 |
Organic substance (%) | 2.9 | 2.5 | 2.2 | 1.2 | 2.9 | 1.6 | 2.5 |
Exchangeable phosphorus (mg/kg) | 26 | 73 | 73 | 68 | 48 | 47 | 64 |
Exchangeable potassium (mg/kg) | 190 | 376 | 620 | 230 | 168 | 154 | 126 |
Exchangeable magnesium (mg/kg) | 272 | 468 | 848 | 623 | 294 | 293 | 183 |
Exchangeable calcium (mg/kg) | 6500 | 5380 | 7358 | 6346 | 4652 | 10055 | 2878 |
Berry Reducing Sugars 2006 | 211.25 ± 1.20 | 176.20 ± 0.42 | 187.40 ± 0.00 | 203.70 ± 1.13 | 212.55 ± 0.64 | 195.20 ± 0.00 | 211.65 ± 0.64 |
Berry Reducing Sugars 2007 | 190.00 ± 1.27 | 165.25 ± 0.49 | 153.00 ± 0.42 | 203.60 ± 0.71 | 210.90 ± 0.71 | 192.25 ± 0.64 | 188.70 ± 1.84 |
Berry Reducing Sugars 2008 | 191.35 ± 0.64 | 178.90 ± 0.57 | 170.05 ± 0.49 | 205.15 ± 1.48 | 188.70 ± 0.57 | 169.35± 0.49 | 108.05 ± 1.06 |
Berry pH 2006 | 3.01 ± 0.01 | 2.96 ± 0.01 | 2.84 ± 0.00 | 2.9 ± 0.00 | 2.98 ± 0.00 | 3.02 ± 0.00 | 3.06 ± 0.01 |
Berry pH 2007 | 2.97 ± 0.00 | 3.00 ± 0.00 | 2.74 ± 0.00 | 3.07 ± 0.01 | 2.98 ± 0.00 | 2.87 ± 0.01 | 3.09 ± 0.00 |
Berry pH 2008 | 2.83 ± 0.00 | 3.04 ± 0.01 | 2.71 ± 0.00 | 2.98 ± 0.01 | 2.98 ± 0.00 | 2.82 ± 0.00 | 3.11 ± 0.00 |
Harvesting Date | 2006 | 2007 | 2008 | ||||
Veraison | 8-Aug | 18-Jul | 12-Aug | ||||
Mid Ripening | 4-Sep | 8-Aug | 2-Sep | ||||
Ripe | 18-Sep | 29-Aug | 23-Sep |
Table 1: Principal features of each vineyard and sample collection dates. m = meters, E-W = Eat-West, N-S = North-South.
Figure 1: Schematic representation of the sampling procedure. The three wine production macro-zones are located in the surroundings of the city of Verona, Veneto region, Italy. The three time points are veraison (V) representing the onset of ripening in viticulture, mid-ripening (MR) and ripe berries (R). Please click here to view a larger version of this figure.
2. Prepare Berry Powder Extracts, Analyze the Metabolites and Process the Data
Mass spectrometer components | Function | Parameters |
Electrospray Ionization Source | Nebulizing Gas | 50 psi, 350 °C |
Drying gas | 10 L min-1 | |
Ion trap and detector | Scan | Full scan mode, 13,000 m/z per second, range 50-1,500 m/z |
Target mass | 400 m/z | |
Collision Gas | Helium | |
Vacuum pressure | 1.4 x 10-5 mbar | |
Capillary source | +4,000 V | |
End plate offset | -500 V | |
Skimmer | -40 V | |
Cap exit | -121 V | |
Oct 1 DC | -12 V | |
Oct 2 DC | -1.7 V | |
Lens 1 | +5 V | |
Lens 2 | +60 V | |
ICC for positive ionization mode | 20.000 | |
ICC for negative ionization mode | 7,000 |
Table 2: Principal parameter set for acquiring mass spectra.
Operation | Selection | Function | Parameters | Values |
Peak Detection | Mass detection | Centroid | Noise level | 3,500 |
Chromatogram builder | Highest data point | min time span | 0.15 | |
min height | 4,000 | |||
m/z tolerance | 0.3 | |||
Peak Detection | Peak deconvolution | Local minimum search | Chromatographic threshold | 70 |
Search minimum in RT range (min) | 0:50 | |||
Minimum relative height | 15% | |||
Minimum absolute height | 4,000 | |||
Min ratio of peak top/edge | 2 | |||
Duration range (min) | 0-10 | |||
Isotopes | Isotopic peaks grouper | - | m/z tolerance | 1.2 |
RT tolerance | 0:50 | |||
monotonic shape | No | |||
Maximum charge | 3 | |||
Representative isotope | No | |||
Alignment | Join aligner | - | m/z tolerance | 1.2 |
Weight for m/z | 10 | |||
Retention time tolerance | 0:50 | |||
Weight for RT | 5 | |||
Require same charge state | No | |||
Require same ID | No | |||
Compare isotope pattern | No | |||
Gap filling | Peak finder | - | Intensity tolerance | 20% |
m/z tolerance | 0.9 | |||
Retention time tolerance | 0:40 | |||
RT correction | No | |||
Filtering | Duplicate Peak filter | - | m/z tolerance | 1.2 |
RT tolerance | 0:30 | |||
Require same identification | No |
Table 3: Mzmine workflow with specific values to process negative LC-MS grape berry data files.
3. Prepare Berry Powder Extracts for Transcriptome Analysis and Process the Data
Metric Name | Upper Limit | Lower Limit | Description |
AnyColorPrcntFeatNonUnif | 1.00 | NA | Percentage of features that are feature non-uniformity outliers in either channel |
DetectionLimit | 2.00 | 0.10 | Average plus 1 standard deviation of the spike ins below the linear concentration range |
absGE1E1aSlope | 1.20 | 0.90 | Absolute of slope of fit for Signal vs. Concentration of E1a probes |
MedCVProcSignal | 8.00 | NA | Median %CV for the Processed Signal |
gNegCtrlAveBGSubSig | 5.00 | -10.00 | Average of background subtracted signal of all inlier negative controls (BGSubSignal is calculated by substracting a value called BGUsed from the feature mean signal) |
gNegCtrlAveNetSig | 40.00 | NA | Average of net signal of all inlier negative controls |
gNegCtrlSDevBGSubSig | 10.00 | NA | Standard deviation of background subtracted signals of all inlier negative controls |
gNonCntrlMedCVProcSignal | 8.00 | NA | Median %CV for the Processed Signal of the non-control probes |
gSpatialDetrendRMSFilter | 15.00 | NA | Residual of background detrending fit |
Table 4: Principal parameters to be checked to verify the quality of microarray hybridization.
4. Carry Out the Detailed Statistical Analysis of the Metabolomics and Transcriptomics Data
The case study described in this article yielded a final data matrix comprising 552 signals (m/z features) including molecular ions plus their isotopes, adducts and some fragments, relatively quantified among 189 samples (7 vineyards x 3 ripening stages x 3 growing seasons x 3 biological replicates). The total number for data points was therefore 104,328. Fragmentation tree analysis resulted in the annotation of 282 m/z features, corresponding to metabolites plus adducts...
This article describes the metabolomics, transcriptomics and statistical analysis protocols used to interpret the grape berry terroir concept. Metabolomics analysis by HPLC-ESI-MS is sensitive enough to detect large numbers of metabolites simultaneously, but relative quantitation is affected by the matrix effect and ion suppression/enhancement. However, a similar approach has already been used to describe the ripening and post-harvest withering of Corvina berries, and the correction of matrix effects had a limited impact...
The authors have nothing to disclose.
This work benefited from the networking activities coordinated within the EU-funded COST ACTION FA1106 "An integrated systems approach to determine the developmental mechanisms controlling fleshy fruit quality in tomato and grapevine". This work was supported by the 'Completamento del Centro di Genomica Funzionale Vegetale' project funded by the CARIVERONA Bank Foundation and by the 'Valorizzazione dei Principali Vitigni Autoctoni Italiani e dei loro Terroir (Vigneto)' project funded by the Italian Ministry of Agricultural and Forestry Policies. SDS was financed by the Italian Ministry of University and Research FIRB RBFR13GHC5 project "The Epigenomic Plasticity of Grapevine in Genotype per Environment Interactions".
Name | Company | Catalog Number | Comments |
Mill Grinder | IKA | IKA A11 basic | |
HPLC Autosampler | Beckman Coulter | - | System Gold 508 Autosampler |
HPLC System | Beckman Coulter | - | System Gold 127 Solvent Module HPLC |
C18 Guard Column | Grace | - | Alltima HP C18 (7.5 mm x 2.1 mm; 5 μm) Guard Column |
C18 Column | Grace | - | Alltima HP C18 (150 mm x 2.1 mm; 3 μm) Column |
Mass Spectometer | Bruker Daltonics | - | Bruker Esquire 6000; The mass spectometer was equipped with an ESI source and the analyzer was an ion trap. |
Extraction solvents and HPLC buffers | Sigma | 34966 | Methanol LC-MS grade |
Sigma | 94318 | Formic acid LC-MS grade | |
Sigma | 34967 | Acetonitrile LC-MS grade | |
Sigma | 39253 | Water LC-MS grade | |
Minisart RC 4 Syringe filters (0.2 μm) | Sartorius | 17764 | |
Softwares for data collection (a) and processing (b) | Bruker Daltonics | - | Bruker Daltonics Esquire 5.2 Control (a); Esquire 3.2 Data Analysis and MzMine 2.2 softwares (b) |
Spectrum Plant Total RNA kit | Sigma-Aldrich | STRN250-1KT | For total RNA extractino from grape pericarps |
Nanodrop 1000 | Thermo Scientific | 1000 | |
BioAnalyzer 2100 | Agilent Technologies | G2939A | |
RNA 6000 Nano Reagents | Agilent Technologies | 5067-1511 | |
RNA Chips | Agilent Technologies | 5067-1511 | |
Agilent Gene Expression Wash Buffer 1 | Agilent Technologies | 5188-5325 | |
Agilent Gene Expression Wash Buffer 2 | Agilent Technologies | 5188-5326 | |
LowInput QuickAmp Labeling kit One-Color | Agilent Technologies | 5190-2305 | |
Kit RNA Spike In - One-Color | Agilent Technologies | 5188-5282 | |
Gene Expression Hybridization Kit | Agilent Technologies | 5188-5242 | |
RNeasy Mini Kit (50) | Qiagen | 74104 | For cRNA Purification |
Agilent SurePrint HD 4X44K 60-mer Microarray | Agilent Technologies | G2514F-048771 | |
eArray | Agilent Technologies | - | https://earray.chem.agilent.com/earray/ |
Gasket slides | Agilent Technologies | G2534-60012 | Enable Agilent SurePrint Microarray 4-array Hybridization |
Thermostatic bath | Julabo | - | |
Hybridization Chamber | Agilent Technologies | G2534-60001 | |
Microarray Hybridization Oven | Agilent Technologies | G2545A | |
Hybridization Oven Rotator Rack | Agilent Technologies | G2530-60029 | |
Rotator Rack Conversion Rod | Agilent Technologies | G2530-60030 | |
Staining kit | Bio-Optica | 10-2000 | Slide-staining dish and Slide rack |
Magnetic stirrer device | AREX Heating Magnetic Stirrer | F20540163 | |
Thermostatic Oven | Thermo Scientific | Heraeus - 6030 | |
Agilent Microarray Scanner | Agilent Technologies | G2565CA | |
Scanner Carousel, 48-position | Agilent Technologies | G2505-60502 | |
Slide Holders | Agilent Technologies | G2505-60525 | |
Feature extraction software v11.5 | Agilent Technologies | - | inside the Agilent Microarray Scanner G2565CA |
SIMCA + V13 Software | Umetrics |
Zapytaj o uprawnienia na użycie tekstu lub obrazów z tego artykułu JoVE
Zapytaj o uprawnieniaThis article has been published
Video Coming Soon
Copyright © 2025 MyJoVE Corporation. Wszelkie prawa zastrzeżone