JoVE Logo

Zaloguj się

Aby wyświetlić tę treść, wymagana jest subskrypcja JoVE. Zaloguj się lub rozpocznij bezpłatny okres próbny.

W tym Artykule

  • Podsumowanie
  • Streszczenie
  • Wprowadzenie
  • Protokół
  • Wyniki
  • Dyskusje
  • Ujawnienia
  • Podziękowania
  • Materiały
  • Odniesienia
  • Przedruki i uprawnienia

Podsumowanie

The protocol describes a method of predicting o-cresol concentration during the production of polyphenylene ether using near-infrared spectroscopy and partial least squares regression. To describe the process more clearly and completely, an example of predicting the o-cresol concentration during the production of polyphenylene is used to clarify the steps.

Streszczenie

Unlike macroscopic process variables, near-infrared spectroscopy provides process information at the molecular level and can significantly improve the prediction of the components in industrial processes. The ability to record spectra for solid and liquid samples without any pretreatment is advantageous and the method is widely used. However, the disadvantages of analyzing high-dimensional near-infrared spectral data include information redundancy and multicollinearity of the spectral data. Thus, we propose to use partial least squares regression method, which has traditionally been used to reduce the data dimensionality and eliminate the collinearity between the original features. We implement the method for predicting the o-cresol concentration during the production of polyphenylene ether. The proposed approach offers the following advantages over component regression prediction methods: 1) partial least squares regression solves the multicollinearity problem of the independent variables and effectively avoids overfitting, which occurs in a regression analysis due to the high correlation between the independent variables; 2) the use of the near-infrared spectra results in high accuracy because it is a non-destructive and non-polluting method to obtain information at microscopic and molecular scales.

Wprowadzenie

Near infrared (NIR) spectroscopy (NIRS) has gained wide acceptance as a fast, efficient, non-destructive, and non-polluting modern analytical technology; the method has been used during the past several years for product quality detection and analysis and chemical component measurement in industrial processes. The most essential specialty of the method is its ability to record spectra for solid and liquid samples without any pre-processing, making NIRS especially suitable for the direct and rapid detection and analysis of natural and synthetic products1,2. Unlike traditional sensors that measure process variables (e.g., temperature, pressure, liquid level, etc.) at a macroscopic scale and inevitably suffer the external noise and background interference, NIRS detects the structural information of the chemical composition at microscopic and molecular scales. Thus, essential information can be measured more accurately and effectively than with other methods3,4.

Polyphenyl ether, as one of the engineering plastics, are widely used due to its heat resistance, flame retardant, insulation, electrical properties, dimensional stability, impact resistance, creep resistance, mechanical strength and other properties5. More importantly, it is non-toxic and harmless compared to other engineering plastics. At present, 2,6-xylenol is one of the basic raw materials for the synthesis of polyphenylene ether, and it is usually prepared by catalyzed alkylation of phenol with methanol method6. There are two main products of this preparation method, o-cresol and 2,6-xylenol. After a series of separation and extraction steps, 2,6 xylenol is used to produce polyphenylene ether. However, trace amounts of o-cresol remain in 2,6-xylenol. O-cresol does not participate in the synthesis of polyphenylene ether and will remain in the polyphenylene ether product, resulting in a decrease in product quality or even the substandard. At present, most companies still analyze the compositions of complex organic mixtures such as liquid phase polyphenyl ether products containing impurities (e.g., o-cresol) by physical or chemical separation analysis such as chromatography7,8. The separation principle of chromatography is the use of the mixture of compositions in the fixed phase and the flow phase in the dissolution, analysis, adsorption, desorption or other affinity of the minor differences in the performance. When the two phases move relative to each other, the compositions are separated by the above actions repeatedly in the two phases. Depending on the object, it usually takes a few minutes to a few tens of minutes to complete a complex material separation operation. It can be seen that the measurement efficiency is low.

Nowadays, the measurement of product quality and the advanced control technology based on this analysis for the modern fine process chemical materials industry is the key direction to further improve product quality. In the process industry of polyphenyl ether production, real-time measurement of o-cresol content in polyphenylene ether product is of great development significance. Chromatographic analysis clearly cannot meet the requirements of advanced control technology for real-time measurement of substances and signal feedback. Therefore, we propose the partial least squares regression (PLSR) method to establish a linear model between the NIRS data and the o-cresol concentration, which realize the online measurement of o-cresol content in the liquid polyphenylene ether product of outlet.

The pre-processing for NIRS plays the most important role prior to multivariate statistical modeling. NIRS wavenumbers in the NIR spectrum and the particle sizes of biological samples are comparable, so it is known for unexpected scatter effects that has influence on the recorded sample spectra. By performing appropriate pre-processing methods, these effects are easy to be eliminated largely9. The most commonly used pre-processing techniques in NIRS are categorized as scatter correction and spectral derivative methods. First group of methods includes multiplicative scatter correction, detrending, standard normal variate transformations, and normalization. The spectral derivation methods include the use of the first and second derivatives.

Prior to developing a quantitative regression model, it is important to remove the unsystematic scatter variations from the NIRS data because they have a significant influence on the accuracy of the predictive model, its complexity and parsimony. The selection of a suitable pre-processing method should always depend on the subsequent modeling step. Here, if the NIR spectral dataset does not follow the Lambert-Beer law, then other factors tend to compensate for the non-ideal behavior of the prediction for predicted components. The disadvantage of the existence of such needless factors leads to the increase of model complexity, even most likely, a reduction in the robustness. Thus, the application of spectral derivatives and a conventional normalization to the spectral data is an essential part of the method.

After spectral preprocessing, the NIRS data with a high signal-to-noise ratio and low background interference are obtained. Modern NIRS analysis provides the rapid acquisition of large amounts of absorbance over an appropriate spectral range. The chemical composition of the sample is then predicted by extracting the relevant variables using the information contained in the spectral curve. Generally, NIRS is combined with multivariate analysis techniques for qualitative or quantitative analyses10. A multivariate linear regression (MLR) analysis is commonly used for developing and mining the mathematical relationship between the data and the components in industrial processes and has been widely used in NIRS analysis.

However, there are two fundamental problems when implementing an MLR for preprocessed NIRS data. One problem is the variable redundancy. The high dimensionality of the NIRS data often renders the prediction of a dependent variable unreliable because variables are included that have no correlation with the components. These redundant variables reduce the information efficiency of the spectral data and affect the accuracy of the model. In order to eliminate the variable redundancy, it is essential to develop and maximize the correlation between the NIRS data and the predicted components.

Another problem is the issue of multicollinearity in the NIRS data. One of the important assumptions of multiple linear regression models is that there is no linear relationship between any of the explanatory variables of the regression model. If this linear relationship exists, it is proved that there is multicollinearity in the linear regression model and the assumption is violated. In multiple linear regressions, such as an ordinary least squares regression (OLSR), multiple correlations between the variables affect the parameter estimation, increase the model error, and affect the stability of the model. To eliminate the multilinear correlation between the NIR spectral data, we use variable selection methods that maximize the inherent variability of the samples.

Here, we propose to use the PLSR, which is a generalization of multiple linear regression that has been widely used in the field of NIRS11,12. The PLSR integrates the basic functions of the MLR, canonical correlation analysis (CCA), and principal component analysis (PCA) and combines the forecasting analysis with a non-model data connotation analysis. The PLSR can be divided into two parts. The first part selects the components of the characteristic variables and the predicted components by partial least squares analysis (PLS). PLS maximizes the inherent variability of principal components by making the covariance of the principal components and predicted components as large as possible when extracting the principal components. Next, the OLSR model of o-cresol concentration is established for the principal components selected. PLSR is suitable for the analysis of noisy data with numerous independent variables that are strongly collinear and highly correlated and for the simultaneous modeling of several response variables. Also, PLSR extracts the effective information of the sample spectra, overcomes the problem of multicollinearity, and has the advantages of strong stability and high prediction accuracy13,14.

The following protocol describes the process of using the PLSR model for measuring the o-cresol concentration using NIR spectral data. The reliability and accuracy of the model are evaluated quantitatively by using the determination coefficient (figure-introduction-9341), the prediction correlation coefficient (figure-introduction-9451) and the mean square prediction error of cross-validation (MSPECV). Moreover, to intuitively show the advantages of the PLSR, the evaluation indicators are visualized in several plots for a qualitative analysis. Finally, evaluation indicators of an experiment are presented in table format to quantitatively illustrate the reliability and precision of the PLSR model.

Protokół

1. NIR spectrum data acquisition with Fourier transform (FT)-NIR process spectrometer

  1. Install the liquid phase optical fiber probe of the near-infrared spectrometer at the outlet of the polyphenyl ether product. And open the OPUS software on the upper computer connected to the instrument and start to configure the measurement.
  2. Connecting to spectrometer
    1. On the Measure menu, select the Optic Setup and Service command, or click the icon from the toolbar.
    2. On the dialog that opens, click the Optical Bench tab.
    3. Check whether the spectrometer settings are ok. If yes, close the dialog. If no, continue with step 4.
    4. From the Configuration drop-down list, select the particular spectrometer type.
    5. Enter the spectrometer’s IP address into the Optical Bench URL entry field.
    6. Click the Connect button.
  3. Setting up measurement parameters
    1. On the Measure menu, select the Measurement command, or click the icon from the toolbar.
    2. On the dialog that opens, define the measurement parameters on the different tabs.
      NOTE: Details on the individual measurement parameters are described in the OPUS Reference Manual.
    3. Click the Accept & Exit button.
  4. Storing experiment file
    1. On the Measure menu, select the Advanced Measurement command. Then, click the Advanced tab.
    2. On the dialog that opens, define the resolution as 4 cm-1.
    3. Define the number of scans as 16 scans in the Sample/Background Scan Time entry fields.
    4. Define the path to automatically store the measuring data from 4,000 cm-1-12,500 cm-1.
    5. Determine the data type for the result spectrum as Absorbance.
    6. Click the Save button.
    7. On the dialog that opens, define a name for the experiment file and save this name.
  5. Measuring background spectrum
    1. On the Measure menu, select the Advanced Measurement command.
    2. Click the Optic tab.
    3. On the dialog that opens, click the Aperture setting drop-down list and select the same value used to acquire a sample spectrum.
    4. Click the Basic tab.
    5. On the dialog that opens, click the Background Single Channel button.
  6. Measuring sample spectrum
    1. Place the sample into the optical path of the spectrometer. The way in which this is done depends on the spectrometer configuration.
    2. On the Measure menu, select the Advanced Measurement command.
    3. Click the Basic tab.
    4. On the dialog that opens, define the sample description and sample form in the particular entry field. This information is stored together with the spectrum.
    5. Click the Sample Single Channel button to start online measurement. And save the NIR spectrum of each scan as OPUS file.
  7. Collect the polyphenylene samples every 6 h and test the o-cresol concentration with liquid chromatography in the laboratory of industry to obtain a chemical reference value.
    NOTE: Laboratory staff of industry field take each polyphenyl ether sample from the outlet of the liquid phase polyphenyl ether. The o-cresol content in each sample was measured three times by liquid chromatography. Then, the mean value of the results of the three times analysis was taken as the reference value of the o-cresol content to reduce the accidental error.
  8. Obtain 600 chemical reference values of o-cresol concentration in the laboratory. The calibration range of o-cresol concentration is from 42.1063 mg/1 g polyphenyl ether product to 51.6763 mg/1 g polyphenyl ether product.
  9. Combine the NIR spectra at the given test times with the chemical reference values of the o-cresol concentration.
  10. Use the software OPUS to read the original spectral set as shown in Figure 1.
    1. On the File menu, click the Load File command.
    2. On the dialog that opens, select the particular spectrum file.
    3. Click the Open button. The spectrum is displayed in the spectrum window.

2. NIR spectroscopy data pre-processing

  1. With the spectral preprocessing function in, obtain spectral dataset preprocessed with first-order derivative.
    1. Open The Unscrambler which is a multivariate data analysis and experimental design software, select the Import command under File. Import the OPUS file as original NIR spectral dataset.
    2. Select Transform command under Modify. And select the Savitzky Golay Derivatives under Derivatives.
    3. Define the Samples and Variables as All Samples and All Variables in Scope. And define the number of Smoothing points as 13 and the Derivative as 1st derivative in Parameters.
    4. Click OK to start the derivative.
      CAUTION: The increase of smoothness can reduce the sharp fluctuations of the curve, reduce the noise effect but also weaken the characteristics of the curve and make the curve distorted. Therefore, the appropriate smoothness selected according to the observation of the actual fluctuation intensity of the curve and the effect after processing.
  2. Perform vector normalization on the sample spectra to normalize the value of the absorbance.
    1. Select the Normalization command under Modify.
    2. Define the Samples and Variables as All Samples and All Variables in Scope.
    3. Select Vector normalization in the Type.
    4. Click OK to perform vector normalization.

3. Establishment of PLSR model

  1. Creation of the NIR spectral data set
    1. Open Uncrambler.exe, select Export under File with the Matlab files to export the preprocessed spectral data set into .mat File and to obtain the spectral data set X automatically with 2203 variables.
    2. Obtain a complete NIR spectral dataset X (a matrix of 600 rows and 2203 columns) and the corresponding chemical reference values Y (a vector of 600 rows) in the form of .mat file for subsequent analysis and modeling.
  2. Selection of the appropriate number of principal components
    1. Open Matlab and import the .mat file containing the preprocessed near-infrared spectral data into the workspace by dragging the .mat file to the workspace.
      NOTE: The .mat file stores the near-infrared spectral data X as an independent variable and the o-cresol content of the product as a dependent variable in the form of two matrices.
    2. Open the programmed .m file in the Editor. Click Open under the Editor option, select the compiled .m file in the file storage directory, and then click Confirm.
    3. Extract 15 principal components according to the optimization objective of Equation 1 and the OLSR model between the extracted principal components and the predicted values of the o-cresol concentration with the program containing the command plsregress() in Matlab.
      [XL, YL, XS, YS, BETA, PCTVAR, MSE] = plsregress(X,Y,ncomp,’CV’,k);
      Consult the MATLAB help document to get the usage details and the return value.
      NOTE: figure-protocol-8650 Equation 1figure-protocol-8762
      figure-protocol-8838 , and figure-protocol-8912 is the ith principal components of the NIR spectral data;
      figure-protocol-9066 is the projection of the ith principal components of the NIR spectral data;
      figure-protocol-9238 is the Pearson correlation coefficient for the ith principal components and the o-cresol concentration.
    4. Obtain the figure-protocol-9452 value of the NIR spectral data and the predicted values for the different principal components using Equation 2.
      NOTE: figure-protocol-9664 Equation 2
      figure-protocol-9768 is the sum of squares due to error and is defined as figure-protocol-9891 ;
      figure-protocol-9971 is the total sum of squares and is defined as figure-protocol-10087;
      figure-protocol-10166 is the reference value of the o-cresol concentration of test dataset;
      figure-protocol-10314 is the predicted value of the o-cresol concentration of test dataset;
      figure-protocol-10462 is the mean value of reference value of the o-cresol concentration of test dataset;
      figure-protocol-10624 is the number of samples of test dataset.
    5. Determine the figure-protocol-10761 values and the trend with increasing number of principal components as shown in Figure 2. Select 10 as the appropriate number of principal components with the figure-protocol-11018 value of 0.9917.
      NOTE:figure-protocol-11116 value is the proportion of the variance in the dependent variable that is predictable by the independent variables. The higher the figure-protocol-11317 value is, the higher the goodness-of-fit is and vice versa.
  3. Validation of the goodness-of-fit and accuracy of the PLSR model with 10 principal components by using the command plsregress().
    1. Repeat the modeling process with 10 principal components as steps 3.2.1-3.2.5 with 10 principal components.
    2. Evaluate the model based on a 10-fold cross-validation using the plots of the percent variance explained in the NIR spectral data, the residuals, and the MSPECV.
    3. Plot the percent variance explained in NIR spectral data, the residuals, and the MSPECV as Figures 3, 4, and 5.
    4. Tabulate the evaluation indicators of figure-protocol-12140,figure-protocol-12208, and MSPE of 10-fold cross validation for the PLSR model for a quantitative analysis as shown in Table 1.
      NOTE: The equations of figure-protocol-12430 and MSPE are shown as Equation 3 and Equation 4.
      figure-protocol-12589 Equation 3
      figure-protocol-12678 Equation 4
      figure-protocol-12767 is the covariance of reference value and predicted value of o-cresol concentration; figure-protocol-12921 is the standard deviation of reference value of o-cresol concentration;
      figure-protocol-13071 is the standard deviation of predicted value of o-cresol concentration.

Wyniki

The predicted value of o-cresol Impurity in polyphenyl ether products is obtained by PLSR-based near-infrared spectroscopy. Figure 2 and Figure 3 respectively show the reliability of the method in the feature selection stage from the curve of the decision coefficient and the error interpretation percentage increasing with the number of principal components.

Specifically, please note that in the ...

Dyskusje

This protocol describes the process of performing the PLSR on the measurement of the o-cresol concentration remaining in the liquid product of polyphenylene ether with NIRS.

The two critical steps in this process are the pre-processing of the original NIR spectral data and the variables selection of the high-dimensional NIR spectral data.

Generally, the non-systematic background interference leads to the non-systematic scattering deviation or baseline drift of NIR s...

Ujawnienia

The authors have nothing to disclose.

Podziękowania

This work was supported by the National Natural Science Foundation of China (Grant Nos. 61722306 and 61473137) and National First-class Discipline Program of Light Industry Technology and Engineering (LITE2018-025).

Materiały

NameCompanyCatalog NumberComments
MPA II Multi Purpose FT-NIR AnalyzerBruker1
Fiber Optic Probes(Liquid phase)Bruker1
Liquid chromatography analyzer /1
Laboratory Equipment and Supplies(e.g. test tube, etc.)/
MATLABMathWork1
OPUSBruker1
Principal computerDELL1
The UnscramblerCAMO1

Odniesienia

  1. Nicolai, B. M., et al. Nondestructive measurement of fruit and vegetable quality by means of NIR spectroscopy: A review. Postharvest Biology and Technology. 46 (2), 99-118 (2007).
  2. Chang, C. W., Laird, D. A., Mausbach, M. J., Hurburgh, C. R. Near-infrared reflectance spectroscopy-principal components regression analyses of soil properties. Soil Science Society of America Journal. 65 (2), 480-490 (2001).
  3. Chen, Y., et al. Near-infrared spectroscopy for rapid evaluation of different processing products of Sophora japonica. L. Spectroscopy Letters. 51 (1), 37-44 (2018).
  4. Cayuela, J. A., Garcia, J. F. Nondestructive measurement of squalene in olive oil by near infrared spectroscopy. LWT-FOOD SCIENCE AND TECHNOLOGY. 88, 103-108 (2018).
  5. Joaquim, M., Rudnick, R. L., Shubkin, R. L. Polyphenyl Ether Lubricants. Synthetic Lubricants and High-performance Functional. , 239 (1999).
  6. Grabowska, H., Kaczmarczyk, W., Wrzyszcz, J. Synthesis of 2,6-Xylenol by Alkylation of Phenol with Methanol. Applied Catalysis. 47 (2), 351-355 (1989).
  7. Jeon, D. B., et al. Determination of volatile organic compounds, catechins, caffeine and theanine in Jukro tea at three growth stages by chromatographic and spectrometric methods. FOOD CHEMISTRY. 219, 443-452 (2016).
  8. Davidyuk, E. I., Demchenko, V. F., Klisenko, M. A. Rapid group separation and identification of chlorinated organic compounds by high performance liquid chromatography. JOURNAL OF ANALYTICAL CHEMISTRY. 52 (11), 1058-1065 (1997).
  9. Rinnan, A., Berg, F., Engelsen, S. B. Review of the most common pre-processing techniques for near-infrared spectra. TrAC Trends in Analytical Chemistry. 28 (10), 1201-1222 (2009).
  10. Zou, X. B., Zhao, J. W., Povey, M. J. W., Holmes, M., Mao, H. P. Variables selection methods in near-infrared spectroscopy. Analytica Chimica Acta. (1-2), 14-32 (2010).
  11. Dunn, B. W., Beecher, H. G., Batten, G. D., Ciavarella, S. The potential of near-infrared reflectance spectroscopy for soil analysis - a case study from the Riverine Plain of south-eastern Australia. Australian Journal of Experimental Agriculture. 42 (5), 607-614 (2002).
  12. Wang, C. K., Zhang, T. L., Pan, X. Z. Potential of visible and near-infrared reflectance spectroscopy for the determination of rare earth elements in soil. Geoderma. 306, 120-126 (2017).
  13. Gatius, F., Miralbes, C., David, C., Puy, J. Comparison of CCA and PLS to explore and model NIR data. Chemometrics and Intelligent Laboratory Systems. , 76-82 (2017).
  14. Wold, S., Sjostrom, M., Eriksson, L. PLS-regression: a basic tool of chemometrics. Chemometrics & Intelligent Laboratory. 58 (2), 109-130 (2001).
  15. Douglas, R. K., Nawar, S., Alamar, M. C., Mouazen, A. M., Coulon, F. Rapid prediction of total petroleum hydrocarbons concentration in contaminated soil using vis-NIR spectroscopy and regression techniques. SCIENCE OF THE TOTAL ENVIRONMENT. 616, 147-155 (2017).
  16. Grassi, S., Alamprese, C. Advances in NIR spectroscopy applied to process analytical technology in food industries. CURRENT OPINION IN FOOD SCIENCE. 22 (SI), 17-21 (2018).
  17. Trung, T., Downes, G., Meder, R., Allison, B. Pulp mill and chemical recovery control with advanced analysers - from trees to final product. APPITA. 68 (1), 39-46 (2015).
  18. Vann, L., Sheppard, J. Use of near-infrared spectroscopy (NIRs) in the biopharmaceutical industry for real-time determination of critical process parameters and integration of advanced feedback control strategies using MIDUS control. Journal of Industrial Microbiology& Biotechnology. 44 (12), 1589-1603 (2017).
  19. Modrono, S., Soldado, A., Martinez-Fernandez, A., de la Roza-Delgado, B. Handheld NIRS sensors for routine compound feed quality control: Real time analysis and field monitoring. TALANTA. 162, 597-603 (2017).

Przedruki i uprawnienia

Zapytaj o uprawnienia na użycie tekstu lub obrazów z tego artykułu JoVE

Zapytaj o uprawnienia

Przeglądaj więcej artyków

O cresol ConcentrationOnline MeasurementNear infrared SpectroscopyNIR Detection TechnologyPartial Least Squares RegressionOPUS SoftwareAbsorbance SpectrumSpectral Pre processingMultivariate Data AnalysisSavitzky Golay DerivativeExperimental DesignScan Parameters

This article has been published

Video Coming Soon

JoVE Logo

Prywatność

Warunki Korzystania

Zasady

Badania

Edukacja

O JoVE

Copyright © 2025 MyJoVE Corporation. Wszelkie prawa zastrzeżone