When the total least squares(TLS)solution is used to solve the parameters in the errors-in-variables(EIV)model,the obtained parameter estimations will be unreliable in the observations containing systematic errors.To ...When the total least squares(TLS)solution is used to solve the parameters in the errors-in-variables(EIV)model,the obtained parameter estimations will be unreliable in the observations containing systematic errors.To solve this problem,we propose to add the nonparametric part(systematic errors)to the partial EIV model,and build the partial EIV model to weaken the influence of systematic errors.Then,having rewritten the model as a nonlinear model,we derive the formula of parameter estimations based on the penalized total least squares criterion.Furthermore,based on the second-order approximation method of precision estimation,we derive the second-order bias and covariance of parameter estimations and calculate the mean square error(MSE).Aiming at the selection of the smoothing factor,we propose to use the U curve method.The experiments show that the proposed method can mitigate the influence of systematic errors to a certain extent compared with the traditional method and get more reliable parameter estimations and its precision information,which validates the feasibility and effectiveness of the proposed method.展开更多
Scientific forecasting water yield of mine is of great significance to the safety production of mine and the colligated using of water resources. The paper established the forecasting model for water yield of mine, co...Scientific forecasting water yield of mine is of great significance to the safety production of mine and the colligated using of water resources. The paper established the forecasting model for water yield of mine, combining neural network with the partial least square method. Dealt with independent variables by the partial least square method, it can not only solve the relationship between independent variables but also reduce the input dimensions in neural network model, and then use the neural network which can solve the non-linear problem better. The result of an example shows that the prediction has higher precision in forecasting and fitting.展开更多
Boreal forests play an important role in global environment systems. Understanding boreal forest ecosystem structure and function requires accurate monitoring and estimating of forest canopy and biomass. We used parti...Boreal forests play an important role in global environment systems. Understanding boreal forest ecosystem structure and function requires accurate monitoring and estimating of forest canopy and biomass. We used partial least square regression (PLSR) models to relate forest parameters, i.e. canopy closure density and above ground tree biomass, to Landsat ETM+ data. The established models were optimized according to the variable importance for projection (VIP) criterion and the bootstrap method, and their performance was compared using several statistical indices. All variables selected by the VIP criterion passed the bootstrap test (p〈0.05). The simplified models without insignificant variables (VIP 〈1) performed as well as the full model but with less computation time. The relative root mean square error (RMSE%) was 29% for canopy closure density, and 58% for above ground tree biomass. We conclude that PLSR can be an effective method for estimating canopy closure density and above ground biomass.展开更多
The Laser Induced Breakdown Spectroscopy (LIBS) is a fast, non-contact, no sample preparation analytic technology;it is very suitable for on-line analysis of alloy composition. In the copper smelting industry, analysi...The Laser Induced Breakdown Spectroscopy (LIBS) is a fast, non-contact, no sample preparation analytic technology;it is very suitable for on-line analysis of alloy composition. In the copper smelting industry, analysis and control of the copper alloy concentration affect the quality of the products greatly, so LIBS is an efficient quantitative analysis tech- nology in the copper smelting industry. But for the lead brass, the components of Pb, Al and Ni elements are very low and the atomic emission lines are easily submerged under copper complex characteristic spectral lines because of the matrix effects. So it is difficult to get the online quantitative result of these important elements. In this paper, both the partial least squares (PLS) method and the calibration curve (CC) method are used to quantitatively analyze the laser induced breakdown spectroscopy data which is obtained from the standard lead brass alloy samples. Both the major and trace elements were quantitatively analyzed. By comparing the two results of the different calibration method, some useful results were obtained: both for major and trace elements, the PLS method was better than the CC method in quantitative analysis. And the regression coefficient of PLS method is compared with the original spectral data with background interference to explain the advantage of the PLS method in the LIBS quantitative analysis. Results proved that the PLS method used in laser induced breakdown spectroscopy was suitable for simultaneous quantitative analysis of different content elements in copper smelting industry.展开更多
Accurately approximating higher order derivatives is an inherently difficult problem. It is shown that a random variable shape parameter strategy can improve the accuracy of approximating higher order derivatives with...Accurately approximating higher order derivatives is an inherently difficult problem. It is shown that a random variable shape parameter strategy can improve the accuracy of approximating higher order derivatives with Radial Basis Function methods. The method is used to solve fourth order boundary value problems. The use and location of ghost points are examined in order to enforce the extra boundary conditions that are necessary to make a fourth-order problem well posed. The use of ghost points versus solving an overdetermined linear system via least squares is studied. For a general fourth-order boundary value problem, the recommended approach is to either use one of two novel sets of ghost centers introduced here or else to use a least squares approach. When using either ghost centers or least squares, the random variable shape parameter strategy results in significantly better accuracy than when a constant shape parameter is used.展开更多
水浸出物是茶叶质量评价的重要指标之一。该研究提出利用近红外光谱法结合偏最小二乘算法(Partial least squares,PLS)快速检测乌龙茶中水浸出物含量。利用近红外光谱仪采集60份乌龙茶样品的光谱信息,通过Savitzky-Golay(SG)滤波器对原...水浸出物是茶叶质量评价的重要指标之一。该研究提出利用近红外光谱法结合偏最小二乘算法(Partial least squares,PLS)快速检测乌龙茶中水浸出物含量。利用近红外光谱仪采集60份乌龙茶样品的光谱信息,通过Savitzky-Golay(SG)滤波器对原始光谱数据进行预处理;采用连续投影算法(Successive projections algorithm,SPA)对采集的SG预处理光谱进行特征波长选择,基于SG预处理光谱和SPA法优化的特征光谱建立乌龙茶中水浸出物含量的PLS定量模型。结果显示,利用SPA法优化出14个特征波长建立SPA-PLS模型的性能最佳。在预测集中的相关系数为0.8966,预测均方根误差为0.8034%,剩余预测偏差为4.11。结果表明采用近红外光谱结合SPAPLS算法快速检测乌龙茶中水浸出物含量是可行的。展开更多
As an effective and universal acaricide, amitraz is widely used on beehives against varroasis caused by the mite Varroa jacobsoni. Its residues in honey pose a great danger to human health. In this study, a sensitive,...As an effective and universal acaricide, amitraz is widely used on beehives against varroasis caused by the mite Varroa jacobsoni. Its residues in honey pose a great danger to human health. In this study, a sensitive, rapid, and environmentally friendly surface-enhanced Raman spectroscopy method (SERS) was developed for the determination of trace amount of amitraz in honey with the use of silver nanorod (AgNR) array substrate. The AgNR array substrate fabricated by an oblique angle deposition technique exhibited an excellent SERS activity with an enhancement factor of -10^7. Density function theory was employed to assign the characteristic peak of amitraz. The detection of amitraz was further explored and amitraz in honey at concentrations as low as 0.08 mg/kg can be identified. Specifically, partial least square regression analysis was employed to correlate the SERS spectra in full-wavelength with Camitraz to afford a multiple-quantitative amitraz predicting model. Preliminary results show that the predicted concentrations of amitraz in honey samples are in good agreement with their real concentrations. Compared with the conventional univariate quantitative model based on single peak’s intensity, the proposed multiple-quantitative predicting model integrates all the characteristic peaks of amitraz, thus offering an improved detecting accuracy and anti-interference ability.展开更多
Human serum albumin(HSA)injectable product is a severely afflicted area on drug safety due to its high price and restricted supply.Raman spectroscopy performances high specificity on HSA detection and it is even possi...Human serum albumin(HSA)injectable product is a severely afflicted area on drug safety due to its high price and restricted supply.Raman spectroscopy performances high specificity on HSA detection and it is even possible to determine HSA injectable products noninvasively.In this study,we developed a noninvasive rapid screening method for of HSA injectable products by using portable Raman spectrometer.Qualitative models were established by using principal component analysis combined with classical least squares(PCA-CLS)algorithm,while quanti-tative model was established by using partial least squares(PLS)algorithm.Model transfer in different instruments of both the same and different apparatus modules was further discussed in this paper.A total of 34 HSA injectable samples collected from markets were used for verification.The identification results showed 100%accuracy and the predicted concentrations of those identified as true HSA were consistent with their labeled concentrations.The quantitative results also indicated that model transfer was excellent in the same apparatus modules of Raman spectrometer at all concentration levels,and still good enough in the different apparatus modules although the relative standard deviation(RSD)value showed a little increasing trend at low HSA concentration level.In conclusion,the method was proved to be feasible and efficient for screening HSA injections,especially on its screening speed and the consideration of glass containers.Moreover,with inspiring results on the model transfer,the method could be used as a universal screening mean to different Raman instruments.展开更多
The effluent total phosphorus(ETP) is an important parameter to evaluate the performance of wastewater treatment process(WWTP). In this study, a novel method, using a data-derived soft-sensor method, is proposed to ob...The effluent total phosphorus(ETP) is an important parameter to evaluate the performance of wastewater treatment process(WWTP). In this study, a novel method, using a data-derived soft-sensor method, is proposed to obtain the reliable values of ETP online. First, a partial least square(PLS) method is introduced to select the related secondary variables of ETP based on the experimental data. Second, a radial basis function neural network(RBFNN) is developed to identify the relationship between the related secondary variables and ETP. This RBFNN easily optimizes the model parameters to improve the generalization ability of the soft-sensor. Finally, a monitoring system, based on the above PLS and RBFNN, named PLS-RBFNN-based soft-sensor system, is developed and tested in a real WWTP. Experimental results show that the proposed monitoring system can obtain the values of ETP online and own better predicting performance than some existing methods.展开更多
基金supported by the National Natural Science Foundation of China,Nos.41874001 and 41664001Support Program for Outstanding Youth Talents in Jiangxi Province,No.20162BCB23050National Key Research and Development Program,No.2016YFB0501405。
文摘When the total least squares(TLS)solution is used to solve the parameters in the errors-in-variables(EIV)model,the obtained parameter estimations will be unreliable in the observations containing systematic errors.To solve this problem,we propose to add the nonparametric part(systematic errors)to the partial EIV model,and build the partial EIV model to weaken the influence of systematic errors.Then,having rewritten the model as a nonlinear model,we derive the formula of parameter estimations based on the penalized total least squares criterion.Furthermore,based on the second-order approximation method of precision estimation,we derive the second-order bias and covariance of parameter estimations and calculate the mean square error(MSE).Aiming at the selection of the smoothing factor,we propose to use the U curve method.The experiments show that the proposed method can mitigate the influence of systematic errors to a certain extent compared with the traditional method and get more reliable parameter estimations and its precision information,which validates the feasibility and effectiveness of the proposed method.
基金Supported by "863" Program of P. R. China(2002AA2Z4291)
文摘Scientific forecasting water yield of mine is of great significance to the safety production of mine and the colligated using of water resources. The paper established the forecasting model for water yield of mine, combining neural network with the partial least square method. Dealt with independent variables by the partial least square method, it can not only solve the relationship between independent variables but also reduce the input dimensions in neural network model, and then use the neural network which can solve the non-linear problem better. The result of an example shows that the prediction has higher precision in forecasting and fitting.
基金supported by the 948 Program of the State Forestry Administration (2009-4-43)the National Natura Science Foundation of China (No.30870420)
文摘Boreal forests play an important role in global environment systems. Understanding boreal forest ecosystem structure and function requires accurate monitoring and estimating of forest canopy and biomass. We used partial least square regression (PLSR) models to relate forest parameters, i.e. canopy closure density and above ground tree biomass, to Landsat ETM+ data. The established models were optimized according to the variable importance for projection (VIP) criterion and the bootstrap method, and their performance was compared using several statistical indices. All variables selected by the VIP criterion passed the bootstrap test (p〈0.05). The simplified models without insignificant variables (VIP 〈1) performed as well as the full model but with less computation time. The relative root mean square error (RMSE%) was 29% for canopy closure density, and 58% for above ground tree biomass. We conclude that PLSR can be an effective method for estimating canopy closure density and above ground biomass.
文摘The Laser Induced Breakdown Spectroscopy (LIBS) is a fast, non-contact, no sample preparation analytic technology;it is very suitable for on-line analysis of alloy composition. In the copper smelting industry, analysis and control of the copper alloy concentration affect the quality of the products greatly, so LIBS is an efficient quantitative analysis tech- nology in the copper smelting industry. But for the lead brass, the components of Pb, Al and Ni elements are very low and the atomic emission lines are easily submerged under copper complex characteristic spectral lines because of the matrix effects. So it is difficult to get the online quantitative result of these important elements. In this paper, both the partial least squares (PLS) method and the calibration curve (CC) method are used to quantitatively analyze the laser induced breakdown spectroscopy data which is obtained from the standard lead brass alloy samples. Both the major and trace elements were quantitatively analyzed. By comparing the two results of the different calibration method, some useful results were obtained: both for major and trace elements, the PLS method was better than the CC method in quantitative analysis. And the regression coefficient of PLS method is compared with the original spectral data with background interference to explain the advantage of the PLS method in the LIBS quantitative analysis. Results proved that the PLS method used in laser induced breakdown spectroscopy was suitable for simultaneous quantitative analysis of different content elements in copper smelting industry.
文摘Accurately approximating higher order derivatives is an inherently difficult problem. It is shown that a random variable shape parameter strategy can improve the accuracy of approximating higher order derivatives with Radial Basis Function methods. The method is used to solve fourth order boundary value problems. The use and location of ghost points are examined in order to enforce the extra boundary conditions that are necessary to make a fourth-order problem well posed. The use of ghost points versus solving an overdetermined linear system via least squares is studied. For a general fourth-order boundary value problem, the recommended approach is to either use one of two novel sets of ghost centers introduced here or else to use a least squares approach. When using either ghost centers or least squares, the random variable shape parameter strategy results in significantly better accuracy than when a constant shape parameter is used.
文摘水浸出物是茶叶质量评价的重要指标之一。该研究提出利用近红外光谱法结合偏最小二乘算法(Partial least squares,PLS)快速检测乌龙茶中水浸出物含量。利用近红外光谱仪采集60份乌龙茶样品的光谱信息,通过Savitzky-Golay(SG)滤波器对原始光谱数据进行预处理;采用连续投影算法(Successive projections algorithm,SPA)对采集的SG预处理光谱进行特征波长选择,基于SG预处理光谱和SPA法优化的特征光谱建立乌龙茶中水浸出物含量的PLS定量模型。结果显示,利用SPA法优化出14个特征波长建立SPA-PLS模型的性能最佳。在预测集中的相关系数为0.8966,预测均方根误差为0.8034%,剩余预测偏差为4.11。结果表明采用近红外光谱结合SPAPLS算法快速检测乌龙茶中水浸出物含量是可行的。
基金supported by the Natural Science Foundation of the Higher Education Institutions of Jiangsu Province (No.16KJB510009 and No.17KJB510017)Jiangsu Province Natural Science Foundation of China (BK20150228)
文摘As an effective and universal acaricide, amitraz is widely used on beehives against varroasis caused by the mite Varroa jacobsoni. Its residues in honey pose a great danger to human health. In this study, a sensitive, rapid, and environmentally friendly surface-enhanced Raman spectroscopy method (SERS) was developed for the determination of trace amount of amitraz in honey with the use of silver nanorod (AgNR) array substrate. The AgNR array substrate fabricated by an oblique angle deposition technique exhibited an excellent SERS activity with an enhancement factor of -10^7. Density function theory was employed to assign the characteristic peak of amitraz. The detection of amitraz was further explored and amitraz in honey at concentrations as low as 0.08 mg/kg can be identified. Specifically, partial least square regression analysis was employed to correlate the SERS spectra in full-wavelength with Camitraz to afford a multiple-quantitative amitraz predicting model. Preliminary results show that the predicted concentrations of amitraz in honey samples are in good agreement with their real concentrations. Compared with the conventional univariate quantitative model based on single peak’s intensity, the proposed multiple-quantitative predicting model integrates all the characteristic peaks of amitraz, thus offering an improved detecting accuracy and anti-interference ability.
基金Youth Develop-ment Research Foundation(No.2015C03)of Na-tional Institutes of Food and Drug Control,P.R.China.
文摘Human serum albumin(HSA)injectable product is a severely afflicted area on drug safety due to its high price and restricted supply.Raman spectroscopy performances high specificity on HSA detection and it is even possible to determine HSA injectable products noninvasively.In this study,we developed a noninvasive rapid screening method for of HSA injectable products by using portable Raman spectrometer.Qualitative models were established by using principal component analysis combined with classical least squares(PCA-CLS)algorithm,while quanti-tative model was established by using partial least squares(PLS)algorithm.Model transfer in different instruments of both the same and different apparatus modules was further discussed in this paper.A total of 34 HSA injectable samples collected from markets were used for verification.The identification results showed 100%accuracy and the predicted concentrations of those identified as true HSA were consistent with their labeled concentrations.The quantitative results also indicated that model transfer was excellent in the same apparatus modules of Raman spectrometer at all concentration levels,and still good enough in the different apparatus modules although the relative standard deviation(RSD)value showed a little increasing trend at low HSA concentration level.In conclusion,the method was proved to be feasible and efficient for screening HSA injections,especially on its screening speed and the consideration of glass containers.Moreover,with inspiring results on the model transfer,the method could be used as a universal screening mean to different Raman instruments.
基金Supported by the National Science Foundation of China(61622301,61533002)Beijing Natural Science Foundation(4172005)Major National Science and Technology Project(2017ZX07104)
文摘The effluent total phosphorus(ETP) is an important parameter to evaluate the performance of wastewater treatment process(WWTP). In this study, a novel method, using a data-derived soft-sensor method, is proposed to obtain the reliable values of ETP online. First, a partial least square(PLS) method is introduced to select the related secondary variables of ETP based on the experimental data. Second, a radial basis function neural network(RBFNN) is developed to identify the relationship between the related secondary variables and ETP. This RBFNN easily optimizes the model parameters to improve the generalization ability of the soft-sensor. Finally, a monitoring system, based on the above PLS and RBFNN, named PLS-RBFNN-based soft-sensor system, is developed and tested in a real WWTP. Experimental results show that the proposed monitoring system can obtain the values of ETP online and own better predicting performance than some existing methods.