The purpose of this study was to establish a method able to accurately estimate the long-term exposure levels of individuals to fine particulate matter (PM2.5) in Jiujiang City (China) by constructing land use regress...The purpose of this study was to establish a method able to accurately estimate the long-term exposure levels of individuals to fine particulate matter (PM2.5) in Jiujiang City (China) by constructing land use regression (LUR) models. Subsequently, the accuracy of models was further verified. PM2.5 concentrations were continuously collected daily from seven monitoring stations for the construction of daily LUR models from September 1 to 14, 2023. The constructed models used PM2.5 concentrations as the dependent variable, while land use, elevation, population density and road length were used as the predictive variables. Subsequently, twenty volunteers were invited to participate, with their daily PM2.5 exposure estimated based on their work address and home address, allowing their average exposure levels to be calculated. Furthermore, volunteers wore portable PM2.5 detectors continuously for a 14-day period and the average measured PM2.5 level was used as a comparative standard. Results showed that the adjusted R2 values for the 14 daily models ranged from 0.85 to 0.94, with the R2 values generated from leave-one-out-cross-validation tests all greater than 0.61, indicating good prediction accuracy. No significant differences were observed in the measurement accuracy of the LUR modeling method and measurements using a portable PM2.5 detector (p > 0.05). This study aimed to develop a novel method for the accurate and convenient measurement of individual long-term PM2.5 exposure levels for epidemiological studies in urban environments comparable to that of Jiujiang city.展开更多
This article presents a mathematical model addressing a scenario involving a hybrid nanofluid flow between two infinite parallel plates.One plate remains stationary,while the other moves downward at a squeezing veloci...This article presents a mathematical model addressing a scenario involving a hybrid nanofluid flow between two infinite parallel plates.One plate remains stationary,while the other moves downward at a squeezing velocity.The space between these plates contains a Darcy-Forchheimer porous medium.A mixture of water-based fluid with gold(Au)and silicon dioxide(Si O2)nanoparticles is formulated.In contrast to the conventional Fourier's heat flux equation,this study employs the Cattaneo-Christov heat flux equation.A uniform magnetic field is applied perpendicular to the flow direction,invoking magnetohydrodynamic(MHD)effects.Further,the model accounts for Joule heating,which is the heat generated when an electric current passes through the fluid.The problem is solved via NDSolve in MATHEMATICA.Numerical and statistical analyses are conducted to provide insights into the behavior of the nanomaterials between the parallel plates with respect to the flow,energy transport,and skin friction.The findings of this study have potential applications in enhancing cooling systems and optimizing thermal management strategies.It is observed that the squeezing motion generates additional pressure gradients within the fluid,which enhances the flow rate but reduces the frictional drag.Consequently,the fluid is pushed more vigorously between the plates,increasing the flow velocity.As the fluid experiences higher flow rates due to the increased squeezing effect,it spends less time in the region between the plates.The thermal relaxation,however,abruptly changes the temperature,leading to a decrease in the temperature fluctuations.展开更多
.High-dimensional heterogeneous data have acquired increasing attention and discussion in the past decade.In the context of heterogeneity,semiparametric regression emerges as a popular method to model this type of dat....High-dimensional heterogeneous data have acquired increasing attention and discussion in the past decade.In the context of heterogeneity,semiparametric regression emerges as a popular method to model this type of data in statistics.In this paper,we leverage the benefits of expectile regression for computational efficiency and analytical robustness in heterogeneity,and propose a regularized partially linear additive expectile regression model with a nonconvex penalty,such as SCAD or MCP,for high-dimensional heterogeneous data.We focus on a more realistic scenario where the regression error exhibits a heavy-tailed distribution with only finite moments.This scenario challenges the classical sub-gaussian distribution assumption and is more prevalent in practical applications.Under certain regular conditions,we demonstrate that with probability tending to one,the oracle estimator is one of the local minima of the induced optimization problem.Our theoretical analysis suggests that the dimensionality of linear covariates that our estimation procedure can handle is fundamentally limited by the moment condition of the regression error.Computationally,given the nonconvex and nonsmooth nature of the induced optimization problem,we have developed a two-step algorithm.Finally,our method’s effectiveness is demonstrated through its high estimation accuracy and effective model selection,as evidenced by Monte Carlo simulation studies and a real-data application.Furthermore,by taking various expectile weights,our method effectively detects heterogeneity and explores the complete conditional distribution of the response variable,underscoring its utility in analyzing high-dimensional heterogeneous data.展开更多
Piezo actuators are widely used in ultra-precision fields because of their high response and nano-scale step length.However,their hysteresis characteristics seriously affect the accuracy and stability of piezo actuato...Piezo actuators are widely used in ultra-precision fields because of their high response and nano-scale step length.However,their hysteresis characteristics seriously affect the accuracy and stability of piezo actuators.Existing methods for fitting hysteresis loops include operator class,differential equation class,and machine learning class.The modeling cost of operator class and differential equation class methods is high,the model complexity is high,and the process of machine learning,such as neural network calculation,is opaque.The physical model framework cannot be directly extracted.Therefore,the sparse identification of nonlinear dynamics(SINDy)algorithm is proposed to fit hysteresis loops.Furthermore,the SINDy algorithm is improved.While the SINDy algorithm builds an orthogonal candidate database for modeling,the sparse regression model is simplified,and the Relay operator is introduced for piecewise fitting to solve the distortion problem of the SINDy algorithm fitting singularities.The Relay-SINDy algorithm proposed in this paper is applied to fitting hysteresis loops.Good performance is obtained with the experimental results of open and closed loops.Compared with the existing methods,the modeling cost and model complexity are reduced,and the modeling accuracy of the hysteresis loop is improved.展开更多
The abstract provided offers a succinct overview of the research paper’s focus on the significance of statistics, specifically regression analysis, across diverse fields. The emphasis on regression analysis indicates...The abstract provided offers a succinct overview of the research paper’s focus on the significance of statistics, specifically regression analysis, across diverse fields. The emphasis on regression analysis indicates its importance as a statistical method that helps researchers understand relationships between variables and make predictions based on data. The inclusion of multiple disciplines, such as health sciences, social sciences, environmental studies, economics, engineering, clinical psychology, social psychology, developmental psychology, cognitive psychology, and education highlights the interdisciplinary relevance of regression analysis. This breadth suggests that the findings and methodologies discussed in the paper may have wide applications, benefiting various sectors by enhancing the quality of research outcomes. The mention of “methodologies and data analysis techniques” indicates that the paper will likely delve into specific statistical approaches, offering a comprehensive examination of how regression analysis is applied in real-world scenarios. This nuance is essential, as it demonstrates the research’s commitment to not only presenting theoretical insights but also practical applications. Furthermore, the abstract states that regression analysis “enhances the validity of findings” and “informs data-driven decision-making.” This assertion underlines the critical role that robust statistical methods play in ensuring that research conclusions are reliable and applicable. The ability of regression analysis to provide clarity and support informed decisions makes it a valuable tool in both academic and professional settings. The abstract effectively outlines the paper’s exploration of regression analysis in various fields, underscoring its importance in enhancing research validity and facilitating informed decision-making. The interdisciplinary nature of the research broadens its appeal and emphasizes the need for rigorous statistical approaches in addressing complex issues across different domains.展开更多
Identification of the ice channel is the basic technology for developing intelligent ships in ice-covered waters,which is important to ensure the safety and economy of navigation.In the Arctic,merchant ships with low ...Identification of the ice channel is the basic technology for developing intelligent ships in ice-covered waters,which is important to ensure the safety and economy of navigation.In the Arctic,merchant ships with low ice class often navigate in channels opened up by icebreakers.Navigation in the ice channel often depends on good maneuverability skills and abundant experience from the captain to a large extent.The ship may get stuck if steered into ice fields off the channel.Under this circumstance,it is very important to study how to identify the boundary lines of ice channels with a reliable method.In this paper,a two-staged ice channel identification method is developed based on image segmentation and corner point regression.The first stage employs the image segmentation method to extract channel regions.In the second stage,an intelligent corner regression network is proposed to extract the channel boundary lines from the channel region.A non-intelligent angle-based filtering and clustering method is proposed and compared with corner point regression network.The training and evaluation of the segmentation method and corner regression network are carried out on the synthetic and real ice channel dataset.The evaluation results show that the accuracy of the method using the corner point regression network in the second stage is achieved as high as 73.33%on the synthetic ice channel dataset and 70.66%on the real ice channel dataset,and the processing speed can reach up to 14.58frames per second.展开更多
Despite the maturity of ensemble numerical weather prediction(NWP),the resulting forecasts are still,more often than not,under-dispersed.As such,forecast calibration tools have become popular.Among those tools,quantil...Despite the maturity of ensemble numerical weather prediction(NWP),the resulting forecasts are still,more often than not,under-dispersed.As such,forecast calibration tools have become popular.Among those tools,quantile regression(QR)is highly competitive in terms of both flexibility and predictive performance.Nevertheless,a long-standing problem of QR is quantile crossing,which greatly limits the interpretability of QR-calibrated forecasts.On this point,this study proposes a non-crossing quantile regression neural network(NCQRNN),for calibrating ensemble NWP forecasts into a set of reliable quantile forecasts without crossing.The overarching design principle of NCQRNN is to add on top of the conventional QRNN structure another hidden layer,which imposes a non-decreasing mapping between the combined output from nodes of the last hidden layer to the nodes of the output layer,through a triangular weight matrix with positive entries.The empirical part of the work considers a solar irradiance case study,in which four years of ensemble irradiance forecasts at seven locations,issued by the European Centre for Medium-Range Weather Forecasts,are calibrated via NCQRNN,as well as via an eclectic mix of benchmarking models,ranging from the naïve climatology to the state-of-the-art deep-learning and other non-crossing models.Formal and stringent forecast verification suggests that the forecasts post-processed via NCQRNN attain the maximum sharpness subject to calibration,amongst all competitors.Furthermore,the proposed conception to resolve quantile crossing is remarkably simple yet general,and thus has broad applicability as it can be integrated with many shallow-and deep-learning-based neural networks.展开更多
The performance of lithium-ion batteries(LIBs)gradually declines over time,making it critical to predict the battery’s state of health(SOH)in real-time.This paper presents a model that incorporates health indicators ...The performance of lithium-ion batteries(LIBs)gradually declines over time,making it critical to predict the battery’s state of health(SOH)in real-time.This paper presents a model that incorporates health indicators and ensemble Gaussian process regression(EGPR)to predict the SOH of LIBs.Firstly,the degradation process of an LIB is analyzed through indirect health indicators(HIs)derived from voltage and temperature during discharge.Next,the parameters in the EGPR model are optimized using the gannet optimization algorithm(GOA),and the EGPR is employed to estimate the SOH of LIBs.Finally,the proposed model is tested under various experimental scenarios and compared with other machine learning models.The effectiveness of EGPR model is demonstrated using the National Aeronautics and Space Administration(NASA)LIB.The root mean square error(RMSE)is maintained within 0.20%,and the mean absolute error(MAE)is below 0.16%,illustrating the proposed approach’s excellent predictive accuracy and wide applicability.展开更多
The picking efficiency of seismic first breaks(FBs)has been greatly accelerated by deep learning(DL)technology.However,the picking accuracy and efficiency of DL methods still face huge challenges in low signal-to-nois...The picking efficiency of seismic first breaks(FBs)has been greatly accelerated by deep learning(DL)technology.However,the picking accuracy and efficiency of DL methods still face huge challenges in low signal-to-noise ratio(SNR)situations.To address this issue,we propose a regression approach to pick FBs based on bidirectional long short-term memory(Bi LSTM)neural network by learning the implicit Eikonal equation of 3D inhomogeneous media with rugged topography in the target region.We employ a regressive model that represents the relationships among the elevation of shots,offset and the elevation of receivers with their seismic traveltime to predict the unknown FBs,from common-shot gathers with sparsely distributed traces.Different from image segmentation methods which automatically extract image features and classify FBs from seismic data,the proposed method can learn the inner relationship between field geometry and FBs.In addition,the predicted results by the regressive model are continuous values of FBs rather than the discrete ones of the binary distribution.The picking results of synthetic data shows that the proposed method has low dependence on label data,and can obtain reliable and similar predicted results using two types of label data with large differences.The picking results of9380 shots for 3D seismic data generated by vibroseis indicate that the proposed method can still accurately predict FBs in low SNR data.The subsequent stacked profiles further illustrate the reliability and effectiveness of the proposed method.The results of model data and field seismic data demonstrate that the proposed regression method is a robust first-break picker with high potential for field application.展开更多
Purpose:The purpose of this study is to develop and compare model choice strategies in context of logistic regression.Model choice means the choice of the covariates to be included in the model.Design/methodology/appr...Purpose:The purpose of this study is to develop and compare model choice strategies in context of logistic regression.Model choice means the choice of the covariates to be included in the model.Design/methodology/approach:The study is based on Monte Carlo simulations.The methods are compared in terms of three measures of accuracy:specificity and two kinds of sensitivity.A loss function combining sensitivity and specificity is introduced and used for a final comparison.Findings:The choice of method depends on how much the users emphasize sensitivity against specificity.It also depends on the sample size.For a typical logistic regression setting with a moderate sample size and a small to moderate effect size,either BIC,BICc or Lasso seems to be optimal.Research limitations:Numerical simulations cannot cover the whole range of data-generating processes occurring with real-world data.Thus,more simulations are needed.Practical implications:Researchers can refer to these results if they believe that their data-generating process is somewhat similar to some of the scenarios presented in this paper.Alternatively,they could run their own simulations and calculate the loss function.Originality/value:This is a systematic comparison of model choice algorithms and heuristics in context of logistic regression.The distinction between two types of sensitivity and a comparison based on a loss function are methodological novelties.展开更多
Ignimbrites have been widely used as building materials in many historical and touristic structures in the Kayseri region of Türkiye. Their diverse colours and textures make them a popular choice for modern const...Ignimbrites have been widely used as building materials in many historical and touristic structures in the Kayseri region of Türkiye. Their diverse colours and textures make them a popular choice for modern construction as well. However, ignimbrites are particularly vulnerable to atmospheric conditions, such as freeze-thaw cycles, due to their high porosity, which is a result of their formation process. When water enters the pores of the ignimbrites, it can freeze during cold weather. As the water freezes and expands, it generates internal stress within the stone, causing micro-cracks to develop. Over time, repeated freeze-thaw (F-T) cycles lead to the growth of these micro-cracks into larger cracks, compromising the structural integrity of the ignimbrites and eventually making them unsuitable for use as building materials. The determination of the long-term F-T performance of ignimbrites can be established after long F-T experimental processes. Determining the long-term F-T performance of ignimbrites typically requires extensive experimental testing over prolonged freeze-thaw cycles. To streamline this process, developing accurate predictive equations becomes crucial. In this study, such equations were formulated using classical regression analyses and artificial neural networks (ANN) based on data obtained from these experiments, allowing for the prediction of the F-T performance of ignimbrites and other similar building stones without the need for lengthy testing. In this study, uniaxial compressive strength, ultrasonic propagation velocity, apparent porosity and mass loss of ignimbrites after long-term F-T were determined. Following the F-T cycles, the disintegration rate was evaluated using decay function approaches, while uniaxial compressive strength (UCS) values were predicted with minimal input parameters through both regression and ANN analyses. The ANN and regression models created for this purpose were first started with a single input value and then developed with two and three combinations. The predictive performance of the models was assessed by comparing them to regression models using the coefficient of determination (R2) as the evaluation criterion. As a result of the study, higher R2 values (0.87) were obtained in models built with artificial neural network. The results of the study indicate that ANN usage can produce results close to experimental outcomes in predicting the long-term F-T performance of ignimbrite samples.展开更多
In the railway system,fasteners have the functions of damping,maintaining the track distance,and adjusting the track level.Therefore,routine maintenance and inspection of fasteners are important to ensure the safe ope...In the railway system,fasteners have the functions of damping,maintaining the track distance,and adjusting the track level.Therefore,routine maintenance and inspection of fasteners are important to ensure the safe operation of track lines.Currently,assessment methods for fastener tightness include manual observation,acoustic wave detection,and image detection.There are limitations such as low accuracy and efficiency,easy interference and misjudgment,and a lack of accurate,stable,and fast detection methods.Aiming at the small deformation characteristics and large elastic change of fasteners from full loosening to full tightening,this study proposes high-precision surface-structured light technology for fastener detection and fastener deformation feature extraction based on the center-line projection distance and a fastener tightness regression method based on neural networks.First,the method uses a 3D camera to obtain a fastener point cloud and then segments the elastic rod area based on the iterative closest point algorithm registration.Principal component analysis is used to calculate the normal vector of the segmented elastic rod surface and extract the point on the centerline of the elastic rod.The point is projected onto the upper surface of the bolt to calculate the projection distance.Subsequently,the mapping relationship between the projection distance sequence and fastener tightness is established,and the influence of each parameter on the fastener tightness prediction is analyzed.Finally,by setting up a fastener detection scene in the track experimental base,collecting data,and completing the algorithm verification,the results showed that the deviation between the fastener tightness regression value obtained after the algorithm processing and the actual measured value RMSE was 0.2196 mm,which significantly improved the effect compared with other tightness detection methods,and realized an effective fastener tightness regression.展开更多
This study aims to predict the undrained shear strength of remolded soil samples using non-linear regression analyses,fuzzy logic,and artificial neural network modeling.A total of 1306 undrained shear strength results...This study aims to predict the undrained shear strength of remolded soil samples using non-linear regression analyses,fuzzy logic,and artificial neural network modeling.A total of 1306 undrained shear strength results from 230 different remolded soil test settings reported in 21 publications were collected,utilizing six different measurement devices.Although water content,plastic limit,and liquid limit were used as input parameters for fuzzy logic and artificial neural network modeling,liquidity index or water content ratio was considered as an input parameter for non-linear regression analyses.In non-linear regression analyses,12 different regression equations were derived for the prediction of undrained shear strength of remolded soil.Feed-Forward backpropagation and the TANSIG transfer function were used for artificial neural network modeling,while the Mamdani inference system was preferred with trapezoidal and triangular membership functions for fuzzy logic modeling.The experimental results of 914 tests were used for training of the artificial neural network models,196 for validation and 196 for testing.It was observed that the accuracy of the artificial neural network and fuzzy logic modeling was higher than that of the non-linear regression analyses.Furthermore,a simple and reliable regression equation was proposed for assessments of undrained shear strength values with higher coefficients of determination.展开更多
Objective Previous studies on the association between lipid profiles and chronic kidney disease(CKD)have yielded inconsistent results and no defined thresholds for blood lipids.Methods A prospective cohort study inclu...Objective Previous studies on the association between lipid profiles and chronic kidney disease(CKD)have yielded inconsistent results and no defined thresholds for blood lipids.Methods A prospective cohort study including 32,351 subjects who completed baseline and follow-up surveys over 5 years was conducted.Restricted cubic splines and Cox models were used to examine the association between the lipid profiles and CKD.A regression discontinuity design was used to determine the cutoff value of lipid profiles that was significantly associated with increased the risk of CKD.Results Over a median follow-up time of 2.2(0.5,4.2)years,648(2.00%)subjects developed CKD.The lipid profiles that were significantly and linearly related to CKD included total cholesterol(TC),triglycerides(TG),high-density lipoprotein cholesterol(HDL-C),TC/HDL-C,and TG/HDL-C,whereas lowdensity lipoprotein cholesterol(LDL-C)and LDL-C/HDL-C were nonlinearly correlated with CKD.TC,TG,TC/HDL-C,and TG/HDL-C showed an upward jump at the cutoff value,increasing the risk of CKD by 0.90%,1.50%,2.30%,and 1.60%,respectively,whereas HDL-C showed a downward jump at the cutoff value,reducing this risk by 1.0%.Female and participants with dyslipidemia had a higher risk of CKD,while the cutoff values for the different characteristics of the population were different.Conclusion There was a significant association between lipid profiles and CKD in a prospective cohort from Northwest China,while TG,TC/HDL-C,and TG/HDL-C showed a stronger risk association.The specific cutoff values of lipid profiles may provide a clinical reference for screening or diagnosing CKD risk.展开更多
Concentrate copper grade(CCG)is one of the important production indicators of copper flotation processes,and keeping the CCG at the set value is of great significance to the economic benefit of copper flotation indust...Concentrate copper grade(CCG)is one of the important production indicators of copper flotation processes,and keeping the CCG at the set value is of great significance to the economic benefit of copper flotation industrial processes.This paper addresses the fluctuation problem of CCG through an operational optimization method.Firstly,a density-based affinity propagationalgorithm is proposed so that more ideal working condition categories can be obtained for the complex raw ore properties.Next,a Bayesian network(BN)is applied to explore the relationship between the operational variables and the CCG.Based on the analysis results of BN,a weighted Gaussian process regression model is constructed to predict the CCG that a higher prediction accuracy can be obtained.To ensure the predicted CCG is close to the set value with a smaller magnitude of the operation adjustments and a smaller uncertainty of the prediction results,an index-oriented adaptive differential evolution(IOADE)algorithm is proposed,and the convergence performance of IOADE is superior to the traditional differential evolution and adaptive differential evolution methods.Finally,the effectiveness and feasibility of the proposed methods are verified by the experiments on a copper flotation industrial process.展开更多
Accurately estimating blasting vibration during rock blasting is the foundation of blasting vibration management.In this study,Tuna Swarm Optimization(TSO),Whale Optimization Algorithm(WOA),and Cuckoo Search(CS)were u...Accurately estimating blasting vibration during rock blasting is the foundation of blasting vibration management.In this study,Tuna Swarm Optimization(TSO),Whale Optimization Algorithm(WOA),and Cuckoo Search(CS)were used to optimize two hyperparameters in support vector regression(SVR).Based on these methods,three hybrid models to predict peak particle velocity(PPV)for bench blasting were developed.Eighty-eight samples were collected to establish the PPV database,eight initial blasting parameters were chosen as input parameters for the predictionmodel,and the PPV was the output parameter.As predictive performance evaluation indicators,the coefficient of determination(R2),rootmean square error(RMSE),mean absolute error(MAE),and a10-index were selected.The normalizedmutual information value is then used to evaluate the impact of various input parameters on the PPV prediction outcomes.According to the research findings,TSO,WOA,and CS can all enhance the predictive performance of the SVR model.The TSO-SVR model provides the most accurate predictions.The performances of the optimized hybrid SVR models are superior to the unoptimized traditional prediction model.The maximum charge per delay impacts the PPV prediction value the most.展开更多
In oil and gas exploration,elucidating the complex interdependencies among geological variables is paramount.Our study introduces the application of sophisticated regression analysis method at the forefront,aiming not...In oil and gas exploration,elucidating the complex interdependencies among geological variables is paramount.Our study introduces the application of sophisticated regression analysis method at the forefront,aiming not just at predicting geophysical logging curve values but also innovatively mitigate hydrocarbon depletion observed in geochemical logging.Through a rigorous assessment,we explore the efficacy of eight regression models,bifurcated into linear and nonlinear groups,to accommodate the multifaceted nature of geological datasets.Our linear model suite encompasses the Standard Equation,Ridge Regression,Least Absolute Shrinkage and Selection Operator,and Elastic Net,each presenting distinct advantages.The Standard Equation serves as a foundational benchmark,whereas Ridge Regression implements penalty terms to counteract overfitting,thus bolstering model robustness in the presence of multicollinearity.The Least Absolute Shrinkage and Selection Operator for variable selection functions to streamline models,enhancing their interpretability,while Elastic Net amalgamates the merits of Ridge Regression and Least Absolute Shrinkage and Selection Operator,offering a harmonized solution to model complexity and comprehensibility.On the nonlinear front,Gradient Descent,Kernel Ridge Regression,Support Vector Regression,and Piecewise Function-Fitting methods introduce innovative approaches.Gradient Descent assures computational efficiency in optimizing solutions,Kernel Ridge Regression leverages the kernel trick to navigate nonlinear patterns,and Support Vector Regression is proficient in forecasting extremities,pivotal for exploration risk assessment.The Piecewise Function-Fitting approach,tailored for geological data,facilitates adaptable modeling of variable interrelations,accommodating abrupt data trend shifts.Our analysis identifies Ridge Regression,particularly when augmented by Piecewise Function-Fitting,as superior in recouping hydrocarbon losses,and underscoring its utility in resource quantification refinement.Meanwhile,Kernel Ridge Regression emerges as a noteworthy strategy in ameliorating porosity-logging curve prediction for well A,evidencing its aptness for intricate geological structures.This research attests to the scientific ascendancy and broad-spectrum relevance of these regression techniques over conventional methods while heralding new horizons for their deployment in the oil and gas sector.The insights garnered from these advanced modeling strategies are set to transform geological and engineering practices in hydrocarbon prediction,evaluation,and recovery.展开更多
The burning of crop residues in fields is a significant global biomass burning activity which is a key element of the terrestrial carbon cycle,and an important source of atmospheric trace gasses and aerosols.Accurate ...The burning of crop residues in fields is a significant global biomass burning activity which is a key element of the terrestrial carbon cycle,and an important source of atmospheric trace gasses and aerosols.Accurate estimation of cropland burned area is both crucial and challenging,especially for the small and fragmented burned scars in China.Here we developed an automated burned area mapping algorithm that was implemented using Sentinel-2 Multi Spectral Instrument(MSI)data and its effectiveness was tested taking Songnen Plain,Northeast China as a case using satellite image of 2020.We employed a logistic regression method for integrating multiple spectral data into a synthetic indicator,and compared the results with manually interpreted burned area reference maps and the Moderate-Resolution Imaging Spectroradiometer(MODIS)MCD64A1 burned area product.The overall accuracy of the single variable logistic regression was 77.38%to 86.90%and 73.47%to 97.14%for the 52TCQ and 51TYM cases,respectively.In comparison,the accuracy of the burned area map was improved to 87.14%and 98.33%for the 52TCQ and 51TYM cases,respectively by multiple variable logistic regression of Sentind-2 images.The balance of omission error and commission error was also improved.The integration of multiple spectral data combined with a logistic regression method proves to be effective for burned area detection,offering a highly automated process with an automatic threshold determination mechanism.This method exhibits excellent extensibility and flexibility taking the image tile as the operating unit.It is suitable for burned area detection at a regional scale and can also be implemented with other satellite data.展开更多
Partial Differential Equation(PDE)is among the most fundamental tools employed to model dynamic systems.Existing PDE modeling methods are typically derived from established knowledge and known phenomena,which are time...Partial Differential Equation(PDE)is among the most fundamental tools employed to model dynamic systems.Existing PDE modeling methods are typically derived from established knowledge and known phenomena,which are time-consuming and labor-intensive.Recently,discovering governing PDEs from collected actual data via Physics Informed Neural Networks(PINNs)provides a more efficient way to analyze fresh dynamic systems and establish PEDmodels.This study proposes Sequentially Threshold Least Squares-Lasso(STLasso),a module constructed by incorporating Lasso regression into the Sequentially Threshold Least Squares(STLS)algorithm,which can complete sparse regression of PDE coefficients with the constraints of l0 norm.It further introduces PINN-STLasso,a physics informed neural network combined with Lasso sparse regression,able to find underlying PDEs from data with reduced data requirements and better interpretability.In addition,this research conducts experiments on canonical inverse PDE problems and compares the results to several recent methods.The results demonstrated that the proposed PINN-STLasso outperforms other methods,achieving lower error rates even with less data.展开更多
Objective This study employs the Geographically and Temporally Weighted Regression(GTWR)model to assess the impact of meteorological elements and imported cases on dengue fever outbreaks,emphasizing the spatial-tempor...Objective This study employs the Geographically and Temporally Weighted Regression(GTWR)model to assess the impact of meteorological elements and imported cases on dengue fever outbreaks,emphasizing the spatial-temporal variability of these factors in border regions.Methods We conducted a descriptive analysis of dengue fever’s temporal-spatial distribution in Yunnan border areas.Utilizing annual data from 2013 to 2019,with each county in the Yunnan border serving as a spatial unit,we constructed a GTWR model to investigate the determinants of dengue fever and their spatio-temporal heterogeneity in this region.Results The GTWR model,proving more effective than Ordinary Least Squares(OLS)analysis,identified significant spatial and temporal heterogeneity in factors influencing dengue fever’s spread along the Yunnan border.Notably,the GTWR model revealed a substantial variation in the relationship between indigenous dengue fever incidence,meteorological variables,and imported cases across different counties.Conclusion In the Yunnan border areas,local dengue incidence is affected by temperature,humidity,precipitation,wind speed,and imported cases,with these factors’influence exhibiting notable spatial and temporal variation.展开更多
文摘The purpose of this study was to establish a method able to accurately estimate the long-term exposure levels of individuals to fine particulate matter (PM2.5) in Jiujiang City (China) by constructing land use regression (LUR) models. Subsequently, the accuracy of models was further verified. PM2.5 concentrations were continuously collected daily from seven monitoring stations for the construction of daily LUR models from September 1 to 14, 2023. The constructed models used PM2.5 concentrations as the dependent variable, while land use, elevation, population density and road length were used as the predictive variables. Subsequently, twenty volunteers were invited to participate, with their daily PM2.5 exposure estimated based on their work address and home address, allowing their average exposure levels to be calculated. Furthermore, volunteers wore portable PM2.5 detectors continuously for a 14-day period and the average measured PM2.5 level was used as a comparative standard. Results showed that the adjusted R2 values for the 14 daily models ranged from 0.85 to 0.94, with the R2 values generated from leave-one-out-cross-validation tests all greater than 0.61, indicating good prediction accuracy. No significant differences were observed in the measurement accuracy of the LUR modeling method and measurements using a portable PM2.5 detector (p > 0.05). This study aimed to develop a novel method for the accurate and convenient measurement of individual long-term PM2.5 exposure levels for epidemiological studies in urban environments comparable to that of Jiujiang city.
文摘This article presents a mathematical model addressing a scenario involving a hybrid nanofluid flow between two infinite parallel plates.One plate remains stationary,while the other moves downward at a squeezing velocity.The space between these plates contains a Darcy-Forchheimer porous medium.A mixture of water-based fluid with gold(Au)and silicon dioxide(Si O2)nanoparticles is formulated.In contrast to the conventional Fourier's heat flux equation,this study employs the Cattaneo-Christov heat flux equation.A uniform magnetic field is applied perpendicular to the flow direction,invoking magnetohydrodynamic(MHD)effects.Further,the model accounts for Joule heating,which is the heat generated when an electric current passes through the fluid.The problem is solved via NDSolve in MATHEMATICA.Numerical and statistical analyses are conducted to provide insights into the behavior of the nanomaterials between the parallel plates with respect to the flow,energy transport,and skin friction.The findings of this study have potential applications in enhancing cooling systems and optimizing thermal management strategies.It is observed that the squeezing motion generates additional pressure gradients within the fluid,which enhances the flow rate but reduces the frictional drag.Consequently,the fluid is pushed more vigorously between the plates,increasing the flow velocity.As the fluid experiences higher flow rates due to the increased squeezing effect,it spends less time in the region between the plates.The thermal relaxation,however,abruptly changes the temperature,leading to a decrease in the temperature fluctuations.
基金Supported by the Hangzhou Joint Fund of the Zhejiang Provincial Natural Science Foundation of Chi-na(LHZY24A010002)the MOE Project of Humanities and Social Sciences(21YJCZH235).
文摘.High-dimensional heterogeneous data have acquired increasing attention and discussion in the past decade.In the context of heterogeneity,semiparametric regression emerges as a popular method to model this type of data in statistics.In this paper,we leverage the benefits of expectile regression for computational efficiency and analytical robustness in heterogeneity,and propose a regularized partially linear additive expectile regression model with a nonconvex penalty,such as SCAD or MCP,for high-dimensional heterogeneous data.We focus on a more realistic scenario where the regression error exhibits a heavy-tailed distribution with only finite moments.This scenario challenges the classical sub-gaussian distribution assumption and is more prevalent in practical applications.Under certain regular conditions,we demonstrate that with probability tending to one,the oracle estimator is one of the local minima of the induced optimization problem.Our theoretical analysis suggests that the dimensionality of linear covariates that our estimation procedure can handle is fundamentally limited by the moment condition of the regression error.Computationally,given the nonconvex and nonsmooth nature of the induced optimization problem,we have developed a two-step algorithm.Finally,our method’s effectiveness is demonstrated through its high estimation accuracy and effective model selection,as evidenced by Monte Carlo simulation studies and a real-data application.Furthermore,by taking various expectile weights,our method effectively detects heterogeneity and explores the complete conditional distribution of the response variable,underscoring its utility in analyzing high-dimensional heterogeneous data.
基金National Natural Science Foundation of China(62203118)。
文摘Piezo actuators are widely used in ultra-precision fields because of their high response and nano-scale step length.However,their hysteresis characteristics seriously affect the accuracy and stability of piezo actuators.Existing methods for fitting hysteresis loops include operator class,differential equation class,and machine learning class.The modeling cost of operator class and differential equation class methods is high,the model complexity is high,and the process of machine learning,such as neural network calculation,is opaque.The physical model framework cannot be directly extracted.Therefore,the sparse identification of nonlinear dynamics(SINDy)algorithm is proposed to fit hysteresis loops.Furthermore,the SINDy algorithm is improved.While the SINDy algorithm builds an orthogonal candidate database for modeling,the sparse regression model is simplified,and the Relay operator is introduced for piecewise fitting to solve the distortion problem of the SINDy algorithm fitting singularities.The Relay-SINDy algorithm proposed in this paper is applied to fitting hysteresis loops.Good performance is obtained with the experimental results of open and closed loops.Compared with the existing methods,the modeling cost and model complexity are reduced,and the modeling accuracy of the hysteresis loop is improved.
文摘The abstract provided offers a succinct overview of the research paper’s focus on the significance of statistics, specifically regression analysis, across diverse fields. The emphasis on regression analysis indicates its importance as a statistical method that helps researchers understand relationships between variables and make predictions based on data. The inclusion of multiple disciplines, such as health sciences, social sciences, environmental studies, economics, engineering, clinical psychology, social psychology, developmental psychology, cognitive psychology, and education highlights the interdisciplinary relevance of regression analysis. This breadth suggests that the findings and methodologies discussed in the paper may have wide applications, benefiting various sectors by enhancing the quality of research outcomes. The mention of “methodologies and data analysis techniques” indicates that the paper will likely delve into specific statistical approaches, offering a comprehensive examination of how regression analysis is applied in real-world scenarios. This nuance is essential, as it demonstrates the research’s commitment to not only presenting theoretical insights but also practical applications. Furthermore, the abstract states that regression analysis “enhances the validity of findings” and “informs data-driven decision-making.” This assertion underlines the critical role that robust statistical methods play in ensuring that research conclusions are reliable and applicable. The ability of regression analysis to provide clarity and support informed decisions makes it a valuable tool in both academic and professional settings. The abstract effectively outlines the paper’s exploration of regression analysis in various fields, underscoring its importance in enhancing research validity and facilitating informed decision-making. The interdisciplinary nature of the research broadens its appeal and emphasizes the need for rigorous statistical approaches in addressing complex issues across different domains.
基金financially supported by the National Key Research and Development Program(Grant No.2022YFE0107000)the General Projects of the National Natural Science Foundation of China(Grant No.52171259)the High-Tech Ship Research Project of the Ministry of Industry and Information Technology(Grant No.[2021]342)。
文摘Identification of the ice channel is the basic technology for developing intelligent ships in ice-covered waters,which is important to ensure the safety and economy of navigation.In the Arctic,merchant ships with low ice class often navigate in channels opened up by icebreakers.Navigation in the ice channel often depends on good maneuverability skills and abundant experience from the captain to a large extent.The ship may get stuck if steered into ice fields off the channel.Under this circumstance,it is very important to study how to identify the boundary lines of ice channels with a reliable method.In this paper,a two-staged ice channel identification method is developed based on image segmentation and corner point regression.The first stage employs the image segmentation method to extract channel regions.In the second stage,an intelligent corner regression network is proposed to extract the channel boundary lines from the channel region.A non-intelligent angle-based filtering and clustering method is proposed and compared with corner point regression network.The training and evaluation of the segmentation method and corner regression network are carried out on the synthetic and real ice channel dataset.The evaluation results show that the accuracy of the method using the corner point regression network in the second stage is achieved as high as 73.33%on the synthetic ice channel dataset and 70.66%on the real ice channel dataset,and the processing speed can reach up to 14.58frames per second.
基金supported by the National Natural Science Foundation of China (Project No.42375192)the China Meteorological Administration Climate Change Special Program (CMA-CCSP+1 种基金Project No.QBZ202315)support by the Vector Stiftung through the Young Investigator Group"Artificial Intelligence for Probabilistic Weather Forecasting."
文摘Despite the maturity of ensemble numerical weather prediction(NWP),the resulting forecasts are still,more often than not,under-dispersed.As such,forecast calibration tools have become popular.Among those tools,quantile regression(QR)is highly competitive in terms of both flexibility and predictive performance.Nevertheless,a long-standing problem of QR is quantile crossing,which greatly limits the interpretability of QR-calibrated forecasts.On this point,this study proposes a non-crossing quantile regression neural network(NCQRNN),for calibrating ensemble NWP forecasts into a set of reliable quantile forecasts without crossing.The overarching design principle of NCQRNN is to add on top of the conventional QRNN structure another hidden layer,which imposes a non-decreasing mapping between the combined output from nodes of the last hidden layer to the nodes of the output layer,through a triangular weight matrix with positive entries.The empirical part of the work considers a solar irradiance case study,in which four years of ensemble irradiance forecasts at seven locations,issued by the European Centre for Medium-Range Weather Forecasts,are calibrated via NCQRNN,as well as via an eclectic mix of benchmarking models,ranging from the naïve climatology to the state-of-the-art deep-learning and other non-crossing models.Formal and stringent forecast verification suggests that the forecasts post-processed via NCQRNN attain the maximum sharpness subject to calibration,amongst all competitors.Furthermore,the proposed conception to resolve quantile crossing is remarkably simple yet general,and thus has broad applicability as it can be integrated with many shallow-and deep-learning-based neural networks.
基金supported by Fundamental Research Program of Shanxi Province(No.202203021211088)Shanxi Provincial Natural Science Foundation(No.202204021301049).
文摘The performance of lithium-ion batteries(LIBs)gradually declines over time,making it critical to predict the battery’s state of health(SOH)in real-time.This paper presents a model that incorporates health indicators and ensemble Gaussian process regression(EGPR)to predict the SOH of LIBs.Firstly,the degradation process of an LIB is analyzed through indirect health indicators(HIs)derived from voltage and temperature during discharge.Next,the parameters in the EGPR model are optimized using the gannet optimization algorithm(GOA),and the EGPR is employed to estimate the SOH of LIBs.Finally,the proposed model is tested under various experimental scenarios and compared with other machine learning models.The effectiveness of EGPR model is demonstrated using the National Aeronautics and Space Administration(NASA)LIB.The root mean square error(RMSE)is maintained within 0.20%,and the mean absolute error(MAE)is below 0.16%,illustrating the proposed approach’s excellent predictive accuracy and wide applicability.
基金financially supported by the National Key R&D Program of China(2018YFA0702504)the National Natural Science Foundation of China(42174152)+1 种基金the Strategic Cooperation Technology Projects of China National Petroleum Corporation(CNPC)and China University of Petroleum-Beijing(CUPB)(ZLZX2020-03)the R&D Department of China National Petroleum Corporation(2022DQ0604-01)。
文摘The picking efficiency of seismic first breaks(FBs)has been greatly accelerated by deep learning(DL)technology.However,the picking accuracy and efficiency of DL methods still face huge challenges in low signal-to-noise ratio(SNR)situations.To address this issue,we propose a regression approach to pick FBs based on bidirectional long short-term memory(Bi LSTM)neural network by learning the implicit Eikonal equation of 3D inhomogeneous media with rugged topography in the target region.We employ a regressive model that represents the relationships among the elevation of shots,offset and the elevation of receivers with their seismic traveltime to predict the unknown FBs,from common-shot gathers with sparsely distributed traces.Different from image segmentation methods which automatically extract image features and classify FBs from seismic data,the proposed method can learn the inner relationship between field geometry and FBs.In addition,the predicted results by the regressive model are continuous values of FBs rather than the discrete ones of the binary distribution.The picking results of synthetic data shows that the proposed method has low dependence on label data,and can obtain reliable and similar predicted results using two types of label data with large differences.The picking results of9380 shots for 3D seismic data generated by vibroseis indicate that the proposed method can still accurately predict FBs in low SNR data.The subsequent stacked profiles further illustrate the reliability and effectiveness of the proposed method.The results of model data and field seismic data demonstrate that the proposed regression method is a robust first-break picker with high potential for field application.
文摘Purpose:The purpose of this study is to develop and compare model choice strategies in context of logistic regression.Model choice means the choice of the covariates to be included in the model.Design/methodology/approach:The study is based on Monte Carlo simulations.The methods are compared in terms of three measures of accuracy:specificity and two kinds of sensitivity.A loss function combining sensitivity and specificity is introduced and used for a final comparison.Findings:The choice of method depends on how much the users emphasize sensitivity against specificity.It also depends on the sample size.For a typical logistic regression setting with a moderate sample size and a small to moderate effect size,either BIC,BICc or Lasso seems to be optimal.Research limitations:Numerical simulations cannot cover the whole range of data-generating processes occurring with real-world data.Thus,more simulations are needed.Practical implications:Researchers can refer to these results if they believe that their data-generating process is somewhat similar to some of the scenarios presented in this paper.Alternatively,they could run their own simulations and calculate the loss function.Originality/value:This is a systematic comparison of model choice algorithms and heuristics in context of logistic regression.The distinction between two types of sensitivity and a comparison based on a loss function are methodological novelties.
文摘Ignimbrites have been widely used as building materials in many historical and touristic structures in the Kayseri region of Türkiye. Their diverse colours and textures make them a popular choice for modern construction as well. However, ignimbrites are particularly vulnerable to atmospheric conditions, such as freeze-thaw cycles, due to their high porosity, which is a result of their formation process. When water enters the pores of the ignimbrites, it can freeze during cold weather. As the water freezes and expands, it generates internal stress within the stone, causing micro-cracks to develop. Over time, repeated freeze-thaw (F-T) cycles lead to the growth of these micro-cracks into larger cracks, compromising the structural integrity of the ignimbrites and eventually making them unsuitable for use as building materials. The determination of the long-term F-T performance of ignimbrites can be established after long F-T experimental processes. Determining the long-term F-T performance of ignimbrites typically requires extensive experimental testing over prolonged freeze-thaw cycles. To streamline this process, developing accurate predictive equations becomes crucial. In this study, such equations were formulated using classical regression analyses and artificial neural networks (ANN) based on data obtained from these experiments, allowing for the prediction of the F-T performance of ignimbrites and other similar building stones without the need for lengthy testing. In this study, uniaxial compressive strength, ultrasonic propagation velocity, apparent porosity and mass loss of ignimbrites after long-term F-T were determined. Following the F-T cycles, the disintegration rate was evaluated using decay function approaches, while uniaxial compressive strength (UCS) values were predicted with minimal input parameters through both regression and ANN analyses. The ANN and regression models created for this purpose were first started with a single input value and then developed with two and three combinations. The predictive performance of the models was assessed by comparing them to regression models using the coefficient of determination (R2) as the evaluation criterion. As a result of the study, higher R2 values (0.87) were obtained in models built with artificial neural network. The results of the study indicate that ANN usage can produce results close to experimental outcomes in predicting the long-term F-T performance of ignimbrite samples.
基金Supported by Fundamental Research Funds for the Central Universities of China(Grant No.2023JBMC014).
文摘In the railway system,fasteners have the functions of damping,maintaining the track distance,and adjusting the track level.Therefore,routine maintenance and inspection of fasteners are important to ensure the safe operation of track lines.Currently,assessment methods for fastener tightness include manual observation,acoustic wave detection,and image detection.There are limitations such as low accuracy and efficiency,easy interference and misjudgment,and a lack of accurate,stable,and fast detection methods.Aiming at the small deformation characteristics and large elastic change of fasteners from full loosening to full tightening,this study proposes high-precision surface-structured light technology for fastener detection and fastener deformation feature extraction based on the center-line projection distance and a fastener tightness regression method based on neural networks.First,the method uses a 3D camera to obtain a fastener point cloud and then segments the elastic rod area based on the iterative closest point algorithm registration.Principal component analysis is used to calculate the normal vector of the segmented elastic rod surface and extract the point on the centerline of the elastic rod.The point is projected onto the upper surface of the bolt to calculate the projection distance.Subsequently,the mapping relationship between the projection distance sequence and fastener tightness is established,and the influence of each parameter on the fastener tightness prediction is analyzed.Finally,by setting up a fastener detection scene in the track experimental base,collecting data,and completing the algorithm verification,the results showed that the deviation between the fastener tightness regression value obtained after the algorithm processing and the actual measured value RMSE was 0.2196 mm,which significantly improved the effect compared with other tightness detection methods,and realized an effective fastener tightness regression.
文摘This study aims to predict the undrained shear strength of remolded soil samples using non-linear regression analyses,fuzzy logic,and artificial neural network modeling.A total of 1306 undrained shear strength results from 230 different remolded soil test settings reported in 21 publications were collected,utilizing six different measurement devices.Although water content,plastic limit,and liquid limit were used as input parameters for fuzzy logic and artificial neural network modeling,liquidity index or water content ratio was considered as an input parameter for non-linear regression analyses.In non-linear regression analyses,12 different regression equations were derived for the prediction of undrained shear strength of remolded soil.Feed-Forward backpropagation and the TANSIG transfer function were used for artificial neural network modeling,while the Mamdani inference system was preferred with trapezoidal and triangular membership functions for fuzzy logic modeling.The experimental results of 914 tests were used for training of the artificial neural network models,196 for validation and 196 for testing.It was observed that the accuracy of the artificial neural network and fuzzy logic modeling was higher than that of the non-linear regression analyses.Furthermore,a simple and reliable regression equation was proposed for assessments of undrained shear strength values with higher coefficients of determination.
基金supported by the Municipal Science and Technology Program of Wuwei City,China(WW2202RPZ037)the Fundamental Research Funds for the Central Universities in China(Grant No.lzujbky-2018-69).
文摘Objective Previous studies on the association between lipid profiles and chronic kidney disease(CKD)have yielded inconsistent results and no defined thresholds for blood lipids.Methods A prospective cohort study including 32,351 subjects who completed baseline and follow-up surveys over 5 years was conducted.Restricted cubic splines and Cox models were used to examine the association between the lipid profiles and CKD.A regression discontinuity design was used to determine the cutoff value of lipid profiles that was significantly associated with increased the risk of CKD.Results Over a median follow-up time of 2.2(0.5,4.2)years,648(2.00%)subjects developed CKD.The lipid profiles that were significantly and linearly related to CKD included total cholesterol(TC),triglycerides(TG),high-density lipoprotein cholesterol(HDL-C),TC/HDL-C,and TG/HDL-C,whereas lowdensity lipoprotein cholesterol(LDL-C)and LDL-C/HDL-C were nonlinearly correlated with CKD.TC,TG,TC/HDL-C,and TG/HDL-C showed an upward jump at the cutoff value,increasing the risk of CKD by 0.90%,1.50%,2.30%,and 1.60%,respectively,whereas HDL-C showed a downward jump at the cutoff value,reducing this risk by 1.0%.Female and participants with dyslipidemia had a higher risk of CKD,while the cutoff values for the different characteristics of the population were different.Conclusion There was a significant association between lipid profiles and CKD in a prospective cohort from Northwest China,while TG,TC/HDL-C,and TG/HDL-C showed a stronger risk association.The specific cutoff values of lipid profiles may provide a clinical reference for screening or diagnosing CKD risk.
基金supported in part by the National Key Research and Development Program of China(2021YFC2902703)the National Natural Science Foundation of China(62173078,61773105,61533007,61873049,61873053,61703085,61374147)。
文摘Concentrate copper grade(CCG)is one of the important production indicators of copper flotation processes,and keeping the CCG at the set value is of great significance to the economic benefit of copper flotation industrial processes.This paper addresses the fluctuation problem of CCG through an operational optimization method.Firstly,a density-based affinity propagationalgorithm is proposed so that more ideal working condition categories can be obtained for the complex raw ore properties.Next,a Bayesian network(BN)is applied to explore the relationship between the operational variables and the CCG.Based on the analysis results of BN,a weighted Gaussian process regression model is constructed to predict the CCG that a higher prediction accuracy can be obtained.To ensure the predicted CCG is close to the set value with a smaller magnitude of the operation adjustments and a smaller uncertainty of the prediction results,an index-oriented adaptive differential evolution(IOADE)algorithm is proposed,and the convergence performance of IOADE is superior to the traditional differential evolution and adaptive differential evolution methods.Finally,the effectiveness and feasibility of the proposed methods are verified by the experiments on a copper flotation industrial process.
基金financially supported by the NationalNatural Science Foundation of China(Grant No.42072309)the Fundamental Research Funds for National University,China University of Geosciences(Wuhan)(Grant No.CUGDCJJ202217)+1 种基金the Knowledge Innovation Program of Wuhan-Basic Research(Grant No.2022020801010199)the Hubei Key Laboratory of Blasting Engineering Foundation(HKLBEF202002).
文摘Accurately estimating blasting vibration during rock blasting is the foundation of blasting vibration management.In this study,Tuna Swarm Optimization(TSO),Whale Optimization Algorithm(WOA),and Cuckoo Search(CS)were used to optimize two hyperparameters in support vector regression(SVR).Based on these methods,three hybrid models to predict peak particle velocity(PPV)for bench blasting were developed.Eighty-eight samples were collected to establish the PPV database,eight initial blasting parameters were chosen as input parameters for the predictionmodel,and the PPV was the output parameter.As predictive performance evaluation indicators,the coefficient of determination(R2),rootmean square error(RMSE),mean absolute error(MAE),and a10-index were selected.The normalizedmutual information value is then used to evaluate the impact of various input parameters on the PPV prediction outcomes.According to the research findings,TSO,WOA,and CS can all enhance the predictive performance of the SVR model.The TSO-SVR model provides the most accurate predictions.The performances of the optimized hybrid SVR models are superior to the unoptimized traditional prediction model.The maximum charge per delay impacts the PPV prediction value the most.
文摘In oil and gas exploration,elucidating the complex interdependencies among geological variables is paramount.Our study introduces the application of sophisticated regression analysis method at the forefront,aiming not just at predicting geophysical logging curve values but also innovatively mitigate hydrocarbon depletion observed in geochemical logging.Through a rigorous assessment,we explore the efficacy of eight regression models,bifurcated into linear and nonlinear groups,to accommodate the multifaceted nature of geological datasets.Our linear model suite encompasses the Standard Equation,Ridge Regression,Least Absolute Shrinkage and Selection Operator,and Elastic Net,each presenting distinct advantages.The Standard Equation serves as a foundational benchmark,whereas Ridge Regression implements penalty terms to counteract overfitting,thus bolstering model robustness in the presence of multicollinearity.The Least Absolute Shrinkage and Selection Operator for variable selection functions to streamline models,enhancing their interpretability,while Elastic Net amalgamates the merits of Ridge Regression and Least Absolute Shrinkage and Selection Operator,offering a harmonized solution to model complexity and comprehensibility.On the nonlinear front,Gradient Descent,Kernel Ridge Regression,Support Vector Regression,and Piecewise Function-Fitting methods introduce innovative approaches.Gradient Descent assures computational efficiency in optimizing solutions,Kernel Ridge Regression leverages the kernel trick to navigate nonlinear patterns,and Support Vector Regression is proficient in forecasting extremities,pivotal for exploration risk assessment.The Piecewise Function-Fitting approach,tailored for geological data,facilitates adaptable modeling of variable interrelations,accommodating abrupt data trend shifts.Our analysis identifies Ridge Regression,particularly when augmented by Piecewise Function-Fitting,as superior in recouping hydrocarbon losses,and underscoring its utility in resource quantification refinement.Meanwhile,Kernel Ridge Regression emerges as a noteworthy strategy in ameliorating porosity-logging curve prediction for well A,evidencing its aptness for intricate geological structures.This research attests to the scientific ascendancy and broad-spectrum relevance of these regression techniques over conventional methods while heralding new horizons for their deployment in the oil and gas sector.The insights garnered from these advanced modeling strategies are set to transform geological and engineering practices in hydrocarbon prediction,evaluation,and recovery.
基金Under the auspices of National Natural Science Foundation of China(No.42101414)Natural Science Found for Outstanding Young Scholars in Jilin Province(No.20230508106RC)。
文摘The burning of crop residues in fields is a significant global biomass burning activity which is a key element of the terrestrial carbon cycle,and an important source of atmospheric trace gasses and aerosols.Accurate estimation of cropland burned area is both crucial and challenging,especially for the small and fragmented burned scars in China.Here we developed an automated burned area mapping algorithm that was implemented using Sentinel-2 Multi Spectral Instrument(MSI)data and its effectiveness was tested taking Songnen Plain,Northeast China as a case using satellite image of 2020.We employed a logistic regression method for integrating multiple spectral data into a synthetic indicator,and compared the results with manually interpreted burned area reference maps and the Moderate-Resolution Imaging Spectroradiometer(MODIS)MCD64A1 burned area product.The overall accuracy of the single variable logistic regression was 77.38%to 86.90%and 73.47%to 97.14%for the 52TCQ and 51TYM cases,respectively.In comparison,the accuracy of the burned area map was improved to 87.14%and 98.33%for the 52TCQ and 51TYM cases,respectively by multiple variable logistic regression of Sentind-2 images.The balance of omission error and commission error was also improved.The integration of multiple spectral data combined with a logistic regression method proves to be effective for burned area detection,offering a highly automated process with an automatic threshold determination mechanism.This method exhibits excellent extensibility and flexibility taking the image tile as the operating unit.It is suitable for burned area detection at a regional scale and can also be implemented with other satellite data.
文摘Partial Differential Equation(PDE)is among the most fundamental tools employed to model dynamic systems.Existing PDE modeling methods are typically derived from established knowledge and known phenomena,which are time-consuming and labor-intensive.Recently,discovering governing PDEs from collected actual data via Physics Informed Neural Networks(PINNs)provides a more efficient way to analyze fresh dynamic systems and establish PEDmodels.This study proposes Sequentially Threshold Least Squares-Lasso(STLasso),a module constructed by incorporating Lasso regression into the Sequentially Threshold Least Squares(STLS)algorithm,which can complete sparse regression of PDE coefficients with the constraints of l0 norm.It further introduces PINN-STLasso,a physics informed neural network combined with Lasso sparse regression,able to find underlying PDEs from data with reduced data requirements and better interpretability.In addition,this research conducts experiments on canonical inverse PDE problems and compares the results to several recent methods.The results demonstrated that the proposed PINN-STLasso outperforms other methods,achieving lower error rates even with less data.
基金supported by National Science and Technology Infrastructure Platform National Population and Health Science Data Sharing Service Platform Public Health Science Data Center[NCMI-ZB01N-201905]。
文摘Objective This study employs the Geographically and Temporally Weighted Regression(GTWR)model to assess the impact of meteorological elements and imported cases on dengue fever outbreaks,emphasizing the spatial-temporal variability of these factors in border regions.Methods We conducted a descriptive analysis of dengue fever’s temporal-spatial distribution in Yunnan border areas.Utilizing annual data from 2013 to 2019,with each county in the Yunnan border serving as a spatial unit,we constructed a GTWR model to investigate the determinants of dengue fever and their spatio-temporal heterogeneity in this region.Results The GTWR model,proving more effective than Ordinary Least Squares(OLS)analysis,identified significant spatial and temporal heterogeneity in factors influencing dengue fever’s spread along the Yunnan border.Notably,the GTWR model revealed a substantial variation in the relationship between indigenous dengue fever incidence,meteorological variables,and imported cases across different counties.Conclusion In the Yunnan border areas,local dengue incidence is affected by temperature,humidity,precipitation,wind speed,and imported cases,with these factors’influence exhibiting notable spatial and temporal variation.