The ocean plays an important role in maintaining the equilibrium of Earth’s ecology and providing humans access to a wealth of resources.To obtain a high-precision underwater image classification model,we propose a c...The ocean plays an important role in maintaining the equilibrium of Earth’s ecology and providing humans access to a wealth of resources.To obtain a high-precision underwater image classification model,we propose a classification model that combines an EfficientnetB0 neural network and a two-hidden-layer random vector functional link network(EfficientnetB0-TRVFL).The features of underwater images were extracted using the EfficientnetB0 neural network pretrained via ImageNet,and a new fully connected layer was trained on the underwater image dataset using the transfer learning method.Transfer learning ensures the initial performance of the network and helps in the development of a high-precision classification model.Subsequently,a TRVFL was proposed to improve the classification property of the model.Net construction of the two hidden layers exhibited a high accuracy when the same hidden layer nodes were used.The parameters of the second hidden layer were obtained using a novel calculation method,which reduced the outcome error to improve the performance instability caused by the random generation of parameters of RVFL.Finally,the TRVFL classifier was used to classify features and obtain classification results.The proposed EfficientnetB0-TRVFL classification model achieved 87.28%,74.06%,and 99.59%accuracy on the MLC2008,MLC2009,and Fish-gres datasets,respectively.The best convolutional neural networks and existing methods were stacked up through box plots and Kolmogorov-Smirnov tests,respectively.The increases imply improved systematization properties in underwater image classification tasks.The image classification model offers important performance advantages and better stability compared with existing methods.展开更多
In the contemporary era, the proliferation of information technology has led to an unprecedented surge in data generation, with this data being dispersed across a multitude of mobile devices. Facing these situations a...In the contemporary era, the proliferation of information technology has led to an unprecedented surge in data generation, with this data being dispersed across a multitude of mobile devices. Facing these situations and the training of deep learning model that needs great computing power support, the distributed algorithm that can carry out multi-party joint modeling has attracted everyone’s attention. The distributed training mode relieves the huge pressure of centralized model on computer computing power and communication. However, most distributed algorithms currently work in a master-slave mode, often including a central server for coordination, which to some extent will cause communication pressure, data leakage, privacy violations and other issues. To solve these problems, a decentralized fully distributed algorithm based on deep random weight neural network is proposed. The algorithm decomposes the original objective function into several sub-problems under consistency constraints, combines the decentralized average consensus (DAC) and alternating direction method of multipliers (ADMM), and achieves the goal of joint modeling and training through local calculation and communication of each node. Finally, we compare the proposed decentralized algorithm with several centralized deep neural networks with random weights, and experimental results demonstrate the effectiveness of the proposed algorithm.展开更多
Accurately estimating the State of Health(SOH)and Remaining Useful Life(RUL)of lithium-ion batteries(LIBs)is crucial for the continuous and stable operation of battery management systems.However,due to the complex int...Accurately estimating the State of Health(SOH)and Remaining Useful Life(RUL)of lithium-ion batteries(LIBs)is crucial for the continuous and stable operation of battery management systems.However,due to the complex internal chemical systems of LIBs and the nonlinear degradation of their performance,direct measurement of SOH and RUL is challenging.To address these issues,the Twin Support Vector Machine(TWSVM)method is proposed to predict SOH and RUL.Initially,the constant current charging time of the lithium battery is extracted as a health indicator(HI),decomposed using Variational Modal Decomposition(VMD),and feature correlations are computed using Importance of Random Forest Features(RF)to maximize the extraction of critical factors influencing battery performance degradation.Furthermore,to enhance the global search capability of the Convolution Optimization Algorithm(COA),improvements are made using Good Point Set theory and the Differential Evolution method.The Improved Convolution Optimization Algorithm(ICOA)is employed to optimize TWSVM parameters for constructing SOH and RUL prediction models.Finally,the proposed models are validated using NASA and CALCE lithium-ion battery datasets.Experimental results demonstrate that the proposed models achieve an RMSE not exceeding 0.007 and an MAPE not exceeding 0.0082 for SOH and RUL prediction,with a relative error in RUL prediction within the range of[-1.8%,2%].Compared to other models,the proposed model not only exhibits superior fitting capability but also demonstrates robust performance.展开更多
现代的大型复杂结构,如大坝、高层建筑、桥梁及海洋平台等,处于复杂的环境载荷作用下,这些环境载荷往往是无法测量的。在仅有输出响应时,应用随机减量法RDT获得自由衰减响应信号,而后用时域复指数拟合法、ITD法、特征系统实现算法ERA等...现代的大型复杂结构,如大坝、高层建筑、桥梁及海洋平台等,处于复杂的环境载荷作用下,这些环境载荷往往是无法测量的。在仅有输出响应时,应用随机减量法RDT获得自由衰减响应信号,而后用时域复指数拟合法、ITD法、特征系统实现算法ERA等算法获得结构的模态参数是一种有效的方法。但在数据量有限时,随机减量函数的平均次数过少,导致RD函数的收敛性较差。为此提出了利用Vector Random Decrement技术(VRDT)提取自由衰减响应信号,而后利用特征系统实现算法ERA求得模态参数的方法,新算法能够有效地提高模态参数识别精度。数值算例验证了所提算法的有效性。展开更多
In this paper, sixty-eight research articles published between 2000 and 2017 as well as textbooks which employed four classification algorithms: K-Nearest-Neighbor (KNN), Support Vector Machines (SVM), Random Forest (...In this paper, sixty-eight research articles published between 2000 and 2017 as well as textbooks which employed four classification algorithms: K-Nearest-Neighbor (KNN), Support Vector Machines (SVM), Random Forest (RF) and Neural Network (NN) as the main statistical tools were reviewed. The aim was to examine and compare these nonparametric classification methods on the following attributes: robustness to training data, sensitivity to changes, data fitting, stability, ability to handle large data sizes, sensitivity to noise, time invested in parameter tuning, and accuracy. The performances, strengths and shortcomings of each of the algorithms were examined, and finally, a conclusion was arrived at on which one has higher performance. It was evident from the literature reviewed that RF is too sensitive to small changes in the training dataset and is occasionally unstable and tends to overfit in the model. KNN is easy to implement and understand but has a major drawback of becoming significantly slow as the size of the data in use grows, while the ideal value of K for the KNN classifier is difficult to set. SVM and RF are insensitive to noise or overtraining, which shows their ability in dealing with unbalanced data. Larger input datasets will lengthen classification times for NN and KNN more than for SVM and RF. Among these nonparametric classification methods, NN has the potential to become a more widely used classification algorithm, but because of their time-consuming parameter tuning procedure, high level of complexity in computational processing, the numerous types of NN architectures to choose from and the high number of algorithms used for training, most researchers recommend SVM and RF as easier and wieldy used methods which repeatedly achieve results with high accuracies and are often faster to implement.展开更多
To solve the precision and reliability problem of various machinery equipments and military vehicles, some military organisations, the industrial sector and the academia at home and abroad begin to pay attention to th...To solve the precision and reliability problem of various machinery equipments and military vehicles, some military organisations, the industrial sector and the academia at home and abroad begin to pay attention to the statistical distribution of machining dimensions, material properties and service loads, and the system reliability optimization design with constraints and reliability optimization design of various mechanical parts is studied in this way. However, the above researches focus on solving the strength and the life problem, and no studies have been done on the discrete degree and discrete pattern of other performance indicators. The concept of using a random vector to describe the mechanical parts performance indicators is presented; characteristics between the value of the vector variance matrix determinant and the sum of the diagonal covariance matrix in describing the performance indicators of vector dispersion are studied and compared. A clutch diaphragm spring is set as an example, the geometric dimension indicator is described with random vector, and the applicability of using variance matrix determinant and variance matrix trace of geometric dimension vector to describe discrete degree of random vector is studied by using Monte-Carlo simulation method and component discrete degree perturbation method. Also, the effects of different components of diaphragm spring geometric dimension vector on the value of covariance matrix determinant and the sum of covariance matrix diagonal of diaphragm spring performance indicators vector are analyzed. The present study shows that the impacts of the dispersion of diaphragm spring cone angle on every performance dispersion are all ranked first, and far exceed that of other dimension dispersion. So it must be strictly controlled in the production process. The result of the research work provides a reference for the design of diaphragm spring, and also it presents a proper method for researching the performance of other mechanical parts.展开更多
Image signals are always disturbed by noise during their transmission, such as in mobile or network communication. The received image quality is significantly influenced by noise. Thus, image signal denoising is an in...Image signals are always disturbed by noise during their transmission, such as in mobile or network communication. The received image quality is significantly influenced by noise. Thus, image signal denoising is an indispensable step during image processing. As we all know, most commonly used methods of image denoising is Bayesian wavelet transform estimators. The Performance of various estimators, such as maximum a posteriori (MAP), or minimum mean square error (MMSE) is strongly dependent on correctness of the proposed model for original data distribution. Therefore, the selection of a proper model for distribution of wavelet coefficients is important in wavelet-based image denoising. This paper presents a new image denoising algorithm based on the modeling of wavelet coefficients in each subband with multivariate Radial Exponential probability density function (PDF) with local variances. Generally these multivariate extensions do not result in a closed form expression, and the solution requires numerical solutions. However, we drive a closed form MMSE shrinkage functions for a Radial Exponential random vectors in additive white Gaussian noise (AWGN). The estimator is motivated and tested on the problem of wavelet-based image denoising. In the last, proposed, the same idea is applied to the dual-tree complex wavelet transform (DT-CWT), This Transform is an over-complete wavelet transform.展开更多
Based on the research of predictingβ-hairpin motifs in proteins, we apply Random Forest and Support Vector Machine algorithm to predictβ-hairpin motifs in ArchDB40 dataset. The motifs with the loop length of 2 to 8 ...Based on the research of predictingβ-hairpin motifs in proteins, we apply Random Forest and Support Vector Machine algorithm to predictβ-hairpin motifs in ArchDB40 dataset. The motifs with the loop length of 2 to 8 amino acid residues are extracted as research object and thefixed-length pattern of 12 amino acids are selected. When using the same characteristic parameters and the same test method, Random Forest algorithm is more effective than Support Vector Machine. In addition, because of Random Forest algorithm doesn’t produce overfitting phenomenon while the dimension of characteristic parameters is higher, we use Random Forest based on higher dimension characteristic parameters to predictβ-hairpin motifs. The better prediction results are obtained;the overall accuracy and Matthew’s correlation coefficient of 5-fold cross-validation achieve 83.3% and 0.59, respectively.展开更多
This research paper proposes a filter to remove Random Valued Impulse Noise (RVIN) based on Global Threshold Vector Outlyingness Ratio (GTVOR) that is applicable for real time image processing. This filter works with ...This research paper proposes a filter to remove Random Valued Impulse Noise (RVIN) based on Global Threshold Vector Outlyingness Ratio (GTVOR) that is applicable for real time image processing. This filter works with the algorithm that breaks the images into various decomposition levels using Discrete Wavelet Transform (DWT) and searches for the noisy pixels using the outlyingness of the pixel. This algorithm has the capability of differentiating high frequency pixels and the “noisy pixel” using the threshold as well as window adjustments. The damage and the loss of information are prevented by means of interior mining. This global threshold based algorithm uses different thresholds for different quadrants of DWT and thus helps in recovery of noisy image even if it is 90% affected. Experimental results exhibit that this method outperforms other existing methods for accurate noise detection and removal, at the same time chain of connectivity is not lost.展开更多
Making the distinction between different plantation tree species is crucial for creating reliable and trustworthy information, which is critical in forestry administration and upkeep. Over the years, forest delineatio...Making the distinction between different plantation tree species is crucial for creating reliable and trustworthy information, which is critical in forestry administration and upkeep. Over the years, forest delineation and mapping have been done using the conventional techniques, such as the utilization of ground truth facts together with orthophotos. These techniques have been proven to be very precise, but they are expensive, cumbersome, and challenging to employ in remote regions. To resolve this shortfall, this research investigates the potential of data from the commercial, PlanetScope CubeSat and the freely available, Sentinel 2 data from Copernicus to discriminate commercial forest tree species in the Usutu Forest, Eswatini. Two approaches for image classification, Random Forest (RF) and the Support Vector Machine (SVM) were investigated at different levels of the forest database classification which is the genus (family of tree species) and species levels. The result of the study indicates that, the Sentinel 2 images had the highest species classification accuracy compared to the PlanetScope image. Both classification methods achieved a 94% maximum OA and 0.90 kappa value at the genus level with the Sentinel 2 imagery. At the species level, the Sentinel 2 imagery again showed highly acceptable results with the SVM method, with an OA of 82%. The PlanetScope images performed badly with less than 64% OA for both RF and SVM at the genus level and poorer at the species level with a low OA figure, 47% and 53% for the SVM and RF respectively. Our results suggest that the freely available Sentinel 2 data together with the SVM method has a high potential for identifying differences between commercial tree species than the PlanetScope. The study uncovered that both classification methods are highly capable of classifying species under the gum genus group (esmi, egxu, and egxn) using both imageries. However, it was difficult to separate species types under the pine genus group, particularly discriminating the hybrid species such as pech and pell since pech is a hybrid species for pell.展开更多
1st cases of COVID-19 were reported in March 2020 in Bangladesh and rapidly increased daily. So many steps were taken by the Bangladesh government to reduce the outbreak of COVID-19, such as masks, gatherings, local m...1st cases of COVID-19 were reported in March 2020 in Bangladesh and rapidly increased daily. So many steps were taken by the Bangladesh government to reduce the outbreak of COVID-19, such as masks, gatherings, local movements, international movements, etc. The data was collected from the World Health Organization. In this research, different variables have been used for analysis, for instance, new cases, new deaths, masks, schools, business, gatherings, domestic movement, international travel, new test, positive rate, test per case, new vaccination smoothed, new vaccine, total vaccination, and stringency index. Machine learning algorithms were used to predict and build the model, such as linear regression, K-nearest neighbours, decision trees, random forests, and support vector machines. Accuracy and Mean Square error (MSE) were used to test the model. A hyperparameter was also applied to find the optimum values of parameters. After computing the analysis, the result showed that the linear regression algorithm performs the best overall among the algorithms listed, with the highest testing accuracy and the lowest RMSE before and after hyper-tuning. The highest accuracy and lowest MSE were used for the best model, and for this data set, Linear regression got the highest accuracy, 0.98 and 0.97 and the lowest MSE, 4.79 and 4.04, respectively.展开更多
Every second, a large volume of useful data is created in social media about the various kind of online purchases and in another forms of reviews. Particularly, purchased products review data is enormously growing in ...Every second, a large volume of useful data is created in social media about the various kind of online purchases and in another forms of reviews. Particularly, purchased products review data is enormously growing in different database repositories every day. Most of the review data are useful to new customers for theier further purchases as well as existing companies to view customers feedback about various products. Data Mining and Machine Leaning techniques are familiar to analyse such kind of data to visualise and know the potential use of the purchased items through online. The customers are making quality of products through their sentiments about the purchased items from different online companies. In this research work, it is analysed sentiments of Headphone review data, which is collected from online repositories. For the analysis of Headphone review data, some of the Machine Learning techniques like Support Vector Machines, Naive Bayes, Decision Trees and Random Forest Algorithms and a Hybrid method are applied to find the quality via the customers’ sentiments. The accuracy and performance of the taken algorithms are also analysed based on the three types of sentiments such as positive, negative and neutral.展开更多
基金support of the National Key R&D Program of China(No.2022YFC2803903)the Key R&D Program of Zhejiang Province(No.2021C03013)the Zhejiang Provincial Natural Science Foundation of China(No.LZ20F020003).
文摘The ocean plays an important role in maintaining the equilibrium of Earth’s ecology and providing humans access to a wealth of resources.To obtain a high-precision underwater image classification model,we propose a classification model that combines an EfficientnetB0 neural network and a two-hidden-layer random vector functional link network(EfficientnetB0-TRVFL).The features of underwater images were extracted using the EfficientnetB0 neural network pretrained via ImageNet,and a new fully connected layer was trained on the underwater image dataset using the transfer learning method.Transfer learning ensures the initial performance of the network and helps in the development of a high-precision classification model.Subsequently,a TRVFL was proposed to improve the classification property of the model.Net construction of the two hidden layers exhibited a high accuracy when the same hidden layer nodes were used.The parameters of the second hidden layer were obtained using a novel calculation method,which reduced the outcome error to improve the performance instability caused by the random generation of parameters of RVFL.Finally,the TRVFL classifier was used to classify features and obtain classification results.The proposed EfficientnetB0-TRVFL classification model achieved 87.28%,74.06%,and 99.59%accuracy on the MLC2008,MLC2009,and Fish-gres datasets,respectively.The best convolutional neural networks and existing methods were stacked up through box plots and Kolmogorov-Smirnov tests,respectively.The increases imply improved systematization properties in underwater image classification tasks.The image classification model offers important performance advantages and better stability compared with existing methods.
文摘In the contemporary era, the proliferation of information technology has led to an unprecedented surge in data generation, with this data being dispersed across a multitude of mobile devices. Facing these situations and the training of deep learning model that needs great computing power support, the distributed algorithm that can carry out multi-party joint modeling has attracted everyone’s attention. The distributed training mode relieves the huge pressure of centralized model on computer computing power and communication. However, most distributed algorithms currently work in a master-slave mode, often including a central server for coordination, which to some extent will cause communication pressure, data leakage, privacy violations and other issues. To solve these problems, a decentralized fully distributed algorithm based on deep random weight neural network is proposed. The algorithm decomposes the original objective function into several sub-problems under consistency constraints, combines the decentralized average consensus (DAC) and alternating direction method of multipliers (ADMM), and achieves the goal of joint modeling and training through local calculation and communication of each node. Finally, we compare the proposed decentralized algorithm with several centralized deep neural networks with random weights, and experimental results demonstrate the effectiveness of the proposed algorithm.
基金funded by the Pyramid Talent Training Project of Beijing University of Civil Engineering and Architecture under Grant GJZJ20220802。
文摘Accurately estimating the State of Health(SOH)and Remaining Useful Life(RUL)of lithium-ion batteries(LIBs)is crucial for the continuous and stable operation of battery management systems.However,due to the complex internal chemical systems of LIBs and the nonlinear degradation of their performance,direct measurement of SOH and RUL is challenging.To address these issues,the Twin Support Vector Machine(TWSVM)method is proposed to predict SOH and RUL.Initially,the constant current charging time of the lithium battery is extracted as a health indicator(HI),decomposed using Variational Modal Decomposition(VMD),and feature correlations are computed using Importance of Random Forest Features(RF)to maximize the extraction of critical factors influencing battery performance degradation.Furthermore,to enhance the global search capability of the Convolution Optimization Algorithm(COA),improvements are made using Good Point Set theory and the Differential Evolution method.The Improved Convolution Optimization Algorithm(ICOA)is employed to optimize TWSVM parameters for constructing SOH and RUL prediction models.Finally,the proposed models are validated using NASA and CALCE lithium-ion battery datasets.Experimental results demonstrate that the proposed models achieve an RMSE not exceeding 0.007 and an MAPE not exceeding 0.0082 for SOH and RUL prediction,with a relative error in RUL prediction within the range of[-1.8%,2%].Compared to other models,the proposed model not only exhibits superior fitting capability but also demonstrates robust performance.
文摘现代的大型复杂结构,如大坝、高层建筑、桥梁及海洋平台等,处于复杂的环境载荷作用下,这些环境载荷往往是无法测量的。在仅有输出响应时,应用随机减量法RDT获得自由衰减响应信号,而后用时域复指数拟合法、ITD法、特征系统实现算法ERA等算法获得结构的模态参数是一种有效的方法。但在数据量有限时,随机减量函数的平均次数过少,导致RD函数的收敛性较差。为此提出了利用Vector Random Decrement技术(VRDT)提取自由衰减响应信号,而后利用特征系统实现算法ERA求得模态参数的方法,新算法能够有效地提高模态参数识别精度。数值算例验证了所提算法的有效性。
文摘In this paper, sixty-eight research articles published between 2000 and 2017 as well as textbooks which employed four classification algorithms: K-Nearest-Neighbor (KNN), Support Vector Machines (SVM), Random Forest (RF) and Neural Network (NN) as the main statistical tools were reviewed. The aim was to examine and compare these nonparametric classification methods on the following attributes: robustness to training data, sensitivity to changes, data fitting, stability, ability to handle large data sizes, sensitivity to noise, time invested in parameter tuning, and accuracy. The performances, strengths and shortcomings of each of the algorithms were examined, and finally, a conclusion was arrived at on which one has higher performance. It was evident from the literature reviewed that RF is too sensitive to small changes in the training dataset and is occasionally unstable and tends to overfit in the model. KNN is easy to implement and understand but has a major drawback of becoming significantly slow as the size of the data in use grows, while the ideal value of K for the KNN classifier is difficult to set. SVM and RF are insensitive to noise or overtraining, which shows their ability in dealing with unbalanced data. Larger input datasets will lengthen classification times for NN and KNN more than for SVM and RF. Among these nonparametric classification methods, NN has the potential to become a more widely used classification algorithm, but because of their time-consuming parameter tuning procedure, high level of complexity in computational processing, the numerous types of NN architectures to choose from and the high number of algorithms used for training, most researchers recommend SVM and RF as easier and wieldy used methods which repeatedly achieve results with high accuracies and are often faster to implement.
文摘To solve the precision and reliability problem of various machinery equipments and military vehicles, some military organisations, the industrial sector and the academia at home and abroad begin to pay attention to the statistical distribution of machining dimensions, material properties and service loads, and the system reliability optimization design with constraints and reliability optimization design of various mechanical parts is studied in this way. However, the above researches focus on solving the strength and the life problem, and no studies have been done on the discrete degree and discrete pattern of other performance indicators. The concept of using a random vector to describe the mechanical parts performance indicators is presented; characteristics between the value of the vector variance matrix determinant and the sum of the diagonal covariance matrix in describing the performance indicators of vector dispersion are studied and compared. A clutch diaphragm spring is set as an example, the geometric dimension indicator is described with random vector, and the applicability of using variance matrix determinant and variance matrix trace of geometric dimension vector to describe discrete degree of random vector is studied by using Monte-Carlo simulation method and component discrete degree perturbation method. Also, the effects of different components of diaphragm spring geometric dimension vector on the value of covariance matrix determinant and the sum of covariance matrix diagonal of diaphragm spring performance indicators vector are analyzed. The present study shows that the impacts of the dispersion of diaphragm spring cone angle on every performance dispersion are all ranked first, and far exceed that of other dimension dispersion. So it must be strictly controlled in the production process. The result of the research work provides a reference for the design of diaphragm spring, and also it presents a proper method for researching the performance of other mechanical parts.
文摘Image signals are always disturbed by noise during their transmission, such as in mobile or network communication. The received image quality is significantly influenced by noise. Thus, image signal denoising is an indispensable step during image processing. As we all know, most commonly used methods of image denoising is Bayesian wavelet transform estimators. The Performance of various estimators, such as maximum a posteriori (MAP), or minimum mean square error (MMSE) is strongly dependent on correctness of the proposed model for original data distribution. Therefore, the selection of a proper model for distribution of wavelet coefficients is important in wavelet-based image denoising. This paper presents a new image denoising algorithm based on the modeling of wavelet coefficients in each subband with multivariate Radial Exponential probability density function (PDF) with local variances. Generally these multivariate extensions do not result in a closed form expression, and the solution requires numerical solutions. However, we drive a closed form MMSE shrinkage functions for a Radial Exponential random vectors in additive white Gaussian noise (AWGN). The estimator is motivated and tested on the problem of wavelet-based image denoising. In the last, proposed, the same idea is applied to the dual-tree complex wavelet transform (DT-CWT), This Transform is an over-complete wavelet transform.
文摘Based on the research of predictingβ-hairpin motifs in proteins, we apply Random Forest and Support Vector Machine algorithm to predictβ-hairpin motifs in ArchDB40 dataset. The motifs with the loop length of 2 to 8 amino acid residues are extracted as research object and thefixed-length pattern of 12 amino acids are selected. When using the same characteristic parameters and the same test method, Random Forest algorithm is more effective than Support Vector Machine. In addition, because of Random Forest algorithm doesn’t produce overfitting phenomenon while the dimension of characteristic parameters is higher, we use Random Forest based on higher dimension characteristic parameters to predictβ-hairpin motifs. The better prediction results are obtained;the overall accuracy and Matthew’s correlation coefficient of 5-fold cross-validation achieve 83.3% and 0.59, respectively.
文摘This research paper proposes a filter to remove Random Valued Impulse Noise (RVIN) based on Global Threshold Vector Outlyingness Ratio (GTVOR) that is applicable for real time image processing. This filter works with the algorithm that breaks the images into various decomposition levels using Discrete Wavelet Transform (DWT) and searches for the noisy pixels using the outlyingness of the pixel. This algorithm has the capability of differentiating high frequency pixels and the “noisy pixel” using the threshold as well as window adjustments. The damage and the loss of information are prevented by means of interior mining. This global threshold based algorithm uses different thresholds for different quadrants of DWT and thus helps in recovery of noisy image even if it is 90% affected. Experimental results exhibit that this method outperforms other existing methods for accurate noise detection and removal, at the same time chain of connectivity is not lost.
文摘Making the distinction between different plantation tree species is crucial for creating reliable and trustworthy information, which is critical in forestry administration and upkeep. Over the years, forest delineation and mapping have been done using the conventional techniques, such as the utilization of ground truth facts together with orthophotos. These techniques have been proven to be very precise, but they are expensive, cumbersome, and challenging to employ in remote regions. To resolve this shortfall, this research investigates the potential of data from the commercial, PlanetScope CubeSat and the freely available, Sentinel 2 data from Copernicus to discriminate commercial forest tree species in the Usutu Forest, Eswatini. Two approaches for image classification, Random Forest (RF) and the Support Vector Machine (SVM) were investigated at different levels of the forest database classification which is the genus (family of tree species) and species levels. The result of the study indicates that, the Sentinel 2 images had the highest species classification accuracy compared to the PlanetScope image. Both classification methods achieved a 94% maximum OA and 0.90 kappa value at the genus level with the Sentinel 2 imagery. At the species level, the Sentinel 2 imagery again showed highly acceptable results with the SVM method, with an OA of 82%. The PlanetScope images performed badly with less than 64% OA for both RF and SVM at the genus level and poorer at the species level with a low OA figure, 47% and 53% for the SVM and RF respectively. Our results suggest that the freely available Sentinel 2 data together with the SVM method has a high potential for identifying differences between commercial tree species than the PlanetScope. The study uncovered that both classification methods are highly capable of classifying species under the gum genus group (esmi, egxu, and egxn) using both imageries. However, it was difficult to separate species types under the pine genus group, particularly discriminating the hybrid species such as pech and pell since pech is a hybrid species for pell.
文摘1st cases of COVID-19 were reported in March 2020 in Bangladesh and rapidly increased daily. So many steps were taken by the Bangladesh government to reduce the outbreak of COVID-19, such as masks, gatherings, local movements, international movements, etc. The data was collected from the World Health Organization. In this research, different variables have been used for analysis, for instance, new cases, new deaths, masks, schools, business, gatherings, domestic movement, international travel, new test, positive rate, test per case, new vaccination smoothed, new vaccine, total vaccination, and stringency index. Machine learning algorithms were used to predict and build the model, such as linear regression, K-nearest neighbours, decision trees, random forests, and support vector machines. Accuracy and Mean Square error (MSE) were used to test the model. A hyperparameter was also applied to find the optimum values of parameters. After computing the analysis, the result showed that the linear regression algorithm performs the best overall among the algorithms listed, with the highest testing accuracy and the lowest RMSE before and after hyper-tuning. The highest accuracy and lowest MSE were used for the best model, and for this data set, Linear regression got the highest accuracy, 0.98 and 0.97 and the lowest MSE, 4.79 and 4.04, respectively.
文摘Every second, a large volume of useful data is created in social media about the various kind of online purchases and in another forms of reviews. Particularly, purchased products review data is enormously growing in different database repositories every day. Most of the review data are useful to new customers for theier further purchases as well as existing companies to view customers feedback about various products. Data Mining and Machine Leaning techniques are familiar to analyse such kind of data to visualise and know the potential use of the purchased items through online. The customers are making quality of products through their sentiments about the purchased items from different online companies. In this research work, it is analysed sentiments of Headphone review data, which is collected from online repositories. For the analysis of Headphone review data, some of the Machine Learning techniques like Support Vector Machines, Naive Bayes, Decision Trees and Random Forest Algorithms and a Hybrid method are applied to find the quality via the customers’ sentiments. The accuracy and performance of the taken algorithms are also analysed based on the three types of sentiments such as positive, negative and neutral.