Social media(SM)based surveillance systems,combined with machine learning(ML)and deep learning(DL)techniques,have shown potential for early detection of epidemic outbreaks.This review discusses the current state of SM...Social media(SM)based surveillance systems,combined with machine learning(ML)and deep learning(DL)techniques,have shown potential for early detection of epidemic outbreaks.This review discusses the current state of SM-based surveillance methods for early epidemic outbreaks and the role of ML and DL in enhancing their performance.Since,every year,a large amount of data related to epidemic outbreaks,particularly Twitter data is generated by SM.This paper outlines the theme of SM analysis for tracking health-related issues and detecting epidemic outbreaks in SM,along with the ML and DL techniques that have been configured for the detection of epidemic outbreaks.DL has emerged as a promising ML technique that adaptsmultiple layers of representations or features of the data and yields state-of-the-art extrapolation results.In recent years,along with the success of ML and DL in many other application domains,both ML and DL are also popularly used in SM analysis.This paper aims to provide an overview of epidemic outbreaks in SM and then outlines a comprehensive analysis of ML and DL approaches and their existing applications in SM analysis.Finally,this review serves the purpose of offering suggestions,ideas,and proposals,along with highlighting the ongoing challenges in the field of early outbreak detection that still need to be addressed.展开更多
Social media platforms have lately emerged as a promising tool for predicting the outbreak of epidemics by analyzing information on them with the help of machine learning techniques.Many analytical and statistical mod...Social media platforms have lately emerged as a promising tool for predicting the outbreak of epidemics by analyzing information on them with the help of machine learning techniques.Many analytical and statistical models are available to infer a variety of user sentiments in posts on social media.The amount of data generated by social media platforms,such as Twitter,that can be used to track diseases is increasing rapidly.This paper proposes a method for the classication of tweets related to the outbreak of dengue using machine learning algorithms.An articial neural network(ANN)-based method is developed using Global Vector(GloVe)embedding to use the data in tweets for the automatic and efcient identication and classication of dengue.The proposed method classies tweets related to the outbreak of dengue into positives and negatives.Experiments were conducted to assess the proposed ANN model based on performance evaluation matrices(confusion matrices).The results show that the GloVe vectors can efciently capture a sufcient amount of information for the classier to accurately identify and classify tweets as relevant or irrelevant to dengue outbreaks.The proposed method can help healthcare professionals and researchers track and analyze epidemic outbreaks through social media in real time.展开更多
Social networking services(SNSs)provide massive data that can be a very influential source of information during pandemic outbreaks.This study shows that social media analysis can be used as a crisis detector(e.g.,und...Social networking services(SNSs)provide massive data that can be a very influential source of information during pandemic outbreaks.This study shows that social media analysis can be used as a crisis detector(e.g.,understanding the sentiment of social media users regarding various pandemic outbreaks).The novel Coronavirus Disease-19(COVID-19),commonly known as coronavirus,has affected everyone worldwide in 2020.Streaming Twitter data have revealed the status of the COVID-19 outbreak in the most affected regions.This study focuses on identifying COVID-19 patients using tweets without requiring medical records to find the COVID-19 pandemic in Twitter messages(tweets).For this purpose,we propose herein an intelligent model using traditional machine learning-based approaches,such as support vector machine(SVM),logistic regression(LR),naïve Bayes(NB),random forest(RF),and decision tree(DT)with the help of the term frequency inverse document frequency(TF-IDF)to detect the COVID-19 pandemic in Twitter messages.The proposed intelligent traditional machine learning-based model classifies Twitter messages into four categories,namely,confirmed deaths,recovered,and suspected.For the experimental analysis,the tweet data on the COVID-19 pandemic are analyzed to evaluate the results of traditional machine learning approaches.A benchmark dataset for COVID-19 on Twitter messages is developed and can be used for future research studies.The experiments show that the results of the proposed approach are promising in detecting the COVID-19 pandemic in Twitter messages with overall accuracy,precision,recall,and F1 score between 70%and 80%and the confusion matrix for machine learning approaches(i.e.,SVM,NB,LR,RF,and DT)with the TF-IDF feature extraction technique.展开更多
基金authors are thankful to the Deanship of Scientific Research at Najran University for funding this work,under the Research Groups Funding Program Grant Code(NU/RG/SERC/12/27).
文摘Social media(SM)based surveillance systems,combined with machine learning(ML)and deep learning(DL)techniques,have shown potential for early detection of epidemic outbreaks.This review discusses the current state of SM-based surveillance methods for early epidemic outbreaks and the role of ML and DL in enhancing their performance.Since,every year,a large amount of data related to epidemic outbreaks,particularly Twitter data is generated by SM.This paper outlines the theme of SM analysis for tracking health-related issues and detecting epidemic outbreaks in SM,along with the ML and DL techniques that have been configured for the detection of epidemic outbreaks.DL has emerged as a promising ML technique that adaptsmultiple layers of representations or features of the data and yields state-of-the-art extrapolation results.In recent years,along with the success of ML and DL in many other application domains,both ML and DL are also popularly used in SM analysis.This paper aims to provide an overview of epidemic outbreaks in SM and then outlines a comprehensive analysis of ML and DL approaches and their existing applications in SM analysis.Finally,this review serves the purpose of offering suggestions,ideas,and proposals,along with highlighting the ongoing challenges in the field of early outbreak detection that still need to be addressed.
文摘Social media platforms have lately emerged as a promising tool for predicting the outbreak of epidemics by analyzing information on them with the help of machine learning techniques.Many analytical and statistical models are available to infer a variety of user sentiments in posts on social media.The amount of data generated by social media platforms,such as Twitter,that can be used to track diseases is increasing rapidly.This paper proposes a method for the classication of tweets related to the outbreak of dengue using machine learning algorithms.An articial neural network(ANN)-based method is developed using Global Vector(GloVe)embedding to use the data in tweets for the automatic and efcient identication and classication of dengue.The proposed method classies tweets related to the outbreak of dengue into positives and negatives.Experiments were conducted to assess the proposed ANN model based on performance evaluation matrices(confusion matrices).The results show that the GloVe vectors can efciently capture a sufcient amount of information for the classier to accurately identify and classify tweets as relevant or irrelevant to dengue outbreaks.The proposed method can help healthcare professionals and researchers track and analyze epidemic outbreaks through social media in real time.
基金supported by a grant from the Research Center of the Female Scientific and Medical Colleges,Deanship of Scientific Research,King Saud University。
文摘Social networking services(SNSs)provide massive data that can be a very influential source of information during pandemic outbreaks.This study shows that social media analysis can be used as a crisis detector(e.g.,understanding the sentiment of social media users regarding various pandemic outbreaks).The novel Coronavirus Disease-19(COVID-19),commonly known as coronavirus,has affected everyone worldwide in 2020.Streaming Twitter data have revealed the status of the COVID-19 outbreak in the most affected regions.This study focuses on identifying COVID-19 patients using tweets without requiring medical records to find the COVID-19 pandemic in Twitter messages(tweets).For this purpose,we propose herein an intelligent model using traditional machine learning-based approaches,such as support vector machine(SVM),logistic regression(LR),naïve Bayes(NB),random forest(RF),and decision tree(DT)with the help of the term frequency inverse document frequency(TF-IDF)to detect the COVID-19 pandemic in Twitter messages.The proposed intelligent traditional machine learning-based model classifies Twitter messages into four categories,namely,confirmed deaths,recovered,and suspected.For the experimental analysis,the tweet data on the COVID-19 pandemic are analyzed to evaluate the results of traditional machine learning approaches.A benchmark dataset for COVID-19 on Twitter messages is developed and can be used for future research studies.The experiments show that the results of the proposed approach are promising in detecting the COVID-19 pandemic in Twitter messages with overall accuracy,precision,recall,and F1 score between 70%and 80%and the confusion matrix for machine learning approaches(i.e.,SVM,NB,LR,RF,and DT)with the TF-IDF feature extraction technique.