期刊文献+

Unsupervised Methods to Classify Real Data from Offshore Wells

Unsupervised Methods to Classify Real Data from Offshore Wells
在线阅读 下载PDF
导出
摘要 In the petroleum industry, sensor data and information are valuable. It can detect, predict and help to understand processes during oil production. Offshore wells require more attention. Once workovers, maintenance, and intervention are more costly than onshore wells. Coupling data-driven methods for well-monitoring applications, two unsupervised classification methods, one statistical and one machine learning-based, are proposed to detect anomalies in well data. The novelty is presented by applying a Control Chart us</span><span style="font-family:Verdana;">ing a 3 standard deviations window for the Permanent Downhole Gauge Pr</span><span style="font-family:Verdana;">es</span><span style="font-family:Verdana;">sure sensor (P-PDG), and a Fuzzy C-means algorithm to classify data from pr</span><span style="font-family:Verdana;">essure and temperature sensors in an offshore field. The main goal in structuring a classified data set is using it to train machine learning models to monitor and manage petroleum production. Modeling applications for early fault detection systems in offshore production, based on real-time data from production sensors, require classified data sets. Then, labeling two target classes</span></span><span style="font-family:Verdana;">:</span><span style="font-family:""><span style="font-family:Verdana;"> “normal” and “fault” is a key step to be implemented in order to train the machine learning models. Therefore, this paper applies two methodologies to classify a real-time data set to create a training data set divided into “normal” </span><span style="font-family:Verdana;">and “fault” classes. Thus, it is possible to visualize the abnormal events poi</span><span style="font-family:Verdana;">nted out by the methodologies and compare how sensible is each method. In addition, </span></span><span style="font-family:Verdana;">it </span><span style="font-family:""><span style="font-family:Verdana;">is proposed a random forest application to test the performance of the classified data sets from both methods. The results have shown that the con</span><span style="font-family:Verdana;">trol chart method presents higher sensibility than fuzzy c-means, however, th</span><span style="font-family:Verdana;">e </span><span style="font-family:Verdana;">differences between are insignificant. The random forest performance displ</span><span style="font-family:Verdana;">ayed sensitivity and specificity values of 99.91% and 100% for the data set classified by the control chart method and 94.01% and 99.98% for the data set classified by fuzzy c-means algorithm. In the petroleum industry, sensor data and information are valuable. It can detect, predict and help to understand processes during oil production. Offshore wells require more attention. Once workovers, maintenance, and intervention are more costly than onshore wells. Coupling data-driven methods for well-monitoring applications, two unsupervised classification methods, one statistical and one machine learning-based, are proposed to detect anomalies in well data. The novelty is presented by applying a Control Chart us</span><span style="font-family:Verdana;">ing a 3 standard deviations window for the Permanent Downhole Gauge Pr</span><span style="font-family:Verdana;">es</span><span style="font-family:Verdana;">sure sensor (P-PDG), and a Fuzzy C-means algorithm to classify data from pr</span><span style="font-family:Verdana;">essure and temperature sensors in an offshore field. The main goal in structuring a classified data set is using it to train machine learning models to monitor and manage petroleum production. Modeling applications for early fault detection systems in offshore production, based on real-time data from production sensors, require classified data sets. Then, labeling two target classes</span></span><span style="font-family:Verdana;">:</span><span style="font-family:""><span style="font-family:Verdana;"> “normal” and “fault” is a key step to be implemented in order to train the machine learning models. Therefore, this paper applies two methodologies to classify a real-time data set to create a training data set divided into “normal” </span><span style="font-family:Verdana;">and “fault” classes. Thus, it is possible to visualize the abnormal events poi</span><span style="font-family:Verdana;">nted out by the methodologies and compare how sensible is each method. In addition, </span></span><span style="font-family:Verdana;">it </span><span style="font-family:""><span style="font-family:Verdana;">is proposed a random forest application to test the performance of the classified data sets from both methods. The results have shown that the con</span><span style="font-family:Verdana;">trol chart method presents higher sensibility than fuzzy c-means, however, th</span><span style="font-family:Verdana;">e </span><span style="font-family:Verdana;">differences between are insignificant. The random forest performance displ</span><span style="font-family:Verdana;">ayed sensitivity and specificity values of 99.91% and 100% for the data set classified by the control chart method and 94.01% and 99.98% for the data set classified by fuzzy c-means algorithm.
作者 Antônio Orestes De Salvo Castro Mayara De Jesus Rocha Santos Fabiana Rodrigues Leta Cláudio Benevenuto C. Lima Gilson Brito Alves Lima Antônio Orestes De Salvo Castro;Mayara De Jesus Rocha Santos;Fabiana Rodrigues Leta;Cláudio Benevenuto C. Lima;Gilson Brito Alves Lima(Rio de Janeiro State University (UERJ), Rio de Janeiro, Brazil;Department of Engineering of Fluminense Federal University (UFF), Niterói, Brazil;Petróleo Brasileiro S.A, Rio de Janeiro, Brazil)
出处 《American Journal of Operations Research》 2021年第5期227-241,共15页 美国运筹学期刊(英文)
关键词 Unsupervised Classification Fuzzy C-Means Control Chart Unsupervised Classification Fuzzy C-Means Control Chart
  • 相关文献

参考文献2

二级参考文献8

共引文献17

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部