摘要
针对数据挖掘中亟需考虑的数据缺失、数据被污染等两类数据质量问题进行了相关分析,提出分别用于处理这些问题的统计方法.
In data mining, there are two common qualitative problems of data. One is that the data is incomplete, i.e., some data are missing,the other is that some data are contaminated. Both problems are studied, and some statistical means are introduced to deal with them respectively.
出处
《山东大学学报(理学版)》
CAS
CSCD
北大核心
2005年第3期57-61,共5页
Journal of Shandong University(Natural Science)
基金
山东省软科学资助项目(A2004241)
山东省社科规划研究项目(04BJZ46)
山东大学人文社会科学青年成长基金资助项目