期刊文献+

一种基于Hadoop的并行关联规则算法 被引量:13

A parallel algorithm for mining frequent item sets on Hadoop
在线阅读 下载PDF
导出
摘要 在挖掘大型数据库的关联规则时,使用并行计算是必需的.针对传统的并行计算,存在不能处理节点失效,难以处理负载均衡等问题,提出基于Hadoop架构实现并行关联规则计算的设计.理论和实验证明,基于Hadoop的并行关联规则计算,能处理节点失效,并且能做到节点负载均衡. Parallel compute is required in mining frequent item sets on large databases.MPI can't handle node failure,and it's difficult to solve load balance.So we propose parallel mining frequent item sets on Hadoop.Theoretical and experimental results show parallel mining frequent item sets on Hadoop can handle node failure and can solve load balance.
出处 《天津理工大学学报》 2011年第1期25-28,32,共5页 Journal of Tianjin University of Technology
基金 国家863基金(2009AA01Z152) 天津市自然科学基金(08JCYBJC12400) 中小企业创新基金(08ZXCXGX15000)
关键词 关联规则 HADOOP 云计算 数据挖掘 association rules Hadoop cloud compute data mining
  • 相关文献

参考文献8

  • 1Agrawal R,tmielinski T,Swami A.Mining association rules between sets of Items in large database[C] //Pros of ACM SIGMOD International Conference on Management of Data.Washington DC:ACM Press,1993:207-216.
  • 2Savasere A,Om ieci nski E,Nacathe S.An efficient algorithm for mining association rules in large database[C].//Pros of the 21st International Conference on Very LargeDatabases.San Francisco:Morgan Kaufmann Publishers,1995:4322444.
  • 3Han Jia-wei,Pei Jian,Yin Yi-wen.Mining frequent patterns without candidate generation[C] //Pros of ACM SIGMOD International Conference on Management of Data.New York:ACM Press,2000:1-12.
  • 4Agrawal R,Sharfer J.Parallel mining of association rules[J].IEEE Trans on Knowledge and Data Engineering,1996,8(6);962-969.
  • 5Cheung W L,Ncentn V I,FU W C,et al.Efficient mining of association rules in distributed database[J].IEEETrans on Knowledge and Data Engineering,1996,8(1):911-922.
  • 6杨明,孙志挥,吉根林.快速挖掘全局频繁项目集[J].计算机研究与发展,2003,40(4):620-626. 被引量:35
  • 7Apache.Welcome to apache hadoop[EB/OL].(201010-15)[2010-11-02] http://hadoop.apache.org/.
  • 8Dean J,Chemawat S.MapReduce:simplified data processing on large clusters[C] //Pros of the 6th Symposium on Operating System Design and Implemention.San Francisco:CA,2004.

二级参考文献2

  • 1RAgrawa1 TImie1inSki Aswami.Mining association ru1es between sets of items in 1arge database[J].The ACM SIGMOD Intemationa1 Conf on Management of Data, Washington, DC,1993,.
  • 2路松峰,卢正鼎.快速开采最大频繁项目集[J].软件学报,2001,12(2):293-297. 被引量:113

共引文献34

同被引文献120

引证文献13

二级引证文献63

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部