一种基于贪心EM的改进预测算法

EM-based Greedy Algorithm for Improved Forecasting

下载PDF

导出

摘要本文主要研究了Motif预测算法,在贪心EM预测算法基础上进行分析优化,并形成新的预测方法。工作重点是在参数的初始化,对参数模型的重新划分并引入Kd-树的层次聚类的方法,建立新的PKG算法。预测结果表明,在预测较大数据集方面新算法有一定的优势,尤其是对同一物种的序列预测具有更强的搜索和分类能力,在没有影响时间复杂度的前提下显著的提高了搜索的效率。 Motif finding algorithm was studied as the key point in this paper.Optimization was based on the EM-baced greedy algorithm and then predicting method was established.Parameters initialization,the re-division of the parameter model and introduction of Kd-tree hierarchical clustering method,and the establishment of PKG algorithm were paid more attention in this paper.The results indicated PKG algorithm has some advantages in predicting motifs in large data sets,especially in the prediction of sequences in the same species.With advantages in sequence search and classification capabilities,the search efficiency was improved by PKG algorithm significantly without affecting complexity in the time.

作者张斐

机构地区陕西警官职业学院

出处《价值工程》 2011年第17期141-142,共2页 Value Engineering

基金国家自然科学基金资助项目(30600329)

关键词 Motif预测贪心EM算法 PKG算法 Motif finding EM-based greedy algorithm PKG algorithm

分类号 TP39 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献5

1张斐,谭军,谢竞博.基于不同算法的Motif预测比较分析与优化[J].计算机工程,2009,35(22):94-96. 被引量：6
2王维彬,钟润添.一种基于贪心EM算法学习GMM的聚类算法[J].计算机仿真,2007,24(2):65-68. 被引量：15
3M Tompa, N Li, T L Bailey, et al.assessing computational tools for the discovery of transcription factor binding sites [J].Nat Biotechnol,2005,23:137- 144.
4T L Bailey. Discovering Motifs in DNA and protein sequences:the appoximate common substring problem [D].University of California, San Diego, 1995.
5T L Bailey,C Elkan.Unsupervised learning of multiple Motifs in biopolymers using expectation maximization[J].Machine Learning,1995,21:51-83.

二级参考文献12

1Thakurta D G Computational Identification of Transcriptional Regulatory Elements in DNA Sequence[J]. Nucleic Acids Research, 2006, 34(12), 3585-3598.
2Martin T, Li Nan, Timothy L B, et al. Assessing Computational Tools for the Discovery of Transc_ription Factor Bingding Sites[J].Nature Biotechnology, 2005, 23(1): 137-144.
3Obayshi T, Kinoshita K, Nakai K, et al. ATTED-II: A Database of Co-expressed Genes and CIS Elements for Identifying Co-regulated Gene Groups in Arabidopsis[J]. Nucleic Acids Research, 2007, 35(Database Issue): 863-869.
4Timothy L B, Nadya W, Chris M, et al. MEME: Discovering and Analyzing DNA and Protein Sequence Motifs[J]. Nucleic Acids Research, 2006, 34(Web Server Issue): 369-373.
5M Kearns, Y Mansour, A Y Ng. An information - theoretic analysis of hard and soft assignment methods for clustering[C ].Proceedings of the NATO Advanced Study Institute on Learning in graphical models, 1998. 495 - 520.
6C Fraley, A E Rafiery. How Many Clusters Which Clustering Method Answers Via Model - Based Cluster Analysis[ J]. The Computer Journal, 1998,41 (8) :578 - 588.
7V NIKOS, L ARISTIDIS. A Greedy EM Algorithm for Gaussian Mixture Learning[ J]. Neural Processing Letters, 2002,15 (1) :77 - 87.
8J Q Li, A R Barron. Mixture density estimation[J]. Advances in Neural Information Processing Systems. 2000, 12:279 -285.
9B Minaei - Bidgoli, A Topchy, W Punch. A Comparison of Resampling Methods for Clustering Ensembles[C]. IC - AI 2004. 939 - 945.
10G J McLachlan, D Peel. Finite Mixture Models[ M]. US, New York : Wiley - Interscience, 2000.

共引文献16

1牛滨,孔令志,罗森林,潘丽敏,郭亮.基于MFCC和GMM的个性音乐推荐模型[J].北京理工大学学报,2009,29(4):351-355. 被引量：11
2许仙珍,谢磊,王树青.基于GMM的多工况过程监测方法[J].计算机与应用化学,2010,27(1):17-22. 被引量：6
3山拜.达拉拜,曹红丽,尤努斯.艾沙.基于遗传算法的K-means初始化EM算法及聚类应用[J].现代电子技术,2010,33(15):102-103. 被引量：1
4曹红丽,山拜.达拉拜.混合EM算法研究及聚类应用[J].通信技术,2010,43(11):150-152. 被引量：1
5刘文远,田陆芳,王常武,王宝文.基于Gibbs采样与遗传算法的模体识别[J].计算机工程,2011,37(14):180-182.
6张斐.一种对预测Motifs算法的评价策略[J].计算机技术与发展,2011,21(10):171-175.
7张运楚,李贻斌,张建滨.高斯混合背景模型的方差估计研究[J].计算机工程与应用,2012,48(4):162-166. 被引量：3
8刘城霞.基于MS聚类分析模型的数据挖掘应用探讨[J].计算机与现代化,2012(4):56-60.
9高冶,陈绮.基于蚁群聚类的蛋白质二级结构特征研究[J].计算机技术与发展,2013,23(6):191-194.
10张成,李秀玉,逄玉俊,李元.基于GMM的马氏距离kNN故障检测方法研究[J].测控技术,2014,33(9):13-17. 被引量：14

1王维彬,钟润添.一种基于贪心EM算法学习GMM的聚类算法[J].计算机仿真,2007,24(2):65-68. 被引量：15
2牛鹏辉,李卫华,李小春.基于贪心EM算法的HMRF遥感影像变化检测[J].光电工程,2011,38(11):50-56. 被引量：4
3张斐,谭军,谢竞博.基于不同算法的Motif预测比较分析与优化[J].计算机工程,2009,35(22):94-96. 被引量：6
4李斌,钟润添,王先基,庄镇泉.一种基于递增估计GMM的连续优化算法[J].计算机学报,2007,30(6):979-985. 被引量：9
5王宇新,刘彦飞,郭禾,刘天阳,杨元生.海冰观测中的图像匹配方法研究[J].计算机工程与应用,2010,46(35):245-248. 被引量：1
6张斐.一种对预测Motifs算法的评价策略[J].计算机技术与发展,2011,21(10):171-175.
7江平.IPv6下的QoS体系简介[J].办公自动化,2004(11):19-20.
8谢洁锐,胡月明,刘才兴,刘兰.无线传感器网络的时间同步技术[J].计算机工程与设计,2007,28(1):76-77. 被引量：9
9王文新,潘立登,李荣,徐永新,闻光辉.常减压蒸馏装置双模型结构RBF神经网络建模及其应用[J].北京化工大学学报（自然科学版）,2004,31(4):91-94. 被引量：9
10胡波,苗克坚,曹昕.基于时间仿真模型的DSP软件仿真系统[J].计算机仿真,2009,26(11):312-315. 被引量：1

价值工程

2011年第17期

浏览历史

内容加载中请稍等...

一种基于贪心EM的改进预测算法

参考文献5

二级参考文献12

共引文献16

相关作者

相关机构

相关主题

浏览历史