摘要
提出了一种新的DNA序列的2-D图形表示方法,并证明了它的非退化性,随后结合图形表示给出DNA序列的12个正规化的ALE指标.在此基础上,结合双核苷酸计数和符号序列LZ复杂度,将DNA序列转化为一个29维的数值向量.对23个物种的β球蛋白基因和18个物种的线粒体NADH脱氢酶序列进行的系统发生分析,证明了所提方法的有效性.
We first propose a new 2-D graphical representation for a DNA sequence and prove that it has nondegeneracy. Then we transform a DNA sequence into a 12- dimensional vector whose components are normalized ALE- indices. Combining ALE- indices with counts of the dinucleotide and the LZ complexity,we characterize a DNA sequence by a 29-dimensional numerical vector. The phylogenetic analysis on two separate datasets( the β- globin genes of 23 species and the NADH dehydrogenase genes of 18 species) shows the effectiveness of the proposed method.
出处
《渤海大学学报(自然科学版)》
CAS
2014年第4期307-312,324,共7页
Journal of Bohai University:Natural Science Edition
基金
国家自然科学基金项目(No:11171042)
辽宁省"百千万人才工程"项目(No:2012921060)
辽宁省高等学校创新团队(No:LT2014024)
辽宁省食品安全重点实验室开放课题(No:LNSAKF2011034)