期刊文献+

基于RNA-Seq技术的连翘转录组组装与分析及SSR分子标记的开发 被引量:50

Assembly and Characterization of the Transcriptome and Development of SSR Markers in Forsythia suspensa Based on RNA-Seq Technology
原文传递
导出
摘要 连翘既是传统的中药材,又是优良的城市绿化树种,具有重要的经济价值和生态价值.但是连翘基因资料非常匮乏,限制了其分子生物学和基因功能的研究.本研究以连翘根、茎、叶、花和果实等器官的混合样品作为材料,利用Illumina Hi SeqTM 2500测序平台对其进行转录组测序.共获得23164327条干净数据(clean reads),总碱基数为4678791021 bp.Clean reads经de novo组装后获得45112条unigenes.进一步利用五大公共数据库进行同源比对,注释了28699条unigenes.其中,473个基因参与了连翘次生物质的合成和代谢,包括81个与苯丙氨酸和苯丙烷代谢相关的基因.对这81个基因的分析表明,有4个基因编码苯丙氨酸脱氨酶,1个基因编码肉桂酸4-羟化酶,2个基因编码4-香豆酰:辅酶A连接酶.这3个酶催化了连翘中主要药用活性物质苯乙醇苷和木脂素前体肉桂酸衍生物的生物合成.此外,还发现了2个松脂醇-落叶松树脂醇还原酶和1个开环异落叶松脂醇脱氢酶编码基因,这2个酶是木脂素合成的关键酶.最后,分析了长度在1 kb以上的12721个unigenes的基因结构,检测到3199个SSR位点,并对其中40个位点进行了验证.本研究不仅为连翘基因克隆和分子生物学研究提供了丰富的基础数据信息,而且为连翘遗传多样性研究、指纹图谱构建和分子标记辅助选育奠定了分子基础. F. suspense is not only an important traditional Chinese medicine and but also an excellent urban greening tree species, thus has important economic and ecologic values. However, the lack of gene resources limits the molecular biology and gene functional studies in F. suspensa. In this study, the transcriptome of the mixed samples from root, stem, leaf, flower and fruit of F. suspensa was sequenced using the Illumina HiSeqTM 2500 sequencing platform. After high through-put sequencing, a total of 23164327 clean reads with 4678791021 bp were obtained. The clean reads were then de novo assembled into 45112 unigenes, and 28699 unigenes were annotated by a similarity search against five public databases. Of these annotated unigenes, 473 genes were assigned to second metabolic pathway, including 81 genes associated with phenylalanine and phenylpropanoid metabolism. Further analysis of these 81 genes reveled that there were 4 genes encoding phenylalanine ammonia-lyase, 1 gene encoding cinnamate 4-hydroxylase and 2 genes encoding 4-coumarate: CoA ligase, respectively. These enzymes are involved in the biosynthesis of cinnamic acid derivatives, which are precursors of the mainly pharmaceutically active substances phenylethanoid glycosides and lignan in F. suspensa. In addition, we also identified 2 pinoresinol/lariciresinol reductase encoding genes and 1 secoisolariciresinol dehydrogenase encoding gene, which are the key genes in the lignan biosynthesis. Finally, a total of 3199 SSRs were identified from 12721 unigenes longer than 1 kb, and 40 of them were verified. This work not only provides many valuable basal data for gene cloning and molecular biology research, but also lays the foundation for genetic diversity analysis, finger printing construction and molecular marker assisted breeding in F. suspensa.
出处 《中国科学:生命科学》 CSCD 北大核心 2015年第3期301-310,共10页 Scientia Sinica(Vitae)
基金 山西省科技攻关项目(批准号:20120313015-9) 山西省自然科学基金(批准号:2013011028-1) 山西省卫生厅中药现代化科技产业基地建设项目资助
关键词 RNA-SEQ 转录组 次生代谢 苯乙醇苷 木脂素 SSR分子标记 连翘 RNA-Seq, transcriptome, second metabolism, phenylethanoid glycoside, lignan, SSR molecular marker,Forsythia suspensa
  • 相关文献

参考文献13

二级参考文献101

共引文献573

同被引文献635

引证文献50

二级引证文献319

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部