登录    注册    忘记密码

详细信息

基于RNA-Seq的杜仲转录组微卫星特征分析     被引量:2

Microsatellites characteristics of transcriptomic sequences from Eucommia ulmoides Oliv.based on RNA-Seq

文献类型:期刊文献

中文题名:基于RNA-Seq的杜仲转录组微卫星特征分析

英文题名:Microsatellites characteristics of transcriptomic sequences from Eucommia ulmoides Oliv.based on RNA-Seq

作者:冯延芝[1,2] 李芳东[1,2] 魏琦琦[3] 莫文娟[4] 王璐[1,2] 黄地歌[5] 傅建敏[1,2]

第一作者:冯延芝

机构:[1]中国林业科学研究院经济林研究开发中心;[2]国家林业局泡桐研究开发中心;[3]中南林业科技大学经济林培育与保护教育部重点实验室;[4]中国林业科学研究院华北林业实验中心;[5]中南林业科技大学林学院

年份:2016

卷号:21

期号:9

起止页码:68-79

中文期刊名:中国农业大学学报

外文期刊名:Journal of China Agricultural University

收录:CSTPCD;;北大核心:【北大核心2014】;CSCD:【CSCD2015_2016】;

基金:国家自然科学基金面上项目(31370682)

语种:中文

中文关键词:杜仲;转录组;转录组测序;单基因簇;SSR

外文关键词:Eucommia ulmoides oliv. ; transcriptome; RNA-Seq; unigene; SSR

分类号:S949.751

摘要:对杜仲(Eucommia ulmoides)国审良种‘华仲6号’和‘华仲10号’花后70和160d的种仁共4个样本进行转录组测序,对测序数据进行组装和功能注释分类,并对转录组获得的单基因簇(unigene)进行微卫星特征分析。利用新一代高通量测序技术Illumina HiSeq^(TM)2000对杜仲样品进行转录组测序,采用软件Trinity进行组装;利用BLAST软件将unigene序列分别与Nr、GO、COG和KEGG等数据库比对分析;利用MISA软件对转录组的96 469条unigenes进行SSR搜索。结果表明:转录组测序分析,共得到72 791 399个高质量的序列读取片段(Clean reads),包含了14 702 548 161个的碱基序列(bp)信息。对reads进行序列组装,共获得96 469个平均长度为690bp的unigene,序列信息量达到了66.56 Mb。同源性分析结果显示,有49 856个与其它物种同源的unigenes得到注释,占All-unigene的51.68%。将杜仲转录组中的unigene与GO数据库进行比对分析,根据其功能可将注释到的38 983条unigene分成3大类(细胞组分、分子功能和生物学过程)56个分支;根据COG功能可将注释的14 796条unigene基因划分成25个类别;KEGG数据库作为参照,可将注释到的11 260条unigene定位到117个代谢途径分支;SSR位点搜索结果显示,96 469条unigenes中共包含9 621个完整型SSR位点,占总SSR位点的84.14%。完整型SSR位点共包含55种重复基元,其中出现频率最高的重复基序类型为单核苷酸重复中的A/T(4 597个),其次是AG/CT(2 597个)、AT/AT(439个)。
The transcriptomes of Eucommia ulmoides Oliv. kernels of 70 and 160 d after flowering in varieties ‘Huazhong 6 and 10’ were sequenced. The transcriptome data was assembled and classified by function, and microsatellites characteristics from obtained unigenes and analyzed. The Illumina HiSeq^TM 2000, a new generation of high-throughput sequencing technology was used to sequence the transcriptomes of kernels of assembled by software Trinity. The unigenes were annotated according to Nr,GO,COG and KEGG category by BLAST searches. A total of 72 791 399 clean reads fragment including 14 702 548 161 bp in sequence information were generated, and then de novo assembly generated a total of 96 469 unigenes with an average length of 690 bp, which contains 66. 56 Mb in sequence information. Among them,49 856 unigenes accounted for 51.68% were annotated by BLAST searches. All 38 983 annotated unigenes according to GO were divided into three categories (cellular components, molecular function and biological processes) of 56 branches by gene ontology; 14 796 annotated unigenes based on COG were grouped into 25 functional categories; KEGG pathway analysis presented that 11 260 annotated unigenes were divided into 117 classes according to its function. There were 9 621 complete SSR located in 96 469 unigenes, which accounted for 84.14% of the total SSR. The complete SSR included 55 frequent motifs, and the highest repeat of complete SSR type was A/T (4 597) ,following by AG/CT (2 597) ,AT/AT (439). The characteristics of SSRs can provide useful information for the analysis of genetic polymorphism and map structure in E. ulmoides.

参考文献:

正在载入数据...

版权所有©中国林业科学研究院 重庆维普资讯有限公司 渝B2-20050021-8 
渝公网安备 50019002500408号 违法和不良信息举报中心