登录    注册    忘记密码

详细信息

IsoSplitter: identification and characterization of alternative splicing sites without a reference genome  ( SCI-EXPANDED收录)   被引量:3

文献类型:期刊文献

英文题名:IsoSplitter: identification and characterization of alternative splicing sites without a reference genome

作者:Wang, Yupeng[1,2] Hu, Zhikang[1,2] Ye, Ning[1] Yin, Hengfu[2,3]

第一作者:王亚萍;Wang, Yupeng

通信作者:Yin, HF[1];Yin, HF[2]

机构:[1]Nanjing Forestry Univ, Coll Informat Sci & Technol, Nanjing 210037, Peoples R China;[2]Chinese Acad Forestry, Res Inst Subtrop Forestry, State Key Lab Tree Genet & Breeding, Hangzhou 311400, Zhejiang, Peoples R China;[3]Chinese Acad Forestry, Res Inst Subtrop Forestry, Key Lab Forest Genet & Breeding, Hangzhou 311400, Zhejiang, Peoples R China

年份:2021

卷号:27

期号:8

起止页码:868-875

外文期刊名:RNA

收录:;Scopus(收录号:2-s2.0-85110609516);WOS:【SCI-EXPANDED(收录号:WOS:000692111900002)】;

基金:We would like to thank Dr. Y.-J. Liu of the Chinese Academy of Forestry for helpful comments. We are grateful to the anonymous reviewers for critical comments and suggestions that greatly improved this work. This work is supported by Nonprofit Research Projects (CAFYBB2018ZY001-1) of the Chinese Academy of Forestry and the National Key R&D Program of China (grant no. 2019YFD1000400). We also thank the Postgraduate Research & Practice Innovation Program of Jiangsu Province for funding support.

语种:英文

外文关键词:gene structure; gene expression; alternative splicing; sequence alignment

摘要:Long-read transcriptome sequencing is designed to sequence full-length RNA molecules and advantageous for identifying alternative splice isoforms; however, in the absence of a reference genome, it is difficult to accurately locate splice sites because of the diversity of patterns of alternative splicing (AS). Based on long-read transcriptome data, we developed a versatile tool, IsoSplitter, to reverse-trace and validate AS gene "split sites" with the following features: (i) IsoSplitter initially invokes a modified SIM4 program to find transcript split sites; (ii) each split site is then quantified, to reveal transcript diversity, and putative isoforms are grouped into gene clusters; (iii) an optional step for aligning short reads is provided, to validate split sites by identifying unique junction reads, and revealing and quantifying tissue-specific alternative splice isoforms. We tested IsoSplitter AS prediction using data sets from multiple model and nonmodel plant species and showed that the IsoSplitter pipeline is efficient to handle different transcriptomes with high accuracy. Furthermore, we evaluated the IsoSplitter pipeline compared with that of the splice junction identification tools, Program to Assemble Spliced Alignments (PASA software needs a reference genome for AS identification) and AStrap, using data from the model plant Arabidopsis thaliana. We found that IsoSplitter determined more than twice as many AS events than AStrap analysis, and 94.13% of the IsoSplitter predicted AS events were also identified by the PASA analysis. Starting from a simple sequence file, IsoSplitter is an assembly-free tool for identification and characterization of AS. IsoSplitter is developed and implemented in Python 3.5 using the Linux platform and is freely available at https://github.com/Hengfu-Yin/IsoSplitter.

参考文献:

正在载入数据...

版权所有©中国林业科学研究院 重庆维普资讯有限公司 渝B2-20050021-8 
渝公网安备 50019002500408号 违法和不良信息举报中心