详细信息
一种二倍体片段测序中SNP检测系统的构建
Development of an automatic system for SNP detection in diploid fragment sequencing
文献类型:期刊文献
中文题名:一种二倍体片段测序中SNP检测系统的构建
英文题名:Development of an automatic system for SNP detection in diploid fragment sequencing
作者:邓继忠[1] 林伟森[1] 甘四明[2] 黄华盛[1,3] 李梅[2] 金济[1] 何明昊[1]
第一作者:邓继忠
机构:[1]华南农业大学工程学院;[2]中国林业科学研究院热带林业研究所;[3]广东科技学院
年份:2016
卷号:37
期号:3
起止页码:115-120
中文期刊名:华南农业大学学报
外文期刊名:Journal of South China Agricultural University
收录:CSTPCD;;Scopus;北大核心:【北大核心2014】;CSCD:【CSCD2015_2016】;
基金:863计划(2013AA102705);国家自然科学基金(31270702;31070592)
语种:中文
中文关键词:二倍体;测序;SNP检测;模式识别;尾叶桉
外文关键词:diploid ; sequencing ; SNP detection ; pattern recognition ; Eucalyptus urophylla
分类号:TP391.4;Q523.8
摘要:【目的】开发基于模式识别方法的二倍体片段测序中单核苷酸多态性(Single nucleotide polymorphism,SNP)自动检测系统,提高检测的准确性。【方法】采用Lab Windows/CVI 9.0开发平台,结合Matlab函数库编程,以二倍体PCR片段测序的.ab1或.scf格式文件作为源数据,首先分离出碱基G、A、T和C,进行一维离散小波滤波,再对各碱基处的波形进行典型特征提取,最后运用基于反向传播神经网络的分类器完成SNP识别和判断。【结果】系统界面友好、运行稳定。SNP等级分为6级,允许用户对可疑的SNP进行人工修正,对尾叶桉Eucalyptus urophylla的26个测序序列143个SNP的测试中检测准确率、假阳性率和假阴性率均明显优于之前的类似软件。【结论】本文所构建的SNP自动检测系统准确性高,不需参考序列,可用于二倍体PCR片段测序中SNP的高效检测。
[ Objective ] This study aims to develop a pattern-recognition based system for automatic single nucleotide polymorphism (SNP)detection in diploid fragment sequencing and improve the detection accuracy. [ Method ] The LabWindows/CVI 9.0 platform and Matlab environment were combined for ana- lyzing . abl or . scf files generated in diploid PCR fragment sequencing. Firstly, four bases G, A, T and C were separated for eliminating noise through one-dimensional discrete wavelet filtering, following with extraction of typical features of each base position (peak) from a fluorescence curve. A classifier based on back-propagation neural network was then used for SNP recognition and diagnosis. [ Result] This es- tablished system was characterized by friendly interface, stable operation and manual modification acces- sibility. It classified the SNP reliability into six grades. Performance test with 143 SNPs of 26 sequencing fragments from Eucalyptus urophyUa demonstrated that our system outperformed three previously reported software packages in detecting accuracy, false positive and false negative rates. [ Conclusion ] Our system has a high rate of accuracy without the need for a reference sequence. It could be used for efficient SNP detection in diploid PCR fragment sequencing.
参考文献:
正在载入数据...