详细信息
Machine learning approaches outperform distance- and tree-based methods for DNA barcoding of Pterocarpus wood ( SCI-EXPANDED收录) 被引量:26
文献类型:期刊文献
英文题名:Machine learning approaches outperform distance- and tree-based methods for DNA barcoding of Pterocarpus wood
作者:He, Tuo[1,2,3,4] Jiao, Lichao[1,2] Wiedenhoeft, Alex C.[3,4,5,6] Yin, Yafang[1,2]
第一作者:He, Tuo;何拓
通信作者:Yin, YF[1];Yin, YF[2]
机构:[1]Chinese Acad Forestry, Chinese Res Inst Wood Ind, Dept Wood Anat & Utilizat, Beijing 100091, Peoples R China;[2]Chinese Acad Forestry, Wood Collect WOODPEDIA, Beijing 100091, Peoples R China;[3]US Forest Serv, Forest Prod Lab, Ctr Wood Anat Res, USDA, Madison, WI 53726 USA;[4]Univ Wisconsin, Dept Bot, Madison, WI 53706 USA;[5]Purdue Univ, Dept Forestry & Natl Resources, W Lafayette, IN 47907 USA;[6]Univ Estadual Paulista, Ciencias Biol Bot, Botucatu, SP, Brazil
年份:2019
卷号:249
期号:5
起止页码:1617-1625
外文期刊名:PLANTA
收录:;Scopus(收录号:2-s2.0-85062706221);WOS:【SCI-EXPANDED(收录号:WOS:000464898700025)】;
基金:This work was financially supported by National Natural Science Foundation of China (Grant No. 31600451), the National High-level Talent for Special Support Program of China (Grant No. W02020331), and the China Scholarship Council (Grant No. 2017-3109). We express our gratitude to Professor Xiaomei Jiang, Dr. Min Yu, Dr. Bo Liu and Dr. Prabu Ravindran for their assistance and suggestions on this study. We thank Sarah Friedrich for her help with the figure works.
语种:英文
外文关键词:DNA barcoding; Forensic wood identification; Identification accuracy; Machine learning approaches (MLAs); Pterocarpus; SMO classifier
摘要:Main conclusion Machine-learning approaches (MLAs) for DNA barcoding outperform distance- and tree-based methods on identification accuracy and cost-effectiveness to arrive at species-level identification of wood. DNA barcoding is a promising tool to combat illegal logging and associated trade, and the development of reliable and efficient analytical methods is essential for its extensive application in the trade of wood and in the forensics of natural materials more broadly. In this study, 120 DNA sequences of four barcodes (ITS2, matK, ndhF-rp132, and rbcL) generated in our previous study and 85 downloaded from National Center for Biotechnology Information (NCBI) were collected to establish a reference data set for six commercial Pterocarpus woods. MLAs (BLOG, BP-neural network, SMO and J48) were compared with distance- (TaxonDNA) and tree-based (NJ tree) methods based on identification accuracy and cost-effectiveness across these six species, and also were applied to discriminate the CITES-listed species Pterocarpus santalinus from its anatomically similar species P. tinctorius for forensic identification. MLAs provided higher identification accuracy (30.8-100%) than distance- (15.1-97.4%) and tree-based methods (11.1-87.5%), with SMO performing the best among the machine learning classifiers. The two-locus combination ITS2 + matK when using SMO classifier exhibited the highest resolution (100%) with the fewest barcodes for discriminating the six Pterocarpus species. The CITES-listed species P. santalinus was discriminated successfully from P. tinctorius using MLAs with a single barcode, ndhF-rp132. This study shows that MLAs provided higher identification accuracy and cost-effectiveness for forensic application over other analytical methods in DNA barcoding of Pterocarpus wood.
参考文献:
正在载入数据...