详细信息
Multi-view features fusion for birdsong classification ( SCI-EXPANDED收录) 被引量:12
文献类型:期刊文献
英文题名:Multi-view features fusion for birdsong classification
作者:Xie, Shanshan[1] Lu, Jing[1] Liu, Jiang[3] Zhang, Yan[2] Lv, Danjv[1] Chen, Xu[1] Zhao, Youjie[1]
第一作者:Xie, Shanshan
通信作者:Lv, D[1];Zhang, Y[2]
机构:[1]Southwest Forestry Univ, Coll Big Data & Intelligent Engn, Kunming 650224, Peoples R China;[2]Southwest Forestry Univ, Coll Math & Phys, Kunming 650224, Peoples R China;[3]Chinese Acad Forestry, Res Inst Forestry Policy & Informat, Beijing 100091, Peoples R China
年份:2022
卷号:72
外文期刊名:ECOLOGICAL INFORMATICS
收录:;Scopus(收录号:2-s2.0-85141510704);WOS:【SCI-EXPANDED(收录号:WOS:000885975000008)】;
基金:Funding This study was supported by the Yunnan Provincial Science and Technology Department under Grant no: 202002AA10007, the National Natural Science Foundation of China under Grant no: 61462078 and under Grant no: 31860332, and the Yunnan Provincial Department of Education under Grant no: 2022Y558.
语种:英文
外文关键词:Birdsong recognition; Deep features; Handcrafted features; mRMR; Feature selection
摘要:As important members of the ecosystem, birds are good monitors of the ecological environment. Bird recogni-tion, especially birdsong recognition, has attracted more and more attention in the field of artificial intelligence. At present, traditional machine learning and deep learning are widely used in birdsong recognition. Deep learning can not only classify and recognize the spectrums of birdsong, but also be used as a feature extractor. Machine learning is often used to classify and recognize the extracted birdsong handcrafted feature parameters. As the data samples of the classifier, the feature of birdsong directly determines the performance of the classifier. Multi-view features from different methods of feature extraction can obtain more perfect information of bird -song. Therefore, aiming at enriching the representational capacity of single feature and getting a better way to combine features, this paper proposes a birdsong classification model based multi-view features, which combines the deep features extracted by convolutional neural network (CNN) and handcrafted features. Firstly, four kinds of handcrafted features are extracted. Those are wavelet transform (WT) spectrum, Hilbert-Huang transform (HHT) spectrum, short-time Fourier transform (STFT) spectrum and Mel-frequency cepstral coefficients (MFCC). Then CNN is used to extract the deep features from WT, HHT and STFT spectrum, and the minimal-redundancy -maximal-relevance (mRMR) to select optimal features. Finally, three classification models (random forest, support vector machine and multi-layer perceptron) are built with the deep features and handcrafted features, and the probability of classification results of the two types of features are fused as the new features to recognize birdsong. Taking sixteen species of birds as research objects, the experimental results show that the three classifiers obtain the accuracy of 95.49%, 96.25% and 96.16% respectively for the features of the proposed method, which are better than the seven single features and three fused features involved in the experiment. This proposed method effectively combines the deep features and handcrafted features from the perspectives of signal. The fused features can more comprehensively express the information of the bird audio itself, and have higher classification accuracy and lower dimension, which can effectively improve the performance of bird audio classification.
参考文献:
正在载入数据...