基于Local-Global-VIT细粒度分类算法的蝴蝶识别

doi:10.16380/j.kcxb.2024.09.009

昆虫学报 ›› 2024, Vol. 67 ›› Issue (9): 1251-1261.doi: 10.16380/j.kcxb.2024.09.009

基于Local-Global-VIT细粒度分类算法的蝴蝶识别

李建祥¹, 李小林¹, 王荣², 张元孜¹, 陈淑武¹, 张飞萍^2,3, 黄世国^1,3,*

(1. 福建农林大学计算机与信息学院, 福州 350002; 2. 福建农林大学林学院, 福州 350002; 3. 生态公益重大有害生物防控福建省高校重点实验室, 福州 350002)

出版日期:2024-09-20 发布日期:2024-10-22

Butterfly recognition based on Local-Global-VIT fine-grained classification algorithm

LI Jian-Xiang¹, LI Xiao-Lin¹, WANG Rong², ZHANG Yuan-Zi¹, CHEN Shu-Wu¹, ZHANG Fei-Ping^2,3, HUANG Shi-Guo^1,3,*

(1. College of Computer and Information Sciences, Fujian Agriculture and Forestry University, Fuzhou 350002, China; 2. College of Forestry, Fujian Agriculture and Forestry University, Fuzhou 350002, China; 3. Key Laboratory of Integrated Pest Management in Ecological Forests, Fujian Province University, Fuzhou 350002, China)

Online:2024-09-20 Published:2024-10-22

摘要/Abstract

摘要： 【目的】准确鉴别蝴蝶种类，动态观测蝴蝶群落多样性变化对生境质量评估、生态环境恢复等方面具有重要意义。针对现有蝴蝶识别方法仅依靠整体特征，忽略了局部特征导致识别生态图像能力不足的问题，本研究旨在开发一种Local-Global-VIT细粒度分类算法的蝴蝶识别方法。【方法】本研究以5科200种共计25 279张蝴蝶图像为识别对象，采用多种数据增强方法扩充图像数据；通过视觉Transformer(vision transformer, VIT)层级结构及自注意力机制逐层选择局部令牌并保留至最后一层学习蝴蝶局部判别部位信息；聚合高层全局令牌消除复杂背景干扰；通过对比损失拉大类间距提高区分度。除此之外，使用合理的学习率调整策略和迁移学习方法，优化了模型收敛过程，在不增加参数量的情况下提高了性能。【结果】 Local-Global-VIT算法在大规模细粒度公开数据集Butterfly-200上识别准确率达91.20%，较改进前提升了1.15%，比最优的一般害虫识别算法EfficientNet_b0和细粒度分类算法TransFG准确率分别高了1.83%和0.64%, F1分值分别提高了1.89%和0.88%。【结论】Local-Global-VIT算法以细粒度识别方式有效解决了蝴蝶类内差异大、类间差异小的分类难题，能准确地识别蝴蝶种类，有助于高效评估生境质量。

关键词: 蝴蝶, 图像识别, 细粒度分类, vision transformer, 局部令牌选择, 全局令牌聚合

Abstract: 【Aim】 Identifying butterfly species accurately and monitoring changes in butterfly community diversity dynamically play a significant role in habitat quality assessment and ecological environment restoration. This study aims to develop a Local-Global-VIT fine-grained classification algorithm-based method for butterfly recognition to address the limitation of existing butterfly recognition methods by relying solely on global features but overlooking local features, consequently, leading to inadequate recognition of ecological images. 【Methods】 A dataset of 25 279 butterfly images from 200 species across five families for recognition was used. Various data augmentation techniques were employed to expand the image data. By utilizing the hierarchical structure and self-attention mechanism of vision transformer (VIT), the method selected local tokens layer by layer and retains them until the final layer learned the discriminative local features of butterflies. High-level global tokens were aggregated to mitigate interference from complex backgrounds. Contrastive loss was optimized to widen the inter-class gap and improve differentiation. Additionally, a reasonable learning rate adjustment strategy and transfer learning methods were applied to optimize the model’s convergence process, thereby improving performance without increasing the number of parameters. 【Results】 The recognition accuracy of the Local-Global-VIT algorithm reached 91.20% on the extensive fine-grained Butterfly-200 public dataset, which represented an improvement of 1.15% over previous methods. Therefore, the accuracy of the Local-Global-VIT algorithm exhibited an enhancement by 1.83% and 0.64%, respectively, and its F1-scores increased by 1.89% and 0.88%, respectively, in comparison to the state-of-the-art general pest recognition algorithm EfficientNet_b0 and the fine-grained classification algorithm, TransFG. 【Conclusion】 The Local-Global-VIT algorithm effectively addresses the challenge of distinguishing between significantly different intra-class characteristics and subtle inter-class differences in butterflies through fine-grained recognition, and can accurately identifies various butterfly species, thus contributing to the efficient habitat quality assessment.

Key words: Butterfly, image recognition, finegrained classification, vision transformer, local tokens selection, global tokens aggregation

李建祥, 李小林, 王荣, 张元孜, 陈淑武, 张飞萍, 黄世国. 基于Local-Global-VIT细粒度分类算法的蝴蝶识别[J]. 昆虫学报, 2024, 67(9): 1251-1261.

LI Jian-Xiang, LI Xiao-Lin, WANG Rong, ZHANG Yuan-Zi, CHEN Shu-Wu, ZHANG Fei-Ping, HUANG Shi-Guo. Butterfly recognition based on Local-Global-VIT fine-grained classification algorithm[J]. Acta Entomologica Sinica, 2024, 67(9): 1251-1261.

[1]	邱荣洲, 赵健, 何玉仙, 陈韶萍, 黄美玲, 池美香, 梁勇, 翁启勇. 基于性诱和深度学习的草地贪夜蛾成虫自动识别计数方法[J]. 昆虫学报, 2021, 64(12): 1444-1454.
[2]	俞佩仕, 郭龙军, 姚青, 杨保军, 唐健, 许渭根, 陈渝阳, 朱旭华, 陈宏明, 张晨光, 段德康, 贝文勇, 彭晴晖. 基于移动终端的稻田飞虱调查方法[J]. 昆虫学报, 2019, 62(5): 615-623.
[3]	竺乐庆, 张大兴, 张真. 基于韦伯局部描述子和颜色直方图的鳞翅目昆虫翅图像特征描述与种类识别[J]. , 2015, 58(4): 419-426.
[4]	竺乐庆, 张大兴, 张真. 基于颜色名和OpponentSIFT特征的鳞翅目昆虫图像识别[J]. , 2015, 58(12): 1331-1337.
[5]	刘国成, 张杨, 黄建华, 汤文亮. 基于K-means聚类算法的叶螨图像分割与识别[J]. , 2015, 58(12): 1338-1343.
[6]	娄定风，刘新娇，徐浪，赖天树，余道坚，焦懿，陈志粦，陈彦伦. 近红外窄带光照下不同水果背景中桔小实蝇的图像分割[J]. , 2014, 57(8): 951-961.
[7]	竺乐庆, 张真. 基于稀疏编码和SCG BPNN的鳞翅目昆虫图像识别[J]. , 2013, 56(11): 1335-1341.
[8]	王江宁, 纪力强. 昆虫图像分割方法及其应用[J]. , 2011, 54(2): 211-217.
[9]	竺乐庆, 张真, 张培毅. 基于颜色直方图及双树复小波变换（DTCWT）的昆虫图像识别[J]. , 2010, 53(1): 91-97.
[10]	周志艳, 罗锡文, 张扬, 李燕芳, 臧英. 农作物虫害的机器检测与监测技术研究进展[J]. , 2010, 53(1): 98-109.

基于Local-Global-VIT细粒度分类算法的蝴蝶识别

Butterfly recognition based on Local-Global-VIT fine-grained classification algorithm

PDF (PC)

PDF (Mobile)

赞

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 10

Metrics

本文评价

推荐阅读 0