昆虫学报 ›› 2025, Vol. 68 ›› Issue (8): 1164-1174.doi: 10.16380/j.kcxb.2025.08.014

• 研究论文 • 上一篇    下一篇

基于机器学习的贵州省中华蜜蜂工蜂形态分化分析#br#

王胤晨1, 袁扬1, 任昌仕1, 赵恬1, 邓梦青1, 任荣清1王华1, 方小明2, 廖艳1, 王海龙3, 房宇2,*   

  1. (1. 贵州省畜牧兽医研究所, 贵阳 550002; 2.中国农业科学院蜜蜂研究所, 北京 100093; 3. 武川县农牧技术推广中心, 呼和浩特 011700)
  • 出版日期:2025-08-20 发布日期:2025-09-30

Analysis of the morphological differentiation of Apis cerana cerana (Hymenoptera: Apidae) workers in Guizhou Province, Southwest China based on machine learning

WANG Yin-Chen1, YUAN Yang1, REN Chang-Shi1, ZHAO Tian1, DENG Meng-Qing1, REN Rong-Qing1, WANG Hua1, FANG Xiao-Ming2, LIAO Yan1, WANG Hai-Long3, FANG Yu2,*   

  1. (1. Guizhou Institute of Animal Husbandry and Veterinary Science, Guiyang 550002, China; 2. Institute of Apicultural Research, Chinese Academy of Agricultural Sciences, Beijing 100093, China; 3. Wuchuan County Agricultural and Animal Husbandry Technology Promotion Center, Hohhot 011700, China)
  • Online:2025-08-20 Published:2025-09-30

摘要: 【目的】中华蜜蜂Apis cerana cerana形态受地理及生态差异影响。本研究旨在探究贵州省不同地区中华蜜蜂的形态差异,以期为其形态在不同地区之间的分化提供参考。【方法】选取贵州省内33个地区云贵高原型中华蜜蜂工蜂样本共7 524头,解剖后测量喙、足、背板、腹板和前翅相关形态参数值,再采用随机森林(random forest,RF)、神经网络(neural network,NN)和支持向量机(support vector machine,SVM)方法经参数调优后确定最佳形态分析模型和筛选变量重要性;根据变量重要性进行Kmeans无监督聚类分析;将重要变量与气温、降水量和植被值进行Pearson相关性分析。【结果】机器学习结果显示,当mtry为5,随机树为550时参数调优结果最佳; 3个模型中SVM在精确率(precision)、召回率(recall)及F1分值上优于RF和NN;第3腹板蜡镜间距、第3腹板蜡镜长、第3腹板蜡镜宽和肘脉2、前翅长重要性值较高。聚类分析结果显示,织金、贞丰、赤水等共12个地区的中华蜜蜂工蜂样本聚为一类;纳雍、赫章和册亨等共8个地区的中华蜜蜂工蜂样本聚为另一类;息烽、平塘和罗甸等共13个地区的中华蜜蜂工蜂样本被聚在2类内。相关性分析结果显示,第3腹板蜡镜长与第3腹板蜡镜宽和前翅宽呈极显著正相关;第3腹板蜡镜宽与前翅宽呈极显著正相关;第3腹板蜡镜间距与3年春冬温差累积值呈显著负相关;翅脉2与3年春冬温差累积值呈显著负相关。【结论】基于机器学习的结果表明中华蜜蜂在贵州省内不同地区出现形态上的分化。SVM方法可为蜜蜂的分类提供新的思路,有助于理解中华蜜蜂形态在贵州省内的演化趋势,对推动保护地方中华蜜蜂种质资源具有指导意义。

关键词: 蜜蜂, 中华蜜蜂, 工蜂, 形态, 协同演化, 机器学习

Abstract: 【Aim】The morphology of Apis cerana cerana is influenced by geographical and ecological differences. This study aims to explore the morphological differences of A. c. cerana in different regions of Guizhou Province, Southwest China, so as to provide a reference for understanding its morphological differentiation across regions. 【Methods】A total of 7 524 worker samples of the A. c. cerana Yunnan-Guizhou Plateau ecotype were collected from 33 regions across Guizhou Province. The morphological parameters related to proboscis, legs, tergites, sternites and forewings were measured post-dissection. The random forest (RF), neural network (NN) and support vector machine (SVM) methods were employed for parameter tuning to identify the optimal morphological analysis model and screen the importance of variables. The K-means unsupervised cluster analysis was conducted based on the importance of variables. The Pearson correlation analysis was performed between important variables and temperature, precipitation and vegetation values. 【Results】 The machine learning results indicated that the optimal parameter tuning was achieved with the mtry set at 5 and random tree at 550. Among the three models, SVM outperformed RF and NN in terms of precision, recall and F1 score. The importance values for the wax mirror interval on sternite Ⅲ, wax mirror length on sternite Ⅲ, wax mirror width on sternite Ⅲ, cubital2 and forewing length were notably high. The cluster analysis results revealed that the worker samples of A. c. cerana from 12 regions such as Zhijin, Zhenfeng and Chishui clustered into one group, those from 8 regions such as Nayong, Hezhang and Ceheng clustered into another, while those from 13 regions such as Xifeng, Pingtang and Luodian were distributed across both groups. The correlation analysis result showed a highly significant positive correlation between the wax mirror length on sternite Ⅲ and the wax mirror width on sternite Ⅲ and forewing width, and the wax mirror width on sternite Ⅲ was also highly positively correlated with the forewing width. The wax mirror interval on sternite Ⅲ was significantly negatively correlated with the cumulative spring-winter temperature difference over three years, and cubital2 exhibited a significant negative correlation with the cumulative spring-winter temperature difference over three years. 【Conclusion】 The machine learning results indicate morphological differentiation of A. c. cerana across different regions in Guizhou Province. This SVM method offers new insights into bee classification and aids in understanding the evolutionary trends of A. c. cerana morphology within Guizhou Province. It also has implications for the conservation of local A. c. cerana germplasm resources in Guizhou Province.

Key words: Honey bee; Apis cerana cerana, worker bee, morphology, coevolution, machine learning