昆虫学报 ›› 2022, Vol. 65 ›› Issue (1): 31-43.doi: 10.16380/j.kcxb.2022.01.004

• 研究论文 • 上一篇    下一篇

中华大仰蝽转录组学研究及SSR新标记开发

李敏1, 张丹丽1, 李荣荣1, 雷廷1, 宋鲜梅1, 卜文俊2,*   

  1.  (1. 太原师范学院生物系, 山西晋中 030619; 2. 南开大学生命科学学院昆虫学研究所, 天津 300071)
  • 出版日期:2022-01-20 发布日期:2022-01-17

Analysis of the transcriptome and development of novel SSR markers in Notonecta chinensis (Hemiptera: Notonectidae)

LI Min1, ZHANG Dan-Li1, LI Rong-Rong1, LEI Ting1, SONG Xian-Mei1, BU Wen-Jun2,*   

  1.  (1. Department of Biology, Taiyuan Normal University, Jinzhong, Shanxi 030619, China; 2. Institute of Entomology, College of Life Sciences, Nankai University, Tianjin 300071, China)
  • Online:2022-01-20 Published:2022-01-17

摘要:

 【目的】中华大仰蝽Notonecta chinensis为中国和日本冲绳分布的重要水生天敌昆虫,可用于蚊虫的生物防治。本研究旨在建立中华大仰蝽转录组数据库,挖掘其基因信息。【方法】采用高通量测序平台Illumina NextSeq500对中华大仰蝽进行转录组测序、de novo组装及生物信息学分析;利用MISA软件基于转录组unigenes数据进行SSR新分子标记筛选。毛细管电泳检测SSR多态性。【结果】总计获得34 782 282条clean reads(NCBI SRA数据库登录号: SRR13259254),组装成37 801条unigenes,N50为913 bp。将unigenes与已知数据库比对进行基因功能注释,分别有36 474, 32 470, 27 781, 35 079和5 638条序列注释到nr, Swiss-Prot, GO, eggNOG和KEGG数据库。通过GO数据库注释,unigenes的功能可分为生物学过程、细胞组分和分子功能三大类,其中参与细胞、细胞部分及结合功能的unigenes比例较大。eggNOG数据库注释结果显示,37 801条unigenes归到25个基因家族,注释到未知功能的最多。KEGG代谢通路富集分析显示,5 638条unigenes注释到245个代谢通路,注释到核糖体的数目最多。此外,用MISA软件在转录组测序数据中的37 801条unigenes中搜索到3 124个SSR位点(占总unigenes的8.26%),发生频率为707%。通过PCR筛选出16个SSR位点。7个中华大仰蝽地理种群3个位点NcCF/NcCR, NcKF/NcKR和NcLF/NcLR的多态信息含量(PIC)分别为0.870, 0.902和0.857,具高度多态性。【结论】本研究成功获得了中华大仰蝽转录组数据,为其基因功能分析提供了分子理论基础;SSR新标记的开发为中华大仰蝽遗传多样性分析、隐存种鉴定及基因图谱构建提供了更丰富的候选分子标记。

关键词: 中华大仰蝽, 转录组, SSR, 基因注释, Illumina NextSeq

Abstract: 【Aim】Notonecta chinensis, distributed in China and Okinawa of Japan, is an important aquatic natural enemy insect which can be used for biological control of mosquitoes. The purpose of this study is to establish the transcriptome database of N. chinensis, and to mine its genetic information. 【Methods】The transcriptome of N. chinensis was sequenced, de novo assembled and subjected to bioinformatics analysis by using the Illumina NextSeq500 high-throughput sequencing platform. New SSR molecular markers were selected by using MISA software based on the transcriptome unigenes data. Polymorphism of SSRs was detected by capillary electrophoresis. 【Results】A total of 34 782 282 clean reads (NCBI SRA accession number: SRR13259254) were obtained, and assembled into 37 801 unigenes with the N50 length of 913 bp. All unigenes were aligned against the known databases for gene function annotation, and 36 474, 32 470, 27 781, 35 079 and 5 638 sequences were annotated in nr, SwissProt, GO, eggNOG and KEGG databases, respectively. According to GO database annotation, the function of the unigenes can be divided into three categories including biological process, cellular component and molecular function, among which unigenes involved in cell, cell part and binding were enriched to a higher degree. eggNOG database annotation results showed that 37 801 unigenes were classified into 25 gene families, with the most annotated to function unknown. Enrichment analysis of KEGG metabolic pathways showed that 5 638 unigenes were annotated to 245 metabolic pathways, and the number of annotations to ribosome was the largest. Moreover, a large-scale SSR search was performed for 37 801 unigenes on the transcriptome sequencing data using the software MISA, and a total of 3 124 SSR loci (accounting for 8.26%) were identified, with a frequency of 7.07%. A total of 16 SSR loci were screened by PCR. The polymorphic information content (PIC) of the three loci NcCF/NcCR, NcKF/NcKR and NcLF/NcLR in seven geographical populations of N. chinensis were 0.870, 0.902 and 0.857, respectively, indicating high polymorphism. 【Conclusion】In this study, we successfully obtained the reference transcriptome of N. chinensis, which provides a molecular theoretical basis for its gene function analysis. The development of these potential SSR markers could provide a set of candidate molecular markers for genetic diversity analysis, cryptic species identification and genetic mapping construction of N. chinensis.

Key words: Notonecta chinensis, transcriptome, SSR, gene annotation, Illumina NextSeq