昆虫学报 ›› 2021, Vol. 64 ›› Issue (11): 1244-1251.doi: 10.16380/j.kcxb.2021.11.002

• 研究论文 • 上一篇    下一篇

利用全长转录本优化柞蚕基因组注释(英文)

李莹1,#, 雷煜宇1,#, 梁世梅1, 章贤1, 杜杰1,  杨新峰2, 李闪闪1, 段建平1,*   

  1.  (1. 南阳师范学院, 河南省伏牛山昆虫生物学重点实验室, 昆虫生物反应器河南省工程实验室, 河南南阳 473061;  2. 河南省蚕业科学研究院, 郑州 450008)
  • 出版日期:2021-11-20 发布日期:2021-11-03

Improvement of the annotation of Antheraea pernyi  (Lepidoptera: Saturniidae) genome using  full-length transcripts (In English)

LI Ying1,#, LEI Yu-Yu1,#, LIANG Shi-Mei1, ZHANG Xian1, DU Jie1,  YANG Xin-Feng2, LI Shan-Shan1, DUAN Jian-Ping1,*    

  1.  (1. Henan Key Laboratory of Insect Biology in Funiu Mountain, Henan Provincial Engineering Laboratory of Insects Bio-reactor,  Nanyang Normal University, Nanyang, Henan 473061, China; 2. Henan Institute of Sericulture Science,  Zhengzhou 450008, China)
  • Online:2021-11-20 Published:2021-11-03

摘要:

【目的】优化柞蚕Antheraea pernyi基因组注释,更好地扩展其在比较基因组学及品种改良研究中的应用。【方法】对柞蚕进行全长转录组测序分析;经全长转录本与参考基因组比对,鉴定新基因及新转录本,并对这些新基因和新转录本进行功能注释及长链非编码RNAs (lncRNAs)预测。利用大量的蛋白质编码转录本和lncRNAs对柞蚕基因组中基因结构进行修订。最后创建矫正后的柞蚕基因组基因注释。【结果】新发现1 997个蛋白编码基因和3 399个lncRNA基因,分别由2 402个和3 574个全长转录本数据支持。发现柞蚕基因组含25 021个基因,其中19 825个基因是蛋白编码基因,包括7个保幼激素酸甲基转移酶基因。【结论】本研究促进了对柞蚕基因组基因注释信息的认识,为柞蚕及相关物种功能基因组及比较基因组学研究提供了很有用的数据资源。

关键词: 柞蚕; 基因组注释; 全长转录组; 蛋白编码基因; lncRNA, 保幼激素酸甲基转移酶基因

Abstract:

【Aim】 This study aims to improve the annotation of the genome of the Chinese oak silkworm, Antheraea pernyi, so as to expand its application to comparative genomics and breed improvement. 【Methods】 The full-length transcriptome of A. pernyi was sequenced and analyzed, and compared with the reference genome to identify novel genes and transcripts with functional annotation and to predict long non-coding RNAs (lncRNAs). The gene models in the genome of A. pernyi were modified by thousands of novel protein-coding transcripts and lncRNAs. Finally, the corrected and merged annotation of the genome of A. pernyi was created. 【Results】 A total of 1 997 novel protein-coding genes and 3 399 novel lncRNA genes were discovered and supported by 2 402 and 3 574 full-length transcripts, respectively. The genome of A. pernyi contains 25 021 genes, including 19 825 protein-coding genes, from which seven juvenile hormone acid O-methyltransferase genes were identified. 【Conclusion】 This study improves our knowledge of the known genes in the genome of A. pernyi and provides valuable resources for comparative and functional genomic studies in A. pernyi and its relatives.

Key words: Antheraea pernyi, genome annotation, full-length transcriptome, protein-coding genes, lncRNA, juvenile hormone acid O-methyltransferase genes