Scientific Data ( IF 5.8 ) Pub Date : 2023-07-19 , DOI: 10.1038/s41597-023-02380-z Bei Lu 1, 2 , Tao Shi 1 , Jinming Chen 1
Watershield (Brasenia schreberi) is an aquatic plant that belongs to the basal angiosperm family Cabombaceae. This species has been cultivated as an aquatic vegetable for more than 3000 years in East Asia, but the natural populations have greatly declined in recent decades and have become endangered in several countries of East Asia. In this study, by using PacBio long reads, Illumina short reads, and Hi-C sequencing data, we assembled the genome of B. schreberi, which was approximately 1170.4 Mb in size with a contig N50 of 7.1 Mb. Of the total assembled sequences, 93.6% were anchored to 36 pseudochromosomes with a scaffold N50 of 28.9 Mb. A total of 74,699 protein-coding genes were predicted in the B. schreberi genome, and 558 Mb of repetitive elements occupying 47.69% of the genome were identified. BUSCO analysis yielded a completeness score of 95.8%. The assembled high-quality genome of B. schreberi will be a valuable reference for the study of conservation, evolution and molecular breeding in this species.
中文翻译:
水莼(Brasenia schreberi)的染色体水平基因组组装
莼菜(Brasenia schreberi)是属于基生被子植物卡邦科(Cabombaceae)的水生植物。该物种作为水生蔬菜在东亚已有3000多年的栽培历史,但近几十年来自然种群数量大幅下降,在东亚多个国家已濒临灭绝。在本研究中,我们利用 PacBio 长读长、Illumina 短读长和 Hi-C 测序数据组装了B的基因组。schreberi,大小约为 1170.4 Mb,重叠群 N50 为 7.1 Mb。在总组装序列中,93.6% 锚定到 36 条假染色体,支架 N50 为 28.9 Mb。在B. schreberi基因组中预测出总共 74,699 个蛋白质编码基因,并鉴定出 558 Mb 的重复元件,占基因组的 47.69%。BUSCO 分析得出的完整性得分为 95.8%。组装的北蝾螈高质量基因组将为该物种的保护、进化和分子育种研究提供有价值的参考。