当前位置:
X-MOL 学术
›
Proc. Natl. Acad. Sci. U.S.A.
›
论文详情
Our official English website, www.x-mol.net, welcomes your
feedback! (Note: you will need to create a separate account there.)
Targeted whole-genome recovery of single viral species in a complex environmental sample
Proceedings of the National Academy of Sciences of the United States of America ( IF 9.4 ) Pub Date : 2024-07-25 , DOI: 10.1073/pnas.2404727121 Liyin Chen 1 , Anqi Chen 1 , Xinge Diana Zhang 1 , Maria Teresa Saenz Robles 2 , Hee-Sun Han 3, 4 , Yi Xiao 1 , Gao Xiao 1 , James M Pipas 2 , David A Weitz 1, 5
Proceedings of the National Academy of Sciences of the United States of America ( IF 9.4 ) Pub Date : 2024-07-25 , DOI: 10.1073/pnas.2404727121 Liyin Chen 1 , Anqi Chen 1 , Xinge Diana Zhang 1 , Maria Teresa Saenz Robles 2 , Hee-Sun Han 3, 4 , Yi Xiao 1 , Gao Xiao 1 , James M Pipas 2 , David A Weitz 1, 5
Affiliation
Characterizing unknown viruses is essential for understanding viral ecology and preparing against viral outbreaks. Recovering complete genome sequences from environmental samples remains computationally challenging using metagenomics, especially for low-abundance species with uneven coverage. We present an experimental method for reliably recovering complete viral genomes from complex environmental samples. Individual genomes are encapsulated into droplets and amplified using multiple displacement amplification. A unique gene detection assay, which employs an RNA-based probe and an exonuclease, selectively identifies droplets containing the target viral genome. Labeled droplets are sorted using a microfluidic sorter, and genomes are extracted for sequencing. We demonstrate this method’s efficacy by spiking two known viral genomes, Simian virus 40 (SV40, 5,243 bp) and Human Adenovirus 5 (HAd5, 35,938 bp), into a sewage sample with a final abundance in the droplets of around 0.1% and 0.015%, respectively. We achieve 100% recovery of the complete sequence of the spiked-in SV40 genome with uniform coverage distribution. For the larger HAd5 genome, we cover approximately 99.4% of its sequence. Notably, genome recovery is achieved with as few as one sorted droplet, which enables the recovery of any desired genomes in complex environmental samples, regardless of their abundance. This method enables single-genome whole-genome amplification and targeting characterizations of rare viral species and will facilitate our ability to access the mutational profile in single-virus genomes and contribute to an improved understanding of viral ecology.
中文翻译:
复杂环境样本中单一病毒物种的靶向全基因组恢复
表征未知病毒对于了解病毒生态学和预防病毒爆发至关重要。使用宏基因组学从环境样本中恢复完整的基因组序列在计算上仍然具有挑战性,特别是对于覆盖度不均匀的低丰度物种。我们提出了一种从复杂环境样本中可靠地恢复完整病毒基因组的实验方法。单个基因组被封装到液滴中并使用多重位移扩增进行扩增。独特的基因检测分析采用基于 RNA 的探针和核酸外切酶,选择性地识别含有目标病毒基因组的液滴。使用微流体分选器对标记的液滴进行分选,并提取基因组进行测序。我们通过将两种已知病毒基因组——猿猴病毒 40(SV40,5,243 bp)和人类腺病毒 5(HAd5,35,938 bp)掺入污水样本中来证明该方法的功效,液滴中的最终丰度约为 0.1% 和 0.015% , 分别。我们实现了 SV40 基因组完整序列的 100% 恢复,且覆盖度分布均匀。对于较大的 HAd5 基因组,我们覆盖了大约 99.4% 的序列。值得注意的是,基因组恢复只需一个分选的液滴即可实现,这使得能够在复杂的环境样本中恢复任何所需的基因组,无论其丰度如何。该方法能够实现单基因组全基因组扩增和稀有病毒物种的靶向表征,并将促进我们获得单病毒基因组突变谱的能力,并有助于更好地了解病毒生态学。
更新日期:2024-07-25
中文翻译:
复杂环境样本中单一病毒物种的靶向全基因组恢复
表征未知病毒对于了解病毒生态学和预防病毒爆发至关重要。使用宏基因组学从环境样本中恢复完整的基因组序列在计算上仍然具有挑战性,特别是对于覆盖度不均匀的低丰度物种。我们提出了一种从复杂环境样本中可靠地恢复完整病毒基因组的实验方法。单个基因组被封装到液滴中并使用多重位移扩增进行扩增。独特的基因检测分析采用基于 RNA 的探针和核酸外切酶,选择性地识别含有目标病毒基因组的液滴。使用微流体分选器对标记的液滴进行分选,并提取基因组进行测序。我们通过将两种已知病毒基因组——猿猴病毒 40(SV40,5,243 bp)和人类腺病毒 5(HAd5,35,938 bp)掺入污水样本中来证明该方法的功效,液滴中的最终丰度约为 0.1% 和 0.015% , 分别。我们实现了 SV40 基因组完整序列的 100% 恢复,且覆盖度分布均匀。对于较大的 HAd5 基因组,我们覆盖了大约 99.4% 的序列。值得注意的是,基因组恢复只需一个分选的液滴即可实现,这使得能够在复杂的环境样本中恢复任何所需的基因组,无论其丰度如何。该方法能够实现单基因组全基因组扩增和稀有病毒物种的靶向表征,并将促进我们获得单病毒基因组突变谱的能力,并有助于更好地了解病毒生态学。