当前位置: X-MOL 学术Proc. Natl. Acad. Sci. U.S.A. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Joint trajectory inference for single-cell genomics using deep learning with a mixture prior
Proceedings of the National Academy of Sciences of the United States of America ( IF 9.4 ) Pub Date : 2024-09-03 , DOI: 10.1073/pnas.2316256121
Jin-Hong Du 1, 2 , Tianyu Chen 3 , Ming Gao 4 , Jingshu Wang 5
Affiliation  

Trajectory inference methods are essential for analyzing the developmental paths of cells in single-cell sequencing datasets. It provides insights into cellular differentiation, transitions, and lineage hierarchies, helping unravel the dynamic processes underlying development and disease progression. However, many existing tools lack a coherent statistical model and reliable uncertainty quantification, limiting their utility and robustness. In this paper, we introduce VITAE (Variational Inference for Trajectory by AutoEncoder), a statistical approach that integrates a latent hierarchical mixture model with variational autoencoders to infer trajectories. The statistical hierarchical model enhances the interpretability of our framework, while the posterior approximations generated by our variational autoencoder ensure computational efficiency and provide uncertainty quantification of cell projections along trajectories. Specifically, VITAE enables simultaneous trajectory inference and data integration, improving the accuracy of learning a joint trajectory structure in the presence of biological and technical heterogeneity across datasets. We show that VITAE outperforms other state-of-the-art trajectory inference methods on both real and synthetic data under various trajectory topologies. Furthermore, we apply VITAE to jointly analyze three distinct single-cell RNA sequencing datasets of the mouse neocortex, unveiling comprehensive developmental lineages of projection neurons. VITAE effectively reduces batch effects within and across datasets and uncovers finer structures that might be overlooked in individual datasets. Additionally, we showcase VITAE’s efficacy in integrative analyses of multiomic datasets with continuous cell population structures.

中文翻译:


使用深度学习和混合先验进行单细胞基因组学的联合轨迹推断



轨迹推理方法对于分析单细胞测序数据集中细胞的发育路径至关重要。它提供了对细胞分化、转换和谱系层次结构的见解,有助于揭示发育和疾病进展背后的动态过程。然而,许多现有工具缺乏连贯的统计模型和可靠的不确定性量化,限制了它们的实用性和稳健性。在本文中,我们介绍了 VITAE (Variational Inference for Trajectory by AutoEncoder),这是一种统计方法,它将潜在的分层混合模型与变分自动编码器集成在一起来推断轨迹。统计分层模型增强了我们框架的可解释性,而我们的变分自动编码器生成的后验近似确保了计算效率,并提供了沿轨迹的细胞投影的不确定性量化。具体来说,VITAE 支持同步轨迹推理和数据集成,提高了在数据集之间存在生物和技术异质性的情况下学习联合轨迹结构的准确性。我们表明,在各种轨迹拓扑下,VITAE 在真实和合成数据上都优于其他最先进的轨迹推理方法。此外,我们应用 VITAE 联合分析小鼠新皮层的三个不同的单细胞 RNA 测序数据集,揭示了投射神经元的全面发育谱系。VITAE 有效地减少了数据集内部和数据集之间的批次效应,并揭示了在单个数据集中可能被忽视的更精细的结构。此外,我们展示了 VITAE 在具有连续细胞群结构的多组学数据集综合分析中的功效。
更新日期:2024-09-03
down
wechat
bug