Presented By: DCMB Seminar Series
DCMB / CCMB Weekly Seminar
Jingyi Jessica Li, PhD, "Permutation Enhances the Rigor of Single-Cell Data Analysis”
Abstract:
Ensuring the reliability and accuracy of single-cell data analysis is critical, particularly in visualizing complex biological structures and addressing data sparsity. This talk introduces two novel statistical methods—scDEED and mcRigor—that leverage permutation-based techniques to enhance the rigor of these analyses.
scDEED ([Xia et al., 2024, Nature Communications](https://www.nature.com/articles/s41467-024-45891-y)) addresses the challenge of evaluating the reliability of two-dimensional (2D) embeddings produced by visualization methods like t-SNE and UMAP, which are commonly used to visualize cell clusters. These methods, however, can sometimes misrepresent data structure, leading to erroneous interpretations. scDEED calculates a reliability score for each cell embedding, comparing the consistency between a cell's neighbors in the 2D embedding space and its pre-embedding neighbors. Cells with low reliability scores are flagged as dubious, while those with high scores are deemed trustworthy. Additionally, scDEED provides guidance for optimizing t-SNE and UMAP hyperparameters by minimizing the number of dubious embeddings, significantly improving visualization reliability across multiple datasets.
mcRigor focuses on enhancing metacell partitioning in single-cell RNA-seq and ATAC-seq data analysis, a common strategy to address data sparsity by aggregating similar single cells into metacells. Existing algorithms often fail to verify metacell homogeneity, risking bias and spurious findings. mcRigor introduces a feature-correlation-based statistic to measure heterogeneity within a metacell, identifying dubious metacells composed of heterogeneous single cells. By optimizing metacell partitioning algorithm hyperparameters, mcRigor enhances the reliability of downstream analyses. Moreover, mcRigor allows for benchmarking and selecting the most suitable partitioning algorithm for a dataset, ensuring more robust discoveries.
scDEED and mcRigor demonstrate the power of permutation-based approaches in refining single-cell data analysis, providing researchers with tools to achieve more accurate and reproducible insights into complex cellular processes.
Ensuring the reliability and accuracy of single-cell data analysis is critical, particularly in visualizing complex biological structures and addressing data sparsity. This talk introduces two novel statistical methods—scDEED and mcRigor—that leverage permutation-based techniques to enhance the rigor of these analyses.
scDEED ([Xia et al., 2024, Nature Communications](https://www.nature.com/articles/s41467-024-45891-y)) addresses the challenge of evaluating the reliability of two-dimensional (2D) embeddings produced by visualization methods like t-SNE and UMAP, which are commonly used to visualize cell clusters. These methods, however, can sometimes misrepresent data structure, leading to erroneous interpretations. scDEED calculates a reliability score for each cell embedding, comparing the consistency between a cell's neighbors in the 2D embedding space and its pre-embedding neighbors. Cells with low reliability scores are flagged as dubious, while those with high scores are deemed trustworthy. Additionally, scDEED provides guidance for optimizing t-SNE and UMAP hyperparameters by minimizing the number of dubious embeddings, significantly improving visualization reliability across multiple datasets.
mcRigor focuses on enhancing metacell partitioning in single-cell RNA-seq and ATAC-seq data analysis, a common strategy to address data sparsity by aggregating similar single cells into metacells. Existing algorithms often fail to verify metacell homogeneity, risking bias and spurious findings. mcRigor introduces a feature-correlation-based statistic to measure heterogeneity within a metacell, identifying dubious metacells composed of heterogeneous single cells. By optimizing metacell partitioning algorithm hyperparameters, mcRigor enhances the reliability of downstream analyses. Moreover, mcRigor allows for benchmarking and selecting the most suitable partitioning algorithm for a dataset, ensuring more robust discoveries.
scDEED and mcRigor demonstrate the power of permutation-based approaches in refining single-cell data analysis, providing researchers with tools to achieve more accurate and reproducible insights into complex cellular processes.
Related Links
Explore Similar Events
-
Loading Similar Events...