University calendar

Data/Computational Seminar Series with Dr. Pan Liu on recent Nature Communications work

Wednesday, October 29, 2025 at 1:00pm to 2:00pm

In single-cell data analysis, addressing sparsity often involves aggregating the profiles of homogeneous single cells into metacells. However, existing metacell partitioning methods lack checks on the homogeneity assumption and may aggregate heterogeneous single cells, potentially biasing downstream analysis and leading to spurious discoveries. To fill this gap, we introduce mcRigor, a statistical method to detect dubious metacells, which are composed of heterogeneous single cells, and optimize the hyperparameter(s) of a metacell partitioning method. The core of mcRigor is a feature-correlation-based statistic that measures the heterogeneity of a metacell, with its null distribution derived from a double permutation scheme. As an optimizer for existing metacell partitioning methods, mcRigor has been shown to improve the reliability of discoveries in single-cell RNA-seq and multiome (RNA + ATAC) data analyses, such as uncovering differential gene co-expression modules, enhancer-gene associations, and gene temporal expression. Moreover, mcRigor enables benchmarking and selection of the most suitable metacell partitioning method with optimized hyperparameter(s) tailored to a specific dataset, ensuring reliable downstream analysis. Our results indicate that among existing metacell partitioning methods, MetaCell and SEACells consistently outperform MetaCell2 and SuperCell, albeit with the trade-off of longer runtimes.

This is joint work with Jingyi Jessica Li of UCLA and Fred Hutchinson Cancer Research Center.

DOI: 10.1038/s41467-025-63626-5

Textile 105 : Zoom (linked to be disclosed)
Donghui Yan
dyan@umassd.edu
https://www.cscdr.umassd.edu/seminars

Back to top of screen