MCA: Multiresolution Correlation Analysis, a graphical tool for subpopulation identification in single-cell gene expression data

Justin Feigelman, Fabian J. Theis, Carsten Marr

Background: Biological data often originate from samples containing mixtures of subpopulations, corresponding e.g. to distinct cellular phenotypes. However, identification of distinct subpopulations may be difficult if biological measurements yield distributions that are not easily separable.

Results: We present Multiresolution Correlation Analysis (MCA), a method for visually identifying subpopulations based on the local pairwise correlation between covariates, without needing to define an a priori interaction scale. We demonstrate that MCA facilitates the identification of differentially regulated subpopulations in simulated data from a small gene regulatory network, followed by application to previously published single-cell qPCR data from mouse embryonic stem cells. We show that MCA recovers previously identified subpopulations, provides additional insight into the underlying correlation structure, reveals potentially spurious compartmentalizations, and provides insight into novel subpopulations.

Conclusions: MCA is a useful method for the identification of subpopulations in low-dimensional expression data, as emerging from qPCR or FACS measurements. With MCA it is possible to investigate the robustness of covariate correlations with respect subpopulations, graphically identify outliers, and identify factors contributing to differential regulation between pairs of covariates. MCA thus provides a framework for investigation of expression correlations for genes of interests and biological hypothesis generation.

BioVis 2014 Information

Interactive Exploration of Spatial Distribution in Mass Spectrometry ImagingNeXO Web: An integrated ontology visualization application for modern web platformsTreemap Visualization of Personal Genomic ReportsHitWalker2: An interactive and queryable web-based framework for variant prioritization in precision medicineGWAS Viewer - a fast and interactive visualization for GWAS resultsConTour: Data-Driven Exploration of Multi-Relational Datasets for Drug DiscoveryAn interactive visualisation tool for the hierarchical clustering of large data setsMCAweb: an interactive graphical tool for Multiresolution Correlation Analysis in single-cell dataGingr: Interactive visualization of large-scale phylogenies and multi-alignments.NeuroLines: A Subway Map Metaphor for Visualizing Nanoscale Neuronal ConnectivityLONGEVITY: A novel visualization platform for interpreting multidimensional gene expressionExploration of metagenome assemblies with an interactive visualization tool.