publications
publications by categories in reversed chronological order. generated by jekyll-scholar.
2024
- Spotiflow: accurate and efficient spot detection for fluorescence microscopy with deep stereographic flow regressionAlbert Dominguez Mantes , Antonio Herrera , Irina Khven , and 14 more authorsbioRxiv, 2024
Identifying spot-like structures in large and noisy microscopy images is a crucial step to produce high quality results in various life-science applications. Imaging-based spatial transcriptomics (iST) methods, in particular, critically depend on the precise detection of millions of transcripts in images with low signal-to-noise ratio. Despite advances in computer vision that have revolutionized many biological imaging tasks, currently adopted spot detection techniques are mostly still based on classical signal processing methods that are fragile and require tedious manual tuning per dataset. In this work, we introduce Spotiflow, a deep learning method that achieves subpixel-accurate localizations by formulating the spot detection task as a multiscale heatmap and stereographic flow regression problem. Spotiflow can be used for 2D images and 3D volumetric stacks and can be trained to generalize across different imaging conditions, tissue types and chemical preparations, while being substantially more time- and memory-e cient than existing methods. We show the eficacy of Spotiflow via extensive quantitative experiments on a variety of diverse datasets and demonstrate that the enhanced accuracy of Spotiflow leads to meaningful improvements in the biological insights obtained from iST and live imaging experiments. Spotiflow is available as an easy-to-use Python library as well as a napari plugin at https://github.com/weigertlab/spotiflow.
- Statistical inference with a manifold-constrained RNA velocity model uncovers cell cycle speed modulationsAlex R. Lederer , Maxine Leonardi , Lorenzo Talamanca , and 10 more authorsbioRxiv, 2024
Across a range of biological processes, cells undergo coordinated changes in gene expression, resulting in transcriptome dynamics that unfold within a low-dimensional manifold. Single-cell RNA-sequencing (scRNA-seq) only measures temporal snapshots of gene expression. However, information on the underlying low-dimensional dynamics can be extracted using RNA velocity, which models unspliced and spliced RNA abundances to estimate the rate of change of gene expression. Available RNA velocity algorithms can be fragile and rely on heuristics that lack statistical control. Moreover, the estimated vector field is not dynamically consistent with the traversed gene expression manifold. Here, we develop a generative model of RNA velocity and a Bayesian inference approach that solves these problems. Our model couples velocity field and manifold estimation in a reformulated, unified framework, so as to coherently identify the parameters of an autonomous dynamical system. Focusing on the cell cycle, we implemented VeloCycle to study gene regulation dynamics on one-dimensional periodic manifolds and validated using live-imaging its ability to infer actual cell cycle periods. We benchmarked RNA velocity inference with sensitivity analyses and demonstrated one- and multiple-sample testing. We also conducted Markov chain Monte Carlo inference on the model, uncovering key relationships between gene-specific kinetics and our gene-independent velocity estimate. Finally, we applied VeloCycle to in vivo samples and in vitro genome-wide Perturb-seq, revealing regionally-defined proliferation modes in neural progenitors and the effect of gene knockdowns on cell cycle speed. Ultimately, VeloCycle expands the scRNA-seq analysis toolkit with a modular and statistically rigorous RNA velocity inference framework.
- Specialized signaling centers direct cell fate and spatial organization in a limb organoid modelEvangelia Skoufa , Jixing Zhong , Oliver Kahre , and 12 more authorsbioRxiv, 2024
Specialized signaling centers orchestrate robust development and regeneration. Limb morphogenesis, for instance, requires interactions between the mesoderm and the signaling center apical-ectodermal ridge (AER), whose properties and role in cell fate decisions have remained challenging to dissect. To tackle this, we developed mouse embryonic stem cells (mESCs)-based heterogeneous cultures and a limb organoid model, termed budoids, comprising cells with AER, surface ectoderm, and mesoderm properties. mESCs were first induced into heterogeneous cultures that self-organized into domes in 2D. Aggregating these cultures resulted in formation of limb bud-like structures in 3D, exhibiting chondrogenesis-based symmetry breaking and elongation. Using our organoids and quantitative in situ expression profiling, we uncovered that AER-like cells support nearby limb mesoderm and fibroblast identities while enhancing tissue polarization that permits distant cartilage formation. Together, our findings provide a powerful model to study aspects of limb morphogenesis, and reveal the ability of signaling center AER cells to concurrently modulate cell fate and spatial organization.
2023
- Neural ADMIXTURE for rapid genomic clusteringAlbert Dominguez Mantes , Daniel Mas Montserrat , Carlos D. Bustamante , and 2 more authorsNature Computational Science, 2023
Characterizing the genetic structure of large cohorts has become increasingly important as genetic studies extend to massive, increasingly diverse biobanks. Popular methods decompose individual genomes into fractional cluster assignments with each cluster representing a vector of DNA variant frequencies. However, with rapidly increasing biobank sizes, these methods have become computationally intractable. Here we present Neural ADMIXTURE, a neural network autoencoder that follows the same modeling assumptions as the current standard algorithm, ADMIXTURE, while reducing the compute time by orders of magnitude surpassing even the fastest alternatives. One month of continuous compute using ADMIXTURE can be reduced to just hours with Neural ADMIXTURE. A multi-head approach allows Neural ADMIXTURE to offer even further acceleration by computing multiple cluster numbers in a single run. Furthermore, the models can be stored, allowing cluster assignment to be performed on new data in linear time without needing to share the training samples.
2022
- Archetypal Analysis for population geneticsJulia Gimbernat-Mayol , Albert Dominguez Mantes , Carlos D. Bustamante , and 2 more authorsPLOS Computational Biology, Aug 2022
The estimation of genetic clusters using genomic data has application from genome-wide association studies (GWAS) to demographic history to polygenic risk scores (PRS) and is expected to play an important role in the analyses of increasingly diverse, large-scale cohorts. However, existing methods are computationally-intensive, prohibitively so in the case of nationwide biobanks. Here we explore Archetypal Analysis as an efficient, unsupervised approach for identifying genetic clusters and for associating individuals with them. Such unsupervised approaches help avoid conflating socially constructed ethnic labels with genetic clusters by eliminating the need for exogenous training labels. We show that Archetypal Analysis yields similar cluster structure to existing unsupervised methods such as ADMIXTURE and provides interpretative advantages. More importantly, we show that since Archetypal Analysis can be used with lower-dimensional representations of genetic data, significant reductions in computational time and memory requirements are possible. When Archetypal Analysis is run in such a fashion, it takes several orders of magnitude less compute time than the current standard, ADMIXTURE. Finally, we demonstrate uses ranging across datasets from humans to canids.