Publications
Publications by categories in reversed chronological order. This list was generated by jekyll-scholar.
2025
- Spotiflow: accurate and efficient spot detection for fluorescence microscopy with deep stereographic flow regressionAlbert Dominguez Mantes , Antonio Herrera , Irina Khven , and 14 more authorsNature Methods, 2025
Identification of spot-like structures in large, noisy microscopy images is a crucial step for many life-science applications. Imaging-based spatial transcriptomics (iST), in particular, relies on the precise detection of millions of transcripts in low signal-to-noise images. Despite recent advances in computer vision, most of the currently used spot detection techniques are still based on classical signal processing and require tedious manual tuning per dataset. Here we introduce Spotiflow, a deep learning method for subpixel-accurate spot detection that formulates spot detection as a multiscale heatmap and stereographic flow regression problem. Spotiflow supports 2D and 3D images, generalizes across different imaging conditions and is more time and memory efficient than existing methods. We show the efficacy of Spotiflow by extensive quantitative experiments on diverse datasets and demonstrate that its increased accuracy leads to meaningful improvements in biological insights obtained from iST and live imaging experiments. Spotiflow is available as an easy-to-use Python library as well as a napari plugin at https://github.com/weigertlab/spotiflow.
2024
- Unified Mass Imaging Maps the Lipidome of Vertebrate DevelopmentHalima Hannah Schede , Leila Haj Abdullah Alieh , Laurel Ann Rohde , and 12 more authorsbioRxiv, 2024
Embryo development entails the formation of anatomical structures with distinct biochemical compositions. Compared with the wealth of knowledge on gene regulation, our understanding of metabolic programs operating during embryogenesis is limited. Mass spectrometry imaging (MSI) has the potential to map the distribution of metabolites across embryo development. Here, we established an analytical framework for the joint analysis of large MSI datasets that allows for the construction of multi-dimensional metabolomic atlases. Employing this framework, we mapped the 4D distribution of over a hundred lipids at quasi-single-cell resolution in Danio rerio embryos. We discovered metabolic trajectories that unfold in concert with morphogenesis and revealed spatially organized biochemical coordination overlooked by bulk measurements. Interestingly, lipid mapping revealed unexpected distributions of sphingolipid and triglyceride species, suggesting their involvement in pattern establishment and organ development. Our approach empowers a new generation of whole-organism metabolomic atlases and enables the discovery of spatially organized metabolic circuits.Competing Interest StatementThe authors have declared no competing interest.
- Statistical inference with a manifold-constrained RNA velocity model uncovers cell cycle speed modulationsAlex R. Lederer , Maxine Leonardi , Lorenzo Talamanca , and 11 more authorsNature Methods, 2024
Across biological systems, cells undergo coordinated changes in gene expression, resulting in transcriptome dynamics that unfold within a low-dimensional manifold. While low-dimensional dynamics can be extracted using RNA velocity, these algorithms can be fragile and rely on heuristics lacking statistical control. Moreover, the estimated vector field is not dynamically consistent with the traversed gene expression manifold. To address these challenges, we introduce a Bayesian model of RNA velocity that couples velocity field and manifold estimation in a reformulated, unified framework, identifying the parameters of an explicit dynamical system. Focusing on the cell cycle, we implement VeloCycle to study gene regulation dynamics on one-dimensional periodic manifolds and validate its ability to infer cell cycle periods using live imaging. We also apply VeloCycle to reveal speed differences in regionally defined progenitors and Perturb-seq gene knockdowns. Overall, VeloCycle expands the single-cell RNA sequencing analysis toolkit with a modular and statistically consistent RNA velocity inference framework.
- Specialized signaling centers direct cell fate and spatial organization in a limb organoid modelEvangelia Skoufa , Jixing Zhong , Oliver Kahre , and 12 more authorsbioRxiv, 2024
Specialized signaling centers orchestrate robust development and regeneration. Limb morphogenesis, for instance, requires interactions between the mesoderm and the signaling center apical-ectodermal ridge (AER), whose properties and role in cell fate decisions have remained challenging to dissect. To tackle this, we developed mouse embryonic stem cells (mESCs)-based heterogeneous cultures and a limb organoid model, termed budoids, comprising cells with AER, surface ectoderm, and mesoderm properties. mESCs were first induced into heterogeneous cultures that self-organized into domes in 2D. Aggregating these cultures resulted in formation of limb bud-like structures in 3D, exhibiting chondrogenesis-based symmetry breaking and elongation. Using our organoids and quantitative in situ expression profiling, we uncovered that AER-like cells support nearby limb mesoderm and fibroblast identities while enhancing tissue polarization that permits distant cartilage formation. Together, our findings provide a powerful model to study aspects of limb morphogenesis, and reveal the ability of signaling center AER cells to concurrently modulate cell fate and spatial organization.
2023
- Neural ADMIXTURE for rapid genomic clusteringAlbert Dominguez Mantes , Daniel Mas Montserrat , Carlos D. Bustamante , and 2 more authorsNature Computational Science, 2023
Characterizing the genetic structure of large cohorts has become increasingly important as genetic studies extend to massive, increasingly diverse biobanks. Popular methods decompose individual genomes into fractional cluster assignments with each cluster representing a vector of DNA variant frequencies. However, with rapidly increasing biobank sizes, these methods have become computationally intractable. Here we present Neural ADMIXTURE, a neural network autoencoder that follows the same modeling assumptions as the current standard algorithm, ADMIXTURE, while reducing the compute time by orders of magnitude surpassing even the fastest alternatives. One month of continuous compute using ADMIXTURE can be reduced to just hours with Neural ADMIXTURE. A multi-head approach allows Neural ADMIXTURE to offer even further acceleration by computing multiple cluster numbers in a single run. Furthermore, the models can be stored, allowing cluster assignment to be performed on new data in linear time without needing to share the training samples.
2022
- Archetypal Analysis for population geneticsJulia Gimbernat-Mayol , Albert Dominguez Mantes , Carlos D. Bustamante , and 2 more authorsPLOS Computational Biology, Aug 2022
The estimation of genetic clusters using genomic data has application from genome-wide association studies (GWAS) to demographic history to polygenic risk scores (PRS) and is expected to play an important role in the analyses of increasingly diverse, large-scale cohorts. However, existing methods are computationally-intensive, prohibitively so in the case of nationwide biobanks. Here we explore Archetypal Analysis as an efficient, unsupervised approach for identifying genetic clusters and for associating individuals with them. Such unsupervised approaches help avoid conflating socially constructed ethnic labels with genetic clusters by eliminating the need for exogenous training labels. We show that Archetypal Analysis yields similar cluster structure to existing unsupervised methods such as ADMIXTURE and provides interpretative advantages. More importantly, we show that since Archetypal Analysis can be used with lower-dimensional representations of genetic data, significant reductions in computational time and memory requirements are possible. When Archetypal Analysis is run in such a fashion, it takes several orders of magnitude less compute time than the current standard, ADMIXTURE. Finally, we demonstrate uses ranging across datasets from humans to canids.