Student Theses and Dissertations

Date of Award

2025

Document Type

Thesis

Degree Name

Doctor of Philosophy (PhD)

Thesis Advisor

A. James Hudspeth

Keywords

gene regulation, pseudotime, deep learning, single-cell RNA sequencing (scRNA-seq), hair cell differentiation, regeneration

Abstract

Identifying the causal interactions between genes and their proteins during the differentiation of specialized cells such as mechanosensory hair cells in vertebrates' inner ears and fishes' lateral lines requires an accurate description of the time-lagged relationships between transcription factors and their target genes. Here I describe Depicting Lagged Causality (DELAY), a convolutional neural network for the inference of gene-regulatory relationships across pseudotime-ordered single-cell trajectories. I first show that combining supervised deep learning with joint probability matrices of pseudotime-lagged trajectories allows the neural network to overcome important limitations of ordinary Granger causality-based methods, for example, the inability to infer cyclic relationships such as feedback loops. The algorithm outperforms several common methods for inferring gene regulation and, when given partial ground-truth labels, predicts novel gene-regulatory networks from single-cell RNA sequencing and single-cell ATAC sequencing data sets. To validate this approach, I use DELAY to identify important genes and modules in the regulatory network for auditory hair cell development in the murine inner ear, as well as likely DNA-binding partners for two hair cell cofactors (Hist1h1c and Ccnd1) and a novel DNA-binding sequence for the transcription factor Fiz1. In zebrafish, lateral-line neuromasts can regenerate damaged hair cells by expressing genes such as atoh1a—the master regulator of hair cell fate—in progenitors known as supporting cells. To identify adaptations that promote the rapid regeneration of hair cells in larval zebrafish, I also use DELAY to infer regenerating neuromasts' early gene-regulatory network. The central hub in the network, Y-box binding protein 1 (ybx1), is highly expressed in hair cell progenitors and young hair cells and its protein can recognize binding sites in the candidate regeneration-responsive promoter element for atoh1a. I show that neuromasts from ybx1 mutant zebrafish larvae display consistent, regeneration-specific deficits in hair cell number and initiate both hair cell regeneration and atoh1a expression 20% slower than in siblings. By demonstrating that ybx1 promotes rapid hair cell regeneration in neuromasts through early atoh1a upregulation, these results strongly support DELAY's ability to identify key regulators of gene expression dynamics. I provide a user-friendly implementation of DELAY under an open-source license at https://github.com/calebclayreagor/DELAY.

Comments

A Thesis Presented to the Faculty of The Rockefeller University in Partial Fulfillment of the Requirements for the degree of Doctor of Philosophy

License and Reuse Information

Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License
This work is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 4.0 International License.

Included in

Life Sciences Commons

Share

COinS