Sneha Goenka

I am thrilled to announce that starting Jan 2025, I will be joining Princeton University as a tenure-track Assistant Professor in the ECE department! I am actively looking for graduate and undergraduate students to join my lab.

I am a Postdoctoral Research Scholar at Stanford Medicine with Prof. Euan Ashley. My research interests span computer systems architecture and computational genomics. I am interested in developing systems and methods for accelerated and more efficient genomic pipelines aimed towards clinical and research applications. I completed my Ph.D. (Specialized hardware-software systems for high-performance evolutionary and clinical genomics) in the Electrical Engineering department at Stanford University where I was advised by Prof. Mark Horowitz and collaborated with Prof. Euan Ashley and Prof. Benedict Paten.

I have led the computational team for the world's fastest genome diagnosis technique. I worked as an intern with the Architecture Research Group at NVIDIA Research in the summer of 2022 and the Hardware engineering group at D. E. Shaw Research in the summer of 2018.

I am a 2023 Forbes 30 Under 30 honoree in the Science category, a recipient of the 2022 NVIDIA graduate fellowship and the 2017 Barratt and Oakley Family fellowship. I am a 2021 Cadence Women in Technology scholar, 2019 AnitaBorg Grace Hopper student scholar, and part of the 2021 CRA-WP Grad Cohort for Women.

Previously, I received a Dual Degree (B. Tech. and M. Tech.) in Electrical Engineering from the Indian Institute of Technology, Bombay in 2017 along with the Akshay Dhoke Memorial Award. At IIT Bombay, I worked on my Master's thesis in the High-Performance Computing lab advised by Prof. Sachin Patkar. I contributed to the Pratham project which is the first student satellite project of IIT Bombay launched by the Indian Space Research Organization (ISRO) on September 26, 2016. I have been fortunate to participate in a semester exchange program at the Cooper Union for the Advancement of Science and Art, New York

I am a trained classical dancer with a Master's in Dance in Bharat Natyam from the Art Society, Mumbai.

goenka[at]princeton[dot]edu

gsneha[at]stanford[dot]edu

Current Research

Nature Biotechnology paper

New England Journal of Medicine paper

Circulation: Genomic and Precision Medicine paper

Code

Guinness World Record

World's fastest DNA sequencing-based genome diagnosis pipeline

Ultra-rapid nanopore sequencing in a critical care setting

Genetic disease is a major contributor to critical care hospitalization, especially in younger patients. While early genetic diagnosis can guide clinical management, the turnaround time for whole genome based diagnostic testing has traditionally been measured in months. Recent programs in neonatal populations have reduced turnaround time into the range of days and shown that rapid genetic diagnosis enhances patient care and reduces healthcare costs. Yet, most decisions in critical care need to be made on hourly timescales.

We developed a whole genome sequencing approach designed to provide a genetic diagnosis within 8 hours. Optimized highly parallel nanopore sequencing was coupled to a high-performance cloud compute system to implement near real-time basecalling and alignment followed by accelerated central and graphics processor unit variant calling. A custom scheme for variant prioritization took only minutes to rank variants most likely to be deleterious allowing efficient manual review and classification according to American College of Medical Genetics and Genomics guidelines. We performed whole genome sequencing on 12 patients from the critical care units of Stanford hospitals. A pathogenic or likely pathogenic variant was identified in five out of 12 patients (42%).

SC'20 paper

Code

A Scalable GPU-based whole genome aligner

Pairwise Whole Genome Alignment (WGA) is a crucial first step to understanding evolution at the DNA sequence-level. Pairwise WGA of thousands of currently available species genomes could help make biological discoveries, however, computing them for even a fraction of the millions of possible pairs is prohibitive – WGA of a single pair of vertebrate genomes (human-mouse) takes 11 hours on a 96-core Amazon Web Services (AWS) instance (c5.24xlarge). SegAlign – a scalable, GPU-accelerated system for computing pairwise WGA. SegAlign is based on the standard seed-filter-extend heuristic, in which the filtering stage dominates the runtime (e.g. 98% for human-mouse WGA), and is accelerated using GPU(s). Using three vertebrate genome pairs, we show that SegAlign provides a speedup of up to 14x on an 8-GPU, 64-core AWS instance (p3.16xlarge) for WGA and nearly 2.3x reduction in dollar cost. SegAlign also allows parallelization over multiple GPU nodes and scales efficiently.

HPCA '19 paper

Code

A more-accurate co-processor for whole genome alignments with high speedup

Whole genome alignment (WGA) is an indispensable tool in comparative genomics to study how different life forms have been shaped by evolution at the molecular level. Existing software whole genome aligners require several CPU weeks to compare a pair of mammalian genomes and still miss several biologically-meaningful, high-scoring alignment regions. These aligners are based on the seed-filter-and-extend paradigm with an ungapped filtering stage. Ungapped filtering is responsible for the low sensitivity of these aligners but is used because it is 200x faster than performing gapped alignment, using dynamic programming, in software. We show that replacing ungapped filtering with gapped filtering increases the number of matching base pairs in alignments by up to 3x. Our accelerator, Darwin-WGA, is the first hardware accelerator for whole genome alignment and accelerates the gapped filtering stage. Implemented on an FPGA, Darwin-WGA provides up to 24x improvement (performance/$) in WGA over iso-sensitive software. An ASIC implementation of the proposed architecture on TSMC 40nm technology achieves up to 10x performance/watt improvement on whole genome alignments over state-of-the-art software at higher sensitivity, and up to 1,500x performance/watt improvement compared to iso-sensitive software.