GPU-accelerated generalized linear mixed models for biobank-scale association studies
김선우
27동 220호
0
156
03.30 17:48
| 구분 | 기타 |
|---|---|
| 일정 | 2026-05-08(금) 10:30~12:30 |
| 세미나실 | 27동 220호 |
| 강연자 | 김영대 (UNIST) |
| 담당교수 | 이다빈 |
| 기타 | ACM세미나 |
※ 일시 : 5월 8일(금요일) 10:30AM - 12:00PM
Abstract : Generalized linear mixed models are the statistical foundation of rigorous biobank-scale genetic association studies, enabling null model fitting that accounts for sample relatedness and population stratification across binary and quantitative traits. While their computational cost is substantial for a single genome-wide association study, phenome-wide analysis amplifies this burden further by requiring independent null model fits across thousands of phenotypes, making each individual model solve as fast as possible essential for scientific discovery at scale.
We present a GPU-accelerated framework targeting the dominant computational bottlenecks across the full genome-wide association pipeline: streaming GPU kernels for packed genomic preprocessing, block conjugate gradient for stochastic trace estimation exploiting shared matrix structure, and blocked GPU association testing with hybrid CPU-GPU routing that preserves full statistical validity.
Our contributions yield more than 10x end-to-end speedup on the Million Veteran Program dataset, one of the largest biobank datasets available, with portability validated across multiple GPU architectures.