Friday, November 13, 2020 from Noon to 12:50pm
Dr. Michael Wu, Ph.D.
Public Health Sciences Division
Fred Hutchinson Cancer Research Center
Microbiome profiling studies of hundreds to thousands of individuals are being conducted within existing epidemiologic cohorts. Analysis of data from these studies offer comprehensive identification of bacterial taxa related to a plethora of health outcomes. However, the key characteristics of these studies (e.g. large sample size) also induces serious statistical challenges, particularly in combination with the difficulties inherent to microbiome data (e.g. high-dimensionality, sparsity, compositionality). Some challenges include accommodating batch effects and robustly identifying taxa related to outcomes. To address these problems, we propose novel batch correction and individual-taxon analysis frameworks. Our work is based on using two-part zero-inflated quantile regression which make minimal distributional assumptions while accommodating the zero-inflated nature of the data. We illustrate our work through simulations and application to data from a number of large-scale microbiome studies including the CARDIA cohort.