Statistics Seminar
Department of Mathematical Sciences

DATE:Thursday, Month 24, 2022
TIME:1:15pm – 2:15pm
LOCATION:Zoom meeting
SPEAKER:Wenshu Dai, Binghamton University
TITLE:Variational Inference of FMR and FMRC Models for Clustering Microbiome Data


Abstract

Discrete data such as the microbiome taxa count data resulting from 16S rRNA sequencing are routinely encountered in bioinformatics. Taxa count data in microbiome studies are typically high-dimensional, overdispersed, and can only reveal relative abundance, therefore are treated as compositional. Analyzing these data presents challenges because they are restricted on a simplex. Additionally, these microbiome taxa counts are affected by other biological and/or environmental covariates such as age, gender, diet etc. Here, we develop regression-based mixtures of logistic normal multinomial models for clustering microbiome data. These models partition samples into homogeneous subpopulations and allow for investigation of the relationship between bacterial abundance and biological and/or environmental covariates within each inferred group. In this project, we utilize an efficient framework for parameter estimation using variational Gaussian approximations (VGA). The proposed method is illustrated on simulated datasets.