Data Science Seminar
Hosted by the Department of Mathematics and Statistics
Abstract
Biological and medical data increasingly have high dimensionality and complicated
structures. Cluster analysis is a common exploratory analysis technique for identifying
the underlying latent classes in a dataset. For many diseases, the underlying
pathophysiology is complex, and heterogeneity in the natural history of the disease is
common. Clustering the trajectories of disease progression can be helpful in
understanding observed variability. In this talk, I will discuss the applications and
evaluation of cluster analysis for identifying heterogeneity in longitudinal trajectories
using a latent class mixed effect model (LCMM) approach. I will start with two examples
of applying LCMM to identify latent patterns in trajectories: a) We have successfully
applied LCMM and identified clinically meaningful subgroups in the disease progression
of Duchenne muscular dystrophy; b) I will present preliminary results from our current
study on the heterogeneity in trajectories of cognitive status, measured by a more
complex outcome set, in participants from the community-based Framingham Heart
Study. Then, I will show how to use simulation studies to evaluate the ability of LCMMs
to accurately classify individuals for a wide range of noise and variability in trajectories
and to provide guidelines for model specification when using this technique.
Biography of the speaker: Dr. Fang joined the Department of Pharmaceutical Sciences at Binghamton University in 2023. She received her Ph.D. in Mathematics and Statistics with a focus on Statistics from Binghamton University Department of Mathematical Sciences under the guidance of Dr. Sanjeena Dang. Her Ph.D. research mainly focused on developing novel unsupervised learning algorithms for non-standard data types that are usually encountered in biomedical research. Following her graduate studies, Yuan joined the group of Drs. Kathryn Lunetta and Joanne Murabito at Boston University School of Public Health as a postdoctoral associate. Her postdoc research focused on applying statistical tools to investigate the association between circulating immune cell phenotypes in the pro-inflammatory and regulatory pathways with cognitive decline, dementia, and Alzheimer’s disease. She has also been working on quantifying the heterogeneity in decline trajectories of cognitive functions in the Framingham Heart Study participants. Yuan’s current research focuses on studying lipid profiles for ceramide pathways in boys with Duchenne Muscular Dystrophy using multi-omics statistical and bioinformatics approaches. She is also interested in extending existing models and statistical approaches to clustering omics data and longitudinal data.