These days I like to discuss
- Analytic Design Theory
- Statistical Communication
- The Casual Inference Podcast
- Large-scale medical data
- Italian
- Co-founding R-Ladies Nashville
- Disney World
over coffee
Lucy D’Agostino McGowan
Lucy D’Agostino McGowan is an assistant professor in the Department of Statistical Sciences at Wake Forest University. She received her PhD in Biostatistics from Vanderbilt University and completed her postdoctoral training at Johns Hopkins University Bloomberg School of Public Health. Her research focuses on analytic design theory, statistical communication, causal inference, and data science pedagogy. Dr. D’Agostino McGowan is the 2023 chair of the American Statistical Association’s Section on Statistical Graphics and can be found blogging at livefreeordichotomize.com, on Twitter @LucyStats, and podcasting on the American Journal of Epidemiology partner podcast, Casual Inference.
Awards
Lucy was selected for the Teaching in the Health Sciences Young Investigator Award for her paper Design Principles for Data Analysis. She was also selected as an ASA StatsForward Fellow.
Listen to the Casual Inference Podcast
Recent & Upcoming Talks
Evaluating the Alignment of a Data Analysis between Analyst and Audience
A challenge that all data analysts face is building a data analysis that is useful for a given audience. In this talk, we will begin by proposing a set of principles for describing data analyses. We will then introduce a concept that we call the alignment of a data analysis between the data analyst and audience. We define a successfully aligned data analysis as the matching of principles between the analyst and the audience for whom the analysis is developed. We will propose a statistical model and general framework for evaluating the alignment of a data analysis. This framework can be used as a guide for practicing data scientists and students in data science courses for how to build better data analyses.
Read moreCausal Inference in R
This workshop will use the NHANES Epidemiologic Follow-up Study (NHEFS) data. In this workshop, we’ll teach the essential elements of answering causal questions in R through causal diagrams, and causal modeling techniques such as propensity scores and inverse probability weighting.
Read moreWhy You Must Include the Outcome in Your Imputation Model (and Why It's Not Double Dipping)
Handling missing data is a frequent challenge in analyses of health data, and imputation techniques are often employed to address this issue. This talk focuses on scenarios where a covariate with missing values is to be imputed and examines the prevailing recommendation to include the outcome variable in the imputation model. Specifically, we delve into stochastic imputation methods and their effects on accurately estimating the relationship between the imputed covariate and the outcome. Through mathematical proofs and a series of simulations, we demonstrate that incorporating the outcome variable in imputation models is essential for achieving unbiased results with stochastic imputation. Furthermore, we address the concern that this practice constitutes “double dipping” or data dredging. By providing both theoretical and empirical evidence, we show why including the outcome variable is a legitimate and necessary approach rather than a source of bias.
Read moreTeaching
STA 363 -- WFU Spring 2023
Statistical learning. Learn the theory behind cutting edge statistical and machine learning techniques. Gain hands on experience with real data from a variety of disciplines. The course will focus on the statistical computing language R.
Read moreSTA 112 -- WFU Fall 2022
Statistical models. Learn to explore, visualize, model, evaluate, and communicate data in a reproducible manner. Gain hands on experience with real data from a variety of disciplines. The course will focus on the statistical computing language R.
Read moreSTA 379/679 -- WFU Spring 2022
Causal Inference. From Correlation to Causation. The goal of this course is to give students the skills needed to conduct analyses and communicate results when causality is the goal. Students will learn how to implement causal inference techniques including matching and weighting, evaluate assumptions, and conduct sensitivity analyses.
Read moreWriting
Partnering with Authors to Enhance Reproducibility at JASA
The 'Why' behind including 'Y' in your imputation model
Missing data is a common challenge when analyzing epidemiological data, and imputation is often used to address this issue. Here, we investigate the scenario where a covariate used in an analysis has missingness and will be imputed. There are recommendations to include the outcome from the analysis model in the imputation model for missing covariates, but it is not necessarily clear if this recommendation always holds and why this is sometimes true.
Read morePower and sample size calculations for testing the ratio of reproductive values in phylogenetic samples
The quality of the inferences we make from pathogen sequence data is determined by the number and composition of pathogen sequences that make up the sample used to drive that inference. However, there remains limited guidance on how to best structure and power studies when the end goal is phylogenetic inference. One question that we can attempt to answer with molecular data is whether some people are more likely to transmit a pathogen than others.
Read more