These days I like to discuss
- Analytic Design Theory
- Statistical Communication
- The Casual Inference Podcast
- Large-scale medical data
- Italian
- Co-founding R-Ladies Nashville
- Disney World
over coffee
Lucy D’Agostino McGowan
Lucy D’Agostino McGowan is an associate professor in the Department of Statistical Sciences at Wake Forest University. She received her PhD in Biostatistics from Vanderbilt University and completed her postdoctoral training at Johns Hopkins University Bloomberg School of Public Health. Her research focuses on causal inference, statistical communication, analytic design theory, and data science pedagogy. Dr. D’Agostino McGowan was the 2023 chair of the American Statistical Association’s Section on Statistical Graphics and can be found blogging at livefreeordichotomize.com, on Twitter @LucyStats, and podcasting on the American Journal of Epidemiology partner podcast, Casual Inference.
Recent Awards
- In 2025, Lucy received the Emerging Leader Award from the Committee of Presidents of Statistical Societies
- In 2023, Lucy was selected for the Teaching in the Health Sciences Young Investigator Award for her paper Design Principles for Data Analysis
- In 2023, Lucy was selected as an ASA StatsForward Fellow
Listen to the Casual Inference Podcast
Recent & Upcoming Talks
The Role of Congeniality in Multiple Imputation for Doubly Robust Causal Estimation
This talk provides clear and practical guidance on the specification of imputation models when multiple imputation is used in conjunction with doubly robust estimation methods for causal inference. Through theoretical arguments and targeted simulations, we show that when a confounder has missing data the corresponding imputation model must include all variables used in either the propensity score model or the outcome model, and that these variables must appear in the same functional form as in the final analysis. Violating these conditions can lead to biased treatment effect estimates, even when both components of the doubly robust estimator are correctly specified. We present a mathematical framework for doubly robust estimation combined with multiple imputation, establish the theoretical requirements for proper imputation in this setting, and demonstrate the consequences of misspecification through simulation. Based on these findings, we offer concrete recommendations to ensure valid inference when using multiple imputation with doubly robust methods in applied causal analyses.
Read moreThe Art of Data Refinement: Severance Analyses
This talk demonstrates data extraction from multiple sources using the popular television series Severance as an example. For example, we collected and analyzed elevator sounds predict narrative events, voice recordings underwent cepstral analysis to estimate fundamental frequencies and characterize speaker- specific distributions, with k-nearest neighbors used for classification, and text mining was performed on episode scripts to quantify dialogue patterns. These analyses illustrate how statistical methods can be applied to unconventional data sources from entertainment media.
Read moreExploring the Potential of Large Language Models in Generating Saturated DAGs for Causal Inference
This talk investigates whether large language models (LLMs) could potentially assist in the creation of “saturated DAGs”, graphical representations that exhaustively map all possible causal pathways in a system. We’ll critically examine if and how LLMs might help identify the full space of plausible causal relationships that traditional approaches may overlook. The presentation will assess the strengths and limitations of prompting LLMs to generate comprehensive causal structures, identify backdoor paths, and navigate complex causal systems.
Read moreTeaching
STA 112 -- WFU Spring 2024
Introduction to Regression and Data Science. Learn to explore, visualize, model, evaluate, and communicate data in a reproducible manner. Gain hands on experience with real data from a variety of disciplines. The course will focus on the statistical computing language R.
Read moreBEM 392 -- WFU Spring 2024
Seminar in Mathematical Business Analysis. The main purpose of this seminar is to develop the capability to apply quantitative knowledge to real and ill-defined problems. It tries to bridge the gap between the theory of quantitative decision approaches such as management science/operations research, information systems, and statistics (now mainly collected in the Business Analytics field), with the application of these approaches to the solution of actual business problems.
Read moreSTA 779 -- WFU Fall 2023
Causal Inference. From Correlation to Causation. The goal of this course is to give students the skills needed to conduct analyses and communicate results when causality is the goal. Students will learn how to implement causal inference techniques including matching and weighting, evaluate assumptions, and conduct sensitivity analyses.
Read moreWriting
The Role of Congeniality in Multiple Imputation for Doubly Robust Causal Estimation
This paper provides clear and practical guidance on the specification of imputation models when multiple imputation is used in conjunction with doubly robust estimation methods for causal inference. Through theoretical arguments and targeted simulations, we demonstrate that if a confounder has missing data, the corresponding imputation model must include all variables appearing in either the propensity score model or the outcome model, in addition to both the exposure and the outcome, and that these variables must enter the imputation model in the same functional form as in the final analysis.
Read morePartitioned Local Depth (PaLD) Community Analyses in R
Partitioned Local Depth (PaLD) is a framework for holistic consideration of community structure for distance-based data. This paper describes an R package, pald, for calculating Partitioned Local Depth (PaLD) probabilities, implementing community analyses, determining community clusters, and creating data visualizations to display community structure. We present essentials of the PaLD approach, describe how to use the pald package, walk through several examples, and discuss the method in relation to commonly used techniques.
Read more