About | Lucy D'Agostino McGowan

Lucy D’Agostino McGowan

Lucy D’Agostino McGowan is an associate professor in the Department of Statistical Sciences at Wake Forest University. She received her PhD in Biostatistics from Vanderbilt University and completed her postdoctoral training at Johns Hopkins University Bloomberg School of Public Health. Her research focuses on causal inference, statistical communication, analytic design theory, and data science pedagogy. Dr. D’Agostino McGowan was the 2023 chair of the American Statistical Association’s Section on Statistical Graphics and can be found blogging at livefreeordichotomize.com, on Twitter @LucyStats, and podcasting on the American Journal of Epidemiology partner podcast, Casual Inference.

Recent Awards

In 2025, Lucy received the Emerging Leader Award from the Committee of Presidents of Statistical Societies
In 2023, Lucy was selected for the Teaching in the Health Sciences Young Investigator Award for her paper Design Principles for Data Analysis
In 2023, Lucy was selected as an ASA StatsForward Fellow

Listen to the Casual Inference Podcast

Recent & Upcoming Talks

The Role of Congeniality in Multiple Imputation for Doubly Robust Causal Estimation

This talk provides clear and practical guidance on the specification of imputation models when multiple imputation is used in conjunction with doubly robust estimation methods for causal inference. Through theoretical arguments and targeted simulations, we show that when a confounder has missing data the corresponding imputation model must include all variables used in either the propensity score model or the outcome model, and that these variables must appear in the same functional form as in the final analysis. Violating these conditions can lead to biased treatment eﬀect estimates, even when both components of the doubly robust estimator are correctly specified. We present a mathematical framework for doubly robust estimation combined with multiple imputation, establish the theoretical requirements for proper imputation in this setting, and demonstrate the consequences of misspecification through simulation. Based on these findings, we oﬀer concrete recommendations to ensure valid inference when using multiple imputation with doubly robust methods in applied causal analyses.

This talk demonstrates data extraction from multiple sources using the popular television series Severance as an example. For example, we collected and analyzed elevator sounds predict narrative events, voice recordings underwent cepstral analysis to estimate fundamental frequencies and characterize speaker- specific distributions, with k-nearest neighbors used for classification, and text mining was performed on episode scripts to quantify dialogue patterns. These analyses illustrate how statistical methods can be applied to unconventional data sources from entertainment media.

This talk investigates whether large language models (LLMs) could potentially assist in the creation of “saturated DAGs”, graphical representations that exhaustively map all possible causal pathways in a system. We’ll critically examine if and how LLMs might help identify the full space of plausible causal relationships that traditional approaches may overlook. The presentation will assess the strengths and limitations of prompting LLMs to generate comprehensive causal structures, identify backdoor paths, and navigate complex causal systems.

See all →

Teaching

STA 112 -- WFU Spring 2024

Introduction to Regression and Data Science. Learn to explore, visualize, model, evaluate, and communicate data in a reproducible manner. Gain hands on experience with real data from a variety of disciplines. The course will focus on the statistical computing language R.

Seminar in Mathematical Business Analysis. The main purpose of this seminar is to develop the capability to apply quantitative knowledge to real and ill-defined problems. It tries to bridge the gap between the theory of quantitative decision approaches such as management science/operations research, information systems, and statistics (now mainly collected in the Business Analytics field), with the application of these approaches to the solution of actual business problems.

Causal Inference. From Correlation to Causation. The goal of this course is to give students the skills needed to conduct analyses and communicate results when causality is the goal. Students will learn how to implement causal inference techniques including matching and weighting, evaluate assumptions, and conduct sensitivity analyses.

See all →

Writing

The Role of Congeniality in Multiple Imputation for Doubly Robust Causal Estimation

This paper provides clear and practical guidance on the specification of imputation models when multiple imputation is used in conjunction with doubly robust estimation methods for causal inference. Through theoretical arguments and targeted simulations, we demonstrate that if a confounder has missing data, the corresponding imputation model must include all variables appearing in either the propensity score model or the outcome model, in addition to both the exposure and the outcome, and that these variables must enter the imputation model in the same functional form as in the final analysis.

Partitioned Local Depth (PaLD) Community Analyses in R

Partitioned Local Depth (PaLD) is a framework for holistic consideration of community structure for distance-based data. This paper describes an R package, pald, for calculating Partitioned Local Depth (PaLD) probabilities, implementing community analyses, determining community clusters, and creating data visualizations to display community structure. We present essentials of the PaLD approach, describe how to use the pald package, walk through several examples, and discuss the method in relation to commonly used techniques.

Quantifying the Alignment of a Data Analysis between Analyst and Audience

See all →

Featured categories

Invited (92) Oral Presentation (60) Peer Reviewed Article (35)

These days I like to discuss

over coffee

Lucy D’Agostino McGowan

Recent Awards

Listen to the Casual Inference Podcast

Recent & Upcoming Talks

The Role of Congeniality in Multiple Imputation for Doubly Robust Causal Estimation

The Art of Data Refinement: Severance Analyses

Exploring the Potential of Large Language Models in Generating Saturated DAGs for Causal Inference

Teaching

STA 112 -- WFU Spring 2024

BEM 392 -- WFU Spring 2024

STA 779 -- WFU Fall 2023

Writing

The Role of Congeniality in Multiple Imputation for Doubly Robust Causal Estimation

Partitioned Local Depth (PaLD) Community Analyses in R

Quantifying the Alignment of a Data Analysis between Analyst and Audience

Featured categories

Let's Chat

The best way to reach me for a casual chat is via bsky // Media inquiries can be directed here

Learn More