I am a Professor in the Department of Biostatistics and Medical Informatics and the Department of Population Health Sciences in the School of Medicine and Public Health at the University of Wisconsin-Madison. I have an affiliate appointment in the Department of Statistics.
I grew up in Duluth, Minnesota. I graduated from East High School in 1988. I received a BA in Mathematics and Economics in 1992 from the University of Minnesota-Duluth and an MS in Statistics in 1994 and a PhD in Statistics with emphasis in Biostatistics in 1998 from the University of Wisconsin-Madison. My PhD advisor was Murray Clayton.
I was a research scientist in the Statistical Data Analysis Center(SDAC) in the Department of Biostatistics and Medical Informatics at the University of Wisconsin-Madison, 1998-2005. I joined the faculty with a joint appointment in the Department of Biostatistics and Medical Informatics and the Department of Population Health Sciences in 2005.
I am an applied biostatistician focusing on problems in clinical and epidemiologic research. Current methodologic areas of interest include (1) multi-state models for incidence, progression and regression of ocular (and other) diseases, (2) small area estimation problems, particularly ranking, (3) spatial and spatio-temporal modeling, particularly cluster detection and high-dimensional variable selection and (4) age-period-cohort modeling.
Outside of the office, I’m an avid cyclist. You can check out my recent rides on Strava. I’m also a big movie fan. You’ll definitely see me at the Wisconsin Film Festival, one of my favorite events every year, and you can see lists of my favorite films and what I’ve been watching recently on Letterboxd.
PhD in Statistics (emphasis in Biostatistics), 1998
University of Wisconsin-Madison
MS in Statistics, 1994
University of Wisconsin-Madison
BA in Mathematics and Economics, 1992
University of Minnesota-Duluth
Patterns in disease across space and time are important to epidemiologists and health professionals because they may indicate underlying elevated disease risk. In some cases, elevated risk may be driven by environmental exposures, infectious diseases or other factors where timely public health interventions are important. The spatial and spatio-temporal scan statistics identify a single most likely cluster or equivalently select a single correct model. We instead consider an ensemble of single cluster models. We use stacking, a model-averaging technique, to combine relative risk estimates from all of the single cluster models into a sequence of meta-models indexed by the effective number of parameters/clusters. The number of parameters/spatiotemporal clusters is chosen using information criteria. A simulation study is conducted to demonstrate the statistical properties of the stacking method. The method is illustrated using a dataset of female breast cancer incidence data at the municipality level in Japan.
Introduction: There are significant disparities in the rates of maternal and infant morbidity and mortality in the United States – a discrepancy of particular importance in Wisconsin, where Non-Hispanic Black women experience the highest mortality rates in the country. The adverse effects of neighborhood socioeconomic status and geographical distance to obstetrical care outcomes have been demonstrated previously, with poor neighborhood socioeconomic status having been linked to higher rates of preterm births and low birth weight infants, which both increase the risk of neonatal morbidity and mortality. The objective of this study was to investigate the contributions of Area Deprivation Index and geographic location on age-matched birth weight z-scores. Methods: We conducted a retrospective cohort study of all singleton births >22 weeks’ gestation in Dane County, Wisconsin, from January 2016 through June 2018. Generalized additive models were adjusted for race/ethnicity, cigarette use, delivery route, pregnancy-related or chronic hypertension, pregestational and gestational diabetes, number of prenatal visits, maternal age, total weight gain, and pre-pregnancy body mass index. Results: There is evidence of an association between birth weight z-score and spatial location (median P value 0.006). With area deprivation, we found no evidence of an association with birth weight z-score (-0.01; 95% CI, -0.03 to 0.01; P = 0.109). Mean birth weight z-scores were lowest (-0.72) in the urban center of Madison, while mean birth weight z-score was highest (0.18) in rural areas near the northeast, southeast, and southwest county borders. We found an effect of race/ethnicity on birth weight. Conclusions: We identified geographic variations in birth weight at a granular level using census block groups and a holistic measure of deprivation, which can inform targeted public health interventions. The lack of evidence of area deprivation on birth outcomes but significant spatial trends demonstrated continued geographic disparities in our health care systems.
Context: Population health rankings can be a catalyst for the improvement of health by drawing attention to areas in need of relative improvement and summarizing complex information in a manner understood by almost everyone. However, ranks also have unintended consequences, such as being interpreted as “hard truths”, where variations may not be significant. There is a need to improve communication about uncertainty in ranks, with accurate interpretation. The most common solutions discussed in the literature have included modeling approaches to minimize statistical noise or borrow strength from covariates. However, the use of complex models can limit communication and implementation, especially for broad audiences. Objectives: Explore data-informed grouping (cluster analysis) as an easier-to-understand, empirical technique to account for rank imprecision that can be effectively communicated both numerically and visually. Design: Cluster analysis, specifically k-means clustering with Wasserstein (earth mover’s) distance, was explored as an approach to identify natural and meaningful groupings and gaps in the data distribution for the County Health Rankings’ (CHR) health outcomes ranks. Setting: County-level health outcomes from the 2022 CHR Participants: 3,082 counties that were ranked in the 2022 CHR Main Outcome Measure: Data-informed health groups Results: Cluster analysis identified 30 health groupings among counties nationwide, with cluster size ranging from nine to 184 counties. On average, states had 16 identified clusters, ranging from 3 in Delaware and Hawaii to 27 in Virginia. Number of clusters per state was associated with number of counties per state and population of the state. The method helped address many of the issues that arise from providing rank estimates alone. Conclusions: Public health practitioners can use this information to understand uncertainty in ranks, visualize distances between county ranks, have context around which counties are not meaningfully different from one another, and compare county performance to peer counties.
Prior research has shown that cancer risk varies by geography, but scan statistic methods for identifying cancer clusters in case-control studies have been limited in their ability to identify multiple clusters and adjust for participant-level risk factors. We develop a method to identify geographic patterns of breast cancer odds using the Wisconsin Women’s Health Study, a series of 5 population-based case-control studies of female Wisconsin residents aged 20-79 enrolled in 1988-2004 (cases=16,076, controls=16,795). We create sets of potential clusters by overlaying a 1 km grid over each county-neighborhood and enumerating a series of overlapping circles. Using a two-step approach, we fi=t a penalized binomial regression model to the number of cases and trials in each grid cell, penalizing all potential clusters by the least absolute shrinkage and selection operator (Lasso). We use BIC to select the number of clusters, which are included in a participant-level logistic regression model. We identify 15 geographic clusters, resulting in 23 areas of unique geographic odds ratios. After adjustment for known risk factors, confidence intervals narrowed but breast cancer odds ratios did not meaningfully change; one additional hotspot was identified. By considering multiple overlapping spatial clusters simultaneously, we discern gradients of spatial odds across Wisconsin.
Spatial and spatio-temporal cluster detection are important tools in public health and many other areas of application. Cluster detection can be approached as a multiple testing problem, typically using a space and time scan statistic. We recast the spatial and spatio-temporal cluster detection problem in a high-dimensional data analytical framework with Poisson or quasi-Poisson regression with the Lasso penalty. We adopt a fast and computationally-efficient method using a novel sparse matrix representation of the effects of potential clusters. The number of clusters and tuning parameters are selected based on (quasi-)information criteria. We evaluate the performance of our proposed method including the false positive detection rate and power using a simulation study. Application of the method is illustrated using breast cancer incidence data from three prefectures in Japan.
Patterns in disease across space and time are important to epidemiologists and health professionals because they may indicate underlying elevated disease risk. In some cases, elevated risk may be driven by environmental exposures, infectious diseases or other factors where timely public health interventions are important. The spatial and spatio-temporal scan statistics identify a single most likely cluster or equivalently select a single correct model. We instead consider an ensemble of single cluster models. We use stacking, a model-averaging technique, to combine relative risk estimates from all of the single cluster models into a sequence of meta-models indexed by the effective number of parameters/clusters. The number of parameters/spatiotemporal clusters is chosen using information criteria. A simulation study is conducted to demonstrate the statistical properties of the stacking method. The method is illustrated using a dataset of female breast cancer incidence data at the municipality level in Japan.
Comprehensive plans can promote healthy eating and active living (HEAL). Using a validated scorecard, we assessed HEAL-promoting components in 116 Wisconsin comprehensive plans. Few plans explicitly address healthy food access or public health. Higher HEAL scores are positively associated with population size, recent plan adoption, a consultant plan author, Democratic voting, and whether “housing and transit” is a designated local health priority. Our findings show that, in Wisconsin, municipal comprehensive plans promote HEAL in a limited and aspirational way, often without actionable policies. Strategies to improve HEAL-oriented planning practice include partnering with public health departments and additional training for planners.
Purpose For cancer survivors, self-efficacy is needed to manage the disease and the effects of treatment. The COVID-19 pandemic disrupted cancer-related healthcare, which may have impacted self-management self-efficacy. We investigated self-efficacy reported by cancer survivors during COVID-19, including associations with healthcare disruptions, distress, and general health. Methods Between 2020 and 2021, 1902 individuals aged 18–80 years with a recent cancer diagnosis completed a survey regarding the effects of COVID-19 on healthcare, self-efficacy for managing cancer and social interactions, cancer-related distress, and perceived general health. Linear and logistic models estimated odds ratios and 95% confidence intervals (CIs) between self-efficacy scores, healthcare disruptions, significant distress, and general health. Results Mean self-efficacy for managing cancer was 7.58 out of 10. Greater self-efficacy was associated with lower odds for distress (OR 0.18 [95% CI 0.13–0.26], quartile 4 vs. 1) and for worse general health (0.05 [0.03–0.09]). Participants with disruptions to cancer-related healthcare had lower self-efficacy for managing cancer compared to those without (6.62 vs. 7.09, respectively, P < 0.001) and higher odds for distress (1.70 [1.36–2.14]), but not worse general health (1.13 [0.39–1.44]). Lower self-efficacy mediated 27% of the association between healthcare disruptions and increased distress (15–47%). Associations with self-efficacy for managing social interactions trended in the same direction. Conclusions During COVID-19, disruptions to cancer-related healthcare were associated with lower self-efficacy, increased distress, and worse general health. Psychosocial interventions designed to overcome barriers and target self-efficacy may be important for enhancing outcomes among cancer survivors experiencing disruptions in healthcare access.
Background Many older adults rely on private vehicles for their mobility and may continue to drive when they are advised to stop. Walking and public transit can fulfill mobility needs in some contexts, but in the U.S. these options may not adequately substitute for driving when older adults reduce or stop driving. We examined whether baseline walking or taking public transit was associated with reductions in older adults’ driving after a three-year period in the United States. Methods We analyzed National Health and Aging Trends Study data from community-dwelling older drivers in 2015 (n = 4574). We used weighted logistic regression to estimate associations between older drivers’ walking and use of public transit in 2015 and changes in their driving behavior three years later—avoiding more driving conditions, driving less often, or not driving at all. We also examined associations between neighborhood walkability and driving behavior change three years later. Results There were no statistically significant associations between walking or taking public transit in 2015 and the adjusted odds of driving behavior change three years later. However, older drivers living in the most walkable neighborhoods in 2015 had greater adjusted odds of avoiding more driving conditions compared to those in the least walkable neighborhoods (adjusted Odds Ratio (aOR) = 1.66; 95 % Confidence Interval (95 % CI): 1.23-2.25). Living in the most walkable neighborhoods compared to the least walkable neighborhoods was also associated with an increased odds of no longer driving in 2018 (aOR = 1.56; 95 % CI: 1.04–2.36). Discussion The walkability of one’s neighborhood area—shorter distances between blocks, diverse land uses, and proximity to transit stops—is associated with driving behavior changes over time for older drivers. This work can inform programs and policies designed to connect older adults with alternative transportation options to driving.