Skip to main content

Association between county-level risk groups and COVID-19 outcomes in the United States: a socioecological study



Geographic heterogeneity in COVID-19 outcomes in the United States is well-documented and has been linked with factors at the county level, including sociodemographic and health factors. Whether an integrated measure of place-based risk can classify counties at high risk for COVID-19 outcomes is not known.


We conducted an ecological nationwide analysis of 2,701 US counties from 1/21/20 to 2/17/21. County-level characteristics across multiple domains, including demographic, socioeconomic, healthcare access, physical environment, and health factor prevalence were harmonized and linked from a variety of sources. We performed latent class analysis to identify distinct groups of counties based on multiple sociodemographic, health, and environmental domains and examined the association with COVID-19 cases and deaths per 100,000 population.


Analysis of 25.9 million COVID-19 cases and 481,238 COVID-19 deaths revealed large between-county differences with widespread geographic dispersion, with the gap in cumulative cases and death rates between counties in the 90th and 10th percentile of 6,581 and 291 per 100,000, respectively. Counties from rural areas tended to cluster together compared with urban areas and were further stratified by social determinants of health factors that reflected high and low social vulnerability. Highest rates of cumulative COVID-19 cases (9,557 [2,520]) and deaths (210 [97]) per 100,000 occurred in the cluster comprised of rural disadvantaged counties.


County-level COVID-19 cases and deaths had substantial disparities with heterogeneous geographic spread across the US. The approach to county-level risk characterization used in this study has the potential to provide novel insights into communicable disease patterns and disparities at the local level.

Peer Review reports


Since the first case of the coronavirus disease 2019 (COVID-19) was confirmed in the United States (US) in Snohomish county, Washington, on January 21st, 2020 [1, 2], the US has experienced more cases (99,467 cases per million population) and deaths (585,708) than any other country in the world (as of May 15, 2021) [3]. Clinical and epidemiologic data have identified several modifiable health factors (e.g., obesity [4], hypertension [5], cardiovascular disease [6]) as key risk factors of adverse outcomes such as hospitalization and mortality related to severe COVID-19 infection [7, 8]. However, assessment of individual-level health factors incompletely characterizes risk related to underlying social determinants of health that may contribute to observed differences in COVID-19 outcomes [9, 10].

Understanding how place-based disadvantage has influenced COVID-19 case and death rates is critical for developing strategies to address long-term sequalae of COVID-19 [11] and to target vaccination efforts [12] at the local level. This is especially important given the significant geographic heterogeneity observed in the burden of COVID-19 with large differences in morbidity and mortality among counties within the US [13, 14]. Several studies have focused on single social and environmental factors (e.g., housing overcrowding [15], air pollution [16]) or grouped factors based on existing county-level rankings (e.g., social vulnerability index [17]) to examine differences at the county-level in COVID-19 outcomes. However, susceptibility to COVID-19 infection and mortality is likely due to interrelated biological, social, and environmental factors; to date, no studies have developed a comprehensive measure that integrates place-based factors across multiple domains to classify county-level risk for COVID-19 outcomes.

This study sought to use a novel method for clustering, latent class analysis, to identify meaningful groups of counties based on county-level characteristics across multiple sociodemographic, health, and environmental domains and to examine associations with COVID-19 cases and deaths.


Data Sources

We harmonized and linked a comprehensive set of county-level measures from a variety of resources including the U.S. Census Bureau, County Health Rankings (CHR), Centers for Disease Control and Prevention (CDC) Wide-ranging Online Data for Epidemiologic Research (WONDER) mortality database, Area Health Resources Files (AHRF), and the United States Department of Agriculture (USDA) across all available counties coded by Federal Information Processing Standard county codes for a total of 30 variables (Supplemental Table 1). First, from these various data sources, we obtained relevant measures to characterize counties’ profiles according to the following categories based on pre-pandemic data on demographics, socioeconomic status, health status, healthcare access, and environmental factors. Next, we queried county-level COVID-19 data on 2/18/21 to include all deaths and cases from 1/21/20 to 2/17/21 from The New York Times GitHub where information on cases and fatalities is updated daily ( to link with our integrative county-level index. For a subset of counties, vaccination data (number of first dose and two doses administered) were manually abstracted from county and state department of public health websites by SSK, AEK, LRP, and NBA. All data were deidentified and publicly available, and therefore deemed exempt from Northwestern University’s Institutional Review Board.


Counties were described using metrics relating to residents’ demographics, socioeconomic status, health status, healthcare access, as well as environmental factors (Supplemental Table 1). Demographic factors included proportion of racial/ethnic minority and older residents in addition to population density and urbanization status. Socioeconomic metrics included rates of varying educational levels, poverty, and housing. Health status included quality of life and morbidity and mortality rates. Healthcare access reflected insurance, hospital beds per capita, and primary care physician density. Finally, the environmental factor was a measure of air pollution. The primary outcomes were cumulative deaths and cases per 100,000 population attributed to COVID-19 at the county-level. As an exploratory outcome, COVID-19 vaccine doses administered in a subset of counties was examined.

Statistical Analysis

We included all counties for which we had complete ecological data across county-level metrics that span demographic, socioeconomic status, health status, healthcare access, and environmental factors and at least 5 COVID-19 deaths. We used latent class analysis (LCA) [18] to classify US counties with others who have similar socioecological profiles based on these included metrics. These groupings represent meaningful clusters defined by a combination of all the observed variables by using model-based predicted posterior probabilities. The LCA model incorporated binary and ordinal variables as-is, and continuous variables were categorized as tertiles before inclusion. The model was fit using R package poLCA, which estimates the latent class model by maximizing a log-likelihood function using the EM algorithm. We set the max iterations to 10000. To select the optimal number of classes (between 1-5) for the final LCA model, we imposed the following criteria based on previously published methods: (1) qualitative evaluation of distinct predicted classes or groups, (2) ensuring no group had fewer than 5% of counties, and (3) minimizing the Bayesian information criterion. Model fit statistics for models with 1-5 latent classes (Log-likelihood, Residual DF, BIC, aBIC, cAIC, and Likelihood-Ratio) in Supplemental Table 2 [19]. Conditional probabilities and a classification table for each of the county-level variables for the final model are shown in Supplemental Tables 3 and 4.

For the final identified groups, we described county-level characteristics using means, medians, and proportions, as appropriate, and described crude rates of cumulative COVID-19 cases and deaths per 100,000 population. We additionally identified the ten counties in each group with the highest deaths per 100,000 population to describe dynamic trends in cumulative deaths since the beginning of the pandemic and cumulative cases, deaths, and vaccination rates per 100,000.

In the primary analysis, we calculated the association between each county-level risk group and each outcome (COVID-19 deaths and cases) using linear regression models. We fit two sets of models: 1) unadjusted and 2) adjusted for the time since the first confirmed case in the county, to allow comparison between counties at similar points in their outbreak. Point estimates were interpreted in context of their magnitude and Wald-style 95% confidence interval. We completed a sensitivity analysis including all counties reporting at least one COVID-19 death in order to examine the robustness of our estimates (N=2694 counties). All analyses were completed using R version 3.6.3 or newer.


County-Level Risk Groups

Data on the 30 county-level measures were available for 2,701 US counties. We identified four distinct groups of counties based on demographic, socioeconomic, health status, healthcare access, and environmental profiles (Table 1, Supplemental Figure 1): 1) diverse urban counties with greater social and health assets; 2) rural counties with social and health vulnerabilities; 3) older rural counties with greater social assets; and 4) urban counties with social vulnerabilities. Diverse urban counties had the highest population density (757 [2244]), greatest proportion of at least some college education (71% [7]), lowest rates of unemployment (3% [1]) and percent of children in poverty (11% [4]), and fewest absolute years of potential life lost (5756 [1127] years). By contrast, vulnerable rural counties had the highest proportion of non-Hispanic black residents (17% [19]), lowest proportion of some college education (48% [8]), highest rates of unemployment (5% [1]) and percent of children in poverty (30% [7]), and greatest absolute years of potential life lost (10,768 [2062] years).

Table 1 County-level demographic, socioeconomic, health status, healthcare access, and other factors in each county-level risk group in the United States (N=2701 counties)

Individual county-level metrics included in the latent class modeling were highly correlated with each other (Supplemental Figure 2). For example, favorable social and health assets tracked in counties with lower poverty metrics and higher median income; these counties also had more favorable health metrics such as proportions of smokers, people with obesity, and physically inactive people, as well as fewer years of potential life lost. Conversely, adverse social and health factors and a higher proportion of minority residents were observed in vulnerable counties. These counties also had, on average, lower median income, greater proportion of households with overcrowding, and poorer air pollution rates. Counties within the four groups have widespread geographic dispersion with some regional clustering (Figure 1A). For example, rural counties with social and health vulnerabilities are predominantly, but not exclusively, localized in the southern US.

Fig. 1
figure 1

Nationwide Distribution of (A) County-Level Risk Groups (B) COVID-19 deaths per 100,000 and (C) COVID-19 cases per 100,00 from 1/21/20 to 2/17/21

Nationwide COVID-19 Deaths and Cases

As of February 17th, 2021, 2,621 counties had reported at least 1 confirmed COVID-19 death and 2,701 counties had at least 1 confirmed COVID-19 case to be included in the publicly available dataset hosted by the NY Times. A total of 481,238 COVID-19 deaths and 25.9 million COVID-19 cases were included from 2/29/20 to 2/17/21 in 2701 counties in the US. Large between-county differences were evident with widespread geographic dispersion, with a significant gap in cumulative COVID-19 cases and death rates between counties in the 90th and 10th percentile of 6,581 and 291 per 100,000, respectively. Across US census regions, counties in the Midwest had the greatest cumulative rate of COVID-19 cases (8825) and counties in the Northeast had the greatest cumulative rate of COVID-19 deaths (190) (Figures 1B and C). Yet, there was wide geographic dispersion with counties with high burden of COVID-19 distributed across the US. Qualitatively, counties with high COVID-19 cases also had high COVID-19 deaths.

Association Between County-Level Risk Groups and COVID-19 Outcomes

A significant association was observed between county-level risk groups and COVID-19 deaths and cases per 100,000, even after adjusting for time since first death and case, respectively, within each county (Figure 2, Table 2, Supplemental Table 5). Highest rates of COVID-19 cases (9557 [2520]) and deaths (210 [97]) per 100,000 population occurred in the cluster comprised of rural disadvantaged counties (p<0.05). Rates of COVID-19 deaths were nearly 2-fold higher in the rural disadvantaged cluster compared with the diverse urban counties with greater social assets.

Fig. 2
figure 2

Association between county-level risk group and cumulative COVID-19 (A) deaths and (B) cases per 100,000 population from 1/21/20 to 2/17/21*

Table 2 Summary of Covid-19 cases and deaths stratified by county-level risk groups from January 21st 2020 to February 17th, 2021

Within each county-level risk group, we next identified the ten counties with the highest cumulative COVID-19 deaths per 100,00 population and visualized the trajectory of COVID-19 mortality as a function of time since the 5th reported COVID-19 death in each county (Figure 3). Hancock county, Georgia, in the vulnerable rural county-level risk group experienced the highest burden of COVID-19 deaths (622 per 100,000) while Norton County, Kansas, in the older rural county-level risk group had the highest level of cases (22,357 per 100,000) across the US (Supplemental Table 6). Yet, both county-level risk groups that predominantly included rural counties had the lowest rate of vaccination per 100,00 population compared with the two county-level risk groups predominantly comprised of urban counties (most data available as of 2/23/21). Within and across groups, there was no correlation between COVID-19 case and death rate and vaccination rate (Supplemental Figure 3).

Fig. 3
figure 3

Spatiotemporal patterns of COVID-19 spread in the ten counties with the highest cumulative COVID-19 deaths per 100,000 from 1/21/20 to 2/17/21 within each county-level risk grouping


This study describes a novel, data-driven, classification scheme that identified four distinct county-level risk groups, which were significantly associated with COVID-19 deaths and cases per 100,000 population from January 21, 2020 to February 17, 2021. The county-level risk group clustered by rural setting and social vulnerabilities had the greatest burden of COVID-19 deaths by nearly 2-fold, but the lowest documented rate of vaccination, to-date. This approach identified county-level vulnerability to COVID-19 deaths and cases by integrating characteristics across multiple key domains (sociodemographic, health-related, and environmental) implicated in the risk for COVID-19 infection and severity.

On a population level, targeting prevention of COVID-19 through vaccination, especially with resurgence in areas with novel variants, as well as directing resources to address long-term sequelae or disability due to COVID-19 infection requires identification of communities who have experienced greatest burden of COVID-19 infection and mortality [20, 21]. While there may be multiple mechanisms by which county-level factors may influence COVID-19 outcomes, classifying counties with similar socioecological profiles, which are implicated in COVID-19 infection rate and illness severity, will provide the ability for public health responses targeted to the communities with the greatest need. This is especially important at this stage in the pandemic when novel variants are emerging that may be more infectious or be associated with greater infection severity [22]. Other tools, such as the social vulnerability index (SVI) have been applied and demonstrated significant association between greater social vulnerability and both COVID-19 and non-COVID-19 health outcomes [23]. However, the SVI [24] relies only on demographic variables from the US census and does not integrate health status and healthcare access for county’s residents or environmental measures at the county-level, such as air pollution. Therefore, this current study extends prior reports by developing a comprehensive and data-driven classification scheme to reflect social, health, and environmental vulnerability at the county-level.

These data build upon prior studies that have documented substantial county-level disparities in COVID-19 as well as a large body of literature demonstrating similar patterns in geographic disparities across the US in non-communicable diseases, such as cardiovascular disease [25, 26]. Indeed, COVID-19 infection susceptibility and mortality has further amplified pre-existing place-based health disadvantage due to interrelated sociodemographic [27], health [28], and environmental factors [29] at the county-level and expand growing awareness of a “rural mortality penalty” [30] in the US. Recent investigations have demonstrated widening disparities in rural mortality rates due to noncommunicable diseases, such as chronic lung disease [31] and cardiovascular disease [32], compared with urban mortality rates. This likely reflects convergence of multiple ecological factors related to social determinants of overall health, such as economic prosperity [33] and education [27, 34]. Vulnerability as a result of underlying comorbidities may be one important factor in the observed place-based disparities, but does not account for intrinsic systemic and structural racism within and outside our healthcare system that may also be contributing [35]. Our study incorporates both sociodemographic factors and burden of chronic diseases at the county-level and provides a means to estimate susceptibility and burden of COVID-19 at the population level in hopes of mitigating avoidable health inequalities, especially in counties with limited resources.

Strengths of our study include use of innovative statistical methods to integrate 30 county-level characteristics that may be implicated in COVID-19 transmission and mortality with a latent-class modeling approach [18] that identified groups of counties with similar profiles of ecological risk factors. To accomplish this task, we harmonized and linked numerous publicly available county-level datasets to create a comprehensive classification scheme leveraged available data on county-level COVID-19 outcomes that represent cumulative cases and deaths from >12 months since the beginning of the pandemic.

This analysis has several limitations. First, there is the potential misclassification of COVID-19 cases and mortality due to under-ascertainment of COVID-19. Under-recognition of COVID-19 may have occurred due to limited availability of testing, especially earlier in the pandemic. Prior studies have demonstrated a strong correlation between testing and cases per 100,000 population and seroprevalence studies suggest that up to 10 times more COVID-19 infections occurred than number of cases [36, 37], which may reflect mild illness or no symptoms in addition to under-testing. Therefore, COVID-19 cases and deaths are unlikely to reflect the total burden of COVID-19. This has been demonstrated in analyses of excess deaths at the national level [38]. However, our analysis relies on the most comprehensive estimates of cumulative COVID-19 mortality and cases available at the county-level. Second, use of LCA assigns group membership for each county based on the highest probability, but there may be counties that may be equally likely to belong to another county-level risk group. However, examination of posterior probabilities revealed that no county had a probability <0.7 based on their assigned group membership and LCA facilitated the integration of 30 highly interrelated variables in one classification scheme in an unbiased data-driven approach. Third, our analysis does not account for individual-level characteristics that are associated with COVID-19 infection or disease severity. However, the current study design is ecological, and it is not meant to infer causation; rather it is meant to inform large-scale comparisons at the population-level to communicate with local health departments and leaders to inform decisions at the county-level [39]. Fourth, analysis at a smaller geographic scale, such as the neighborhood (census-tract) level was not possible in this nationwide study. Prior work in New York City [40], Milwaukee [41], and Chicago [42] have identified the neighborhood-level variation within a city in COVID-19 deaths that highlight structural inequities as root causes of these disparities and warrant further investigation. This analysis at the county-level is a meaningful geographic scale for changes in policy and public health response at the state- and national-level that can complement efforts within counties.

In conclusion, this study identified clustering of counties based on a variety of county-level factors that incorporate sociodemographic, health status and access, and environmental characteristics. The county-level risk index was significantly associated with COVID-19 cases and deaths, with the greatest burden for both occurring in the group with rural disadvantaged counties. Place-based health disadvantage for COVID-19 outcomes will require targeted and equitable allocation of resources for vaccination as well as management of long-term sequelae and disability due to COVID-19. The findings from this current study provide unique insights into county-level characteristics that may inform the lack of reserve or resilience at a population-level when challenged by an unexpected stress of a highly infectious communicable disease that carries a significant risk of morbidity and mortality, such as COVID-19.

Availability of data and materials

The datasets used and/or analyzed during the current study are available in the New York Times GitHub repository ( Additional datasets generated for analysis are included in this published article and its supplementary information files.


  1. McMichael TM, Currie DW, Clark S, et al. Epidemiology of Covid-19 in a Long-Term Care Facility in King County, Washington. N Engl J Med. 2020;382(21):2005–11.

    Article  CAS  Google Scholar 

  2. Yang BY, Barnard LM, Emert JM, et al. Clinical Characteristics of Patients With Coronavirus Disease 2019 (COVID-19) Receiving Emergency Medical Services in King County, Washington. JAMA Netw Open. 2020;3(7):e2014549.

    Article  Google Scholar 

  3. Our World in Data. 2020; Accessed 2/28/21, 2021.

  4. Tartof SY, Qian L, Hong V, et al. Obesity and Mortality Among Patients Diagnosed With COVID-19: Results From an Integrated Health Care Organization. Ann Intern Med. 2020;173(10):773–81.

    Article  Google Scholar 

  5. Richardson S, Hirsch JS, Narasimhan M, et al. Presenting Characteristics, Comorbidities, and Outcomes Among 5700 Patients Hospitalized With COVID-19 in the New York City Area. JAMA. 2020;323(20):2052–9.

    Article  CAS  Google Scholar 

  6. Guo T, Fan Y, Chen M, et al. Cardiovascular Implications of Fatal Outcomes of Patients With Coronavirus Disease 2019 (COVID-19). JAMA Cardiol. 2020;5(7):811–8.

    Article  Google Scholar 

  7. Fisman DN, Bogoch I, Lapointe-Shaw L, McCready J, Tuite AR. Risk Factors Associated With Mortality Among Residents With Coronavirus Disease 2019 (COVID-19) in Long-term Care Facilities in Ontario, Canada. JAMA Netw Open. 2020;3(7):e2015957.

    Article  Google Scholar 

  8. Rosenthal N, Cao Z, Gundrum J, Sianis J, Safo S. Risk Factors Associated With In-Hospital Mortality in a US National Sample of Patients With COVID-19. JAMA Netw Open. 2020;3(12):e2029058.

    Article  Google Scholar 

  9. Adhikari S, Pantaleo NP, Feldman JM, Ogedegbe O, Thorpe L, Troxel AB. Assessment of Community-Level Disparities in Coronavirus Disease 2019 (COVID-19) Infections and Deaths in Large US Metropolitan Areas. JAMA Netw Open. 2020;3(7):e2016938.

    Article  Google Scholar 

  10. Khazanchi R, Evans CT, Marcelin JR. Racism, Not Race, Drives Inequity Across the COVID-19 Continuum. JAMA Netw Open. 2020;3(9):e2019933.

    Article  Google Scholar 

  11. Carfi A, Bernabei R, Landi F. Gemelli Against C-P-ACSG. Persistent Symptoms in Patients After Acute COVID-19. JAMA. 2020;324(6):603–5.

    Article  CAS  Google Scholar 

  12. Moore JP. Approaches for Optimal Use of Different COVID-19 Vaccines: Issues of Viral Variants and Vaccine Efficacy. JAMA. 2021;325(13):1251–2.

  13. Fielding-Miller RK, Sundaram ME, Brouwer K. Social determinants of COVID-19 mortality at the county level. PLoS One. 2020;15(10):e0240151.

    Article  CAS  Google Scholar 

  14. Vahabi N, Salehi M, Duarte JD, Mollalo A, Michailidis G. County-level longitudinal clustering of COVID-19 mortality to incidence ratio in the United States. Sci Rep. 2021;11(1):3088.

    Article  CAS  Google Scholar 

  15. Ahmad K, Erqou S, Shah N, et al. Association of poor housing conditions with COVID-19 incidence and mortality across US counties. PLoS One. 2020;15(11):e0241327.

    Article  CAS  Google Scholar 

  16. Wu X, Nethery RC, Sabath MB, Braun D, Dominici F. Air pollution and COVID-19 mortality in the United States: Strengths and limitations of an ecological regression analysis. Sci Adv. 2020;6(45):eabd4049.

  17. Karmakar M, Lantz PM, Tipirneni R. Association of Social and Demographic Factors With COVID-19 Incidence and Death Rates in the US. JAMA Netw Open. 2021;4(1):e2036462.

    Article  Google Scholar 

  18. Mori M, Krumholz HM, Allore HG. Using Latent Class Analysis to Identify Hidden Clinical Phenotypes. JAMA. 2020;324(7):700–1.

    Article  Google Scholar 

  19. Weller BE, Bowen NK, Faubert SJ. Latent class analysis: a guide to best practice. J Black Psychol. 2020;46(4):287–311.

    Article  Google Scholar 

  20. Pearce N, Vandenbroucke JP, VanderWeele TJ, Greenland S. Accurate Statistics on COVID-19 Are Essential for Policy Guidance and Decisions. Am J Public Health. 2020:e1–3.

  21. Smith N, Fraser M. Straining the System: Novel Coronavirus (COVID-19) and Preparedness for Concomitant Disasters. Am J Public Health. 2020;110(5):648–9.

    Article  Google Scholar 

  22. Chopra V, Toner E, Waldhorn R, Washer L. How Should U.S. Hospitals Prepare for Coronavirus Disease 2019 (COVID-19)? Ann Intern Med. 2020;172(9):621–2. Epub 2020 Mar 11.

  23. Khazanchi R, Beiter ER, Gondi S, Beckman AL, Bilinski A, Ganguli I. County-Level Association of Social Vulnerability with COVID-19 Cases and Deaths in the USA. J Gen Intern Med. 2020;35(9):2784–7.

    Article  Google Scholar 

  24. Wolkin A, Patterson JR, Harris S, et al. Reducing Public Health Risk During Disasters: Identifying Social Vulnerabilities. J Homel Secur Emerg Manag. 2015;12(4):809–22.

    Article  Google Scholar 

  25. Roth GA, Dwyer-Lindgren L, Bertozzi-Villa A, et al. Trends and Patterns of Geographic Variation in Cardiovascular Mortality Among US Counties, 1980-2014. JAMA. 2017;317(19):1976–92.

    Article  Google Scholar 

  26. Woolf SH, Schoomaker H. Life Expectancy and Mortality Rates in the United States, 1959-2017. JAMA. 2019;322(20):1996–2016.

    Article  Google Scholar 

  27. Chetty R, Stepner M, Abraham S, et al. The Association Between Income and Life Expectancy in the United States, 2001-2014. JAMA. 2016;315(16):1750–66.

    Article  CAS  Google Scholar 

  28. Peto R, Whitlock G, Jha P. Effects of obesity and smoking on U.S. life expectancy. N Engl J Med. 2010;362(9):855–6 author reply 856-857.

    Article  CAS  Google Scholar 

  29. Pope CA 3rd, Ezzati M, Dockery DW. Fine-particulate air pollution and life expectancy in the United States. N Engl J Med. 2009;360(4):376–86.

    Article  CAS  Google Scholar 

  30. Ferdows NB, Aranda MP, Baldwin JA, Baghban Ferdows S, Ahluwalia JS, Kumar A. Assessment of Racial Disparities in Mortality Rates Among Older Adults Living in US Rural vs Urban Counties From 1968 to 2016. JAMA Netw Open. 2020;3(8):e2012241.

    Article  Google Scholar 

  31. Reyfman PA, Khuder B, Pierce JB, et al. Urban-Rural Mortality Disparities from Chronic Lower Respiratory Diseases in the US, 1999-2019. Am J Respir Crit Care Med. 2021;203(11):1435–7.

  32. Shah NS, Carnethon M, Lloyd-Jones DM, Khan SS. Widening Rural-Urban Cardiometabolic Mortality Gap in the United States, 1999 to 2017. J Am Coll Cardiol. 2020;75(25):3187–8.

    Article  Google Scholar 

  33. Khatana SAM, Venkataramani AS, Nathan AS, et al. Association Between County-Level Change in Economic Prosperity and Change in Cardiovascular Mortality Among Middle-aged US Adults. JAMA. 2021;325(5):445–53.

    Article  Google Scholar 

  34. Sasson I, Hayward MD. Association Between Educational Attainment and Causes of Death Among White and Black US Adults, 2010-2017. JAMA. 2019;322(8):756–63.

    Article  Google Scholar 

  35. Yancy CW. COVID-19 and African Americans. JAMA. 2020;323(19):1891–2.

  36. Moscola J, Sembajwe G, Jarrett M, et al. Prevalence of SARS-CoV-2 Antibodies in Health Care Personnel in the New York City Area. JAMA. 2020;324(9):893–5.

    Article  CAS  Google Scholar 

  37. Sood N, Simon P, Ebner P, et al. Seroprevalence of SARS-CoV-2-Specific Antibodies Among Adults in Los Angeles County, California, on April 10-11, 2020. JAMA. 2020;323(23):2425–7.

    Article  Google Scholar 

  38. Stokes AC, Lundberg DJ, Elo IT, Hempstead K, Bor J, Preston SH. COVID-19 and excess mortality in the United States: A county-level analysis. PLoS Med. 2021;18(5):e1003571.

    Article  CAS  Google Scholar 

  39. Sedgwick P. Ecological studies: advantages and disadvantages. BMJ. 2014;348:g2979.

    Article  Google Scholar 

  40. Wadhera RK, Wadhera P, Gaba P, et al. Variation in COVID-19 Hospitalizations and Deaths Across New York City Boroughs. JAMA. 2020;323(21):2192–5.

    Article  CAS  Google Scholar 

  41. Munoz-Price LS, Nattinger AB, Rivera F, et al. Racial Disparities in Incidence and Outcomes Among Patients With COVID-19. JAMA Netw Open. 2020;3(9):e2021892.

    Article  Google Scholar 

  42. Pierce JB, Harrington K, McCabe ME, et al. Racial/ethnic minority and neighborhood disadvantage leads to disproportionate mortality burden and years of potential life lost due to COVID-19 in Chicago, Illinois. Health Place. 2021;68:102540.

    Article  Google Scholar 

Download references


Not applicable


Research reported in this publication was supported, in part, by the National Institutes of Health's National Institute on Aging, Grant Number P30AG059988, to SSK. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health. The funding sponsor did not contribute to design and conduct of the study, collection, management, analysis, or interpretation of the data or preparation, review, or approval of the manuscript.

Author information

Authors and Affiliations



SK, LP, KK, LP, and NA designed the research study. AK, MM, LP, XY performed data analysis and interpretation. SK drafted the manuscript. AK, MM, LP, XY, KK, LP and NA provided critical revisions to the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Sadiya S. Khan.

Ethics declarations

Ethics approval and consent to participate

All data were deidentified and publicly available, and therefore deemed exempt from Northwestern University’s Institutional Review Board.

Consent for publication

Not applicable

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Khan, S.S., Krefman, A.E., McCabe, M.E. et al. Association between county-level risk groups and COVID-19 outcomes in the United States: a socioecological study. BMC Public Health 22, 81 (2022).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: