Skip to main content

Relative predictive value of sociodemographic factors for chronic diseases among All of Us participants: a descriptive analysis

Abstract

Background

Although sociodemographic characteristics are associated with health disparities, the relative predictive value of different social and demographic factors remains largely unknown. This study aimed to describe the sociodemographic characteristics of All of Us participants and evaluate the predictive value of each factor for chronic diseases associated with high morbidity and mortality.

Methods

We performed a cross-sectional analysis using de-identified survey data from the All of Us Research Program, which has collected social, demographic, and health information from adults living in the United States since May 2018. Sociodemographic data included self-reported age, sex, gender, sexual orientation, race/ethnicity, income, education, health insurance, primary care provider (PCP) status, and health literacy scores. We analyzed the self-reported prevalence of hypertension, coronary artery disease, any cancer, skin cancer, lung disease, diabetes, obesity, and chronic kidney disease. Finally, we assessed the relative importance of each sociodemographic factor for predicting each chronic disease using the adequacy index for each predictor from logistic regression.

Results

Among the 372,050 participants in this analysis, the median age was 53 years, 59.8% reported female sex, and the most common racial/ethnic categories were White (54.0%), Black (19.9%), and Hispanic/Latino (16.7%). Participants who identified as Asian, Middle Eastern/North African, and White were the most likely to report annual incomes greater than $200,000, advanced degrees, and employer or union insurance, while participants who identified as Black, Hispanic, and Native Hawaiian/Pacific Islander were the most likely to report annual incomes less than $10,000, less than a high school education, and Medicaid insurance. We found that age was most predictive of hypertension, coronary artery disease, any cancer, skin cancer, diabetes, obesity, and chronic kidney disease. Insurance type was most predictive of lung disease. Notably, no two health conditions had the same order of importance for sociodemographic factors.

Conclusions

Age was the best predictor for the assessed chronic diseases, but the relative predictive value of income, education, health insurance, PCP status, race/ethnicity, and sexual orientation was highly variable across health conditions. Identifying the sociodemographic groups with the largest disparities in a specific disease can guide future interventions to promote health equity.

Peer Review reports

Background

The social determinants of health encompass a wide range of non-medical factors that can influence health outcomes, such as social, economic, cultural, structural, and environmental conditions [1,2,3]. Studies suggest that clinical care only accounts for 20% of health outcomes, whereas social determinants contribute up to 50% [3, 4]. Identifying risk factors for specific diseases is critical for guiding preventative efforts and advancing health equity. Different sociodemographic groups experience significant disparities in the morbidity and mortality of chronic diseases including hypertension [5,6,7], coronary artery disease [5, 8,9,10], cancer [5, 11,12,13,14,15,16], lung disease [5, 17,18,19], diabetes [5, 20,21,22], obesity [5, 23, 24], and chronic kidney disease [25,26,27,28,29]. Despite advances in detection and treatment of chronic diseases, health disparities have widened over time among racial/ethnic groups [5].

The consequences of social determinants are complex and dynamic, such that each may have a unique contribution to different diseases. For example, an analysis of patients from 169 community health centers across the United States found that physical environments (i.e. rural status and Census region) were strongly predictive of hypertension, but less predictive of cardiovascular disease [30]. This study identified health disparities among different sociodemographic groups in a low-income, uninsured population. However, no studies to date have examined the relative contribution of different social factors across health conditions in a nationally representative population.

In this study, we analyze social determinants of health that fall into priority domains established by Healthy People 2023: economic stability (annual income), education access and quality (educational attainment), and healthcare access and quality (insurance type, health literacy, and primary care history) [4]. Information related to the environment [4] and community support [4] domains were unavailable in the All of Us dataset at the time of analysis. We also assess sociodemographic factors commonly associated with health disparities, including age, sex, gender, sexual orientation, race, and ethnicity.

First, we describe how various sociodemographic factors (age, sex, gender, sexual orientation, annual income, educational attainment, insurance type, and health literacy) are distributed within self-identified racial and ethnic categories among participants in the All of Us Research Program. We then assess the relative contributions of each of these variables on the likelihood of self-reporting hypertension, coronary artery disease, cancer, lung disease, diabetes, obesity, and chronic kidney disease. We hypothesize that the strongest predictors will vary by disease and that a “one-size-fits-all” approach is insufficient for identifying and preventing health disparities.

Methods

Study population and data collection

All of Usaims to collect health information from over one million people in the United States, with the goal of advancing personalized health care [31]. All of Us is therefore uniquely poised to examine social diversity within self-identified racial and ethnic categories and the interactions between different sociodemographic factors and health conditions. The full All of Us Research Program protocol has been published previously [32]. Briefly, adults aged 18 years or older living in the United States were consented and enrolled in All of Us through a healthcare provider organization or directly through the website (www.allofus.nih.gov). Participants answered baseline demographic questions and had the option to provide additional information on their medical history. Surveys were completed in either English or Spanish, and resulting data were pooled as per the All of Us protocol [32].

For this study, we included sociodemographic and health information from collected from All of Us participants who were enrolled from May 2017 to January 2022 and completed the Basics Survey. Sociodemographic data and health history were selected based on the health disparities literature [5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29] and the data available in version 6 of All of Us. Age, sex, gender, sexual orientation, race, ethnicity, annual income, educational attainment, insurance type, health literacy, and history of seeing a primary care practitioner (PCP) within the last twelve months were included in the analysis. Self-reported health history included hypertension, coronary artery disease, any cancer, skin cancer, lung disease, diabetes, obesity, and chronic kidney disease. Health literacy was assessed via the Brief Health Literacy Screen, and participants were scored from 3–15 using the three questions, with higher scores indicating higher health literacy [33, 34]. Supplemental Table 1 provides details on each question and how each characteristic was coded.

Data analysis

All data were analyzed directly in the All of Us Researcher Workbench with R Statistical Software(version 4.2.2) [35]. After data cleaning (Supplemental Tables 1.1 and 1.2), we compared the distributions of age, sex, gender, sexual orientation, annual income, educational attainment, insurance type, health literacy, and recent PCP visit by self-defined racial and ethnic category. We characterized sociodemographic disparities across racial and ethnic categories as these social constructs are tied to important determinants of health that cannot be directly measured in this study, such as racism. We also included participants who skipped or declined to answer a question. As described in Supplemental Table 1.1, participants were asked, “Which categories describe you? Select all that apply. Note, you may select more than one group.” Participants who selected more than one group were coded as “Multiple.” Sub-categories with 20 or fewer participants were excluded for participant privacy [36].

To assess the importance of each social determinant on each health condition, we compared the relative importance of each predictor (ranked from 1–11) based on each predictor’s adequacy index (-2*log-likelihood of each predictor in the model divided by -2*log-likelihood of the full model) in participants who provided baseline self-reported information on a personal history of hypertension, coronary artery disease, any cancer, skin cancer, lung disease, diabetes, obesity, and chronic kidney disease with logistic regression (Table 1) [37, 38]. We used inverse probability weighting to weight each participant who reported clinical conditions (n = 141,878) such that the weighted sample (n = 378,811) was representative of the full 372,050 participants in All of Us version 6 based on the measured social and demographic variables, excluding PCP status due to a large number of version 6 participants who did not fill out the All of UsHealthcare Access and Utilization survey [39, 40]. In all models, age and health literacy were modeled as restricted cubic splines with five knots [37, 38]. As health literacy mediates some racial and ethnic disparities in health outcomes and behaviors [41,42,43,44,45,46,47], we analyzed the interaction between health literacy and race/ethnicity. As we were interested in the variance explained by each sociodemographic variable, not the specific impact of each category within each social variable, “Skip” and “Prefer Not To Answer” were included as subcategories for categorical variables. Participants with missing or unknown health outcomes in the weighted sample were excluded from the model for that chronic disease. Code is available at www.github.com/ansleykunnath/allofus.

Table 1 All of Us (version 6) Participant Information

Results

All of Us (version 6) included social and demographic data from 372,050 individuals, summarized in Table 1. The participants had a median age of 53 years, 59.8% were female, and 86.6% identified as straight. There were ten self-identified racial/ethnic categories, with the largest categories being White (54.0%), Black (19.9%), and Hispanic/Latino (16.7%). Incomes ranged from less than $10,000 (14.2%) to over $200,000 (6.2%), with 13% preferring not to answer. Most participants completed at least a high school degree or equivalent, with 43% of participants completing a college or advanced degree. Thirty percent of participants were insured by their employer, 17.9% by Medicaid and 15.8% by Medicare. Most participants (62.8%) reported feeling extremely confident when completing medical forms. Similarly, 60% and 66% of participants reported never needing assistance with reading health-related materials and never having difficulty understanding written health information, respectively. Nearly all participants (94.5%) who completed the Healthcare Access and Utilization instrument reported seeing a PCP within the last 12 months. The prevalence of health conditions ranged from 2.4% (chronic kidney disease) to 31.3% (hypertension) among participants who reported their health history.

Participants who identified as Asian (Middle Eastern/North African, and White were the most likely to report incomes greater than $200,000 per year (11.9%, 9.4%, 9.5%), advanced degrees (40.3%, 36.5%, 29.7%), and employer/union insurance (50.6%, 38.6%, 38.0%), whereas participants who identified as Black, Hispanic, and Native Hawaiian/Pacific Islander were the most likely to report incomes less than $10,000 per year (32.9%, 17.2%, 16.9%), less than a high school education (15.1%, 25.8%, 6.4%), and Medicaid insurance (32.3%, 31.8%, 28.1%). Participants who identified as Black, Hispanic, or Native Hawaiian/Pacific Islander were also more likely to mark “prefer not to answer”. There was substantial income, educational, and insurance diversity within each racial/ethnic category as well (Fig. 1 and Supplemental Table 2). All racial/ethnic categories self-reported high health literacy (Supplemental Fig. 1). Those who skipped the race/ethnicity question were also more likely to skip all other demographic questions. Annual household income was skipped (7.8%) or preferred not to answer (13.0%) most frequently. Racial/ethnic groups demonstrated large differences in self-reported disease prevalence (Supplemental Table 3). For example, among participants who reported clinical histories, Black participants were the most likely to report hypertension (45.1%), and Asian participants were the least likely to report obesity (9.0%).

Fig. 1
figure 1

Relationship between race/ethnicity category and age, gender, annual income, health insurance, education, and PCP status. “Skip” indicates participants who filled out the corresponding All of Us instruments (Supplemental Tables 1.1 and 1.2) but skipped a question on the variable, and “Missing” indicates participants who did not see the instrument for any reason

Inverse probability weighting generated a pseudo-population very similar to the full All of Us (version 6) population (Table 2 and Supplemental Fig. 2). In general, age, income, education, insurance, and race/ethnicity were the most important predictors across all assessed self-reported health conditions (Fig. 2). However, each health condition has a different order of the relative importance of each sociodemographic factor. For most health conditions, age was the most important predictor. However, age was the 5th most important predictor for a self-reported history of lung disease, which was relatively more impacted by health insurance type. We also found that income was a better predictor of self-reporting obesity compared to the other health conditions. Across most diseases, health literacy was the least predictive variable. Overall, the relative predictive value of sociodemographic factors varied greatly among chronic health conditions.

Table 2 Predictive model characteristics
Fig. 2
figure 2

Relative importance of selected social and demographic variables on health conditions based on adequacy index. The values reflect the proportion of health condition variance (column) explained by each sociodemographic factor (row)

Discussion

In this study, we analyzed the relative contribution of sociodemographic factors to chronic diseases in a large, diverse national sample. Age was the most predictive for self-reporting each health condition. Despite evidence that health literacy is a strong predictor of chronic diseases and health care utilization [48,49,50,51,52], we found that health literacy was the overall weakest predictor of chronic diseases among All of Us participants. This may be due, in part, to the lack of response variability within this cohort or response bias inherent to self-reporting. Previous studies also suggest that the Brief Health Literacy Screen is more sensitive to identifying patients with inadequate health literacy than marginal health literacy [34]. To our knowledge, this study is the largest population in which the Brief Health Literacy Screen has been used; further studies may be necessary to understand its predictive value for chronic diseases. Overall, the differences in the relative contribution of social and demographic factors to each chronic disease underscore the importance of carefully selecting covariates when assessing disease risk and prevention. Furthermore, identifying the strongest predictors for diseases will be crucial for developing targeted interventions to prevent health disparities.

The All of Usparticipants encompass a wide range of demographic factors, social identities, and health conditions. Previous databases have been limited by a lack of diversity among study participants, leading to the exclusion of many marginalized groups in research. The UK Biobank contains data from over 500,000 individuals, 94% of whom identify as White [53, 54]. Similarly, the original Framingham Heart Study was 100% White and, despite the addition of new cohorts, 94% of current participants are White [55]. Furthermore, despite the presence of individuals with diverse backgrounds within these databases, many researchers exclude non-White races in their studies due to low sampling. All of Us, on the other hand, seeks to improve health research diversity by actively including participants from groups historically excluded from research, thereby strengthening its use for health disparities research [31]. Based on 2021 United States Census Bureau data, a representative sample of Americans should be about 59% non-Hispanic White [56]. In our study, we found that 54% of All of Us participants identified as White, which is significantly more reflective of the national population than other databases. The rich diversity and scale of All of Us makes it a powerful tool for studying health conditions across various social variables, including race/ethnicity.

There are several limitations to this study. The data were self-reported, which introduces measurement error that likely differs by baseline sociodemographic characteristics, and cross-sectional, which prevents any causal interpretations from our predictive models. Participants in the All of Us Research Program were enrolled through their health provider organization or online, which may explain the disproportionately high rate of health literacy (89.9%) and PCP status (94.5%). Furthermore, to appropriately compare predictive value of sociodemographic factors across health conditions, we did not change each model for each health condition (e.g., changing the interaction terms to fit any particular condition) or account for factors such as family history or health-related behaviors (i.e., smoking, diet, exercise) specific to each health condition. Despite the large dataset, there were a small number of some health conditions, which limited which interactions we could model. For example, while we thought the interaction between income and self-reported race/ethnicity is likely pertinent in several of the available health conditions, the fact that both were categorical variables with many categories prevented model convergence. Several of the assessed sociodemographic factors likely interact with each other, and the cumulative effect of multiple sociodemographic factors may be greater than the sum of individual sociodemographic factors. Finally, the All of Usdatabase is, unfortunately, missing important covariates that likely impact health outcomes, including, but not limited to: experiences with discrimination and racism [57, 58], psychosocial stress [59, 60], environmental exposures [61], food security [62], exposures to gentrification [63, 64], and interactions with the justice system [65, 66].

Conclusion

In this study, we characterize the differences in sociodemographic factors, and chronic diseases among racial and ethnic groups, as well as the relative predictive value of sociodemographic factors for chronic diseases, using the All of Us database. Our findings demonstrate that the All of Us Research Program is well-poised to expand the diversity of population-level health outcome research in the United States. Finally, our predictive models, although missing factors that measure structural drivers of health, highlight that social and demographic factors are differentially predictive of individual health conditions and, therefore, the importance of thoughtful model generation that considers each health condition individually. Identifying the strongest predictors for each of these diseases can also guide strategies to eliminate health disparities.

Availability of data and materials

The data that support the findings of this study can be accessed via the All of Us Researcher Workbench (https://workbench.researchallofus.org/). The data are not publicly available and cannot be made available by the corresponding author on reasonable request due to privacy or ethical restrictions. At the time of publication, access to the All of Us Researcher Workbench is restricted to researchers whose institution has signed a data use agreement with All of Us (https://www.researchallofus.org/register/).

Abbreviations

PCP:

Primary care provider

IQR:

Interquartile range

References

  1. Centers for Disease Control and Prevntion. Chronic diseases in America. https://www.cdc.gov/chronicdisease/resources/infographic/chronic-diseases.htm. Accessed 15 Oct 2023.

  2. Centers for Disease Control and Prevention. Social determinants of health. https://www.cdc.gov/about/sdoh/index.html.

    Google Scholar 

  3. Commission on Social Determinants of Health. Closing the gap in a generation: health equity through action on the social determinants of health. Final report of the Commission on Social Determinants of Health. Geneva: World Health Organization; 2008.

    Google Scholar 

  4. U.S. Department of Health and Human Services. Social determinants of health. https://health.gov/healthypeople/priority-areas/social-determinants-health.

  5. Singh PGK, Daus MAGP, Allender MSBSNRNM, et al. Social determinants of health in the United States: Addressing major health inequality trends for the nation, 1935–2016. Int J Mch Aids. 2018;6(2):139–64. https://doi.org/10.21106/ijma.236.

    Article  CAS  Google Scholar 

  6. Aggarwal R, Chiu N, Wadhera RK, et al. Racial/ethnic disparities in hypertension prevalence, awareness, treatment, and control in the United States, 2013 to 2018. Hypertension. 2021;78(6):1719–26. https://doi.org/10.1161/HYPERTENSIONAHA.121.17570.

    Article  CAS  PubMed  Google Scholar 

  7. Lackland DT. Racial differences in hypertension: implications for high blood pressure management. Am J Med Sci. 2014;348(2):135–8. https://doi.org/10.1097/MAJ.0000000000000308.

    Article  PubMed  PubMed Central  Google Scholar 

  8. Puckrein GA, Egan BM, Howard G. Social and medical determinants of cardiometabolic health: The big picture. Ethn Dis. 2015;25(4):521–4. https://doi.org/10.18865/ed.25.4.521.

    Article  PubMed  PubMed Central  Google Scholar 

  9. Mital R, Bayne J, Rodriguez F, Ovbiagele B, Bhatt DL, Albert MA. Race and ethnicity considerations in patients with coronary artery disease and stroke: JACC focus seminar 3/9. J Am Coll Cardiol. 2021;78(24):2483–92. https://doi.org/10.1016/j.jacc.2021.05.051.

    Article  PubMed  Google Scholar 

  10. Đoàn LN, Takata Y, Hooker K, Mendez-Luck C, Irvin VL. Trends in cardiovascular disease by Asian American, Native Hawaiian, and pacific islander ethnicity, Medicare Health Outcomes Survey 2011–2015. J Gerontol A Biol Sci Med Sci. 2022;77(2):299–309. https://doi.org/10.1093/gerona/glab262.

    Article  PubMed  Google Scholar 

  11. Brunsgaard E, Wu YP, Grossman D. Melanoma in Skin of Color: Part I. Epidemiology and clinical presentation. J Am Acad Dermatol. 2022. https://doi.org/10.1016/j.jaad.2022.04.056.

  12. Brunsgaard E, Jensen J, Grossman D. Melanoma in Skin of Color: Part II. Racial disparities, role of UV, and interventions for earlier detection. J Am Acad Dermatol. 2022. https://doi.org/10.1016/j.jaad.2022.04.057.

  13. Shao K, Feng H. Racial and Ethnic Healthcare Disparities in Skin Cancer in the United States: A Review of Existing Inequities, Contributing Factors, and Potential Solutions. J Clin Aesthet Dermatol. 2022;15(7):16–22.

    PubMed  PubMed Central  Google Scholar 

  14. Özdemir BC, Dotto GP. Racial Differences in Cancer Susceptibility and Survival: More Than the Color of the Skin? Trends Cancer. 2017;3(3):181–97. https://doi.org/10.1016/j.trecan.2017.02.002.

    Article  PubMed  PubMed Central  Google Scholar 

  15. Coughlin SS. Social determinants of breast cancer risk, stage, and survival. Breast Cancer Res Treat. 2019;177(3):537–48. https://doi.org/10.1007/s10549-019-05340-7.

    Article  PubMed  Google Scholar 

  16. Coughlin SS. Social determinants of colorectal cancer risk, stage, and survival: a systematic review. Int J Colorectal Dis. 2020;35(6):985–95. https://doi.org/10.1007/s00384-020-03585-z.

    Article  PubMed  Google Scholar 

  17. Adie Y, Kats DJ, Tlimat A, et al. Neighborhood disadvantage and lung cancer incidence in ever-smokers at a safety net health-care system: A retrospective study. Chest. 2020;157(4):1021–9. https://doi.org/10.1016/j.chest.2019.11.033.

    Article  PubMed  Google Scholar 

  18. Pleasants RA, Riley IL, Mannino DM. Defining and targeting health disparities in chronic obstructive pulmonary disease. Int J Chron Obstruct Pulmon Dis. 2016;11:2475–96. https://doi.org/10.2147/COPD.S79077.

    Article  PubMed  PubMed Central  Google Scholar 

  19. Pinkerton KE, Harbaugh M, Han MK, et al. Women and lung disease. Sex differences and global health disparities. Am J Respir Crit Care Med. 2015;192(1):11–6. https://doi.org/10.1164/rccm.201409-1740PP.

    Article  PubMed  PubMed Central  Google Scholar 

  20. Peek ME, Cargill A, Huang ES. Diabetes health disparities: a systematic review of health care interventions. Med Care Res Rev. 2007;64(5 Suppl):101S-56S. https://doi.org/10.1177/1077558707305409.

    Article  PubMed  PubMed Central  Google Scholar 

  21. Hill-Briggs F, Adler NE, Berkowitz SA, et al. Social determinants of health and diabetes: A scientific review. Diabetes Care. 2020;44(1):258–79. https://doi.org/10.2337/dci20-0053.

    Article  PubMed  PubMed Central  Google Scholar 

  22. Clark ML, Utz SW. Social determinants of type 2 diabetes and health in the United States. World J Diabetes. 2014;5(3):296–304. https://doi.org/10.4239/wjd.v5.i3.296.

    Article  PubMed  PubMed Central  Google Scholar 

  23. Krueger PM, Reither EN. Mind the gap: race/ethnic and socioeconomic disparities in obesity. Curr Diab Rep. 2015;15(11):95. https://doi.org/10.1007/s11892-015-0666-6.

    Article  PubMed  PubMed Central  Google Scholar 

  24. Rossen LM, Schoendorf KC. Measuring health disparities: trends in racial-ethnic and socioeconomic disparities in obesity among 2- to 18-year old youth in the United States, 2001–2010. Ann Epidemiol. 2012;22(10):698–704. https://doi.org/10.1016/j.annepidem.2012.07.005.

    Article  PubMed  PubMed Central  Google Scholar 

  25. Chu CD, Powe NR, McCulloch CE, et al. Trends in chronic kidney disease care in the US by race and ethnicity, 2012–2019. JAMA Netw Open. 2021;4(9):e2127014. https://doi.org/10.1001/jamanetworkopen.2021.27014.

    Article  PubMed  PubMed Central  Google Scholar 

  26. Norton JM, Moxey-Mims MM, Eggers PW, et al. Social determinants of racial disparities in CKD. J Am Soc Nephrol. 2016;27(9):2576–95. https://doi.org/10.1681/ASN.2016010027.

    Article  PubMed  PubMed Central  Google Scholar 

  27. Nicholas S, Agodoa L, Norris K. Ethnic disparities in the prevalence and treatment of kidney disease. Nephrol News Issues. 2004;18(12):29–30, 32, 34 passim.

    PubMed  Google Scholar 

  28. Nicholas SB, Kalantar-Zadeh K, Norris KC. Racial disparities in kidney disease outcomes. Semin Nephrol. 2013;33(5):409–15. https://doi.org/10.1016/j.semnephrol.2013.07.002.

    Article  PubMed  PubMed Central  Google Scholar 

  29. Nicholas SB, Kalantar-Zadeh K, Norris KC. Socioeconomic disparities in chronic kidney disease. Adv Chronic Kidney Dis. 2015;22(1):6–15. https://doi.org/10.1053/j.ackd.2014.07.002.

    Article  PubMed  PubMed Central  Google Scholar 

  30. Pourat N, Chen X, Lu C, et al. The Relative Contribution of Social Determinants of Health Among Health Resources and Services Administration-Funded Health Centers. Population Health Manage. 2022;25(2):199–208. https://doi.org/10.1089/pop.2021.0293.

    Article  Google Scholar 

  31. National, Program IoHAoUR. FAQ. https://allofus.nih.gov/about/faq. Accessed 15 Oct 2023.

  32. All of Us Research Program I, Denny JC, Rutter JL, et al. The “All of Us” Research Program. N Engl J Med. 2019;381(7):668–76. https://doi.org/10.1056/NEJMsr1809937.

    Article  Google Scholar 

  33. Willens DE, Kripalani S, Schildcrout JS, et al. Association of brief health literacy screening and blood pressure in primary care. J Health Commun. 2013;18 Suppl 1(sup1):129–42. https://doi.org/10.1080/10810730.2013.825663.

    Article  PubMed  Google Scholar 

  34. Chew LD, Bradley KA, Boyko EJ. Brief questions to identify patients with inadequate health literacy. Fam Med. 2004;36(8):588–94.

    PubMed  Google Scholar 

  35. Team RC. R: A Language and Environment for Statistical Computing. https://www.R-project.org/. Accessed 15 Oct 2023.

  36. Research NIoHAoU, Program. Precision Medicine Initiative: Privacy and Trust Principles. https://allofus.nih.gov/protecting-data-and-privacy/precision-medicine-initiative-privacy-and-trust-principles. Accessed 15 Oct 2023.

  37. Harrell FE. Regression Modeling Strategies: With Applications to Linear Models, Logistic and Ordinal Regression, and Survival Analysis. New York: Springer International Publishing; 2015.

  38. Harrell Jr FE. Rms: Regression Modeling Strategies. https://CRAN.R-project.org/package=rms. Accessed 15 Oct 2023.

  39. Lumley T. Survey: Analysis of Complex Survey Samples. https://CRAN.R-project.org/package=survey. Accessed 15 Oct 2023.

  40. Reig J. (Very) Basic Steps to Weight a Survey Sample. https://bookdown.org/jespasareig/Book_How_to_weight_a_survey/. Accessed 15 Oct 2023.

  41. Schillinger D. Social Determinants, Health Literacy, and Disparities: Intersections and Controversies. Health Lit Res Pract. 2021;5(3):e234–43. https://doi.org/10.3928/24748307-20210712-01.

    Article  PubMed  PubMed Central  Google Scholar 

  42. Osborn CY, Cavanaugh K, Wallston KA, et al. Health literacy explains racial disparities in diabetes medication adherence. J Health Commun. 2011;16 Suppl 3(Suppl 3):268–78. https://doi.org/10.1080/10810730.2011.604388.

    Article  PubMed  Google Scholar 

  43. Eneanya ND, Winter M, Cabral H, et al. Health Literacy and Education as Mediators of Racial Disparities in Patient Activation Within an Elderly Patient Cohort. J Health Care Poor Underserved. 2016;27(3):1427–40. https://doi.org/10.1353/hpu.2016.0133.

    Article  PubMed  PubMed Central  Google Scholar 

  44. Berkman ND, Sheridan SL, Donahue KE, Halpern DJ, Crotty K. Low health literacy and health outcomes: an updated systematic review. Ann Intern Med. 2011;155(2):97–107. https://doi.org/10.7326/0003-4819-155-2-201107190-00005.

    Article  PubMed  Google Scholar 

  45. Bennett IM, Chen J, Soroui JS, White S. The contribution of health literacy to disparities in self-rated health status and preventive health behaviors in older adults. Ann Fam Med. 2009;7(3):204–11. https://doi.org/10.1370/afm.940.

    Article  PubMed  PubMed Central  Google Scholar 

  46. Mantwill S, Monestel-Umaña S, Schulz PJ. The Relationship between Health Literacy and Health Disparities: A Systematic Review. PLoS ONE. 2015;10(12):e0145455. https://doi.org/10.1371/journal.pone.0145455.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  47. Stormacq C, Van den Broucke S, Wosinski J. Does health literacy mediate the relationship between socioeconomic status and health disparities? Integrative review. Health Promot Int. 2019;34(5):e1–17. https://doi.org/10.1093/heapro/day062.

    Article  PubMed  Google Scholar 

  48. Rasu RS, Bawa WA, Suminski R, Snella K, Warady B. Health literacy impact on national healthcare utilization and expenditure. Int J Health Policy Manag. 2015;4(11):747–55. https://doi.org/10.15171/ijhpm.2015.151.

    Article  PubMed  PubMed Central  Google Scholar 

  49. Mackey LM, Doody C, Werner EL, Fullen B. Self-management skills in chronic disease management: What role does health literacy have? Med Decis Making. 2016;36(6):741–59. https://doi.org/10.1177/0272989X16638330.

    Article  PubMed  Google Scholar 

  50. Cajita MI, Cajita TR, Han H-R. Health literacy and heart failure. J Cardiovasc Nurs. 2016;31(2):121–30. https://doi.org/10.1097/jcn.0000000000000229.

    Article  PubMed  PubMed Central  Google Scholar 

  51. Puente-Maestu L, Calle M, Rodríguez-Hermosa JL, et al. Health literacy and health outcomes in chronic obstructive pulmonary disease. Respir Med. 2016;115:78–82. https://doi.org/10.1016/j.rmed.2016.04.016.

    Article  PubMed  Google Scholar 

  52. Liu L, Qian X, Chen Z, He T. Health literacy and its effect on chronic disease prevention: evidence from China’s data. BMC Public Health. 2020;20(1):690. https://doi.org/10.1186/s12889-020-08804-4.

    Article  PubMed  PubMed Central  Google Scholar 

  53. Sun Q, Graff M, Rowland B, et al. Analyses of biomarker traits in diverse UK biobank participants identify associations missed by European-centric analysis strategies. J Hum Genet. 2022;67(2):87–93. https://doi.org/10.1038/s10038-021-00968-0.

    Article  PubMed  Google Scholar 

  54. Manolio TA. Using the Data We Have: Improving Diversity in Genomic Research. Am J Hum Genet. 2019;105(2):233–6. https://doi.org/10.1016/j.ajhg.2019.07.008.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  55. Tsao CW, Vasan RS. Cohort Profile: The Framingham Heart Study (FHS): overview of milestones in cardiovascular epidemiology. Int J Epidemiol. 2015;44(6):1800–13. https://doi.org/10.1093/ije/dyv337.

    Article  PubMed  PubMed Central  Google Scholar 

  56. Bureau USC. U.S. Census Bureau QuickFacts. https://www.census.gov/quickfacts/fact/table/US/RHI825221. Accessed 15 Oct 2023.

  57. Hackett RA, Ronaldson A, Bhui K, Steptoe A, Jackson SE. Racial discrimination and health: a prospective study of ethnic minorities in the United Kingdom. BMC Public Health. 2020;20(1):1652. https://doi.org/10.1186/s12889-020-09792-1.

    Article  PubMed  PubMed Central  Google Scholar 

  58. Mays VM, Cochran SD, Barnes NW. Race, race-based discrimination, and health outcomes among African Americans. Annu Rev Psychol. 2007;58:201–25. https://doi.org/10.1146/annurev.psych.57.102904.190212.

    Article  PubMed  PubMed Central  Google Scholar 

  59. Yaribeygi H, Panahi Y, Sahraei H, Johnston TP, Sahebkar A. The impact of stress on body function: A review. EXCLI J. 2017;16:1057–72. https://doi.org/10.17179/excli2017-480.

    Article  PubMed  PubMed Central  Google Scholar 

  60. Mariotti A. The effects of chronic stress on health: new insights into the molecular mechanisms of brain-body communication. Future Sci OA. 2015;1(3):FSO23. https://doi.org/10.4155/fso.15.21.

    Article  PubMed  PubMed Central  Google Scholar 

  61. Rappaport SM, Smith MT. Epidemiology. Environment and disease risks Science. 2010;330(6003):460–1. https://doi.org/10.1126/science.1192603.

    Article  CAS  PubMed  Google Scholar 

  62. Seligman HK, Laraia BA, Kushel MB. Food insecurity is associated with chronic disease among low-income NHANES participants. J Nutr. 2010;140(2):304–10. https://doi.org/10.3945/jn.109.112573.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  63. Smith GS, Breakstone H, Dean LT, Thorpe RJ. Impacts of Gentrification on Health in the US: a Systematic Review of the Literature. J Urban Health. 2020;97(6):845–56. https://doi.org/10.1007/s11524-020-00448-4.

    Article  PubMed  PubMed Central  Google Scholar 

  64. Smith GS, McCleary RR, Thorpe RJ. Racial Disparities in Hypertension Prevalence within US Gentrifying Neighborhoods. Int J Environ Res Public Health. 2020;17(21). https://doi.org/10.3390/ijerph17217889.

  65. Alang S. Police Brutality and the Institutional Patterning of Stressors. Am J Public Health. 2020;110(11):1597–8. https://doi.org/10.2105/AJPH.2020.305937.

    Article  PubMed  PubMed Central  Google Scholar 

  66. Hawkins DS. “After Philando, I Had to Take a Sick Day to Recover”: Psychological Distress, Trauma and Police Brutality in the Black Community. Health Commun. 2022;37(9):1113–22. https://doi.org/10.1080/10410236.2021.1913838.

    Article  PubMed  Google Scholar 

Download references

Acknowledgements

The authors would like to acknowledge the All of Us study participants, without whom this research would not be possible.

Funding

A.J.K. and D.E.S. are funded by the National Institute of General Medical Sciences (T32GM007347). Additionally, A.J.K. is funded by the National Institute on Deafness and Other Communication Disorders (F30DC020917), D.E.S is funded by the National Institute of Mental Health (F30MH123219), and C.H.W. is funded by the National Institutes of Health (P50MD017347 and UL1TR002243). This content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

Author information

Authors and Affiliations

Authors

Contributions

A.J.K. contributed to study design, data analysis, manuscript writing, and manuscript editing. D.E.S. contributed to study design, data analysis, figure generation, and manuscript editing. C.H.W. contributed to study design and manuscript editing. All authors reviewed the manuscript.

Corresponding author

Correspondence to Consuelo H. Wilkins.

Ethics declarations

Ethics approval and consent to participate

The All of Us Operational Protocol was approved by the National Institutes of Health Institutional Review Board. Informed consent was obtained from all participants.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Supplemental Figure 1.

Relationship between race/ethnicity category and sex, sexual orientation, medical form confidence, health material assistance, health information difficulty, and health literacy. Supplemental Figure 2. Probability of completing the Personal Medical History Instrument (propensity score) used to generate inverse probability weights. Purple bars represent participants who completed the Personal Medical History Instrument, and blue bars represent participants who did not complete this survey. Supplemental Table 1.1. Survey Questions and Responses. Supplemental Table 1.2. Health Conditions (Personal Medical History Instrument). Supplemental Table 2. Demographic and Social Variables Stratified by Self-Identified Race/Ethnicity. Supplemental Table 3. Health Conditions Stratified by Self-Identified Race/Ethnicity.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Kunnath, A.J., Sack, D.E. & Wilkins, C.H. Relative predictive value of sociodemographic factors for chronic diseases among All of Us participants: a descriptive analysis. BMC Public Health 24, 405 (2024). https://doi.org/10.1186/s12889-024-17834-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12889-024-17834-1

Keywords