Skip to main content
  • Research article
  • Open access
  • Published:

Area-level deprivation and preterm birth: results from a national, commercially-insured population



Area-level deprivation is associated with multiple adverse birth outcomes. Few studies have examined the mediating pathways through which area-level deprivation affects these outcomes. The objective of this study was to investigate the association between area-level deprivation and preterm birth, and examine the mediating effects of maternal medical, behavioural, and psychosocial factors.


We conducted a retrospective cohort study using national, commercial health insurance claims data from 2011, obtained from the Health Care Cost Institute. Area-level deprivation was derived from principal components methods using ZIP code-level data. Multilevel structural equation modeling was used to examine mediating effects.


In total, 138,487 women with a live singleton birth residing in 14,577 ZIP codes throughout the United States were included. Overall, 5.7% of women had a preterm birth. In fully adjusted generalized estimation equation models, compared to women in the lowest quartile of area-level deprivation, odds of preterm birth increased by 9.6% among women in the second highest quartile (odds ratio (OR) 1.096; 95% confidence interval (CI) 1.021, 1.176), by 11.3% in the third highest quartile (OR 1.113; 95% CI 1.035, 1.195), and by 24.9% in the highest quartile (OR 1.249; 95% CI 1.165, 1.339). Hypertension and infection moderately mediated this association.


Even among commercially-insured women, area-level deprivation was associated with increased risk of preterm birth. Similar to individual socioeconomic status, area-level deprivation does not have a threshold effect. Implementation of policies to reduce area-level deprivation, and the screening and treatment of maternal mediators may be associated with a lower risk of preterm birth.

Peer Review reports


Ten percent of the four million births in the United States are preterm (birth prior to 37 weeks gestation): a rate relatively intransigent over the past four decades [1]. Preterm birth is a leading cause of infant mortality [2], and preterm infants that survive are susceptible to complications throughout the lifecourse including, respiratory, cardiovascular, and neurological disorders [3]. The consequent social, economic, and human burden is profound.

Both individual- and area-level factors are associated with an increased risk of preterm birth [3]. At the individual level these include sociodemographic characteristics, anthropometrics, infections, and behavioural and psychosocial factors. At the area level these include socioeconomic disadvantage and crime. Although studies have shown an association between area-level socioeconomic disadvantage and poorer birth outcomes, an important limitation of this research is that disadvantage has been measured with numerous indicators, thus making comparisons difficult [4]. In an attempt to create a standardized index, named the neighbourhood deprivation index, Messer and colleagues used principal components analysis to obtain one neighbourhood deprivation factor from 20 variables across seven sociodemographic domains including, education, employment, housing, occupation, poverty, racial  composition, and residential stability [5].

Likewise, the neighbourhood deprivation index has been shown to be associated with poorer birth outcomes [6,7,8,9,10]. However, these studies were limited in scope and scale by being conducted in a small number of geographic regions, and by using vital statistics data which do not contain reliable information or information on potentially important covariates. For example, hypertension, sexually transmitted infections, and mental health conditions are underreported or not included on birth certificates [11, 12]. These limitations may affect the generalizability or conservativeness of study findings. Moreover, studies testing the mechanisms of these associations have been limited [13,14,15,16]. Proposed mediators of the association between area-level socioeconomic disadvantage and adverse birth outcomes include biologic, behavioural, psychosocial, and socioeconomic factors at the individual level, and access to goods and services at the area level [14, 17].

The objectives of this study were to examine the association between area-level deprivation and preterm birth, and to assess maternal mediators of this association in a national sample of women. We hypothesize that increasing levels of area-level deprivation will be associated with increased odds of preterm birth, and maternal factors will mediate this association. Using national, commercial health insurance claims data, this study extends the state-of-the-science in the following ways. First, no studies to date have examined the association between area-level deprivation and preterm birth among those commercially insured, despite the fact that commercial health insurance is the most common source of payment for deliveries in the United States, accounting for almost one-half of deliveries [18]. Other sources of payment include Medicaid (43.0%), self-pay or uninsured (4.1%), and other insurance (i.e., other government-funded sources or charity) (3.8%). Second, use of a large national sample of women enables us to explore heterogeneity in the association between area-level deprivation and adverse birth outcomes by geographic region of residence to inform policy making. And third, identifying mediating pathways may guide the development of interventions to prevent adverse birth outcomes.


Study design

This retrospective cohort study used national, commercial health insurance claims data obtained from the Health Care Cost Institute. A convenience sample of de-identified health care services data are provided by three of the largest health insurers (Aetna, Humana, and UnitedHealthcare) from all 50 states. Health Care Cost Institute data represent a quarter of the population with commercial insurance data [19]. Data are compliant with the Health Insurance Portability and Accountability Act, therefore this study was exempt from review by Institutional Review Boards.


The study sample was derived and variables defined using information available in claims data, that is Diagnosis Related Group (DRG) payment codes and International Classification of Diseases, Ninth Revision (ICD-9) procedure codes. Following previous research [20], we restricted analyses to women, ages 18 through 44, with a live birth that occurred in an inpatient hospital setting in 2011. These women were covered by employer-sponsored insurance for 10 months prior to childbirth, had a DRG code indicating a vaginal or cesarean delivery (765, 766, 774, or 775), and continued coverage for at least 6 weeks after delivery. To ensure a reliable analytic sample, we excluded patients with any of the following: delivery cannot be clearly identified in inpatient claims due to multiple admission records; non-continuous enrollment (i.e., missing membership records); multiple locations reported during the observation period (i.e., change in residence referenced by change in ZIP code); or event service dates that cross periods (e.g., antepartum outpatient visit with an initiation date prior to admission and an end date following discharge for the index event); multiple gestation (ICD-9 codes 651.00–651.23, 651.80–65.93, V27.2, V27.5); or medical indications justifying preterm delivery including, placenta previa (ICD-9 code 641.XX), and vasa previa (ICD-9 codes 663.51 and 663.53). Less than 5% of claims were excluded [20].


Individual-level variables

Birth outcome and maternal factors were obtained from the Health Care Cost Institute. Births were classified as preterm using the ICD-9 code 644.2, which indicates deliveries before 37 completed weeks of gestation. Age (18–24, 25–34, 35–44 years), and geographic location (state of residential address converted to one of nine United States Census Bureau divisions [21]), were considered as possible covariates, since preterm birth risk and the neighbourhood deprivation index vary by these factors [5, 7,8,9]. Specifically, preterm birth risk is higher at younger (less than 25 years) and older (35 years and older) ages [3]. Preterm birth risk differs by race [1], however data on race were not available. Therefore our findings may be overestimated and should be interpreted with caution. Reliable data were not available for other potential confounders such as body mass index (BMI), however BMI is a more important causal determinant of intrauterine growth restriction rather than preterm birth [22], and BMI is often controlled for in studies on low birth weight rather than preterm birth [7].

Maternal factors such as health, health behaviours, psychosocial stress, and individual socioeconomic resources have been conceptualized as mechanisms linking area-level socioeconomic disadvantage and preterm birth [13, 17, 22]. We considered maternal health (medical complications, infection, prior obstetric history, obstetric and fetal complications), health behaviours (substance use), and psychosocial stress (mental health conditions) as potential mediators. Data were not available to test mediating pathways through other maternal factors such as chronic stressors, social support, and individual socioeconomic resources. ICD-9 codes of potential mediators were based on prior research using health insurance claims data [23], which underwent clinical review by a maternal fetal medicine specialist (UM) (see Appendix for detailed list).

Area-level variables

We measured exposure to area-level deprivation using the neighbourhood deprivation index, reproducing the methods of Messer and colleagues [5]. This study used ZIP codes of women during pregnancy, which were converted to ZIP Code Tabulation Areas (ZCTA). Unlike many health insurance claims datasets which only include core-based statistical areas (county or counties associated with an urban core [24]), or 3-digit ZIP codes, our data includes 5-digit zip codes. Advantages of using ZIP codes include a large enough scale to protect personal privacy [25], and produce reliable estimates, yet a small enough size to effectively allocate health resources [26].

To compare area-level deprivation across varied geographic locations, preliminary principal component analyses were conducted separately for the nine census divisions. Census divisions are groupings of three to eight adjacent states (and the District of Columbia) that have similar socioeconomics, population characteristics, and historical development [21] (see Table 2 footnotes for the list of states in each division). Principal component analyses were conducted using data from all ZCTAs within a census division. ZCTA-level data were obtained from the 2011 five-year estimates from the American Community Survey [27]. An oblique rotation (promax) was used as the derived factors were correlated (> 0.32), however only the first factor was retained as it accounted for the largest proportion of variability.

Variables were considered for inclusion in the index using a priori criteria: (1) variables loaded above 0.25 in any census division; and (2) lower 95% confidence limit of the variable loading was not below the median lower 95% confidence limit factor loading (0.17 in our analyses) [5]. Ninety-two variables loaded above 0.25 across the census divisions. Bootstrap analyses of 500 randomly drawn samples for each division were used to determine the 95% confidence interval of variable loadings. Forty-two of 92 lower 95% confidence limits did not extend below the absolute value of 0.17. Variables were considered for inclusion if they met these criteria in at least one division. Fourteen variables met all criteria (Table 1), and were retained for final principal component analysis, used to obtain final loadings. ZCTA-level data from all divisions were pooled in the final principal components analysis. Area-level deprivation was standardized to have a mean of zero and a standard deviation of one by dividing the index by the square of the eigenvalue. Higher deprivation values indicate higher levels of deprivation.

Table 1 Fourteen variables retained for the final principal component analysis for the area-level deprivation index, 2011

Urbanization was measured using rural-urban commuting area (RUCA) codes [28]. RUCA codes are useful in public health studies because they may account for access to health care services [29]. ZIP code approximations of RUCA codes were used [30]. Primary RUCA codes of 10 were considered rural, remaining codes were considered non-rural.

Statistical methods

The final study sample included women with data on area-level deprivation and preterm birth. Data structure for the analysis was hierarchical: women at level one, areas at level two. Comparisons between unadjusted and adjusted generalized estimating equation models were used to assess for confounding. Only maternal factors diagnosed in at least 1% of the sample were considered for adjustment. We fit generalized estimating equation models to examine the association between area-level deprivation and preterm birth, accounting for similarities among women from the same ZCTA. Model one was the unadjusted model, model two was adjusted by covariates, and model three was adjusted by covariates and potential mediators. Interactions were assessed by adding a cross-product to the model. We allowed for non-linearity by categorizing area-level deprivation into quartiles, similar to previous studies [7, 8]. Other variables were used as categorical variables. A sensitivity analysis for unmeasured confounding was computed using the E-value [31]. Mediation was assessed using multilevel structural equation modeling [32]. Significance level of 0.05 was used. All P values were two-sided. Principal component analyses and generalized estimating equation models were performed with SAS software (Version 9.4, The SAS Institute Inc., Cary, NC, 2013). Multilevel structural equation modeling was conducted using Mplus (Version 7, Muthén & Muthén, Los Angeles, CA, 2017).

To minimize potential selection bias and endogeneity due to Health Care Cost Institute data not being a random sample, we assessed the effect of controlling for percentage of individuals in a state covered by commercial health insurance whose data was held by the Health Care Cost Institute in 2015, and ZIP code-level urbanization.


A total of 138,494 women from 50 states and the District of Columbia met inclusion criteria for the study. Seven women were missing data on area-level deprivation and were excluded. Overall, 5.7% of women had a preterm birth (Table 2). Preterm births were more common among the youngest and oldest women, and among those residing in the Southeast. Preterm births were less common among women residing along the West and Northeastern coasts. A higher proportion of women with medical complications, infections, poor obstetric history, obstetric and fetal complications, substance use, and mental health conditions had a preterm birth compared to women without these risk factors. Preterm births were more common among women who had a cesarean section compared to a vaginal delivery.

Table 2 Characteristics of commercially-insured women by preterm birth status, United States, 2011

In final analyses, 138,487 women in 14,577 areas were included. The standardized area-level deprivation index ranged from − 1.67 to 5.63. Distribution of area-level deprivation was fairly homogeneous across census divisions (Fig. 1). Overall, cut-off values were − 0.61, − 0.16 and 0.48 for the lowest (least deprived) to highest (most deprived) quartiles. Overall, the mean was 0.03, skewness was 1.3, and kurtosis was 2.3. By census division, quartile cut-off values were higher in the East South Central, South Atlantic, Pacific, and West South Central divisions. A high proportion of women in the southern, western, and Middle Atlantic census divisions resided in the most deprived areas (Appendix). Area-level deprivation was generally higher in central urban and rural areas (data not shown).

Fig. 1
figure 1

Distribution of area-level deprivation index by census division. + indicates the mean. Dots indicate outliers, which are 1.5 times the interquartile range above the 75th percentile. WN Central, West North Central (Iowa, Kansas, Minnesota, Missouri, Nebraska, North Dakota, South Dakota); New Engl, New England (Connecticut, Maine, Massachusetts, New Hampshire, Rhode Island, Vermont); EN Central, East North Central (Illinois, Indiana, Michigan, Ohio, Wisconsin); Mid Atl, Middle Atlantic (New Jersey, New York, Pennsylvania); Mountain (Arizona, Colorado, Idaho, Montana, Nevada, New Mexico, Wyoming, Utah); WS Central, West South Central (Arkansas, Louisiana, Oklahoma, Texas); Pacific (Alaska, California, Hawaii, Oregon, Washington); South Atl, South Atlantic (Delaware, District of Columbia, Florida, Georgia, Maryland, North Carolina, South Carolina, Virginia, West Virginia); ES Central, East South Central (Alabama, Kentucky, Mississippi, Tennessee)

Results of generalized estimating equation models are presented in Table 3. Each increasing quartile of area-level deprivation was associated with population-averaged higher odds of preterm birth. Compared to women in the lowest quartile of area-level deprivation, women in the highest quartile of area-level deprivation had an increased odds of preterm birth in unadjusted (OR 1.33; 95% CI 1.24, 1.42), and fully adjusted analyses (OR 1.25; 95% CI 1.17, 1.34). However, the strength of associations may be overestimated due to our inability to account for race. In sensitivity analyses, the E-value for the highest compared to the lowest quartile of area-level deprivation was 1.807 (or 1.603 for the lower confidence limit). When area-level deprivation was modeled as a continuous variable, the P value for trend was < 0.001. State-level coverage of commercial health insurance at the Health Care Cost Institute and urbanization were not included in model three as these variables were not significant predictors. Analyses of effect modification of census division and urbanization were non-significant.

Table 3 Unadjusted and adjusted results for preterm birth among commercially-insured women, United States, 2011

The direct association between area-level deprivation (continuous) and preterm birth was confirmed in the multilevel structural equation model (b = 0.036; P = 0.001). There was a significant indirect effect of area-level deprivation on preterm birth through hypertension (b = 0.013; P = 0.001) and infection (b = 0.010; P = 0.006) (Fig. 2 and Appendix). We did not find significant indirect effects through diabetes, substance use, mental health conditions, or other factors. Of the total effect of area-level deprivation on preterm birth, 39% was mediated (i.e., all indirect effects/all effects or 0.023/0.059). Model fit statistics indicated a reasonable fit of the model: root mean square error of approximation (RMSEA) was low (0.025), comparative fit index (CFI) was adequate (0.86), and standardized root mean square residual (SRMR) for within (0.03) and between (0.04) were low.

Fig. 2
figure 2

Final structural equation model of area-level deprivation in relation to maternal factors and preterm birth. Note: Parameter estimates are nonstandardized (all P values were significant). Rectangular shapes indicate observed variables, circular shapes indicate latent variables. Paths marked with a W subscript denote within-cluster paths, and paths marked with a B subscript denote between-cluster paths. Variables marked with a j subscript are observed at the area level, variables marked with an ij subscript are observed at the individual level within the area level. Arrows from between-cluster variables or within-cluster variables to observed variables indicate the decomposition of observed effects into between and within-cluster effects, respectively. Only the between indirect effect exists because the exposure, mediators, and outcome have between-cluster variation, whereas only the mediators and outcome (but not the exposure) have within-cluster variation. Short arrows entering endogenous variables indicate errors. DPR: area-level deprivation; HTN: hypertension; INF: infection; PTB: preterm birth


Area-level deprivation is associated with an increased risk of morbidity and mortality in adulthood [33, 34]. Importantly during pregnancy, when health trajectories are often set for life, area-level deprivation is associated with an increased risk of adverse birth outcomes [35]. In this study, we observed that even among women with commercial health insurance, those that are potentially the most economically advantaged, higher area-level deprivation was associated with an increased risk of preterm birth.

Preterm birth prevalence in this population of commercially-insured women was 5.7%, almost half the national preterm birth rate in 2011 (11.7%) [1]. This was consistent with other studies showing that women with commercial health insurance were less likely to have a preterm birth than women with Medicaid [3]. The range of area-level deprivation index values in this national study were wider (− 1.67 to 5.63) than that found in eight geographic areas in the United States (− 1.85 to 3.72) [5]. However, quartile cut-off values were less extreme in this study (− 0.61, − 0.16 and 0.48 for quartiles one, two and three, respectively) compared to those found by others (− 0.81, − 0.173, and 0.76 for quartiles one, two and three, respectively) [5].

Among commercially-insured women, we observed an increasing trend in the odds of preterm birth as area-level deprivation increased. Note that our findings should be interpreted with caution due to our inability to account for race. Similar to studies examining the association between individual socioeconomic status and health [36], a threshold effect for area-level deprivation was not observed. On average, there was a 25% increase in the odds of preterm birth among women residing in the highest compared to the lowest quartile of area-level deprivation. This estimate may be conservative as we controlled for potential mediators. The effect size of area-level deprivation on preterm birth in our study was similar in magnitude to other studies [7, 8]. Furthermore, in a meta-analysis of seven studies on neighbourhood deprivation measured as a single measure or an index, odds ratios were significantly increased for preterm birth in the most compared to the least deprived neighbourhood quintile (OR 1.23) [35].

Our research adds to these findings by assessing if the association between area-level deprivation and preterm birth is mediated by maternal factors. We found that only hypertension and infection moderately mediated this association. Meng and colleagues showed that approximately 25% of the association between area-level socioeconomic disadvantage and preterm birth and low birthweight was mediated by individual-level factors, with socioeconomic status-related support having the highest level of mediation followed by biologic, behavioural, and psychosocial factors [13]. Schempf and colleagues found that psychosocial mediators reduced the effect of neighbourhood risk on birthweight by 12%, and behavioural mediators reduced the effect by an additional 30% to the point where neighbourhood risk was no longer statistically significant [16]. Meanwhile, area-level factors such as the density of retail outlets for alcohol, tobacco, and unhealthy and healthy foods were unlikely to mediate the association between area-level socioeconomic disadvantage and adverse birth outcomes [14]. However, area-level socioeconomic disadvantage may be associated with other adverse area-level conditions such as inadequate housing and transportation, and violence, which may influence pregnancy through psychological stress [37].

Strengths and limitations

Strengths of this study included a large sample of commercially-insured women from all 50 states and the District of Columbia, and the availability of potential mediators of the association between area-level deprivation and preterm birth. There is limited research on this association among women with commercial health insurance, who account for almost half of all births [18]. The proportion of births covered by commercial health insurance, particularly after the implementation of the Affordable Care Act (ACA), is growing [38, 39]. This study showed that despite access to commercial health insurance coverage, women who resided in areas with higher deprivation were at greater odds of preterm birth.

Limitations of the study include lack of data on important risk factors for preterm birth (e.g., race and individual socioeconomic status), operationalization of area, area and covariate/mediator misclassification, selection bias, and potential inability to generalize to women without commercial health insurance or residential stability, or women whose commercial health insurers are not included in the Health Care Cost Institute data.

Race is an important predictor of preterm birth, with non-Hispanic black women having a 60% higher prevalence of preterm birth compared to non-Hispanic white women [1]. Other studies show that among women with commercial health insurance, 66.1% are non-Hispanic white, 12.2% are non-Hispanic black, 13.3% are Hispanic, and 8.4% are non-Hispanic other race [40]. Our study population may likewise have a high composition of non-Hispanic white women which may in part explain the low prevalence of preterm birth that was observed. Differences by race/ethnicity were not able to be assessed, therefore our findings may be overestimated. In sensitivity analyses, the minimum strength of an association that an unmeasured confounder would need to have, conditional on measured covariates, to explain away the observed association of the highest compared to the lowest quartile of area-level deprivation was 1.807 (or 1.603 for the lower confidence limit). In a previous study examining neighbourhood deprivation, the adjusted relative risk of preterm birth among non-Hispanic black women compared to non-Hispanic white women was 1.4 (95% CI 1.3–1.6) [41]. This relative risk is less than our E-value which suggests that controlling for race in our analyses would likely not explain away this observed association. However, it may explain away the association for other quartile comparisons. Additionally, results from other studies indicate that adjusting for individual confounders other than race may lead to attenuation of area-level associations [7].

We cannot rule out the possibility that the associations observed in this study could be explained by residual confounding from area-level factors. Although, as indicated above an unmeasured confounder would need to have a relative risk of 1.807 or a lower confidence limit of 1.603 to explain away our observed results. The association between area-level deprivation and preterm birth may be confounded by differences in availability and quality of health care services within an area, which was beyond the scope of the data available. Insurance coverage does not indicate equal access and use of medical care [42]. Furthermore, racial differences in adverse birth outcomes are related to the quality of care at the site of delivery [43].

Areas were operationalized using ZIP codes. ZIP codes are established by the United States Postal Service as mail routes for efficient mail delivery. ZIP codes have a large population size with an average of 30,000 residents and may be socioeconomically heterogeneous [44]. In contrast, census tracts, a common operationalization of neighbourhoods used in public health research, are relatively socioeconomically homogenous statistical subdivisions of counties derived by the United States Census Bureau that have an average of 4000 residents and approximately 1000 to 3000 housing units [21]. Krieger and colleagues compared the associations between socioeconomic measures at the census block group (aggregated census blocks (subdivisions of census tracts) that have approximately 250 to 550 housing units [21]), census tract, and ZIP code level, and birthweight [45]. Measures at the census block group and census tract level showed similar associations, while ZIP-code level measures detected smaller associations. These effect sizes were similar to the effect sizes observed in our study (OR range from 1.27 to 2.17). Therefore, the magnitude of effect size using ZIP code-level data may be smaller, nonetheless significant associations still were detected. Misclassification of areas may arise as ZCTAs (aggregated census blocks that approximate ZIP Codes [46]) and ZIP codes sharing the same code may not cover exactly the same geographic area [44].

Maternal factors and outcomes were obtained from commercial health insurance claims data and their accuracy is dependent on institutional and individual coding. The positive predictor value of chronic conditions and birth outcomes for administrative health plans range from 87 to 95% [47]. Non-differential misclassification of exposures and outcomes would bias our results towards the null, therefore our observed associations could be conservative estimates.

Our study included women with commercial health insurance and residential stability. While the majority of women (53%) in the United States have stable commercial health insurance coverage during pregnancy, women who are less likely to have stable commercial health insurance are younger, non-Hispanic white, have lower socioeconomic status, and initiate prenatal care late [48]. Many of these same factors are associated with residential mobility during pregnancy [49], and preterm birth [3]. Selection bias may have led to an underselection of women with low socioeconomic status who may have had a higher risk of preterm birth. Individual-level socioeconomic status data were not available in this study, thus the direction of bias is unknown.

Our results may or may not generalize to the populations of other insurers, as we were unable to determine how individuals included in the Health Care Cost Institute data compare to individuals covered by other insurers. However, our results are internally valid. Whether the results of this study generalize to all pregnant women, particularly after the implementation of the ACA, is unknown. However, studies that take insurance status into account, still find an association between area-level deprivation and adverse birth outcomes [6].

Implications for practice and/or policy

We found that area-level deprivation was associated with increased risk of preterm birth among commercially insured women and medical factors mediated this association. The implementation of the ACA was associated with a decrease in being uninsured, and an increase in being commercially insured and receiving timely prenatal care among young women [50]. Thus, efforts to repeal the ACA may result in poorer access to prenatal care and poorer management of medical factors that mediate the association between area-level deprivation and preterm birth, particularly among young women who are at increased risk of preterm birth [3].

Our findings also suggest potential avenues for policy solutions, such as implementing policies to reduce area-level deprivation, and enabling insurers, employers, and health care providers to work towards designing care paths to address medical mediating factors, particularly hypertension and infection.


Our study showed that in a national sample of commercially-insured women there was notable variation in the area-level deprivation index. Even among this commercially-insured population, there were area-level socioeconomic differences that were related to the health and well-being of mothers and their infants. To reduce the risk of adverse birth outcomes such as preterm birth, it is important to identify both modifiable risk factors and mediators that are potential targets for intervention. Future research should continue to test for mediating pathways between area-level deprivation and preterm birth to determine if hypertension and infection are consistently found to mediate this association, or if there are other important mediating factors. Additional mediating factors to explore include maternal factors such as stress, and area-level conditions such as crime or material and social resources. Longitudinal study designs will enable researchers to consider causal inferences on the effects of area-level factors on preterm birth and other adverse maternal and child health outcomes.



Affordable Care Act


Body mass index


Comparative fit index


Confidence interval


Area-level deprivation


Diagnosis Related Group

EN Central:

East North Central

ES Central:

East South Central




International Classification of Diseases, Ninth Revision



Mid Atl:

Middle Atlantic

New Engl:

New England


Odds ratio


Preterm birth


Root mean square error of approximation


Rural-urban commuting area

South Atl:

South Atlantic


Standardized root mean square residual

WN Central:

West North Central

WS Central:

West South Central


ZIP Code Tabulation Area


  1. Martin JA, Hamilton BE, Osterman MJ, Curtin SC, Matthews TJ. Births: final data for 2013. Natl Vital Stat Rep. 2015;64(1):1–65.

    PubMed  Google Scholar 

  2. Mathews TJ, MacDorman MF, Thoma ME. Infant mortality statistics from the 2013 period linked birth/infant death data set. Natl Vital Stat Rep. 2015;64(9):1–30.

    Google Scholar 

  3. Institute of Medicine. Preterm birth: causes, consequences, and prevention. Washington, DC: The National Academies Press; 2007.

    Google Scholar 

  4. Kim D, Saada A. The social determinants of infant mortality and birth outcomes in Western developed nations: a cross-country systematic review. Int J Environ Res Public Health. 2013;10(6):2296–335

    Article  Google Scholar 

  5. Messer LC, Laraia BA, Kaufman JS, Eyster J, Holzman C, Culhane J, et al. The development of a standardized neighborhood deprivation index. J Urban Health. 2006;83(6):1041–62

    Article  Google Scholar 

  6. Wentz AE, Messer LC, Nguyen T, Boone-Heinonen J. Small and large size for gestational age and neighborhood deprivation measured within increasing proximity to homes. Health Place. 2014;30c:98–106

    Article  Google Scholar 

  7. Janevic T, Stein CR, Savitz DA, Kaufman JS, Mason SM, Herring AH. Neighborhood deprivation and adverse birth outcomes among diverse ethnic groups. Ann Epidemiol. 2010;20(6):445–51

    Article  CAS  Google Scholar 

  8. Ma XG, Fleischer NL, Liu JH, Hardin JW, Zhao G, Liese AD. Neighborhood deprivation and preterm birth: an application of propensity score matching. Ann Epidemiol. 2015;25(2):120–5

    Article  Google Scholar 

  9. O'Campo P, Burke JG, Culhane J, Elo IT, Eyster J, Holzman C, et al. Neighborhood deprivation and preterm birth among non-Hispanic black and white women in eight geographic areas in the United States. Am J Epidemiol. 2008;167(2):155–63

    Article  Google Scholar 

  10. Elo IT, Culhane JF, Kohler IV, O'Campo P, Burke JG, Messer LC, et al. Neighbourhood deprivation and small-for-gestational-age term births in the United States. Paediatr Perinat Epidemiol. 2009;23(1):87–96

    Article  Google Scholar 

  11. Lydon-Rochelle MT, Holt VL, Cardenas V, Nelson JC, Easterling TR, Gardella C, et al. The reporting of pre-existing maternal medical conditions and complications of pregnancy on birth certificates and in hospital discharge data. Am J Obstet Gynecol. 2005;193(1):125–34

    Article  Google Scholar 

  12. Roohan PJ, Josberger RE, Acar J, Dabir P, Feder HM, Gagliano PJ. Validation of birth certificate data in New York State. J Community Health. 2003;28(5):335–46.

    Article  Google Scholar 

  13. Meng G, Thompson ME, Hall GB. Pathways of neighbourhood-level socio-economic determinants of adverse birth outcomes. Int J Health Geogr. 2013;12:32

    Article  Google Scholar 

  14. Farley TA, Mason K, Rice J, Habel JD, Scribner R, Cohen DA. The relationship between the neighbourhood environment and adverse birth outcomes. Paediatr Perinat Epidemiol. 2006;20(3):188–200

    Article  Google Scholar 

  15. Clayborne ZM, Giesbrecht GF, Bell RC, Tomfohr-Madsen LM. Relations between neighbourhood socioeconomic status and birth outcomes are mediated by maternal weight. Soc Sci Med. 2017;175:143–51

    Article  Google Scholar 

  16. Schempf A, Strobino D, O'Campo P. Neighborhood effects on birthweight: an exploration of psychosocial and behavioral pathways in Baltimore, 1995-1996. Soc Sci Med. 2009;68(1):100–10

    Article  Google Scholar 

  17. Culhane JF, Elo IT. Neighborhood context and reproductive health. Am J Obstet Gynecol. 2005;192(5 Suppl):S22–9

    Article  Google Scholar 

  18. Martin JA, Hamilton BE, Osterman MJK, Driscoll AK, Drake P. Births: final data for 2017. Natl Vital Stat Rep. 2018;67(8):1–50.

    PubMed  Google Scholar 

  19. Health Care Cost Institute. Health Care Cost and Utilization Report: 2011. 2012. Accessed 16 Jan 2019.

  20. Cunningham SD, Herrera C, Udo IE, Kozhimannil KB, Barrette E, Magriples U, et al. Maternal medical complexity: impact on prenatal health care spending among women at low risk for cesarean section. Womens Health Issues. 2017;27(5):551–8

    Article  Google Scholar 

  21. Bureau of the Census. Geographic areas reference manual. 1994. Accessed 27 Jan 2016.

  22. Kramer MS, Seguin L, Lydon J, Goulet L. Socio-economic disparities in pregnancy outcome: why do the poor fare so poorly? Paediatr Perinat Epidemiol. 2000;14(3):194–210.

    Article  CAS  Google Scholar 

  23. Xu X, Gariepy A, Lundsberg LS, Sheth SS, Pettker CM, Krumholz HM, et al. Wide variation found in hospital facility costs for maternity stays involving low-risk childbirth. Health Aff (Millwood). 2015;34(7):1212–9

    Article  Google Scholar 

  24. United States Census Bureau. Geographic terms and concepts - core based statistical areas and related statistical areas. Accessed 6 Jan 2019.

  25. Jacquez GM. Current practices in the spatial analysis of cancer: flies in the ointment. Int J Health Geogr. 2004;3(1):22

    Article  Google Scholar 

  26. Tian N, Goovaerts P, Zhan FB, Wilson JG. Identification of racial disparities in breast cancer mortality: does scale matter? Int J Health Geogr. 2010;9:35

    Article  Google Scholar 

  27. United States Census Bureau. ACS 2011 (5-year estimates). Prepared by Social Explorer. Accessed 27 Jan 2016.

  28. United States Department of Agriculture. Documentation: 2010 Rural-Urban Commuting Area (RUCA) codes. Accessed 12 Feb 2018.

  29. Hall SA, Kaufman JS, Ricketts TC. Defining urban and rural areas in U.S. epidemiologic studies. J Urban Health. 2006;83(2):162–75

    Article  Google Scholar 

  30. Center for Rural Health. Temporary ZIP RUCA 3.10 file access. Accessed 12 Feb 2018.

  31. VanderWeele TJ, Ding P. Sensitivity analysis in observational research: introducing the e-value. Ann Intern Med. 2017;167(4):268–74

    Article  Google Scholar 

  32. Preacher KJ, Zyphur MJ, Zhang Z. A general multilevel SEM framework for assessing multilevel mediation. Psychol Methods. 2010;15(3):209–33

    Article  Google Scholar 

  33. White JS, Hamad R, Li X, Basu S, Ohlsson H, Sundquist J, et al. Long-term effects of neighbourhood deprivation on diabetes risk: quasi-experimental evidence from a refugee dispersal policy in Sweden. Lancet Diabetes Endocrinol. 2016;4(6):517–24

    Article  Google Scholar 

  34. Meijer M, Rohl J, Bloomfield K, Grittner U. Do neighborhoods affect individual mortality? A systematic review and meta-analysis of multilevel studies. Soc Sci Med. 2012;74(8):1204–12

    Article  Google Scholar 

  35. Vos AA, Posthumus AG, Bonsel GJ, Steegers EA, Denktas S. Deprived neighborhoods and adverse perinatal outcome: a systematic review and meta-analysis. Acta Obstet Gynecol Scand. 2014;93(8):727–40

    Article  Google Scholar 

  36. Marmot M. The health gap: the challenge of an unequal world. London: Bloomsbury Publishing; 2015.

    Google Scholar 

  37. Messer LC, Maxson P, Miranda ML. The urban built environment and associations with women's psychosocial health. J Urban Health. 2013;90(5):857–71

    Article  Google Scholar 

  38. Akosa Antwi Y, Ma J, Simon K, Carroll A. Dependent coverage under the ACA and Medicaid coverage for childbirth. N Engl J Med. 2016;374(2):194–6

    Article  Google Scholar 

  39. Daw JR, Sommers BD. Association of the Affordable Care act dependent coverage provision with prenatal care use and birth outcomes. JAMA. 2018;319(6):579–87

    Article  Google Scholar 

  40. Markus AR, Krohe S, Garro N, Gerstein M, Pellegrini C. Examining the association between Medicaid coverage and preterm births using 2010-2013 National Vital Statistics Birth Data. J Child Poverty. 2016:1–16

  41. Schempf AH, Kaufman JS, Messer LC, Mendola P. The neighborhood contribution to black-white perinatal disparities: an example from two North Carolina counties, 1999-2001. Am J Epidemiol. 2011;174(6):744–52

    Article  Google Scholar 

  42. Adler NE, Stewart J. Health disparities across the lifespan: meaning, methods, and mechanisms. Ann N Y Acad Sci. 2010;1186:5–23

    Article  Google Scholar 

  43. Howell EA, Egorova NN, Janevic T, Balbierz A, Zeitlin J, Hebert PL. Severe maternal morbidity among Hispanic women in New York City: investigation of health disparities. Obstet Gynecol. 2017;129(2):285–94

    Article  Google Scholar 

  44. Krieger N, Waterman P, Chen JT, Soobader MJ, Subramanian SV, Carson R. Zip code caveat: bias due to spatiotemporal mismatches between zip codes and US census-defined geographic areas--the public health disparities geocoding project. Am J Public Health. 2002;92(7):1100–2.

    Article  Google Scholar 

  45. Krieger N, Chen JT, Waterman PD, Soobader MJ, Subramanian SV, Carson R. Choosing area based socioeconomic measures to monitor social inequalities in low birth weight and childhood lead poisoning: the public health disparities geocoding project (US). J Epidemiol Community Health. 2003;57(3):186–99.

    Article  CAS  Google Scholar 

  46. US Bureau of the Census. ZIP Code™ Tabulation Areas (ZCTAs™). Accessed 4 Jan 2019.

  47. Andrade SE, Scott PE, Davis RL, Li DK, Getahun D, Cheetham TC, et al. Validity of health plan and birth certificate data for pregnancy research. Pharmacoepidemiol Drug Saf. 2013;22(1):7–15

    Article  Google Scholar 

  48. D'Angelo DV, Le B, O'Neil ME, Williams L, Ahluwalia IB, Harrison LL, et al. Patterns of health insurance coverage around the time of pregnancy among women with live-born infants--pregnancy risk assessment monitoring system, 29 states, 2009. MMWR Surveill Summ. 2015;64(4):1–19.

    PubMed  Google Scholar 

  49. Bell ML, Belanger K. Review of research on residential mobility during pregnancy: consequences for assessment of prenatal environmental exposures. J Expo Sci Environ Epidemiol. 2012;22(5):429–38

    Article  Google Scholar 

  50. Li R, Bauman B, D'Angelo DV, Harrison LL, Warner L, Barfield W, Cox S. Affordable care act-dependent insurance coverage and access to care among young adult women with a recent live birth. Med Care. 2019;57(2):109–14

    PubMed  Google Scholar 

Download references


The authors acknowledge the assistance of the Health Care Cost Institute and its data contributors, Aetna, Humana, and UnitedHealthcare in providing the claims data analysed in this study.


This work was supported by the United States Health Resources & Services Administration (grant number R40MC28308); RM was supported by the National Institutes of Health (grant number 5T32HS017589–10) and the Herman Fellowship, Yale School of Public Health; CH was supported by the National Institutes of Health (grant number 2T32HS022242–06).

Availability of data and materials

The data that support the findings of this study are available from the Health Care Cost Institute but restrictions apply to the availability of these data, which were used under license for the current study. The data are available from the Health Care Cost Institute for a licensing fee per its data use terms and conditions.

Author information

Authors and Affiliations



RM, SDC, and JRI conceived the study. RM conducted the literature search and wrote the first and successive drafts of the manuscript. EB and CH contributed to preparation of the data set. RM, FMS, SDC, UM, KK, and JRI contributed to study design. RM conducted the principal components analysis with input from FMS. FMS modelled and analysed the data with input from RM. All authors contributed to the interpretation of data and revised the manuscript critically for important intellectual content. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Renee Mehra.

Ethics declarations

Ethics approval and consent to participate

Data are compliant with the Health Insurance Portability and Accountability Act, therefore this study was exempt from review by Institutional Review Boards.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.



Table 4 International Classification of Disease and Diagnosis Related Group payment codes used to identify maternal health conditions
Table 5 Distribution of commercially-insured women by quartile of area-level deprivation and census division, United States, 2011
Table 6 Multilevel structural equation model parameter estimates for the association between area-level deprivation and preterm birth

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Mehra, R., Shebl, F.M., Cunningham, S.D. et al. Area-level deprivation and preterm birth: results from a national, commercially-insured population. BMC Public Health 19, 236 (2019).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: