Skip to main content


Springer Nature is making SARS-CoV-2 and COVID-19 research free. View research | View latest news | Sign up for updates

Validation of food store environment secondary data source and the role of neighborhood deprivation in Appalachia, Kentucky

  • 4898 Accesses

  • 31 Citations



Based on the need for better measurement of the retail food environment in rural settings and to examine how deprivation may be unique in rural settings, the aims of this study were: 1) to validate one commercially available data source with direct field observations of food retailers; and 2) to examine the association between modified neighborhood deprivation and the modified retail food environment score (mRFEI).


Secondary data were obtained from a commercial database, InfoUSA in 2011, on all retail food outlets for each census tract. In 2011, direct observation identifying all listed food retailers was conducted in 14 counties in Kentucky. Sensitivity and positive predictive values (PPV) were compared. Neighborhood deprivation index was derived from American Community Survey data. Multinomial regression was used to examine associations between neighborhood deprivation and the mRFEI score (indicator of retailers selling healthy foods such as low-fat foods and fruits and vegetables relative to retailers selling more energy dense foods).


The sensitivity of the commercial database was high for traditional food retailers (grocery stores, supermarkets, convenience stores), with a range of 0.96-1.00, but lower for non-traditional food retailers; dollar stores (0.20) and Farmer’s Markets (0.50). For traditional food outlets, the PPV for smaller non-chain grocery stores was 38%, and large chain supermarkets was 87%. Compared to those with no stores in their neighborhoods, those with a supercenter [OR 0.50 (95% CI 0.27. 0.97)] or convenience store [OR 0.67 (95% CI 0.51, 0.89)] in their neighborhood have lower odds of living in a low deprivation neighborhood relative to a high deprivation neighborhood.


The secondary commercial database used in this study was insufficient to characterize the rural retail food environment. Our findings suggest that neighborhoods with high neighborhood deprivation are associated with having certain store types that may promote less healthy food options.

Peer Review reports


Obesity prevalence differs significantly among U.S. counties, particularly in the rural, Southern Appalachian region of the U.S. [1, 2]. In the 14 counties being studied within this manuscript the range of obesity is between 31% and 37%, compared to the national average of 33% in 2010 [3]. While at the same time, the Appalachian region has been marked by geographic isolation [4] which in turn may influence the health disparities experienced by residents relative to those living in more urban settings [46]. There is some evidence to suggest that those living in isolation from resources experience worse health outcomes such as certain cancers [7, 8], diabetes prevalence and obesity rates [9] relative to those with greater proximity to health care [10], food stores [11], and physical activity resources [12, 13]. What causes these vast differences may be attributed to societal influences, such as the neighborhood food environment and resources, or through effect of social selection [14, 15], such as individuals who lead a healthy lifestyle may choose to locate in neighborhoods with healthy food outlets. In order to disentangle the effects the environment has on individual choice, there has been increased attention on measuring the community and consumer food environment as a determinant of health [16].

There have been several international [1720] and U.S. based [2123] studies examining the validity of secondary data sources examining the retail food environment at the macro level. However, there are few studies examining the validity of data sources currently used to define and measure the rural community food environment [11, 21, 24, 25]. The lack of consistency between methods and data sources suggests that approaches for measuring the macro-level food environment in rural and remote areas may require different techniques relative to studies conducted in urban settings. To date, one study in Chicago found a positive predictive value (PPV) between commercial data sources and direct field observation of 80% [26]. Most recently in rural South Carolina, the positive predictive value (PPV) was 66% [21] between commercial data source and direct field observation. The results from these two studies suggest that commercial data sources may perhaps have greater validity in urban settings relative to rural areas. One potential reason for the difference between rural and urban settings is that in urban settings the rate of store closings is lower than in rural areas, with 1 in 4 stores closing in 2007 in rural areas compared to 1 in 6 in urban settings. Added to this issue is that a population of 3,252 is needed to support a grocery store in 2010, whereas in 2000 the population needed was only 2,843 [27]. In many of these small census tracts the population is not sufficient to support a store and therefore there may be higher rate of store closings which are not captured with a commercial data source.

Parallel to using valid methods to measure the rural community food environment, especially with higher rates of store closings, is the need to spatially measure access to various food outlets in rural areas to understand the deprivation amplification prevalent in rural and disadvantaged areas [28]. Deprivation amplification suggests that individual or household deprivation (for example, low income) is amplified by area level deprivation (for example, lack of affordable nutritious food or facilities for physical activity in the neighborhood) [29]. Although neighborhood deprivation theory is under much debate [10], in terms of the food environment, findings from the UK suggest that those in remote or disadvantaged areas tend to have adequate access to healthy food resources such as supermarkets [28]. Additionally, other studies conducted in Denmark [30], and Australia [31] corroborates findings from the UK. However, most of these studies were conducted in semi-rural or urban environments or in other countries outside the United States [32], whereas in the Appalachia region there are limited food resources overall, which may suggest that neighborhood deprivation is context specific [28].

Several studies have found that in rural areas supermarket availability does not necessarily indicate an abundant resource of healthy, high quality foods [33, 34]. Food environment researchers need to move beyond the assumption that having a supermarket is equivalent to a less deprived neighborhood. This assumption suggests that the presence of supermarkets or healthy food outlets supersedes the effect of fast-food restaurants and less healthy food outlets on health outcomes. Research has recently documented that people with access to supermarket do not necessarily consume more fruits and vegetables or a better body mass index [35, 36]. These findings highlight the need to also understand the role of individual choice in store type and in food selection within stores beyond just proximity or access to certain stores. Yet, prior studies show that proximity to fast-food restaurants is associated with more meals consumed at these locations [35]. To explain neighborhood deprivation of food resources in rural areas what might be more meaningful is to examine the coverage area of all food resources in rural settings [25, 37, 38], which may more accurately depict the overabundance of less healthy food items which nullifies the effect of healthy food outlets [32, 38].

Based on the need for better measurement of the food environment in rural settings and to examine how deprivation may be unique in remote, rural settings, the aims of this study were the following: 1) to validate commercially available data source with direct field observations of several food outlet types and 2) to examine the association between neighborhood deprivation and retail food environment.


Study region

The spatial area under analysis consisted of 14 counties in the Appalachia region with a total population of 345,000 people [39]. The study was reviewed and determined exempt from Internal Review Board, as it was secondary data analyses not involving human subjects.

Census tracts characteristics

Outlets were categorized within their U.S. census tract and a corresponding level of rurality based on the United States Department of Agriculture rural–urban codes ( We conducted analyses in 14 (25% of the 54-county Appalachia region) contiguous counties within the 54-county Kentucky Appalachian Regions (; Owsley, Jackson, Clay, Lee, Estill, Powell, Lincoln, Pulaski, Garrard, Madison, Robertson, Fleming, Montgomery, and Bath. These counties were selected based on location to each other as well as having a diverse sample of counties with different rural codes, a range of 7–9. Descriptive characteristics of counties are shown in Table 1.

Table 1 Descriptive of Census Tracts aggregated at County level in rural Appalachia Kentucky, 2011

Identifying food outlets via commercial database

Food outlet addresses were purchased from InfoUSA database in July 2011. In most studies to date secondary data sources have been either purchased from InfoUSA or Dunn & Bradstreet as a means to gather large sets of addresses [40, 41]. Addresses were categorized based on North American Industry Classification System (NAICS) codes. The categories reflected supercenters (452990), supermarket/grocery stores (Group 445100), convenience stores (446110), gas stations with food marts (447110), fast-casual restaurants (722212), and fast-food restaurants (722213), respectively. Farmers’ markets and produce stands were identified through the health departments’ listing of such vendors. Farmers’ markets were verified through the Kentucky Department of Agriculture. Small grocery stores were categorized based on number of cash registers, less than 5, which has been used as a standard measure for small size stores [16]. Additional criteria for small grocery store was not having a second listing or a known chain within the same county or in another county as used in previous studies [42]. The trained graduate student went into each store to count cash registers as part of the validation efforts described below. Store type was dichotomized has ‘one’ for having any store type and ‘none’ for having zero store type, based on distribution of the data.

Identifying food outlets via ground-truthing

We conducted ground-truthing [43] to verify the presence of each food outlet in the commercial database and to identify any new outlets (Table 2) in summer and fall of 2011. Ground-truthing is defined as a wind shield audit to verify if the store is located in the same address as InfoUSA has provided and if the location is open. One graduate student was trained in ground-truthing methods and conducted 12 trips, averaging 2 trips per week. Training consisted of both the student and PI driving within the communities with the address to verify location and open status of all stores listed. After one county was jointly performed the graduate student conducted all other assessments. The principal investigator of the study verified addresses by randomly selecting counties and conducted ground-truth verification on 25% of the stores. The field work began in September 2011 and ended in November 2011. Facilities were classified as 1) “located and open” (facility was open and found in the database); 2) “closed” (outlet listed in database and located but permanently closed); 3) “not found” (outlet not found during ground-truthing but was listed in database); or 4) “ineligible” (outlet located but not was not within definition of NAICS code assigned) [21]. The original list of stores was obtained from InfoUSA. Outlet name, type, address were recorded for new outlets identified which were not in the Info USA database.

Table 2 Comparison of ground-truth to secondary data source listing among 14 counties in rural Appalachia Kentucky 2011

Neighborhood deprivation

The Neighborhood Deprivation Index (NDI) was calculated using the method described by Messer et al. [44]. The Index is an empirical score of socioeconomic deprivation based on eight census variables collected from American Community Survey 5-year estimates 2005–2009 [39]: percentage of individuals with income in 2009 below poverty level; percentage of families with female headed households with no husband present and children under age 18 y; percentage of households with incomes under $30,000/year; percentage of households with public assistance income; percentage of people age 16 or older in civilian labor force currently unemployed; percentage of males in management; percentage of all persons age 25 or older with less than a high school degree; and, percentage of households with more than one person per room. We fit a principal component analysis (PCA) to obtain the item loadings, which were used to weight each census variable's contribution to the first principal component. The component was then applied for each census tract within the study area. Neighborhood deprivation retained its linear shape after diagnostic testing of the variable addressing normality. The range of values for NDI was −4.07 – 4.34 (see Table 1 and Figure 1).

Figure 1

Neighborhood Deprivation Scores in Appalachia, KY.

Modified retail food environment index

Coverage represents the number of purchasing opportunities within a given neighborhood [25] or the number of food outlets within a census tract. We calculated a modified retail food environment index [24] ( (mRFEI) for each census tract in the Appalachia region. The mRFEI is an indicator of access to retailers that sell healthy foods, like fresh fruits and vegetables. The mRFEI is based on a range from zero (no food retailers that typically sell healthy food) to 100 (only food retailers that sell healthy food).

The mRFEI is constructed for each census tract, by using the following formula:

mRFE = 100 * # Healthy Food Retailers # Healthy Food Retailers + # Less Healthy Food Retailers

The definitions for healthy and less healthy food retailers are based on the Centers for Disease Control and Prevention definition [45]. Healthy food retailers consist of: grocery stores, supermarkets, supercenters, and produce vendors (produce stores and farmers markets). Less healthy food retailers consist of: fast-food restaurants, gas stations with food marts, and convenience stores, and dollar stores. To date the mRFEI is an environmental indicator of food access or the proportion of healthy stores within a defined neighborhood relative to all the stores accessible [46]. It is not however an indicator of availability of healthy food within the store or availability of unhealthy food items.

Due to the high frequency of food shopping conducted at dollar stores among rural residents [47] but the lack of fresh produce options within this type of store [48], dollar stores were included in the denominator. Dollar stores were tested in the numerator and denominator but results did not significantly change and therefore dollar stores were retained in the denominator. The mRFEI was split into 3 categories based on the distribution of the data (category one = 0; category two = 1–27; category three = 28–100). We conducted sensitivity test for various cut-points and retained high, medium, and low categories based on the results.

Validation of food outlets in the commercial data base

To characterize the validity of the commercial food venue address database against the ground-truthing field observations, we conducted a sensitivity analysis [21]. The sensitivity analyses consisted of calculating the fraction of food outlets that were listed and found to be open (i.e., “located and open”/(“located and open” + “found, not listed”)). The positive predictive value (PPV) was calculated as the fraction of all listed food outlets that were “located and open” during the field census (i.e., “located and open”/(“located and open” + “closed but listed in database” + “not found during ground-truthing but listed in database”)). The final categories consisted of 1) located and open; 2) located and closed; 3) not found; 4) not listed but found. Because of structural zeroes, chance-adjusted kappa statistics could not be computed. We calculated an exact binomial confidence interval for each proportion. Fisher’s exact tests were used to evaluate accuracy due to small cell size (Table 3).

Table 3 Validity of secondary data source as compared to ground-truth effort among 14 counties in rural Appalachia Kentucky 2011

The sensitivity percentage can be interpreted as the ability of the InfoUSA data base to accurately capture the food outlets that are listed. A sensitivity of 100% is deemed to be highly sensitive, while 50- 70% is moderate, and less than 50% is low [21]. The PPV can be interpreted as the likelihood that an establishment is open and found. We used cut-points of below 0.30 as poor, 0.31-0.50 as fair, 0.51-0.70 as moderate, from 0.71-.90 as good, and over 0.91 as excellent [18, 49].

Statistical and geospatial analysis

All analyses were conducted using Stata 11.0 version [50]. To test for differences between census tracts within counties on neighborhood deprivation scores, t-tests were used with a Type I error rate of 0.05. The hypothesis originally proposed asked whether neighborhoods with more deprivation would have less healthy stores or a lower mRFEI. To test this hypothesis multinomial logistic regression was used to model the association between neighborhood deprivation and mRFEI. Our secondary hypothesis asked whether neighborhoods with a specific type of store would have more or less deprivation. To test neighborhood deprivation for each store type (super center, super market) logistic regression was used. A measurement error correction factor was added for all models due to the sensitivity and positive predictive value results [51] and to improve retail food environment estimates. The measurement error correction was set at .3 using C + simex commands. We used census tracts with zero for store values as the reference group for models testing the association between neighborhood deprivation and mRFEI. We used no store in census tract relative to having a store for models testing the association between neighborhood deprivation and each store type (e.g. Super Center). Additionally, research thus far has used zero as the referent to look at neighborhood deprivation in food deserts relative to neighborhoods with adequate variability of store types [52].


Figure 1 depicts the spatial distribution of the census tracts within the counties for each level of neighborhood deprivation. For ease of graphical representation various levels of deprivation have been shown. The two extreme levels of deprivation are indicated by light gray and dark gray. Low neighborhood deprivation is depicted by light gray with a range in scores of −4.07 - -1.51 while high neighborhood deprivation is depicted by dark gray with a range in scores of 2.19 – 4.34. Figure 1 graphically indicates there are many census tracts with high deprivation clustered together within counties. Additionally several census tracts with high deprivation are next to census tracts in other counties with high deprivation.

Figure 2 depicts the spatial distribution of the census tracts within the counties for each level of the modified retail food environment index (mRFEI). Each level of the mRFEI is shown by shades of gray and with line or dot mark patterns. The census tracts that are shaded light gray with cross hatch marks indicate no stores or zero. The census tracts with a mRFEI score of 1–27 that have diagonal lines indicate a low ratio of healthy stores relative to all stores within the census tract. The census tracts with a mRFEI score of 28–100 that are dark gray with dots indicate a high ratio of healthy stores relative to all stores within the census tract. Similar to the neighborhood deprivation clustering pattern, those census tracts with no stores tend to cluster within the same county. However, graphically there are different patterns between counties that have a low mRFEI adjacent to census tracts with high mRFEI. While some counties have all census tracts with high mRFEI scores, other counties have one census tract with low mRFEI scores next to census tracts with high mRFEI scores.

Figure 2

Retail Food Environment Index Scores in Appalachia, KY.

Table 1 shows the descriptive statistics for each county in the rural Appalachian area of the variables the are used to create the neighborhood deprivation score. Most of the counties experience high rates of poverty and unemployment. The mean percentage of poverty among all counties is 26.07% with a range of 5.1-69.6%. The mean percentage of unemployment among those 16 years of age and older is 10.38% with a range of 0–52.7%. The mean neighborhood deprivation score for all counties was 0.17 with a range of −4.07 – 4.34. There are also significant differences between census tracts within counties for neighborhood deprivation scores but in fewer counties; 4 of the 14.

Table 2 compares findings from direct observation (ground-truthing) to the secondary commercial database for all stores and for each store type. Of all the stores found in the commercial database, (n = 540), there were a total of 378 open and located (70%), 27 located and closed (5%), and 135 not found (25%). Additionally, there were 15 stores not on the commercial database list but that were located. Of the 540 stores on the original InfoUSA list, the most common type of stores are fast-food and fast-casual restaurants (28%, 151/540), and grocery stores (26%, 140/540). The type of food outlet with the greatest likelihood of being on the original list and located and open was supercenters (100%), followed by fast-food and fast-casual restaurants (94%, 142/151). The type of food outlet with the lowest likelihood of being on the list and located and open was small non-chain grocery stores (38% 53/140). Additionally, the lowest percentage of stores not listed and found was dollar stores (53% 8/15).

Table 3 highlights the validation results of the commercial database versus direct observation (ground-truthing) (% agreement, sensitivity, PPV). Results indicate that the sensitivity of the commercial database was very high for traditional food outlets (grocery stores, supermarkets, convenience stores), with a range of 0.96-1.00. These results indicate that InfoUSA commercial database is highly sensitive for traditional food retailers overall. If a traditional store type was listed and open close to 100% of the time InfoUSA listed this store on their list of addresses. However, the sensitivity for non-traditional food outlets was low compared to traditional food venues with a range of 0.2-0.5. These results indicate the InfoUSA commercial database is not sensitive to non-traditional food venues.

Specifically, the sensitivity result for supermarkets, a traditional food venue, was 0.96. This result indicates that 96% of the time if a store was on the InfoUSA database it was located and open per direct observation. A similar result was found for supercenters; 100% of the time the supercenter was located and open per direct observation relative to the InfoUSA data base. However, the sensitivity was much lower for non-traditional food venues (Dollar stores (0.2) and Farmer’s Markets (0.5)). Dollar stores and Farmer’s Markets were found through the ground-truthing approach but were not listed on the InfoUSA commercial list.

Similar to the sensitivity analyses the PPV was excellent for supercenters with a PPV of 1.0. The PPV for super markets was a bit lower with a PPV of 0.87 indicating the InfoUSA was good compared to direct observation. However, the PPV was much lower for small grocery stores, with a score of 0.38 indicating InfoUSA was a poor measure for assessing if stores are actually open compared to direct observation. There were a low percentage of stores open when they were located through direct observation. Although the store was found, a small percentage of the stores were actually open; only 38%. Lastly, the PPV was high for dollar stores and Farmer’s Markets, at 100% was excellent, as we found stores listed on the commercial database 100% of the time.

Table 4 shows the results of the association between neighborhood deprivation and the mRFEI. There was no association between neighborhood deprivation and the mRFEI. However, when stratified by store type the results indicate that neighborhoods with low deprivation have lower odds of having at least one super center [OR 0.50 (95% CI 0.27. 0.97)] and convenience store [OR 0.67 (95% CI 0.51, 0.89)] compared to those with no store types and higher deprivation. Such that, neighborhoods with low percentages of poverty, unemployment, and other economic indicators have a low probability of having super centers and convenience stores in their neighborhood compared to those with high deprivation Additionally, we did not find that neighborhoods with low deprivation have more grocery stores or super markets.

Table 4 Neighborhood Deprivation and the association with Modified Retail Food Environment Index and Stratified Store Type, Appalachia Kentucky, 2011


Research regarding the role of the macro-level food environment has experienced a surge in publications in recent years [53, 54], with many studies using secondary data sources as a way to classify neighborhoods with regard to access and availability of food stores [41, 55, 56]. Reliance upon secondary data sources has led to substantial measurement error [17, 21, 48]. Our findings provide further evidence to support conducting direct observation or ground-truthing in rural settings to verify the presence of food venues in the retail food environment [21, 48] obtained from commercial data sources. Previous studies assessing the macro-level food environment, such as number and type of food outlets in a neighborhood, may have introduced bias by not conducting validation studies. This may explain why results of such studies examining association and between the retail food environment and neighborhood characteristics have been conflicting [34, 40, 5759].

To date, there are few studies using several approaches to characterize the macro-level food environment, with fewer studies reporting on validation efforts [21, 43] among rural settings. Our results are similar to a previous study conducted in some rural locations in South Carolina [21] such that there were low positive predictive values for non-traditional food outlets, such as dollar stores and pharmacies. Our secondary data source had greater sensitivity which may be due to the separation of grocery stores from supermarkets, geographic difference between the studies, and the high percentage of establishments not being found in the South Carolina study. The South Carolina study found many establishments, yet they were closed. Additionally, the previous study validated locational presence from several secondary data sources, whereas in this study we only validated one secondary data source with direct observation. However, we conducted our analyses in 14 rural counties to specifically address accuracy of food venues in rural areas. As previous research has shown, rural residents shop for food in non-traditional food outlets such as dollar stores, farmer’s markets, and gas stations with marts [47, 60]. Relying solely on one secondary data source to determine location of key establishments would introduce a biased measure of the retail food environment. Future studies should consider employing direct observation for measuring the retail food store environment, especially in rural areas.

We did not find an association between neighborhood deprivation and the retail food environment for census tracts with no retail stores. This is consistent with several studies, both in the U.S. [61, 62] and internationally [32, 63]. However, in our study those neighborhoods with lower deprivation were less likely to have a super center or a convenience store in their neighborhood. This result is consistent with studies conducted in rural settings [11, 28]. The dynamic between deprivation and the retail food environment is complex. Given that neighborhoods with high deprivation generally have less population, and lower income per individual, there is less incentive for chain food outlets to open stores [64]. With less opportunities to purchase food individuals in remote areas and with more deprivation face greater odds of having access to stores in general [11] and especially stores that sell affordable healthy options [24, 34]. Added to this dynamic is the difficulty that individuals in remote areas face with regard to travel time to certain locations [28]. Taken together, there are limited opportunities for economic development in these areas, especially for large supermarkets or grocery stores, which tend to sell the highest percentage of healthy items at the best prices [65]. These findings suggest that the degree of neighborhood deprivation may play a role in access and availability of healthy food options in rural areas [66].

There are several limitations to our study. We do not have data on consumer shopping patterns and behaviors. It is highly likely that residents living in neighborhoods with no stores shop for food in adjacent neighborhoods with retail food stores. The actual food environment individuals are exposed to are adjacent to where they reside [67]. The results did show that several of the census tracts with zero stores are in point of fact adjacent to census tracts with stores (Figure 2). However, in some cases the proportion of stores favored healthy options while in other cases the proportion of stores favored unhealthy options. Suggesting that individuals are able to access food outlets, yet those outlets may or may not have an abundance of healthy items. The mRFEI is a measure of proportion and does provide the context of availability where individuals shop. Therefore a strong limitation to our study is the lack of both consumer food environment measures and macro level measures such as number and type of stores within a neighborhood. Availability of food within the stores may be more relevant in regards to purchasing behaviors and dietary intake [68] which has not been captured in this study. Future research should examine how living in a neighborhood with no retail food outlets influences food purchasing habits and travel patterns over time, while also assessing the consumer food environment within the stores where individuals shop.

Lastly, we only used one source of secondary data and therefore our sensitivity and positive predictive values might have been higher or lower had more secondary data been collected and validated. Previous studies using more than one secondary data source have found lower values overall [21].

Strengths of this study are the rather large effort at conducting ground-truthing across a rural and remote area. Few studies have been able to verify food venue location in a rural remote setting [48]. Additionally, this study has provided further evidence between store type and deprivation in rural areas of the U.S.


This study provides further support for the need to conduct direct observation of retail food stores when characterizing the food store environment, especially in rural areas, due to the low sensitivity and positive predictive values for certain types of food retailers. This study also suggests that in rural areas, neighborhood deprivation is associated with having certain store types which may or may not sell healthy food items. It is suggested that policies and development aimed at improving healthy food access and availability in rural areas is a promising public health strategy for those most in need.

Figure 1 76 census tract neighborhoods within 14 counties in the Appalachia region of Kentucky. The shaded census tracts represent neighborhood deprivation score for that census tract within the county. The various shades of gray represent 4 different categories of neighborhood deprivation. The most extreme ends of the spectrum for neighborhood deprivation are indicated with dark gray on one end and light gray on the other end. Dark gray is low neighborhood deprivation (i.e. low rates of unemployment; low rates of poverty a range of −4.07–−1.51). Light gray is high deprivation (i.e. high rates of unemployment; high rates of poverty a range of 2.19–4.34).

Figure 2 76 census tract neighborhoods within 14 counties in the Appalachia region of Kentucky. The shaded and patterned census tracts represent the modified retail food environment index score within the county. The census tracts that are shaded light gray with cross hatch marks indicate no stores or zero. The census tracts with a mRFEI score of 1–27 that have diagonal lines indicate a low ratio of healthy stores relative to all stores within the census tract. The census tracts with a mRFEI score of 28–100 that are dark gray with dots indicate a high ratio of healthy stores relative to all stores within the census tract.


Funding was provided from the University of Kentucky Research Foundation.


  1. 1.

    Lee SK, Sobal J, Frongillo EA, Olson CM, Wolfe WS: Parity and body weight in the United States: differences by race and size of place of residence. Obes Res. 2005, 13 (7): 1263-1269. 10.1038/oby.2005.150. Epub 2005/08/04

  2. 2.

    Jackson JE, Doescher MP, Jerant AF, Hart LG: A national study of obesity prevalence and trends by type of rural county. The Journal of rural health: official journal of the American Rural Health Association and the National Rural Health Care Association. 2005, 21 (2): 140-148. Epub 2005/04/30

  3. 3.

    Centers for Disease Control and Prevention: Behavioral Risk Factor Surveillance Survey County Prevalence Data. 2010 [2/12/2012]. Available from: CDC/BRFSS/Obesity trends,

  4. 4.

    U.S Department of Health and Human Services Centers for Disease Control and Prevention: Estimated county-level prevalence of diabetes and obesity - United States, 2007. MMWR Morb Mortal Wkly Rep. 2009, 58 (45): 1259-1263. Epub 2009/11/27

  5. 5.

    Barker L, Crespo R, Gerzoff RB, Denham S, Shrewsberry M, Cornelius-Averhart D: Residence in a distressed county in Appalachia as a risk factor for diabetes, Behavioral Risk Factor Surveillance System, 2006–2007. Prev Chronic Dis. 2010, 7 (5): A104-Epub 2010/08/18

  6. 6.

    Hendryx M, Zullig KJ: Higher coronary heart disease and heart attack morbidity in Appalachian coal mining regions. Prev Med. 2009, 49 (5): 355-359. 10.1016/j.ypmed.2009.09.011. Epub 2009/09/19

  7. 7.

    Hutson SP, Dorgan KA, Duvall KL, Garrett LH: Human papillomavirus infection, vaccination, and cervical cancer communication: the protection dilemma faced by women in southern Appalachia. Women Health. 2011, 51 (8): 795-810. 10.1080/03630242.2011.635245. Epub 2011/12/22

  8. 8.

    Blackley D, Behringer B, Zheng S: Cancer mortality rates in appalachia: descriptive epidemiology and an approach to explaining differences in outcomes. J Community Health. 2012, 37 (4): 804-13. 10.1007/s10900-011-9514-z.

  9. 9.

    Pancoska P, Buch S, Cecchetti A, Parmanto B, Vecchio M, Groark S: Family networks of obesity and type 2 diabetes in rural Appalachia. Clin Transl Sci. 2009, 2 (6): 413-21. 10.1111/j.1752-8062.2009.00162.x. Epub 2010/05/07

  10. 10.

    Pearce J, Witten K, Hiscock R, Blakely T: Are socially disadvantaged neighbourhoods deprived of health-related community resources?. Int J Epidemiol. 2007, 36 (2): 348-55. 10.1093/ije/dyl267. Epub 2006/12/22

  11. 11.

    Sharkey JR, Horel S, Han D, Huber JC: Association between neighborhood need and spatial access to food stores and fast food restaurants in neighborhoods of colonias. Int J Health Geogr. 2009, 8: 9-10.1186/1476-072X-8-9. Epub 2009/02/18

  12. 12.

    Popkin BM, Duffey K, Gordon-Larsen P: Environmental influences on food choice, physical activity and energy balance. Physiol Behav. 2005, 86 (5): 603-13. 10.1016/j.physbeh.2005.08.051. Epub 2005/10/26

  13. 13.

    Gordon-Larsen P, Nelson MC, Page P, Popkin BM: Inequality in the built environment underlies key health disparities in physical activity and obesity. Pediatrics. 2006, 117 (2): 417-24. 10.1542/peds.2005-0058. Epub 2006/02/03

  14. 14.

    Jokela M, Kivimaki M, Elovainio M, Viikari J, Raitakari OT, Keltikangas-Jarvinen L: Urban/rural differences in body weight: evidence for social selection and causation hypotheses in Finland. Soc Sci Med. 2009, 68 (5): 867-75. 10.1016/j.socscimed.2008.12.022. Epub 2009/01/17

  15. 15.

    Liu J, Bennett KJ, Harun N, Probst JC: Urban–rural differences in overweight status and physical inactivity among US children aged 10–17 years. The Journal of rural health : official journal of the American Rural Health Association and the National Rural Health Care Association. 2008, 24 (4): 407-15. Epub 2008/11/15

  16. 16.

    Glanz K, Sallis JF, Saelens BE, Frank LD: Healthy nutrition environments: concepts and measures. Am J Health Promot. 2005, 19 (5): 330-3. 10.4278/0890-1171-19.5.330. ii. Epub 2005/05/18

  17. 17.

    Cummins S, Macintyre S: Are secondary data sources on the neighbourhood food environment accurate? Case-study in Glasgow, UK. Prev Med. 2009, 49 (6): 527-8. 10.1016/j.ypmed.2009.10.007. Epub 2009/10/24.

  18. 18.

    Paquet C, Daniel M, Kestens Y, Leger K, Gauvin L: Field validation of listings of food stores and commercial physical activity establishments from secondary data. Int J Behav Nutr Phys Act. 2008, 5: 58-10.1186/1479-5868-5-58. Epub 2008/11/13.

  19. 19.

    Lake AA, Burgoine T, Greenhalgh F, Stamp E, Tyrrell R: The foodscape: classification and field validation of secondary data sources. Health Place. 2010, 16 (4): 666-73. 10.1016/j.healthplace.2010.02.004. Epub 2010/03/09.

  20. 20.

    Svastisalee CM, Holstein BE, Due P: Validation of presence of supermarkets and fast-food outlets in Copenhagen: case study comparison of multiple sources of secondary data. Public Health Nutr. 2012, 1-4. Epub 2012/03/24, in press

  21. 21.

    Liese AD, Colabianchi N, Lamichhane AP, Barnes TL, Hibbert JD, Porter DE: Validation of 3 food outlet databases: completeness and geospatial accuracy in rural and urban food environments. Am J Epidemiol. 2010, 172 (11): 1324-33. 10.1093/aje/kwq292. Epub 2010/10/22

  22. 22.

    Powell LM, Auld MC, Chaloupka FJ, O'Malley PM, Johnston LD: Associations between access to food stores and adolescent body mass index. Am J Prev Med. 2007, 33 (4): S301-7. 10.1016/j.amepre.2007.07.007. Epub 2007/10/27

  23. 23.

    Powell LM, Han E, Zenk SN, Khan T, Quinn CM, Gibbs KP: Field validation of secondary commercial data sources on the retail food outlet environment in the U.S. Health Place. 2011, 17 (5): 1122-31. 10.1016/j.healthplace.2011.05.010. Epub 2011/07/12

  24. 24.

    Jilcott SB, McGuirt JT, Imai S, Evenson KR: Measuring the retail food environment in rural and urban North Carolina counties. Journal of public health management and practice : JPHMP. 2010, 16 (5): 432-40. Epub 2010/08/07

  25. 25.

    Sharkey JR, Johnson CM, Dean WR, Horel SA: Association between proximity to and coverage of traditional fast-food restaurants and non-traditional fast-food outlets and fast-food consumption among rural adults. Int J Health Geogr. 2011, 10 (1): 37-10.1186/1476-072X-10-37. Epub 2011/05/24

  26. 26.

    Bader MD, Purciel M, Yousefzadeh P, Neckerman KM: Disparities in neighborhood food environments: implications of measurement strategies. Econ Geogr. 2010, 86 (4): 409-30. 10.1111/j.1944-8287.2010.01084.x. Epub 2010/12/02

  27. 27.

    Company N: Good Value is the Top Influencer of U.S. Grocery Store Choice, Nielsen Reports. 2007, IL: Nielson Company, Schaumburg

  28. 28.

    Smith DM, Cummins S, Taylor M, Dawson J, Marshall D, Sparks L: Neighbourhood food environment and area deprivation: spatial accessibility to grocery stores selling fresh fruit and vegetables in urban and rural settings. Int J Epidemiol. 2010, 39 (1): 277-84. 10.1093/ije/dyp221. Epub 2009/06/06

  29. 29.

    Macintyre S: Deprivation amplification revisited; or, is it always true that poorer places have poorer access to resources for healthy diets and physical activity?. Int J Behav Nutr Phys Act. 2007, 4: 32-10.1186/1479-5868-4-32. Epub 2007/08/09

  30. 30.

    Svastisalee CM, Nordahl H, Glumer C, Holstein BE, Powell LM, Due P: Supermarket and fast-food outlet exposure in Copenhagen: associations with socio-economic and demographic characteristics. Public Health Nutr. 2011, 14 (9): 1618-26. 10.1017/S1368980011000759. Epub 2011/05/12

  31. 31.

    Winkler E, Turrell G, Patterson C: Does living in a disadvantaged area mean fewer opportunities to purchase fresh fruit and vegetables in the area? Findings from the Brisbane food study. Health Place. 2006, 12 (3): 306-19. 10.1016/j.healthplace.2004.08.013. Epub 2006/03/21

  32. 32.

    Pearce J, Blakely T, Witten K, Bartie P: Neighborhood deprivation and access to fast-food retailing: a national study. Am J Prev Med. 2007, 32 (5): 375-82. 10.1016/j.amepre.2007.01.009. Epub 2007/05/05

  33. 33.

    Jilcott SB, Liu H, Moore JB, Bethel JW, Wilson J, Ammerman AS: Commute times, food retail gaps, and body mass index in North Carolina counties. Prev Chronic Dis. 2010, 7 (5): A107-Epub 2010/08/18

  34. 34.

    Liese AD, Weis KE, Pluto D, Smith E, Lawson A: Food store types, availability, and cost of foods in a rural environment. J Am Diet Assoc. 2007, 107 (11): 1916-23. 10.1016/j.jada.2007.08.012. Epub 2007/10/30

  35. 35.

    Boone-Heinonen J, Gordon-Larsen P, Kiefe CI, Shikany JM, Lewis CE, Popkin BM: Fast Food Restaurants and Food Stores: Longitudinal Associations With Diet in Young to Middle-aged Adults: The CARDIA Study. Arch Intern Med. 2011, 171 (13): 1162-70. 10.1001/archinternmed.2011.283. Epub 2011/07/13

  36. 36.

    Block JP, Christakis NA, O'Malley AJ, Subramanian SV: Proximity to food establishments and body mass index in the Framingham Heart Study offspring cohort over 30 years. Am J Epidemiol. 2011, 174 (10): 1108-14. 10.1093/aje/kwr244. Epub 2011/10/04

  37. 37.

    Sharkey JR, Horel S, Han D, Huber JC: Association between neighborhood need and spatial access to food stores and fast food restaurants in neighborhoods of colonias. Int J Health Geogr. 2009, 8: 9-10.1186/1476-072X-8-9. Epub 2009/02/18

  38. 38.

    Sharkey JR, Johnson CM, Dean WR, Horel SA: Focusing on fast food restaurants alone underestimates the relationship between neighborhood deprivation and exposure to fast food in a large rural area. Nutr J. 2011, 10: 10-10.1186/1475-2891-10-10. Epub 2011/01/27

  39. 39.

    United States Department of Commerce: United States Census Bureau. 2010, [cited 2012 January 2012]

  40. 40.

    Morland K, Diez Roux AV, Wing S: Supermarkets, other food stores, and obesity: the atherosclerosis risk in communities study. Am J Prev Med. 2006, 30 (4): 333-9. 10.1016/j.amepre.2005.11.003. Epub 2006/03/15

  41. 41.

    Zenk SN, Schulz AJ, Israel BA, James SA, Bao S, Wilson ML: Neighborhood racial composition, neighborhood poverty, and the spatial accessibility of supermarkets in metropolitan Detroit. Am J Public Health. 2005, 95 (4): 660-7. 10.2105/AJPH.2004.042150. Epub 2005/03/31

  42. 42.

    Zenk SN, Schulz AJ, Israel BA, James SA, Bao S, Wilson ML: Fruit and vegetable access differs by community racial composition and socioeconomic position in Detroit, Michigan. Ethn Dis. 2006, 16 (1): 275-80. Epub 2006/04/08

  43. 43.

    Sharkey JR, Horel S: Neighborhood socioeconomic deprivation and minority composition are associated with better potential spatial access to the ground-truthed food environment in a large rural area. J Nutr. 2008, 138 (3): 620-7. Epub 2008/02/22

  44. 44.

    Messer LC, Laraia BA, Kaufman JS, Eyster J, Holzman C, Culhane J: The development of a standardized neighborhood deprivation index. J Urban Health. 2006, 83 (6): 1041-62. 10.1007/s11524-006-9094-x. Epub 2006/10/13

  45. 45.

    Centers for Disease Control and Prevention: Children's Food Environment State Indicator Report 2011. 2011, Washington D.C

  46. 46.

    Jilcott SB, Keyserling T, Crawford T, McGuirt JT, Ammerman AS: Examining associations among obesity and per capita farmers' markets, grocery stores/supermarkets, and supercenters in US counties. J Am Diet Assoc. 2011, 111 (4): 567-72. 10.1016/j.jada.2011.01.010. Epub 2011/03/30

  47. 47.

    Gustafson AA, Sharkey J, Samuel-Hodge CD, Jones-Smith J, Folds MC, Cai J: Perceived and objective measures of the food store environment and the association with weight and diet among low-income women in North Carolina. Public Health Nutr. 2011, 14 (6): 1032-8. 10.1017/S1368980011000115. Epub 2011/02/18

  48. 48.

    Sharkey JR: Measuring potential access to food stores and food-service places in rural areas in the U.S. Am J Prev Med. 2009, 36 (4): S151-5. 10.1016/j.amepre.2009.01.004. Epub 2009/04/16

  49. 49.

    Janse AJ, Gemke RJ, Uiterwaal CS, van der Tweel I, Kimpen JL, Sinnema G: Quality of life: patients and doctors don't always agree: a meta-analysis. J Clin Epidemiol. 2004, 57 (7): 653-61. 10.1016/j.jclinepi.2003.11.013. Epub 2004/09/11

  50. 50.

    Stata 11.0: Stata. 2009, College Station, TX

  51. 51.

    Spiegelman D, Schneeweiss S, McDermott A: Measurement error correction for logistic regression models with an "alloyed gold standard". Am J Epidemiol. 1997, 145 (2): 184-96. 10.1093/oxfordjournals.aje.a009089. Epub 1997/01/15

  52. 52.

    Spence JC, Cutumisu N, Edwards J, Raine KD, Smoyer-Tomic K: Relation between local food environments and obesity among adults. BMC Public Health. 2009, 9: 192-10.1186/1471-2458-9-192. Epub 2009/06/23

  53. 53.

    McKinnon RA, Reedy J, Morrissette MA, Lytle LA, Yaroch AL: Measures of the food environment: a compilation of the literature, 1990–2007. American journal of preventive medicine. Am J Prev Med. 2009, 36 (4): S124-33. 10.1016/j.amepre.2009.01.012. Epub 2009/04/16

  54. 54.

    Beaulac J, Kristjansson E, Cummins S: A systematic review of food deserts, 1966–2007. Prev Chronic Dis. 2009, 6 (3): A105-Epub 2009/06/17

  55. 55.

    Morland K, Filomena S, Morland K, Filomena S: Disparities in the availability of fruits and vegetables between racially segregated urban neighbourhoods. Public Health Nutr. 2007, 10 (12): 1481-9. Epub 2007/06/22

  56. 56.

    Lee RE, Heinrich KM, Medina AV, Regan GR, Reese-Smith JY, Jokura Y: A picture of the healthful food environment in two diverse urban cities. Environ Health Insights. 2010, 4: 49-60. Epub 2010/08/14

  57. 57.

    Zenk SN, Schulz AJ, Israel BA, James SA, Bao S, Wilson ML, Zenk SN, Schulz AJ, Israel BA, James SA, Bao S, Wilson ML: Fruit and vegetable access differs by community racial composition and socioeconomic position in Detroit, Michigan. Ethn Dis. 2006, 16 (1): 275-80. Epub 2006/04/08

  58. 58.

    Cummins S, Macintyre S: Food environments and obesity--neighbourhood or nation?. Int J Epidemiol. 2006, 35 (1): 100-4. Epub 2005/12/13

  59. 59.

    Dean WR, Sharkey JR: Rural and Urban Differences in the Associations between Characteristics of the Community Food Environment and Fruit and Vegetable Intake. J Nutr Educ Behav. 2011, Epub 2011/05/28

  60. 60.

    Dean WR, Sharkey JR: Rural and Urban Differences in the Associations between Characteristics of the Community Food Environment and Fruit and Vegetable Intake. J Nutr Educ Behav. 2011, 43 (6): 426-33. 10.1016/j.jneb.2010.07.001.

  61. 61.

    Ford PB, Dzewaltowski DA: Neighborhood Deprivation, Supermarket Availability, and BMI in Low-Income Women: A Multilevel Analysis. J Community Health. 2011, 36 (5): 785-96. 10.1007/s10900-011-9377-3.

  62. 62.

    Harris DE, Blum JW, Bampton M, O'Brien LM, Beaudoin CM, Polacsek M: Location of food stores near schools does not predict the weight status of maine high school students. J Nutr Educ Behav. 2011, 43 (4): 274-8. 10.1016/j.jneb.2010.08.008. Epub 2011/06/21

  63. 63.

    Cummins S, Stafford M, Macintyre S, Marmot M, Ellaway A: Neighbourhood environment and its association with self rated health: evidence from Scotland and England. J Epidemiol Community Health. 2005, 59 (3): 207-13. 10.1136/jech.2003.016147. Epub 2005/02/15

  64. 64.

    Leibtag E: Where You Shop Matters: Store Formats Drive Variation in Retail Food Prices. 2005,  -[cited 2008 8/10/2008];]

  65. 65.

    Courtemanche C, Carden A: Supersizing Supercenters? The Impact of Wal-Mart Supercenters on Body Mass Index and Obesity. 2010, SSRN eLibrary

  66. 66.

    Walker RE, Keane CR, Burke JG: Disparities and access to healthy food in the United States: A review of food deserts literature. Health Place. 2010, 16 (5): 876-84. 10.1016/j.healthplace.2010.04.013. Epub 2010/05/14

  67. 67.

    Morland KB, Evenson KR: Obesity prevalence and the local food environment. Health Place. 2009, 15 (2): 491-5. 10.1016/j.healthplace.2008.09.004. Epub 2008/11/22

  68. 68.

    Gustafson A, Hankins S, Jilcott S: Measures of the Consumer Food Store Environment: A Systematic Review of the Evidence 2000-2011. J Community Health. 2012, 37 (4): 897-911. 10.1007/s10900-011-9524-x.

Pre-publication history

  1. The pre-publication history for this paper can be accessed here:

Download references


The authors would like to thank those who contributed to the conceptual design and data collection efforts.

Author information

Correspondence to Alison A Gustafson.

Additional information

Competing interests

The authors have no conflict of interest to declare.

Authors’ contributions

SL assisted with data collection. CW assisted with development of the maps. SJ-P assisted with writing and revising of the manuscript. AG conducted data analysis, interpretation of the data, writing, revising, and decision of publication. All authors read and approved the final manuscript.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and Permissions

About this article

Cite this article

Gustafson, A.A., Lewis, S., Wilson, C. et al. Validation of food store environment secondary data source and the role of neighborhood deprivation in Appalachia, Kentucky. BMC Public Health 12, 688 (2012).

Download citation


  • Positive Predictive Value
  • Census Tract
  • Food Environment
  • Food Outlet
  • Store Type