Skip to main content

An uncertainty estimate of the prevalence of stunting in national surveys: the need for better precision



Stunting is determined by using the World Health Organization (WHO) child growth standard which was developed using precise measurements. However, it is unlikely that large scale surveys maintain the same level of rigour and precision when measuring the height of children. The population measure of stunting in children is sensitive to over-dispersion, and the high prevalence of stunting observed in surveys in low and middle-income countries (LMIC) could partly be due to lower measurement precison.


To quantify the incongruence in the dispersion of height-for-age in national surveys of < 5 y children, in relation to the standard WHO Multicenter Growth Reference Study (MGRS), and propose a measure of uncertainty in population measures of stunting.


An uncertainty factor was proposed and measured from the observed incongruence in dispersion of the height-for-age of < 5 y children in the MGRS against carefully matched populations from the Demographic Health Survey of 17 countries (‘test datasets’, based on the availability of data). This also allowed for the determination of uncertainty-corrected prevalence of stunting (height-for-age Z score < − 2) in < 5 y children.


The uncertainty factor was estimated for 17 LMICs. This ranged from 0.9 to 2.1 for Peru and Egypt respectively (reference value 1). As an explicit country example, the dispersion of height-for-age in the Indian National Family Health Survey-4 test dataset was 39% higher than the MGRS study, with an uncertainty factor of 1.39. From this, the uncertainty-adjusted Indian national stunting prevalence estimate reduced to 18.7% from the unadjusted estimate of 36.2%.


This study proposes a robust statistical method to estimate uncertainty in stunting prevalence estimates due to incongruent dispersions of height measured in national surveys for children < 5 years in relation to the WHO height-for-age standard. The uncertainty is partly due to population heterogeneity, but also due to measurement precision, and calls for better quality in these measurements.

Peer Review reports


Stunting is the most frequently used indicator for chronic undernutrition in children and in setting priorities for interventions based on the Sustainable Development Goals [1] or the Global Hunger Index [2]. Though, the prevalence of stunting in some of the Latin American countries like Brazil, Peru and Bolivia has markedly reduced in last three decades [3], it continues to be high in other Low and Middle Income Countries (LMIC), especially in South Asian and sub-Saharan African countries [4]. For example, India reported a national average of 36% in < 5 y children in National Family Health Survey (NFHS-4) [5] and 35% in 0–4 y children in the recent Comprehensive National Nutrition Survey (CNNS) [6]. This measure of chronic underunitrition is related to poor outcomes in health, cognitive development, educational and economic attainment later in life [7].

Stunting is defined as a deficit in height relative to a child’s age, that is, 2 standard deviations (SD) below the median height-for-age derived from the WHO (World Health Organization) child growth standards (hereon referred to as the WHO standard) [8]. This growth standard is based on the WHO Multicentre Growth Reference Study (MGRS) [8], of the anthropometric indices of children living in what were considered to be the ‘best case’ for socioeconomic circumstances and nutritional access [8]. In the MGRS, the very low (~ 3%) between-country variation of age-specific mean height, and a strong similarity in the mean and SD of linear growth from birth to 5 y in different country samples provided sufficient justification for pooling data across countries to obtain a global standard [8].

The population estimate of stunting is sensitive to the dispersion of the height distribution in the population survey to which the standard is applied [9]. Maintaining a desired level of precision in length or height measurement is a major challenge in any large scale survey. Low precision will cause over-dispersion and thus overestimate a dispersion-sensitive measure such as stunting [10]. Simulation exercises showed that even modest, random errors in height measurements can increase the measure of dispersion and the increase in error can exponentially increase the dispersion [10]. It was also clearly demonstrated that this overdisperson could inturn inflate estimates of standard deviation cut-off based undernutrition indicators. In addition to the measurement error induced over-dispersion [10,11,12], population heterogeinity due to biological variability, socioeconomic inequalities and adverse environmental exposure would independently contribute to higher dispersion compared to that of the standard population.

Prompted by the concerns on data quality and its impact on nutrition indicators, standard methods such as the Standardized Monitoring and Assessment of Relief and Transitions (SMART) [13] have been developed for quality assessment of anthropometric measures for large scale surveys, at the time of data collection, reporting and interpretation [10, 11]. However, there is no method that provides a quantitative measure of the potential measurement errors and a measure such as this could further be used re-estimate the prevalence of stunting, after accounting for an uncertainty factor. The aim of the study is to identify an estimate of uncertainty that can be obtained by comparing the dispersion of height-for-age in a sample of healthy children at any given age living in growth-favorable environments in the national survey, with the dispersion of the MGRS. This estimate of uncertainity can be used to correct for the incongruence in dispersion and obtain a rough estimate of the stunting in the absence of measurement error in height. Since the prevalence of stunting is a commonly used population level indicator to monitor and assess the effectiveness of public health and nutrition policy/programs, [14] it is worth quantifying an uncertainty factor to interpret these data and estimate a dispersion corrected prevalence.

In the current paper, we first explored the possibility of overdispersion in height for age measurement in NFHS-4 survey data. Then we proposed a statistical method to quantify overdispersion or uncertainty of the estimate along with a statistical test. A method of dispersion corrected prevalence estimation of stunting was proposed and was applied to DHS and NFHS-4 survey data sets.


The first part of this study explicitly explored a specific national survey to identify over-dispersion in height-for-age as a potential source of uncertainty. For this exercise, the Indian NFHS-4 survey [5] conducted in 2015–16 was used, which provides a nation-wide detailed database on socioeconomic status and anthropometric measurements (237,136 valid height measurements) of < 5 y children (details of survey methods given in Additional file 1). The biologically implausible data was defined as height-for-age z-score (HAZ) > 6 or < − 6 of the sample mean and were cleaned as per the defined norms of WHO [15]. Normality of the HAZ distribution were examined for skewness and kurtosis [16]. The height-for-age data in < 5 y children from NFHS-4 [5] were classified into percentiles under each wealth quintile (computed based on the reported possessions and other household characteristics) and then compared to the WHO standard. Generalized Additive Models for Location, Scale and Shape (GAMLSS) [17] was used to derived smoothed percentiles across the quintiles.

In the second part of this study, the impact of over-dispersion of height-for-age on global country estimates of stunting was examined in the DHS survey data [18] of < 5 y children residing in LMIC (the method of data extraction method from DHS is provided in Online Additional Material 1).

Statistical method for obtaining an uncertainty factor due to measurement error

Let yt be the height-for-age t in a given healthy population with growth favourable environment and yt is subjected to measurement error due to low precision. Let Yt be the true measurement of the height-for-age t, then observed yt can written as

$$ {y}_t={Y}_t+{e}_t;{Y}_t\sim N\left({\mu}_t,{\sigma}_t^2\right)\&{e}_t\sim N\left(0,{\sigma}_e^2\right) $$

where et is random measurement error possibly due to low precision in the measurement and the variance is assumed to be age invariant, as only a trivial difference was observed between that of children below 24 months and children above 24 months. Hence observed height (yt) will have normal distribution with mean μt and variance

$$ {\sigma}_t^2+{\sigma}_e^2={\sigma}_t^2\left(1+\frac{\sigma_e^2}{\sigma_t^2}\right)={\sigma}_t^2{\delta}_t^2. $$

For sake of simplicity, we assume \( {\delta}_t^2 \), the over-dispersion measure (uncertainty factor) to be age invariant [\( {\delta}_t^2={\delta}^2\in \left(0,\infty \right)\Big] \).

As the WHO growth standard is applicable globally, the true mean and variance of height of the healthy population assumed above is assumed to be exactly equal to the WHO standard height for the age t. Hence \( {\mu}_t={\mu}_t^{WHO};{\sigma}_t={\sigma}_t^{WHO} \) and

$$ Z=\frac{y_t-{\mu}_t^{WHO}}{\sigma_t^{WHO}}\sim N\left(0,{\delta}^2\right) $$

which is the Z-score as per WHO standard height-for-age t. Let {Z1, Z2, …, Zn} be the HAZ for a sample of size n drawn from the healthy populaution identified above. An obvious estimate of δ2 would be

$$ {\hat{\delta}}^2=\frac{1}{n}\sum \limits_{i=1}^n{Z}_i^2 $$

The uncertainty factor δ lies between (0, ∞); δ = 1 indicates no over-dispersion; δ > 1 or δ < 1 indicates over-dispersion or under-dispersion against WHO standard height-for-age.

A χ2 statistical test (H0 : δ = 1 vs. H1 : δ ≠ 1), can also be performed in the sample to test for significant over-dispersion or under-dispersion. The details of testing procedures are provided in Additional file 2.

Finally, estimated δ, or the over-dispersion measure, can be used to adjust the Z-score of the survey data prior to deriving stunting as follows

$$ {Z}_j^{RS}=\frac{Z_j}{\hat{\delta}}\ for\ all\ j=1,2,\dots, m\left( survey\ size\right) $$

A child whose \( {z}_j^{RS}<-2;j=1,2,..,m \) would be defined as stunted adjusted for dispersion by the survey tool.

For computing this uncertainity factor for any survey, the requirement is to identify a population that has a overdispersion compared to WHO MGRS, solely due to measurement error of height and not due to other biological variability, socioeconomic inequalities and adverse environmental exposure. A sample from such a population would be considered as a ‘test dataset’.

The extraction of a ‘test dataset’ of healthy children with a favourable growth environment for comparison of dispersion with that of the WHO MGRS

A true ‘test dataset’ should be a sample from a healthy sub-population with a favourable growth environment for the given population. If not available, an attempt to obtain a suitable subsample could be made from the same survey data, identifying a healthy subset that represents the population of healthy children as per the standard definition. As WHO growth standard was developed on a sample of healthy children, the inclusion and exclusion criteria of MGRS [8] data set were used to indentify a subset from NFHS-4 survey that approximate desired healthy sample of children below 5 years. Hence, the test dataset was extracted by matching children by ‘locality’, ‘socio-economic status’, ‘mother’s education’, ‘non-smoking mothers’, ‘exclusive breast feeding for 4 months’, ‘partial breast feeding for 12 months’ with MGRS [8]. This test dataset (n = 3732) consisted of children in the richest wealth quintile of an urban locality, born with normal birthweight, breastfed till 4–23 months of age, and whose mothers were educated to graduate level and above. Outliers for height in this sample, below the 5th and above the 95th percentile of HAZ, as derived by the WHO standard, were excluded.

Several steps were taken to evaluate the validity of the above measure of uncertainty factor (δ). First, the variance in the height-for-age of the NFHS-4 test dataset was compared with other Indian studies [12,13,14]. Reports published between January 2004 and March 2019 (since MGRS completed in December 2003) in PubMed, identified with search terms: ((((Infant) OR Children) AND Height) OR Anthropo*) AND India, yielded 3 studies [19,20,21] on children aged 24–59 months. Single site studies on >5y children, clinical complications, and those reporting on children from middle or lower socio-economic background were excluded. Second, a sensitivity analysis was conducted with different choices of the matching variables on the prevalence of uncertainty adjusted stunting, since the test dataset was selected by matching socio-demographic characteristics of the Indian sub-sample of the MGRS [8]. Third, using the same criteria as described above to extract NFHS-4 test dataset, statistical matching was applied to earlier Indian NFHS-3 data [22], which was conducted between 2005 and 2006, and the estimated uncertainty factor then compared between the two successive Indian NFHS surveys, along with the calculated uncertainty-adjusted stunting prevalence.

Estimation of the uncertainty factor in demographic health survey (DHS) data

In the second part of this analysis, the uncertainty factor was estimated for surveys from the 17 DHS countries that had socio-demographic data available to allow matching with the WHO MGRS data, and had a sufficient sample size for the estimation of the uncertainty factor (details of computation provided in Additional Figure 1). Standard method and measurement protocols were used to collect DHS survey data [23]. The unadjusted stunting prevalence in all the selected countries was estimated using the WHO standard [8]. The estimates of uncertainty-adjusted stunting were then calculated for those countries with sufficient evidence for δ being different from the Null value of 1, ie: \( {\chi}_{0.05.;n}^2<{\chi}_{stat}^2<{\chi}_{0.95.;n}^2 \).

All data analyses were performed using R-version 3.6.1 [24].


Uncertainty factor for the Indian NFHS-4 survey

A consistent over-dispersion of the height-for-age distribution was observed in the NFHS-4 data when compared to the MGRS (Fig. 1a). The height-for-age percentile curves (2.5th, 50th and 97.5th) constructed from NFHS-4 data, classified by wealth quintiles, are shown in comparison to the MGRS, in Fig. 1b. The 2.5th percentile curve (Fig. 1b) remained consistently lower and 97.5th percentile curve remained consistently higher upto ~ 3 years before it started showing little downward trend for NFHS-4 data in comparison to the same curve for the MGRS across age groups and wealth quintiles (Fig. 1b), indicating greater dispersion of NFHS-4 height. However, median shows gradually decreasing trend as age pogressess compared to MGRS.

Fig. 1

Distribution of MGRS and NFHS-4 height-for-age; a Median and 95% CI of height-for-age from MGRS and NFHS-4 (b) Height-for-age centiles of < 5 y children from MGRS and NFHS-4 in different wealth quintiles; NFHS-4 (n = 237,136), richest (n = 32,152), richer (n = 39,607), middle (n = 47,360), poorer (n = 55,916), poorest (n = 62,101). CI: Confidence Interval; MGRS: WHO Multicentre Growth Reference Study [6]; NFHS-4: National Family Health Survey-4 [4]

The estimated value of the \( \hat{\delta} \) (uncertainty factor) was 1.39 (95% CI: 1.36–1.43), which meant that the dispersion in height-for-age in the NFHS-4 test dataset was 39% higher (P < 0.001) than that observed in the MGRS data [8]. Using the uncertainty adjusted Z-score for the WHO standard, the prevalence of stunting could be as low as 18.7%, since the dispersion was much higher than MGRS. This value is substantially lower than the current estimated prevalence of 36.2% (calculated from the raw data of NFHS-4) using the unaltered WHO standard. The estimates of \( \hat{\delta} \) for younger children (< 24 months) was 1.40 (95% CI: 1.36–1.45) and was comparable to that of children aged 24–59 montsh, \( \hat{\delta} \) =1.38 (95% CI: 1.33–1.44).

The dispersion of the height-for-age distribution in the test dataset of NFHS-4 was comparable to other nationally representative datasets but was much higher than that of the MGRS, which defines the WHO standard, as shown in Fig. 2. The sensitivity analysis of the determination of δ, with different choices of matching variables between NFHS-4 data and the Indian sub-sample of the MGRS [8] is reported in Table 1. Maternal education and wealth quintiles were sensitive, but to a small extent. The uncertainty factor varied from 1.39 to 1.43 when maternal education (up to graduate level and above) was replaced with education up to matriculation and above. When replacing the richest quintile with the top two quintiles of wealth, the uncertainty factor increased from from 1.39 to 1.50 (Table 1). These slight changes in uncertainty could result in small changes in the uncertainty adjusted prevalence of stunting.

Fig. 2

Comparison of height-for-age SD in MGRS with NFHS-4 test dataset and other Indian studies; Khadilkar 2010 (n = 1493), [13] Khadilkar 2014 (n = 494), [12] Marwaha 2011 (n = 106,843 children of 3–18 y), [14] NFHS-4 test data (n = 3732), [4] MGRS (n = 8440) [6]; SD: Standard deviation; MGRS: WHO Multicentre Growth Reference Study [6]; NFHS-4: National Family Health Survey-4 [4]

Table 1 Sensitivity analysis of uncertainty factor based on socio-demographic characteristics

As an additional test of validity, a consistent over-dispersion of the height-for-age distribution was observed in the earlier NFHS-3 survey as well, when compared to WHO standard (Additional Figure 2A). Applying the same matching criteria (as performed for the NFHS-4 survey) to the NFHS-3 survey data to obtain a similar test dataset (Additional Figure 2B) of the healthiest children (n = 828), the δ (dispersion based uncertainty factor) was found to be 1.31 (95% CI: 1.24–1.38,), which was not different from the δ value estimated for the NFHS-4 survey (Additional Figure 3). The uncertainty adjusted prevalence of stunting obtained in the NFHS-3 survey was 32.3% (95% CI: 31.9–32.8), compared to 48% (95% CI: 47.6–48.5) when estimated by the unaltered WHO standard.

Uncertainty factor in selected DHS country data

The uncertainty factor δ could be calculated for 17 selected DHS countries, where data were available, and ranged from 0.80 (95% CI: 0.67–1.00) for Nicaragua to 2.11 (95% CI: 1.91–2.37) for Egypt (Fig. 3). The δ was lower than 1 for Latin American countries, with Peru having 13% lower dispersion for height-for-age in their test dataset (δ =0.87, 95% CI: 0.78–0.98). Most of the African and all Asian countries selected from the DHS had δ > 1 indicating over-dispersion. The δ value for India (1.39) was higher than most DHS countries, but lower than that of Egypt which had the highest estimate (δ = 2.1; 95% CI:1.9–2.4). As a result, the uncertainty adjustment lowered the stunting prevalence in Egypt to 3% compared to the 21% unadjusted stunting prevalence estimated with the WHO MGRS standard [8].

Fig. 3

Estimated value of uncertainty factor for variance of height-for-age using DHS data in LMIC countries; DHS: Demographic Health Survey [11]; LMIC: low- and middle-income countries


The debate about stunting as a primary growth problem, rather than an indicator of social deprivation, poverty and environment, and the consequent diversion for attention from the underlying causes has been pointed out before [25, 26]. This has led to debates about the application of global standards to Indian, and by extension, all LMIC children. In some instances, such as in India, these debates have focused on the potential genetic predisposition to short stature, with counterpoints linked to the high variation in height-for-age [25, 26]. In a population living under ideal conditions for child growth, 2.5% of the children would be stunted, where the HAZ would be normally distributed with a mean value of zero and SD of 1 [8]. In large scale surveys [27, 28], it is often observed that the SD of the HAZ distribution is > 1, suggesting that the prevalence of stunting (HAZ < -2) may be > 2.5% even in the healthiest subset of the population due to higher variablility in the anthrometric data in most of the large surveys [29]. This is problematic as the HAZ metric is based on 2.5th percentile position of the height-for-age and distribution in a standard healthy population as defined in the WHO MGRS [8].

As a first step, a quantitative approach to measure and correct for incongruencies in dispersion is required as stunting is a commonly used indicator to evaluate policy and programs and therefore it is worthwhile to dispersion corrected prevalence for more precise estimates of change or progress. The present study makes the first attempt to offer an appropriate, simple, but robust method to examine the uncertainty in the application of the WHO standard [8] in accordance to the variation observed in a relatively healthy population (test dataset of the NFHS-4 survey in this instance), and thereby offers a measure of an over-dispersion based uncertainty factor. The sensitivity analysis (Table 1) performed on the uncertainty factor \( \hat{\delta} \) showed that matching criteria for the test dataset were stringent. Using this estimate of the factor to adjust the stunting prevalence estimates gives values that are, in some cases, far removed from the unadjusted prevalence estimate (for example, Egypt and India). Therefore, it is important to be cautious that the estimation of the uncertainty factor is indicative, but may not result in a ‘true, corrected’ stunting prevalence. There is simply no replacement for better quality and precision in measurements performed in national surveys, which can be achived with available guidance during conduct of large surveys [16].

However, for a survey with optimal measurement error with respect to MGRS study, a deviation of this factor from ‘1’ would indicate true population variation and imply the requirement for population adjustment in dispersion while using standard data. A true higher dispersion could occur due to biological and environmental reasons. These could be related to intergenerational and maternal diet linked epigenetics, parental height, intra-uterine growth restriction, and variation in gestational age at birth [30], or an unsanitary and polluted environment arising from poverty. The sampling method for the MGRS was careful to reduce this variation, by drawing samples from a well-defined affluent population in a single city of the selected country. However, since 85–90% of total genetic variation is found in a collection of individuals within country, multisite sampling within-country is probably a requirement for a true representation of the country population [31]. The study attempted to quantify over-dispersion in the distribution of height-for-age using a ‘best’ subset of the survey itself, and importantly, does not differentiate over-dispersion due to true variability or measurement error.

Thus, in the test dataset of the national surveys studied here, the over-dispersion of height-for-age could be first, due to random errors introduced by poor precision in the measurement of height, recall bias related age-ascertainment errors, terminal digit preferences, incorrect data recording and entry [10, 11]. Second, there is always the possibility that the true biological variance of the theoretically healthiest children could be higher than that of the MGRS. Third, the study design and sampling techniques followed in these large scale national surveys [5, 18] and the MGRS were different [8]. Fourth, secular trends in the variance of height may be operative. However, this was tested in Indian surveys, where a similar over-dispersion in height-for-age was observed in the earlier NFHS-3 (Additional Figure 2), as well as in other research studies in India (Fig. 2); thus it is unlikely that this was responsible for the over-dispersion observed in the later NFHS-4. The overdispersion was considered to be age invariant and this assumption was tested in the NFHS-4 data set, and the uncertainty estimate may vary by age if the measuremenr error varies by age, which appears unlikely.

A recent analysis of height distribution from 179 Demographic and Health Surveys in 64 low and middle-income countries, reported the mean SD of HAZ to be 1.68 (range: 1.65–2.11), indicating the possibility of greater uncertainty in these countries when using the WHO standard [29]. The same analysis showed that if all children were exposed to the same growth limiting factors, the mean height-for-age would decline without an increase in the SD [29]. This may explain the observed lower dispersion but higher prevalence of stunting in sub-saharan African and Latin American countries (Fig. 3).

Measuring the height or length of children is not trivial, defined precision is an important part of survey reporting [10, 16]. The reported precision of the height measurement in the MGRS was excellent, at < 1%, and procedures were adopted to maintain precision throughout the study keeping minimal inter-observer variability [8]. Rigorous training, periodic standardization sessions and assessments, frequent monitoring and regular equipment calibration were among the quality measures of the MGRS [8]. The DHS and the NFHS-4 do not report the precision of the length or height measurements that were made, although the measurements were made using standard instruments [5, 18]. Other cross-sectional studies from India, refered in the present analysis, have reported a precision for height measurement of < 1% at the beginning of the survey, [19,20,21] but it is not clear how precision was maintained throughout the survey period. A comparative analysis of the HAZ distributions for large scale population based surveys such as DHS and Multiple Indicator Cluster Surveys (MICS) showed greater SD of HAZ (1.82 and 1.80 respectively) compared to National Nutrition Surveys (NNS) (SD – 1.36), which followed the standardised SMART method to conduct the survey [11] suggesting the importance of quality assessment in athropometric measures. However, the uncertainty adjustement will not affect a trend analysis of stunting prevalence between NHFS-3 and NFHS-4 as the adjustment factor is very similar, but a comparison of the uncertainty adjusted prevalence estimates between different countries will be preferable given the adjustment factor is different by country. A height distribution analysis using Joint Malnutrition Estimates database (included data from 422 surveys) showed that SDs of HAZ progressively decreased with age, ranging from 1.59 in 6–11 months to 1.28 in 48–59 months, indicating the complexity of measuring length compared to height in children [16]. Higher variation could also be observed if inconsistent measurement techniques were followed; for instance, one versus two leg recumbent length measurement (0.02 to 2 cm difference) have shown a random variation which decreases with increasing age [32]. The diurnal variation in height measurement [26] and errors in maternal recall of birth dates [33] are other important aspects that could lead to imprecise measurements of height. A random error of 2 cm in height measurement can increase the SD of HAZ distribution to 1.17 which may in turn overestimate the prevalence of moderate and severe stunting by 3.5 and 2.2% respectively. The same analysis showed that every 0.1 SD reduction in over-dispersion can reduce the prevalence of stunting by 2% [10]. Further, the random errors associated with accurate age determination also have shown to impact over-dispersion of height-for-age distribution. This is critical when age is approximated using a calendar of local events and mother’s recall ability, in the absence of accurate birth records. An error of 3 months in a 12 month old child can increase the SD of height distribution up to 1.97 and the prevalence of severe stunting can be overestimated by 7.3% [10].

In addition, the purposive sampling and longitudinal study design (0–2 y), followed in the MGRS were not comparable to the cross-sectional sampling design of the DHS. The sampling variation with random sampling in a cross-sectional study design will always be higher than purposive sampling in a longitudinal study design. On the other hand, with a higher replication, the approximation of sampling distribution of any sample statistic is likely to be better. It is possible that having only 6 samples from 6 study centers in the MGRS study [8] may still be insufficient to accurately estimate the true sampling distribution of the sample variance of the population used to derive the growth standard. Effectively, the homogeneity across samples from different countries, or within a country, needs to be established with more replication, under the same conditions. Quantification of uncertainty due measurement error and a novel method of its correction in prevalence of stunting are the most important strengths of the study. A limitation was matching for partial number of inclusion and exclusion criteria (gestational age, history of perinatal morbidity and time of complementary feeding introduction were not available) with the MGRS Indian subsample [6] to extract the DHS test dataset. An additional limitation is the inability to differentiate true population heterogeneity from measurement imprecision in the test dataset.


In conclusion, the uncertainty factor, derived by adjusting for over-dispersion from a test dataset (extracted from a national survey) in comparison to the dispersion in the WHO standard, provides a comprehensive approach that corrects for both precision errors in a large-scale survey and true biological variation of a population. Although, true population heterogeneity could not be differentiated from measurement imprecision in the test dataset, the aim here was to provide a level of uncertainty to be used while interpreting stunting prevalence estimates. A similar over-dispersion could occur with the measurement of weight as well and the present attempt needs to be expanded for other measures of undernutrition such as weight-for-age and weight-for-height. This analysis emphasises the critical need for maintaining precision in anthropometric measurements, especially in large surveys used to derive nutritional status indicators that in turn inform policy.

Availability of data and materials

The datasets analysed during the current study are available in the DHS data repository, [].



Confidence Interval


Comprehensive National Nutrition Survey


Demographic Health Survey


Generalized Additive Models for Location, Scale and Shape


Height-for-age z-score


Low and Middle Income Countries


Multicentre Growth Reference Study


Multiple Indicator Cluster Surveys


National Family Health Survey


National Nutrition Surveys


Standard Deviations


Standardized Monitoring and Assessment of Relief and Transitions


World Health Organization


  1. 1.

    United Nations Development Programme (UNDP). Sustainable development goals, 2016. Available from: Accessed 5 Mar 2019.

    Google Scholar 

  2. 2.

    Wiesmann, D, Biesalski HK, Grebmer K, et al. Methodological Review and Revision of the Global Hunger Index. ZEF Working Paper-139, 2015. Available from: Accessed 8 Mar 2019.

    Google Scholar 

  3. 3.

    World Health Organization (WHO). Global Nutrition Targets 2025. Stunting policy brief: WHO; 2012. Available from: Accessed 14 Feb 2020.

  4. 4.

    World Health Organization (WHO). World health statistics 2018: monitoring health for the SDGs, sustainable development goals. WHO Rep; 2018. Available from: Accessed 6 Mar 2019.

    Google Scholar 

  5. 5.

    International Institute of Population Science (IIPS). National Family Health Survey India (NFHS-4) 2015-2016 Report, 2016. Available from: Accessed 27 Jan 2019.

    Google Scholar 

  6. 6.

    Ministry of Health and Family Welfare (MoHFW), Government of India, UNICEF and Population Council. Comprehensive National Nutrition Survey (CNNS) Report. New Delhi; 2019.

  7. 7.

    Danaei G, Andrews KG, Sudfeld CR, Fink G, McCoy DC, Peet E, Sania A, Fawzi MC, Ezzati M, Fawzi WW. Risk factors for childhood stunting in 137 developing countries: a comparative risk assessment analysis at global, regional, and country levels. PLoS Med. 2016;13(11).

  8. 8.

    WHO Multicentre Growth Reference Study Group. Assessment of differences in linear growth among populations in the WHO multicentre growth reference study. Acta Paediatr. 2006;95:56–65.

    Article  Google Scholar 

  9. 9.

    Perumal N, Bassani DG, Roth DE. Use and misuse of stunting as a measure of child health. J Nutr. 2018;148:311–5.

    Article  PubMed  PubMed Central  Google Scholar 

  10. 10.

    Grellety E, Golden MH. The effect of random error on diagnostic accuracy illustrated with the anthropometric diagnosis of malnutrition. PloS one. 2016;11(12).

  11. 11.

    Corsi DJ, Perkins JM, Subramanian SV. Child anthropometry data quality from demographic and health surveys, multiple Indicator cluster surveys, and National Nutrition Surveys in the west Central Africa region: are we comparing apples and oranges? Glob Health Action. 2017;10(1):1328185.

    Article  PubMed  PubMed Central  Google Scholar 

  12. 12.

    Mejía-Guevara I, Corsi DJ, Perkins JM, et al. Variation in anthropometric status and growth failure in low-and middle-income countries. Pediatrics. 2018;141(3):e20172183.

    Article  PubMed  Google Scholar 

  13. 13.

    Erhardt J, Golden MH, Seaman J, Bilukha O. ENA for SMART 2016. Available from: Accessed 14 Feb 2020.

    Google Scholar 

  14. 14.

    Cumming O, Cairncross S. Can water, sanitation and hygiene help eliminate stunting? Current evidence and policy implications. Matern Child Nutr. 2016;12:91–105.

    Article  PubMed  PubMed Central  Google Scholar 

  15. 15.

    World Health Organisation. In: World Health Organisation, editor. Available from: Accessed 14 Febr 2020 WHO Child Growth Standards: Methods and development: Length/heightfor-age, weight-for-age, weight-for-length, weight-for-height and body mass index-for-age. Geneva; 2006.

  16. 16.

    World Health Organization. Recommendations for data collection, analysis and reporting on anthropometric indicators in children under 5 years old. Geneva; 2019. Available from: Accessed 14 Feb 2020.

  17. 17.

    Rigby RA, Stasinopoulos DM. Generalized additive models for location scale and shape. J Royal Stat Soc. 2005;54:507–54.

    Article  Google Scholar 

  18. 18.

    The Demographic and Health Surveys Program DHS overview. Available from: Accessed 1 Apr 2019.

  19. 19.

    Khadilkar VV, Khadilkar AV, Chiplonkar SA. Growth performance of affluent Indian preschool children: a comparison with the new WHO growth standard. Indian J Pediatr. 2010;47:869–72.

    CAS  Article  Google Scholar 

  20. 20.

    Khadilkar A, Ekbote V, Chiplonkar S, et al. Waist circumference percentiles in 2-18 year old Indian children. J Pediatr. 2014;164:1358–62.

    Article  PubMed  Google Scholar 

  21. 21.

    Marwaha RK, Tandon N, Ganie MA, et al. Nationwide reference data for height, weight and body mass index of Indian school children. Natl Med J India. 2011;24:269–77.

    PubMed  Google Scholar 

  22. 22.

    International Institute of Population Science (IIPS). National Family Health Survey India (NFHS-3) 2005-2006 Report, 2006. Available from: Accessed 2 Feb 2019.

    Google Scholar 

  23. 23.

    Demographic and health survey supervisor’s and editor’s manual. MEASURE DHS basic documentation. 2011. Available from: Accessed 2 Feb 2019.

    Google Scholar 

  24. 24.

    R Core Team. R: A language and environment for statistical computing: R Foundation for Statistical Computing; 2018. Available from: URL

  25. 25.

    Subramanian SV, Mejía-Guevara I, Krishna A. Rethinking policy perspectives on childhood stunting: time to formulate a structural and multifactorial strategy. Matern Child Nutr. 2016;12:219–36.

    Article  PubMed  PubMed Central  Google Scholar 

  26. 26.

    Panagariya A. Does India really suffer from worse child malnutrition than sub-Saharan Africa? Econ Polit Wkly. 2013;4:98–111.

    Google Scholar 

  27. 27.

    Mei Z, Grummer-Strawn LM. Standard deviation of anthropometric Z-scores as a data quality assessment tool using the 2006 WHO growth standards: a cross country analysis. Bull World Health Organ. 2007;85:441–8.

    Article  PubMed  PubMed Central  Google Scholar 

  28. 28.

    Assaf S, Kothari MT, Pullum TW. An assessment of the quality of DHS anthropometric data, 2005–2014. ICF International; 2015.

  29. 29.

    Roth DE, Krishna A, Leung M, Shi J, et al. Early childhood linear growth faltering in low-income and middle-income countries as a whole-population condition: analysis of 179 demographic and health surveys from 64 countries (1993–2015). Lancet Glob Health. 2017;5:e1249–57.

    Article  PubMed  PubMed Central  Google Scholar 

  30. 30.

    Prendergast AJ, Humphrey JH. The stunting syndrome in developing countries. Paediatr Int Child Health. 2014;34:250–65.

    Article  PubMed  PubMed Central  Google Scholar 

  31. 31.

    Jorde LB, Wooding SP. Genetic variation, classification and ‘race’. Nature Genet. 2004;36:S28.

    CAS  Article  PubMed  Google Scholar 

  32. 32.

    Ismail LC, Puglia FA, Ohuma EO, et al. Precision of recumbent crown-heel length when using an infantometer. BMC Pediatr. 2016;16:186.

    Article  Google Scholar 

  33. 33.

    Siklar Z, Sanli E, Dallar Y, et al. Diurnal variation of height in children. Pediatr Int. 2005;47:645–8.

    Article  PubMed  Google Scholar 

Download references


We acknowledge the contribution of Mr. Satvik Kishore and Mr. Jeswin Baby in computing the dispersion correction factors for the different DHS country data sets.


The contribution of AVIK for this work was supported by the Wellcome Trust/DBT India Alliance through the Margdarshi fellowship [IA/M/14/1/M/501681]. The funding agency had no role in the analysis or interpretation of data.

Author information




The author’s responsibilities were as follows - SG and TT were involved in the data acquisition, analysis, drafting the work and final approval of the version to be published. AVK, SG, TT, NS, SB were involved in the conception and design of the work, interpretation of data, drafting and final approval the manuscript. HPS was involved in revising the manuscript critically and final approval. TT and AVK will be accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Tinku Thomas.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

All authors report no conflict of interest.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1.

National Family Health Survey-4 (NFHS-4) methodology; This file provides details of the NFH 4 survey methodology

Additional file 2.

Derivation of test statistic based on Z-score; Describes the derivation of test statistic over or under dispersion

Additional file 3:

Additional Figure 1. Flow diagram of country selection from DHS data to calculate the prevalence of relative stunting; DHS: Demographic Health Survey

Additional file 4:

Additional Figure 2. Distribution of height-for-age in NFHS-3 and MGRS

Additional file 5:

Additional Figure 3. Distribution of height-for-age in NFHS-4 test data set and MGRS

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Ghosh, S., Shivakumar, N., Bandyopadhyay, S. et al. An uncertainty estimate of the prevalence of stunting in national surveys: the need for better precision. BMC Public Health 20, 1634 (2020).

Download citation


  • Demographic health survey (DHS)
  • Height-for-age
  • National Family Health Survey-4 (NFHS-4)
  • WHO multicentre growth reference study (MGRS)
  • Stunting
  • Overdisperion
  • WHO child growth standard