Bayesian modeling of quantiles of body mass index among under-five children in Ethiopia

Mekuriaw, Daniel M.; Mitku, Aweke A.; Zeru, Melkamu A.

doi:10.1186/s12889-024-18602-x

Research
Open access
Published: 24 April 2024

Bayesian modeling of quantiles of body mass index among under-five children in Ethiopia

Daniel M. Mekuriaw¹,
Aweke A. Mitku^1,2 &
Melkamu A. Zeru¹

BMC Public Health volume 24, Article number: 1144 (2024) Cite this article

227 Accesses
1 Altmetric
Metrics details

Abstract

Background

Body Mass Index (BMI) is a measurement of nutritional status, which is a vital pre-condition for good health. The prevalence of childhood malnutrition and the potential long-term health risks associated with obesity in Ethiopia have recently increased globally. The main objective of this study was to investigate the factors associated with the quantiles of under-five children’s BMI in Ethiopia.

Methods

Data on 5,323 children, aged between 0-59 months from March 21, 2019, to June 28, 2019, were obtained from the Ethiopian Mini Demographic Health Survey (EMDHS, 2019), based on the standards set by the World Health Organization. The study used a Bayesian quantile regression model to investigate the association of factors with the quantiles of under-five children’s body mass index. Markov Chain Monte Carlo (MCMC) with Gibbs sampling was used to estimate the country-specific marginal posterior distribution estimates of model parameters, using the Brq R package.

Results

Out of a total of 5323 children included in this study, 5.09% were underweight (less than 12.92 BMI), 10.05% were overweight (BMI: 17.06 – 18.27), and 5.02% were obese (greater than or equal to 18.27 BMI) children’s. The result of the Bayesian quantile regression model, including marginal posterior credible intervals (CIs), showed that for the prediction of the 0.05 quantile of BMI, the current age of children [$\upbeta$= -0.007, 95% CI :(-0.01, -0.004)], the region Afar [$\upbeta$ = - 0.32, 95% CI: (-0.57, -0.08)] and Somalia[$\upbeta$ = -0.72, 95% CI: (-0.96, -0.49)] were negatively associated with body mass index while maternal age [$\upbeta$ = 0.01, 95% CI: (0.005, 0.02)], mothers primary education [$\upbeta$= 0.19, 95% CI: (0.08, 0.29)], secondary and above [$\upbeta$ = 0.44, 95% CI: (0.29, 0.58)], and family follows protestant [$\upbeta$ = 0.22, 95% CI: (0.07, 0.37)] were positively associated with body mass index. In the prediction of the 0.95 (or 0.85?) quantile of BMI, in the upper quantile, still breastfeeding [$\upbeta$ = -0.25, 95% CI: (-0.41, -0.10)], being female [$\upbeta$ = -0.13, 95% CI: (-0.23, -0.03)] were negatively related while wealth index [$\upbeta$ = 0.436, 95% CI: (0.25, 0.62)] was positively associated with under-five children’s BMI.

Conclusions

In conclusion, the research findings indicate that the percentage of lower and higher BMI for under-five children in Ethiopia is high. Factors such as the current age of children, sex of children, maternal age, religion of the family, region and wealth index were found to have a significant impact on the BMI of under-five children both at lower and upper quantile levels. Thus, these findings highlight the need for administrators and policymakers to devise and implement strategies aimed at enhancing the normal or healthy weight status among under-five children in Ethiopia.

Peer Review reports

Background

Health is a positive, multifaceted concept that can encompass a multitude of elements, including capability, judgment, enjoyment, and well-being. The Body Mass Index (BMI) is a metric used to assess nutritional status. Additionally, BMI is used to evaluate a person's weight status in both adults and children. However, while BMI cut points for obesity and overweight are the same for both sexes and age groups in adults, they alter for growing children based on their age and gender [1].

The BMI of a person can be used as a screening tool to determine whether or not they are obese, overweight, underweight, or at a healthy weight for their height. The BMI is a weight measurement that takes height into consideration. It is calculated by dividing weight in kilograms by height in meters squared (kg/m²) [2, 3]. Obese, overweight, and normal (healthy weight) were defined as children's BMIs for under-five children that were at or above the 95^th percentile, between the 85^th and 95^th percentile, and between the 5^th and 85^th percentile, respectively [4]. For children, BMI is dependent on age and sex and is often referred to as BMI-for-age. A person's risk of disease or death may rise dramatically if their BMI is higher than the acceptable limit [5]. Both being underweight and having a large amount of body fat increase the risk of developing disorders linked to weight and other health problems in adults and children [6,7,8]. BMI is significantly associated with relative fatness in childhood and adolescence and is the most convenient way of measuring relative adiposity [9].

Particularly in Ethiopia, a nation with a low income where childhood malnutrition is still a major problem, pediatric obesity (BMI above the 95^th percentile) is not yet seen as a serious health concern and is given little attention. The prevalence of overweight (BMI between the 85^th and 95^th percentile) children in Ethiopia has increased overall, from 1.7 to 3.6%, according to the United Nations Children's Fund (UNICEF 2017) annual report [10]. Despite the high prevalence of childhood malnutrition in Ethiopia, there is limited understanding of the factors influencing the distribution of body mass index faced by specific groups of under-five children [11].

Being overweight and/or obese during puberty increases the risk of contracting non-communicable diseases and contributes to overweight, obesity, cardiovascular disease, metabolic and other diseases in adulthood. Therefore, primary prevention requires information about the lower, and upper-level, classification and underlying factors of BMI in developing countries. Consequently, new insights into the data sets can be obtained by applying quantile regression as an alternative to the conventional techniques of linear or logistic regression models [12, 13]. However, the interest lies in the lower and upper spectrum of BMI, these regression models are based on mean BMI. Quantile regression, a natural extension of classical mean regression is a method that is used to model a relationship between the quantile of variable response and one or more variable predictors [14].

Quintile regression seems to provide a better fit than traditional generalized linear models (GLMs) for estimating risk factors based on BMI data. Quantile regression is recommended in situations where the data are heterogeneous, meaning that the centers and tails of the conditional distributions fluctuate differentially with the covariates [15]. Quantile regression offers a thorough understanding of the interactions between independent and dependent variables (i.e., not just in the center but also in the tails of the dependent variable's conditional distribution) [16].

Quantile regression models (QRM) the impact of predictors on different specific quantiles (or percentiles) of the response distribution, and thus provide a more comprehensive picture of the effect of predictor variables on the spectrum of the response variable [17, 18]. An additional advantage of the quantile regression approach is that its parameter estimates are not affected by changes in the conditional distribution of the dependent variable, which is the BMI of the children, on a location-scale [19]. In the health sciences, quantile regression has become popular concerning studies of BMI [12, 20, 21].

Bayesian methods provide parameter estimates with good statistical properties, parsimonious descriptions of observed data, predictions for missing data and forecasts of future data, and a computational framework for model estimation, selection, and validation [22]. Bayesian techniques use prior distribution to describe sample data and population characteristics. The posterior distribution can be obtained by combining sample data with the prior distribution on the model parameters. In order to estimate a quantile regression parameter using the Bayesian technique, one must ascertain the posterior distribution, which is proportional to the sum of the likelihood function and the prior distribution The computation of posterior distribution can be difficult and time-consuming to calculate analytically if more parameters are to be estimated. Therefore, estimating parameters has been used as a computational method.

Since the mean regression only provides for the description of the distribution's mean response, BMI employing the Bayesian technique quantile regression is more pertinent due to its flexibility in estimating conditional quantiles of interest of a given distribution. In order to model big data sets, we employed quantile regression techniques and an estimation of Bayesian methodologies for this work [23, 24]. So far, there have not been many detailed studies conducted to explore all aspects of BMI in Ethiopia using a quantile regression model rather they only focused on fixed effects. The current study adopted a Bayesian quantile regression model to analyze the BMI of under-five children by including the regional variation.

Data and methods

The section emphasizes the study population, data sources, data analysis approaches, and proposed quantile estimation approach.

Data and sampling procedure

The data was secondary data obtained from the Ethiopia Mini Demographic and Health Survey (mini EDHS) (2019). The 2019 mini EDHS) was implemented by the Ethiopian Public Health Institute, in partnership with the Central Statistical Agency and the Federal Ministry of Health, under the overall guidance of the Technical Working Group. Data collection took place from March 21, 2019, to June 28, 2019. The data are openly available from https://dhsprogram.com and can be accessed following the protocols. To incorporate the geographical covariates, most of the data usually includes global positioning system coordinates [25].

The Ethiopian Demographic and Health Survey used a two-stage stratified cluster sampling technique selected from a population and housing census frame for the 2019 mini EDHS. In the first stage, a total of 305 Enumeration Area EAs (93 in urban areas and 212 in rural areas) were selected with probability proportional to EA size and with independent selection in each sampling stratum. In the second stage of selection, a fixed number of 30 households per cluster were selected with an equal probability of systematic selection from the newly created household listing. A total of 9,150 households were selected for the sample, of which 8,794 were occupied. Of the occupied households, 8,663(99% response rate) were successfully interviewed. The women were interviewed by distributing questionnaires and information on their birth history and 5,323 under-five children were considered for this study [26].

Variables

Variables considered in the study were based on some previous studies and those that are expected to be factors or determinants of under-five age of children BMI. We have considered under-five children’s BMI as the response variable. BMI (in a standardized form) was used as a continuous variable and computed as:

$${\mathrm{child^{\prime}}{\text{s}}}_{{\text{BMI}}}=\frac{\mathrm{child^{\prime}}\mathrm{s\;weight }\;(\mathrm{in\;kilogram}) }{{(\mathrm{child^{\prime}}\mathrm{s\;height }\;(\mathrm{in\;meters}))}^{2}}$$

The covariates were the variables that are expected to affect the response variable. From many kinds of literature, the following are those that affect the BMI of under-five children (Table 1).

Table 1 Description of the independent variable

Full size table

Statistical methods

Quantile regression

Quantile regression is a regression method that models a relationship between the quantile of variable response and one or more variable predictors. Quantile regression is robust to outliers and can model data with a heteroscedasticity effect because it offers the opportunity for a more complete view of the response variable and the relationships among predictor variables. The QRM estimates the potential differential effect of a covariate on various quantiles in the conditional distribution, therefore, we are interested in estimating quantiles of the response distribution as a function of potential Predictor variables. When the conditional densities of the response are heterogeneous, it is natural to consider whether weighted quantile regression might lead to efficiency improvements [14, 17, 18]. An alternative method for dealing with outliers is quantile regression. Quantile is defined as a particular location of some distribution, where τ^th quantile is the value of y when ${P}_{r}\left({\text{Y}}\le {\text{y}}\right)=\uptau$ where τ has a value between 0 and 1.

A useful property of the conditional quantile function is its invariance to any monotone transformation of the response variable that is for any monotone function h(.), We have ${Q}_{{\text{h}}}(\mathrm{ Y })|{\text{X}}(\uptau ) =\mathrm{ h}({Q}_{{\text{Y}}} |{\text{X}}(\uptau ))$.

The quantile regression model is described by the conditional τ ^th quantiles of the response Y for given values of predictors ${x}_{1},{x}_{2},\dots ,{x}_{k}$. The linear quantile regression model for a set of covariates,

X, is given by

$${{\varvec{Y}} ={{\varvec{X}}^{\prime}}_{{\varvec{i}}}{\varvec{\beta}}(\tau ) + {u}_{i}}$$

(1)

where ${{\varvec{X}}}_{{\varvec{i}}}$ is a set of covariates, the ${u}_{i}$ , is a vector of independent errors which are independent and satisfy $P({u}_{i}<0|{{\varvec{X}}}_{{\varvec{i}}})=\tau$. It is a natural extension of the traditional mean model.

$${\text{Qy}}\left(\uptau |{x}_{1},{x}_{2},\dots ,{x}_{k}\right)={{\varvec{\upbeta}}\left(\uptau \right)}_{0}+{{\varvec{\upbeta}}\left(\uptau \right)}_{1}{{\text{x}}}_{1}+... +{{\varvec{\upbeta}} (\uptau )}_{{\varvec{k}}}{{\text{x}}}_{k}, 0<\tau < 1$$

(2)

where ${{\varvec{\upbeta}}({\varvec{\uptau}}) = (\upbeta \left(\uptau \right)}_{0},{\upbeta \left(\uptau \right)}_{1},... ,{\upbeta (\uptau )}_{k}$) is the unknown parameter vector.

Equation (2) gives the changes in the conditional quantiles. Because any τ ^th quantile can be used, any predetermined situation of the distribution can be modeled [27]. This is useful to obtain a more complete understanding of how the outcome distribution can be affected by the predictors.

Bayesian quantile regression

Bayesian quantile regression is a regression method that models a relationship between the quantile of variable response and one or more variable predictors with parameter estimation used in the Bayesian method. A Bayesian quantile regression model with “k” independent variables is:

$$y={\upbeta (\uptau ) }_{0}+{\upbeta (\uptau ) }_{1}{x}_{1}+\dots + {\upbeta \left(\uptau \right)}_{k}{x}_{k}+ \varepsilon$$

(3)

where y is a response variable, ${x}_{k}$ is a k^th predictor variable, ${\upbeta \left(\uptau \right)}_{k}$ is a k^th regression parameter for τ^th quantile, and $\varepsilon \sim \mathrm{Asymmetric\ Laplace\ Distribution }\left({\text{ALD}}\right)\uptau$ is the error term for Bayesian quantile regression. Bayesian quantile regression parameters can be estimated with sample data. Suppose that p>k observations are available and let y_i denote the i^th observed response, and x_ij denote i^th observation or level regressor of x_j. Actually, n is a more standard notation for the sample size (number of observations), instead of p.

For the linear quantile regression, no specific assumptions regarding the error term are made except that given a fixed and known quantile τ ∈ (0, 1), it is assumed that the τ ^th quantile of the error term is zero, i.e. F⁻¹(τ |$\pi$) = 0 and that ε_i and ε_j are independent for i ≠ j. With these assumptions, the quantile-specific regression coefficients β_(τ) are estimated by minimizing an asymmetrically weighted sum of absolute deviations.

$$\widehat{{\varvec{\upbeta}}}(\uptau ) = {\vphantom{\sum}}_{\beta (\tau )}^{min}{\sum }_{i=1}^{p}\rho ({{\text{y}}}_{{\text{i}}}- {\mathbf{x}\mathrm{^{\prime}}}_{{\text{i}}}{\varvec{\upbeta}}(\uptau ))$$

(4)

$$\widehat{{\varvec{\upbeta}}}\left(\uptau \right)= {\vphantom{\sum}}_{\beta (\tau )}^{min}\left\{\uptau {\Sigma }_{{\text{i}}:{\text{yi}}\ge \mathrm{x{\prime}}{\text{i}}}\left|{{\text{y}}}_{{\text{i}}}-{\mathbf{x}\mathrm{^{\prime}}}_{{\text{i}}}{\varvec{\upbeta}}(\uptau )\right|+\left(1-\uptau \right){\Sigma }_{{\text{i}}:{\text{yi}}\ge \mathrm{x{\prime}}{\text{i}}}\left|{{\text{y}}}_{{\text{i}}}-{\mathbf{x}\mathrm{^{\prime}}}_{{\text{i}}}{\varvec{\upbeta}}(\uptau )\right|\right\}$$

(5)

where $\rho$(w) is a loss function defined by:

$$\begin{array}{c}\rho ({\text{w}})=\{\uptau -{\text{I}}({\text{w}}<0)\}{\text{w}}\\ \rho ({\text{w}})={\{}_{\mathrm{w\tau },{\text{w}}\ge 0}^{{\text{w}}\left(\uptau -1\right),{\text{w}}<0}\end{array}$$

where I(w < 0) is the indicator function of w. However, the check function in Eqs. (4) and (5) is not differentiable at zero when y_i = x′_iβ(τ), resulting in the explicit solution of minimization can’t be solved analytically. Therefore, linear programming methods are commonly applied to obtain quantile regression estimates of β(τ) such as the simplex method, interior point, and heuristic method [28, 29].

Bayesian quantile regression by demonstrating that minimizing in Eq. (5) is equivalent to maximizing probability function based on an error distributed ALD. However, it has the same issue as minimizing in Eq. (5) since the check function is not differentiable at zero when y_i= x_iβ(τ), hence a different technique must be used to estimate the Bayesian quantile regression parameter [30]. According to ALD can be represented as a combination of exponential and Normal distribution. It can be written as:

$${\upvarepsilon }_{\mathrm{i }}=\upgamma {l}_{i}+{{\text{h}}m}_{{\text{i}}}\sqrt{{l}_{i}}$$

Where, l_i ~ exp (1), m_i ~ N(0,1), $\upgamma$ = $\frac{(1-2\uptau ) }{\uptau (1-\uptau )}$, h = $\sqrt{\frac{2 }{\uptau (1-\uptau )}}$, i = 1,..,p and l_i and m_i are mutually independent. From this result, the Bayesian quantile regression model for sample data can be rewritten as:

$${{\text{y}}}_{{\text{i}}}= \mathbf{X}{\varvec{\upbeta}}(\uptau ) + {\upgamma l}_{i} +{{\text{h}}m}_{{\text{i}}}\sqrt{{l}_{i}}\mathrm{ i }= 1,\dots ,{\text{p}}, {l}_{{\text{i}}}\sim \mathrm{ exp }(1), {m}_{{\text{i}}} \sim {\varvec{N}}(\mathrm{0,1})$$

(6)

The likelihood function of y given l is:

$$f(\mathbf{y} | {\varvec{l}},{\varvec{\upbeta}}(\uptau )) =\prod\nolimits_{i=1}^{p}\frac{1}{\sqrt{2\pi }\sqrt{{l}_{i}}h}{exp}^{\left(-{\frac{\left({\text{yi}}-\mathbf{X}{\varvec{\upbeta}}\left(\uptau \right)-\upgamma {l}_{i}\right)}{{2 h}^{2}{l}_{i}}}^{2}\right)}$$

where y= (y₁, y₂, …,y_p)’,l = (l₁,l₂,…, lp)’, and β(τ) and y₁|l₁,y₂|l₂,..,y_p|l_p are independent

The prior distribution for β(τ) is a Multivariate Normal With β(τ) ~ N(β(τ) ₀, ${\varvec{\omega}}$(τ)₀) and its Probability Density Function (pdf) is:

$${\text{P}}({\varvec{\upbeta}}(\uptau ))=\frac{1}{\sqrt{2\pi }{|{{\varvec{\omega}}\left(\uptau \right)}_{0}|}^{-\frac{1}{2}}}{e}^{-\frac{1}{2}({\varvec{\upbeta}}\left(\uptau \right)-{{\varvec{\upbeta}}(\uptau )}_{0}){\prime}{{{\varvec{\omega}}\left(\uptau \right)}_{0}}^{-1}({\varvec{\upbeta}}\left(\uptau \right)-{{\varvec{\upbeta}}(\uptau )}_{0})}{\text{exp}}\left(-\frac{1}{2}({\varvec{\upbeta}}\left(\uptau \right)-{{\varvec{\upbeta}}(\uptau )}_{0}){\prime}{{{\varvec{\omega}}\left(\uptau \right)}_{0}}^{-1}({\varvec{\upbeta}}\left(\uptau \right)-{{\varvec{\upbeta}}(\uptau )}_{0})\right)$$

(7)

where ${{\varvec{\upbeta}}(\uptau )}_{0}$ is a vector mean of β(τ) and ${{\varvec{\omega}}\left(\uptau \right)}_{0}$ is a covariance matrix of β(τ). The reason for multivariate normal usage is to simplify Gibbs sampling calculation and form posterior distribution to rationalize with likelihood function.

The posterior distribution of β(τ) is given by:

$${\text{P}}({\varvec{\upbeta}}(\uptau )|\mathbf{y},{\varvec{l}})\propto f(\mathbf{y} | {\varvec{l}},{\varvec{\upbeta}}(\uptau ))\mathrm{ p}( {\varvec{\upbeta}}(\uptau ))$$

(8)

$$\mathrm{P }({\varvec{\upbeta}}(\uptau ) | \mathbf{y}, {\varvec{l}})\propto \prod_{i=1}^{p}\frac{1}{\sqrt{2\pi }\sqrt{{l}_{i}}h}{e}^{\left(-{\frac{\left({\text{yi}}-\mathbf{X}{\varvec{\upbeta}}\left(\uptau \right)-\upgamma li\right)}{{2 h}^{2}{l}_{i}}}^{2}\right)}\times \frac{1}{\sqrt{2\pi }{|{{\varvec{\omega}}\left(\uptau \right)}_{0}|}^{-\frac{1}{2}}}{e}^{\left(-\frac{1}{2}({\varvec{\upbeta}}\left(\uptau \right)-{{\varvec{\upbeta}}(\uptau )}_{0}){\prime}{{{\varvec{\omega}}\left(\uptau \right)}_{0}}^{-1}({\varvec{\upbeta}}\left(\uptau \right)-{{\varvec{\upbeta}}(\uptau )}_{0})\right)}$$

Prior distribution of ${l}_{i}$ is used to fulfill Gibbs's sampling need and tune β(τ) to get good acceptance rates. Prior distribution of ${l}_{i}$ is an exponential distribution with ${l}_{i}$ ~ exp(1) and its pdf is:P(l_i)=exp(l_i)

The joint distribution of l₁, l₂,.., l_p which is a prior distribution of l is:

$${\text{P}}({\varvec{l}})={e}^{\left(-\sum_{i=1}^{p}{l}_{i}\right)}$$

(9)

Posterior distribution of l is:

$$\begin{array}{c}\mathrm{P }({\varvec{l}}| \mathbf{y}, {\varvec{\upbeta}}(\uptau ) ) \propto f (\mathbf{y} | {\varvec{l}},{\varvec{\upbeta}}(\uptau ))\mathrm{ p}( {\varvec{l}})\\ {\text{P}}({\varvec{l}}|\mathbf{y},{\varvec{\upbeta}}(\uptau ))\propto \prod_{i=1}^{p}{{l}_{i}}^{-\frac{1}{2}}{exp}^{\left( -\frac{1}{2}\{{{\delta }_{i}}^{2}{{l}_{i}}^{-1}+{{{\varphi }_{i}}^{2}l}_{i}\}\right)}\end{array}$$

(10)

where ${{\delta }_{i}}^{2}$ = ${\frac{\left({\text{yi}}-\mathbf{X}\mathrm{i }{\varvec{\upbeta}}\left(\uptau \right)\right)}{{ h}^{2}}}^{2}$ and ${{\varphi }_{i}}^{2}=\frac{{\upgamma }^{2}}{{h}^{2}}+2$ [29].

Since Eq. (10) is the kernel of a generalized inverse Gaussian $(\mathcal{G}I\mathcal{G})$ distribution, we have

$${\varvec{l}}| \mathbf{y}, {\varvec{\upbeta}}(\uptau ) \sim \mathcal{G}I\mathcal{G}\left(\frac{1}{2},{\delta }_{i},{\varphi }_{i}\right)$$

(11)

where the pdf of $\mathcal{G}I\mathcal{G}$($v,\alpha ,b$) is given by

$f\left(x|v,\alpha ,b\right)=\frac{{\left({}^{b}\!\left/ \!{}_{a}\right.\right)}^{v}}{2{k}_{v}\left(ab\right)}{x}^{v-1}{exp}^{\left\{-\frac{1}{2}\left({a}^{2}{x}^{-1}+{b}^{2}x\right)\right\}}$, x>0, $-\infty <v<\infty$, $\alpha ,b\ge 0$ and ${k}_{v}\left(.\right)$ is a modified Bessel function of the third kind [29].

MCMC simulation using the Gibbs-sampling algorithm was employed to draw samples from the posterior from which posterior means could be obtained. The posterior inference was implemented using Gibbs sampling this algorithm implements the Bayesian quantile regression (BQR) numerical method to directly perform the computation of fully Bayesian posteriors for the complex quantile regression model. In particular, the Bayesian quantile regression models with the structure of Gibbs sampling algorithm for the quantile regression are constructed by updating β, $v,$ and σ from their full conditional posteriors [31]. The algorithm can be summarized by the following steps:

Step1. Determine the τ or quantile of the regression model
Step2. Determine the initial value of ${{\varvec{\upbeta}}(\uptau )}^{0}$, ${{\varvec{v}}}^{0}$ and ${\upsigma }^{0}$
Step3. Determine the number of samples, suppose the number of samples is k

$$\begin{array}{c}{{\varvec{\upbeta}}(\uptau )}^{1}\mathrm{from P }({{\varvec{\upbeta}}(\uptau )}^{1}| \mathbf{y},{{\varvec{v}}}^{ 0},{\upsigma }^{0})\\ {{\varvec{v}}}^{1 }\mathrm{from P }({{\varvec{v}}}^{1}| \mathbf{y},{{\varvec{\upbeta}}(\uptau )}^{1},{\upsigma }^{0}),\\ {\upsigma }^{1}\mathrm{from P }({\upsigma }^{1}| \mathbf{y},{{\varvec{\upbeta}}(\uptau )}^{1},{{\varvec{v}}}^{1}),\end{array}$$

$$.$$

$$\begin{array}{c}{{\varvec{\upbeta}}(\uptau )}^{k}\mathrm{from P }({{\varvec{\upbeta}}(\uptau )}^{k}| \mathbf{y},{{\varvec{v}}}^{k-1},{\upsigma }^{k-1}),\\ {{\varvec{v}}}^{k}\mathrm{from P }( {{\varvec{v}}}^{k}| \mathbf{y}, {{\varvec{\upbeta}}(\uptau )}^{k},{\upsigma }^{k-1}),\\ {\upsigma }^{k}\mathrm{from P }({\upsigma }^{k}| \mathbf{y},{{\varvec{\upbeta}}(\uptau )}^{k},{{\varvec{v}}}^{k})\end{array}$$

After obtaining the sample sequence in step 3, the sample sequence needs to be averaged empirically to obtain parameter estimation of β(τ), $v$, and $\upsigma$ [29]. Also from step 3, it is needed to check a convergence from the sample sequence that is generated from Gibbs sampling.

In this study, we used the Brq R package of MCMC with Gibbs sampling to approximate the desired country-specific marginal estimates from which posterior estimates were easily computed [31, 32]. With this regard, the Gibbs sampling algorithm was implemented with 10,000 iterations, 1,000 burn-in terms discarded, and 5 thinning intervals to make observations independent or low autocorrelation. To track the convergence of the algorithm, several diagnostic tests have been created. For this investigation, the most widely used convergence assessment methods were utilized out of a variety of testing methodologies. The three approaches trace, autocorrelation, and density plots are used in this study.

Results

Based on the result of Table 2, among the total participants included in this study, about (76.9%) were living in rural areas. From the same result, more than half (54.7%) of maternal education was not formal education. From these households, 1,072(20.1%) and 4,251(79.9%) used improved and unimproved toilet facilities respectively. Concerning water resources, the result of this study shows that 3,272 (61.5%) and 2,051 (38.5%) households have improved and unimproved drinking water sources (Table 2). A large percentage (93.2%) of mothers were married, and more than half (54.3%) of children were ever breastfed and not currently breastfed. When we look at the number of children aged under 5 in household members, 2380 (44.7%) of them had two members and the majority (65.5%) of children had from five to nine household members (Table 2).

Table 2 Summary measures for a categorical sample of the socio-economic and demographic characteristics of children

Full size table

The median BMI has the same value as the 50^th percentile or the second quantile (15.32) values. The median (50^th percentile) maternal age of the sampled household was 28 years with a range of 15 to 49 years and also similar to current age children were 29 months with a range between 0 to 59 months (Table 3).

Table 3 Study result of children's BMI, current age of children, and maternal age

Full size table

Figure 1 (A) presents the histogram for the children's BMI. Based on the figure, it could be seen that the distribution of BMI is asymmetric, thus the distribution is not normal. Figure 1 (B) shows a normal Q-Q plot for the data. This figure also proves that the normality assumption is violated linear regression model in this children's BMI data and any outliers are in the data. To model the BMI of under-five children, the quantile regression approach was then implemented in this study.

The result from the Bayesian quantile regression model identified that the significant predictor variables at different quantile levels were presented in Table 4. At 0.05 (lower) quantile level: the results of the study showed that the current age of children, number of household members, maternal age, maternal education, religion, sex of children, region, and wealth index were found to have a significant effect on the BMI of under-five children. As the result indicated, the current age of children is negatively related to under-five children's BMI. The rate of change of the BMI of under-five children is -0.007 with a 95% credible interval (CI) = (-0.010, -0.004) at a lower quantile per unit change of current age of child keeping all the other variables constant.

Table 4 Parameter estimation of Bayesian quantile regression of under-five children BMI in Ethiopia

Full size table

According to the result, the female child, number of household members (five to nine), and region (Afar, Somalia, and Gambela) are negatively related to under-five children's BMI. At the lower quantile, under-five children's BMI decreased by 0.261 with CI = (-0.341, -0.181) for females as compared to male children by retaining the other factors constant. At the lower quantile, the under-five children's BMI decreased by 0.327 with CI = (-0.574, -0.088), 0.728 with CI = (-0.964, -0.499), and 0.481 with CI = (-0.690, -0.273) for children’s families lived in Afar, Somalia, and Gambela region respectively as compared to Tigray region by setting the other variables constant (Table 4).

Whereas, the findings showed that maternal age, maternal education, religion (Protestant), region (Oromia and Addis Abeba), and wealth index (middle, richer, and richest) are positively related to an under-five children's BMI. The under-five children's BMI increased by 0.012 with CI = (0.005, 0.019) for every one-unit change in the current age of the mother, holding all the other factors constant at a lower quantile level. Similarly, the under-five children's BMI increased by 0.193 with CI = (0.086, 0.292) and 0.444 with CI = (0.294, 0.582) for mothers who attend primary education and secondary and above education respectively as compared to no formal education by leaving the other variables constant at the lowest quantile level.

At 0.85 (higher) quantile, the current age of children, duration of breastfeeding, the current age of mother, number of children who are aged five and under, religion, sex of children, region, and wealth index have a significant effect on the BMI of under-five children. From this result, the current age of children is negatively related to under-five children's BMI ($\beta$ = -0.046, CI (-0.050, -0.043)). Similarly, duration of breastfed (still breastfeeding), religion, sex of a child, and region (Somalia) are negatively related to under-five children's BMI. At the higher quantile, the under-five children's BMI decreased by 0.190 in CI (-0.326, -0.060) for children still breastfeeding as compared to ever and not currently breastfeeding by setting the other factors constant.

At the (highest) 95^th quantile, the current age of children, duration of breastfeeding, maternal age, marital status, number of children age five and under, religion, sex of children, region, and wealth index showed a significant effect on BMI of under-five children (Table 4). The study showed that the current age of children is negatively related to under-five children's BMI. At the highest quantile level, the under-five children's BMI decreased by 0.055 within CI (-0.059, -0.050) for every one-unit change in the current age of a child, holding all the other factors constant. Furthermore, maternal age and wealth index (richer) have positively related to an under-five children's BMI. The result showed that the under-five children's BMI increased by 0.436 with CI = (0.256, 0.621) for the richer wealth index family as compared to the poorest wealth index family by holding the other factors constant (Table 4).

Convergence checking at different quantile levels

As a result, shown in the trace plots in Fig. 2A, all generated samples lie within two parallel horizontal lines, straight lines that did not show up and down periods, centered at respective values, and no trends are detected. For all simulated parameters, the trace plot indicates a good convergence since the independently generated chains are mixed or overlapped. The marginal posterior density plots in Fig. 2B below inform us that the conditional posterior distributions are the desired stationary univariate normal. This shows that all posterior estimates converged.

The finding of the study shows that Fig. 2C indicates that the decrease in the empirical autocorrelation of posterior samples proves that the underlying chains are stationary. The given below independently generated chains demonstrated good chain mixture, an indication of convergence. This shows that all posterior estimates converged. Not all trace, density, and autocorrelation plots are presented here; the remaining plots can be the same as like to this. The results obtained from these convergence diagnostics indicate that our algorithm used in the Bayesian quantile regression approach could produce adequate and acceptable values of the estimated parameter.

Discussion

Based on the findings of the study using the 2019 mini EDHS data, several variables were identified connected to various quantiles of BMI in children under the age of five. One notable factor that was found to lower under-five children's BMI in both the higher and lower quantile levels was their current age. These findings highlight the importance of age-specific interventions that target different age groups of children under the age of five. Such interventions can focus on providing appropriate nutrition, dietary counseling, and health education tailored to the specific needs of children at different stages of development. This result is consistent with previous studies conducted in Ethiopia [11], Sudan [24], and China [33] which found age to be an important factor influencing children's BMI. The study found that breastfeeding has a negative association with under-five child BMI in the upper quantile. This suggests that breastfeeding helps prevent excessive weight gain and reduces the likelihood of children becoming overweight. This finding is consistent with the findings of other studies conducted in China [34] and Greece [35]. These findings emphasize the significance of exclusive breastfeeding and encouraging mothers to exclusively breastfeed their infants for the first six months and continue breastfeeding alongside appropriate complementary feeding practices can contribute to the healthy growth and development of children.

The finding of this study also showed that maternal age is positively related to the BMI of under-five children in the upper quantile level. Younger mothers may engage in more physical activities, provide active stimulation, and promote healthy eating habits, resulting in lower BMI levels for their children. This suggests that younger mothers have more energy and are better able to actively care for their children and provide better care and nutrition for their children, leading to healthier weights. This finding was in agreement with another study conducted in Ethiopia [36]. But this result contradicted study findings conducted in Ethiopia [37]. This may be attributed to the majority of children whose mother was young and adult in this study, which leads to a healthy weight.

Our findings also showed that one of the most important factors affecting under-five children's BMI at different quantiles was the sex of a child. Female children have a worse relationship with BMI at both the lower and upper quantiles for children under five than male children. Female children have a worse relationship with BMI at both the lower and upper quantiles compared to male children, suggesting that there may be gender-related differences in factors influencing BMI in early childhood. This finding is consistent with previous studies conducted in Ethiopia [11] and Sudan [24]. The finding of the study has also shown that a mother’s education significantly affects under-five children’s BMI in the lower quantile level. Children whose mothers attended primary education level had a positive association with under-five child BMI, while children whose mothers had no formal education had a negative association. This indicates that education enables mothers to implement basic health knowledge effectively. It also enhances their ability to navigate healthcare facilities, interact with healthcare professionals, adhere to treatment recommendations, and maintain a clean environment for their children. This finding is in line with the study findings conducted in Taiwan [38] and Ethiopia [39, 40], indicating that maternal education plays a crucial role in shaping children's BMI outcomes.

The findings of this study indicate that religion significantly influences under-five children's BMI at various quantiles. Specifically, families with children who practice the Protestant religion have a more favorable relationship between their children's BMI under the age of five, particularly in the lower quantiles.

Moreover, this findings of this study showed that religion significantly influences under-five children's BMI at various quantiles. According to the findings, families with children who practice the protestant religion have a favourable relationship between their children's BMI under the age of five and those who practice the Orthodox religion in the lower quintile. This finding is consistent with other studies conducted in Ethiopia [41]. However, a previous study conducted in Ethiopia [11] did not find a significant association between under-five children's BMI and religion. The variation in findings could be attributed to several factors. Firstly, different statistical models, such as the Bayesian quantile regression model used in this study, may yield different results. Secondly, the majority of children from families practicing the Protestant religion in this study came from households with better wealth indexes and educated mothers. These socio-economic and educational factors could have influenced the relationship between religion and children's BMI.

Similarly, it was discovered that geography had an impact on under-five children's BMI at various quantiles. According to our findings, a child who lives in Afar, Somalia, and Gambela regions has a worse relationship with their under-five child's BMI than a child who lives in the region of Tigray in the lower quantile. On the other hand, children living in the Amhara, Oromia, Benishangul, SNNPR, Gambela, Harari, Addis Abeba, and Dire Dawa regions have a more favorable relationship with under-five children's BMI compared to those in the Tigray region in the upper quantiles. This result is also consistent with the finding of a study in Ethiopia [11], suggesting that geography plays a role in children's BMI outcomes. The varied associations between geography and under-five children's BMI in different quantiles may reflect regional differences in factors such as access to healthcare, socio-economic conditions, cultural practices, and dietary patterns.

The BMI of under-five children at various quantiles was found to be significantly influenced by the home wealth index. In contrast to poorer wealth index families in the lower quantile level and the upper quantile level, the study's findings on the wealth index of family richer and richest wealth index families were positively related to under-five child BMI. This finding is consistent with previous research studies conducted in Ethiopia [39, 42] and also the study results in Kenya [43].

The possible reason may be, that families with higher wealth index often have greater access to resources such as nutritious food, and a more favorable living environment. These factors may contribute to increasing BMI for under five children. Furthermore, the middle-level wealth index of the family is positively related to under-five child BMI as compared to poorer wealth index families in the lower quantile. This result also seems to agree with the previous finding of the study in Bangladesh [44], further supporting the notion that a moderate level of wealth can still have a positive impact on children's BMI, relative to families with lower wealth index.

Limitations of the study

This study had certain limitations, one of which was the unavailability of variables such as maternal BMI and children's weight at birth in the mini EDHS data set. This may have an impact on the result of the study.

Conclusions

The study findings indicate that several factors have a significant effect on under-five child BMI at both lower and upper quantile levels. The study also showed that the BMI of children under the age of five in Ethiopia is significantly influenced by socioeconomic, behavioral, and demographic factors. The results revealed that the present age of the children, the sex of the children, the age of the mothers, the family's religion, the location, and the wealth index all had a significant impact on the BMI of under-five children at both the lower and upper quantile levels. Additionally, it was discovered that mothers' education levels had a substantial impact on the BMI of under-five children in lower quantile levels.

Thus, we recommend that the education sector should promote maternal education and policies to reduce cultural and gender barriers. Further research is needed to establish the causal relationships between the identified factors and under-five children's BMI in Ethiopia. This would provide a deeper understanding of the factors influencing BMI and inform more targeted interventions and policies to improve the nutritional status of young children in the country.

Availability of data and materials

The dataset used and analyzed during the current study is openly available from EDHS website (https://dhsprogram.com).

Abbreviations

ALD:: Asymmetric Laplace Distribution
BMI:: Body Mass Index
Brq:: Bayesian Quantile Regression
CI:: Credible Interval
EDHS:: Ethiopian Demographic and Health Survey
MCMC:: Markov Chain Monte Carlo
pdf:: Probability Density Function
QRM:: Quantile regression models
SNNPR:: South Nations Nationalities and Peoples Representative

References

Javed A, et al. Diagnostic performance of body mass index to identify obesity as defined by body adiposity in children and adolescents: a systematic review and meta-analysis. Pediatr Obes. 2015;10(3):234–44.
Article CAS PubMed Google Scholar
Cole TJ, Lobstein T. Extended international (IOTF) body mass index cut-offs for thinness, overweight and obesity. Pediatr Obes. 2012;7(4):284–94.
Article CAS PubMed Google Scholar
Rolland-Cachera, M.F., M. Akrout, and S. Péneau, History and meaning of the body mass index. Interest of other anthropometric measurements. The ECOG’s eBook on Child and Adolescent Obesity. 2015: 1-20.
Cai Y, Zhu X, Wu X. Overweight, obesity, and screen-time viewing among Chinese school-aged children: national prevalence estimates from the 2016 Physical Activity and Fitness in China—The Youth Study. J Sport Health Sci. 2017;6(4):404–9.
Article PubMed PubMed Central Google Scholar
Klatsky L, et al. Body mass index and mortality in a very large cohort: is it really healthier to be overweight? Permanente J. 2017;21(3):16–142.
Costa-Urrutia P, et al. Obesity measured as percent body fat, relationship with body mass index, and percentile curves for Mexican pediatric population. PloS One. 2019;14(2):e0212792.
Article CAS PubMed PubMed Central Google Scholar
Herrington WG, et al. Body-mass index and risk of advanced chronic kidney disease: Prospective analyses from a primary care cohort of 1.4 million adults in England. PloS One. 2017;12(3):e0173515.
Article PubMed PubMed Central Google Scholar
Kansra AR, Lakkunarajah S, and Jay MS. Childhood and adolescent obesity: a review. Front pediatr. 2021;8:581461.
Jeyalakshmi S, Kamalam S. Child hood obesity-a lifelong threat to health. Pondicherry J Nurs. 2019;9(2):40–5.
Google Scholar
Gebremichael MA, et al. Prevalence of overweight/obesity and associated factors among under-five children in Ethiopia: a multilevel analysis of nationally representative sample. Front Public Health. 2022;10:3055.
Article Google Scholar
Yirga AA, Ayele DG and Melesse SF. Application of quantile regression: Modeling body mass index in Ethiopia. Open Public Health J. 2018;11(1):221–33.
Gebremariam MK, et al. Change in BMI Distribution over a 24-year period and associated socioeconomic gradients: a quantile regression analysis. Obesity. 2018;26(4):769–75.
Article PubMed Google Scholar
Wei Y, et al. Applications for quantile regression in epidemiology. Curr Epidemiol Rep. 2019;6:191–9.
Article Google Scholar
Davino, C., M. Furno, and D. Vistocco, Quantile regression: theory and applications. Vol. 988. Chichester: John Wiley & Sons; 2013.
Beyerlein A, et al. Alternative regression models to assess increase in childhood BMI. BMC Med Res Methodol. 2008;8:1–9.
Article Google Scholar
Gebregziabher M, et al. Using quantile regression to investigate racial disparities in medication non-adherence. BMC Med Res Methodol. 2011;11:1–11.
Article Google Scholar
Maronna RA, et al. Robust statistics: theory and methods (with R). USA: John Wiley & Sons; 2019.
Hao L, Naiman DQ, Naiman DQ. Quantile regression. California: Sage; 2007.
Benoit DF, Van den Poel D. Binary quantile regression: a Bayesian approach based on the asymmetric Laplace distribution. J Appl Econometrics. 2012;27(7):1174–88.
Article Google Scholar
Chae S-M, et al. Association of weight control behaviors with body mass index in Korean adolescents: a quantile regression approach. J Pediatr Nurs. 2018;40:e18–25.
Article PubMed Google Scholar
Jia P, et al. Effects of school neighborhood food environments on childhood obesity at multiple scales: a longitudinal kindergarten cohort study in the USA. BMC Med. 2019;17(1):1–15.
Article CAS Google Scholar
Hoff PD. A first course in Bayesian statistical methods. Vol. 580. New York: Springer; 2009.
Asif, M., et al., Establishing body mass index growth charts for Pakistani children and adolescents using the Lambda-Mu-Sigma (LMS) and quantile regression method. Minerva Pediatrica. 2020.
Ayele DG, Abdallah ASR, Mohammed MOM. Determinants of under-five children body mass index in sudan; application of quantile regression: a systematic review. Iran J Public Health. 2021;50(1):1.
PubMed PubMed Central Google Scholar
Burgert, C.R., B. Zachary, and J. Colston, Incorporating geographic information into demographic and health surveys: a field guide to GPs data collection. Fairfax: ICF International, 2013.
Institute, E.P.H. and ICF, Ethiopia mini demographic and health survey 2019: key indicators. J Chem Inform Model. 2019; 53: 1689-1699.
Cameron AC, Trivedi PK. Microeconometrics using stata, vol. 2. Stata press College Station; 2010.
Google Scholar
Koenker R, Hallock KF. Quantile regression. J Econ Perspect. 2001;15(4):143–56.
Article Google Scholar
Kozumi H, Kobayashi G. Gibbs sampling methods for Bayesian quantile regression. J Stat Comput Simul. 2011;81(11):1565–78.
Article Google Scholar
Yu K, Moyeed RA. Bayesian quantile regression. Stat Probabil Lett. 2001;54(4):437–47.
Article Google Scholar
Alhamzawi R, Ali HT. Brq: an R package for Bayesian quantile regression. Metron. 2020;78(3):313–28.
Article Google Scholar
Alhamzawi R. Brq: Bayesian analysis of quantile regression models. R package version. 2012;1.
Li H, et al. Body mass index growth curves for Chinese children and adolescents aged 0 to 18 years. Zhonghua er ke za zhi= Chin J Pediatr. 2009;47(7):493–8.
Google Scholar
Liu F, et al. Breastfeeding and overweight/obesity among children and adolescents: a cross-sectional study. BMC Pediatr. 2022;22(1):1–8.
Article CAS Google Scholar
Mantzorou M, et al. Exclusive breastfeeding for at least four months is associated with a lower prevalence of overweight and obesity in mothers and their children after 2–5 years from delivery. Nutrients. 2022;14(17):3599.
Article PubMed PubMed Central Google Scholar
Gebremichael MA, et al. Prevalence of overweight/obesity and associated factors among under-five children in Ethiopia: a multilevel analysis of nationally representative sample. Front Public Health. 2022;10:3055.
Yirga AA. Statistical models to study the BMI of under five children in Ethopia. 2018.
Hsu P-C, et al. The impact of maternal influences on childhood obesity. Sci Rep. 2022;12(1):6258.
Article CAS PubMed PubMed Central Google Scholar
Kebede DT, Bekalo DB, and Mekuriaw DM. Multivariate analysis of correlates of children nutritional status in Harar region, Ethiopia. Int J Sci Rep. 2020;6(3):101.
Baye, B.F., Determinants of nutrition and health status of children in Ethiopia: a Multivariate Multilevel Linear regression analysis. 2010, Addis Ababa University.
Mohammed, S.B., Explaining Child Malnutrition in Ethiopia: The Role of Socioeconomic Status and Maternal Health on Nutritional Condition of Children: a Research Paper. 2013: International Institute of Social Studies. Kortenaerkade.
Yalew BM. Prevalence of malnutrition and associated factors among children age 6–59 months at lalibela town administration, North WolloZone, Anrs Northern Ethiopia. J Nutr Disorders Ther. 2014;4(132):2161–0509.
Google Scholar
Haregu T, et al. Body mass index and wealth index: positively correlated indicators of health and wealth inequalities in Nairobi slums. Global Health Epidemiol Genom. 2018;3:e11.
Article CAS Google Scholar
Hossain MM, Abdulla F, Rahman A. Prevalence and risk factors of underweight among under-5 children in Bangladesh: Evidence from a countrywide cross-sectional study. PLoS One. 2023;18(4):e0284797.
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgments

The datasets used in this study were obtained from the DHS program. Thanks to the authorization received to download the dataset on the website.

Funding

Not applicable.

Author information

Authors and Affiliations

Department of Statistics, College of Science, Bahir Dar University, Bahir Dar, Ethiopia
Daniel M. Mekuriaw, Aweke A. Mitku & Melkamu A. Zeru
School of Mathematics, Statistics and Computer Science, College of Agriculture Engineering and Science, University of KwaZulu-Natal, Durban, South Africa
Aweke A. Mitku

Authors

Daniel M. Mekuriaw
View author publications
You can also search for this author in PubMed Google Scholar
Aweke A. Mitku
View author publications
You can also search for this author in PubMed Google Scholar
Melkamu A. Zeru
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

DMM contributed to data management, data analysis, drafting of the manuscript, and revising the final manuscript. AAM played a key role in conceptualizing the research problem, study design, and manuscript revisions. MAZ contributed to the development of the study design, interpretation of data, and manuscript revisions. All authors critically reviewed the manuscript and made substantial contributions to its improvement. All authors have read and approved the final version of the manuscript.

Corresponding author

Correspondence to Aweke A. Mitku.

Ethics declarations

Ethics approval and consent to participate

Ethical approval for this study was obtained from the Ethical Approval Committee of Postgraduate, Research and Community Service at College of Science, Bahir Dar University, Ethiopia. In data collection; there was no verbal consent from study participants because the data was taken from a secondary source of Ethiopian demographic health survey data (EDHS).

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Mekuriaw, D.M., Mitku, A.A. & Zeru, M.A. Bayesian modeling of quantiles of body mass index among under-five children in Ethiopia. BMC Public Health 24, 1144 (2024). https://doi.org/10.1186/s12889-024-18602-x

Download citation

Received: 13 October 2023
Accepted: 15 April 2024
Published: 24 April 2024
DOI: https://doi.org/10.1186/s12889-024-18602-x

Bayesian modeling of quantiles of body mass index among under-five children in Ethiopia