Skip to main content

Discrete-time survival analysis with survey weights: a case study of age at child death in Sierra Leone

Abstract

Background

Child death rates are often regarded as reliable indicators for overall welfare of a country since they give insight of health accessibility and development. For planning and controlling purposes, it is important to understand which ages are at higher risks of experiencing child death as well as determinants thereof.

Methods

We used the Sierra Leone DHS 2019 data which was collected using two stage sampling methods. Data collection involved interviewing women aged from 15–49 to obtain information about children they had in the past up to 2019. Age at death of child was modelled using discrete-time survival analysis with a logit link at the same time applying survey weights. The analysis also sought to estimate the determinants of child death (under-five mortality). The baseline hazard was modelled with a polynomial function.

Results

Results showed that children from rural areas had significantly lower odds of dying compared with those from urban areas (odds ratio (OR) = 0.861, p-value = 0.0003). Children of mothers who were currently using contraceptives, and those whose mothers had been using since their last birth were at higher odds of child death compared to children whose mothers had never used contraceptives before (currently using: OR = 1.118, p-value =  < .0001; used since last birth: OR = 1.372, p-value =  < .0001). Children with no health insurance had significantly higher odds of death than those with health insurance (OR = 1.036, p-value =  < .0001). Children of women who were married, and of women who were formerly married were at significantly higher odds of experiencing child death than children of women who had never been in union (married: OR = 1.207, p-value = 0.0003; formerly married: OR = 1.308, p-value = 0.0009 compared to those that have never been married). Increase in the age group of mothers increases the odds of their children experiencing child death compared to mothers in their teenage years (20-29: OR = 1.943, p-value =  < .0001, 30-39: OR = 2.397, p-value =  < .0001 and >  = 40: OR = 2.895, p-value =  < .0001 compared to mothers in their 15-19 years).

Conclusion

The study provides evidence that residing in urban areas, marital union of the mother, children having no health insurance, use of contraceptives by mother, older ages of the mother and no health insurance significantly increase the odds of child death. This points out to a possible need for improved health infrastructure to be made available to citizens in all places of delivery and more awareness on pregnancy related complications.

Peer Review reports

Introduction

Child mortality rate (under-five mortality), as a significant national indicator of health development, has become a popular area of study with researchers aiming to establish possible measures for child mortality to be kept at an absolute minimum [1]. The United Nations Millennium Development Goal 4 (MDG4) which was targeting to reduce under-five mortality by two thirds for the period from 1990 to 2015, helped to re-focus the attention on child mortality rate [2]. By the year 2015, this MDG4 global target was still unattainable and child mortality remains acute in developing and low-middle income countries [2]. At the end of 2015, a study by The Lancet global health indicated that in low-middle income countries, 1 child in 12 dies before the age of five years, whereas in high income countries 1 child in 147 dies [3]. These figures substantiate that developing countries still require more intervention and establishments to further reduce child mortality rates. Most Sub-Saharan African countries have high child death due to low level public health [4]. In this study we will focus our analysis on the child death of Sierra Leone, a Sub-Saharan African country with one of the highest child mortality rates [4]. Sierra Leone is one of the world’s poorest countries and is experiencing numerous challenges. It has one of the highest youth unemployment rates in West Africa (of about 60%), and much of its drinking water comes from unclean sources. It is also one of Africa’s lowest literacy rates and is vulnerable to health issues such as HIV/AIDS, Malaria and Yellow fever. Approximately 81% of its population is living in poverty [5].

Even though Sierra Leone has shown improvement in reducing child mortality over the years, it is classified as one of the countries with high child and infant mortality [6, 7]. Under-five mortality rate decreased from 156 deaths per 1000 live births in the year 2013 to 122 deaths per 1000 live births in 2019. Child mortality went from 95 to 75 deaths per 1000 live births and neonatal mortality went from 39 to 31 deaths per 1000 live births [8]. These figures relatively show that the child and infant mortality rates in Sierra Leone have declined from 2013–2019, but these deaths can further be prevented provided necessary precautions and health measures are taken [6]. For that reason, it might be of importance to establish the determinants of child death. This would possibly assist in making informed decisions on which areas to improve to further lower child death.

Recent studies around the world [9,10,11] have determined child mortality mostly using continuous time survival models. However, in cases where data is collected in a truly discrete manner, using continuous time models gives biased estimates due to ties [12]. Methods of handling ties would then have to be adopted, [13, 14] explain extensively how ties are handled. Therefore, discrete time survival models are preferred over continuous time models because of their ability to easily allow modeling time-varying covariates and are also naturally built in a manner that addresses the issue of ties. Furthermore, the discrete time survival models do not require a hazard-related proportionality assumption that is common in continuous-time survival analysis [12]. These methods have commonly been applied to health data by [15,16,17] and will be considered in this study.

In addition, the standard discrete time survival model is appropriate when a simple random sample of participants was used. In an event where data analyzed was sampled using multi-stage probability sampling, using a discrete time survival model with survey weights produces more consistent estimates [18].

The objective of this study was to determine time-to-child mortality as well as factors that affect child mortality in Sierra Leone using Discrete Time Survival Analysis with survey weights.

Statistical methods

Discrete survival and hazard function

For a discrete-time survival analysis, the time scale is subdivided into \(n\) intervals \(\left\{{I}_{t}=\left[{a}_{t-1},{a}_{t}\right)\right\}\) with \({a}_{0}<{a}_{1}<...<{a}_{n}<\infty .\) Censoring and occurrence of event is assumed to happen at the end of the intervals, and the intervals do not necessarily have the same length. Let \(T\) be a discrete time random variable that takes on values \({t}_{i}\) \(\{i=1,..,n\}\), which denotes an occurred event in interval \({I}_{t}\) [16]. The discrete hazard function for the \({j}^{th}\) \((j=\mathrm{1,2},\dots ,M)\) individual is defined as conditional probability of failure at time interval \(t\) given that the individual has survived up to time \(t\) and is given by

$${h}_{j}\left({t}_{i}\right)=P\left(T={t}_{i}\right| T\ge {t}_{i})$$
(1)

The hazard function is always a non-decreasing function. Consequently, the survival function, which is the probability that an event of interest has not occurred by time \(t\) is given by

$${S}_{j}\left({t}_{i}\right)=P\left(T>{t}_{i}\right)=\prod_{k=1}^{i}[1-{h}_{j}\left({t}_{k}\right)]=1-F\left({t}_{i}\right)$$
(2)

where \(F\left({t}_{i}\right)\) is the cumulative probability density function, which tells us the probability of time-to-event is less than some given value \({t}_{i}\) [19, 20].

The methods of estimating the survival and hazard functions in survival analysis are broadly classified into non-parametric, semi-parametric, parametric, and discrete time methods [20]. This paper highlights one of the parametric methods which is the discrete time model as well.

To model the discrete time survival, a logistic model may be used to link the hazard function to the linear predictor, \(\gamma \left(t\right)+\phi {X}_{t}\) as follows:

$$\mathit{logit}\{{h}_{j}\left({t}_{i}|{X}_{t}\right)\}=\mathrm{ln}\left(\frac{{h}_{j}\left({t}_{i}|{X}_{t}\right)}{1-{h}_{j}\left({t}_{i}|{X}_{t}\right)}\right)=\mathrm{ln}\left(\frac{P\left(T={t}_{i}\right| T\ge {t}_{i},{X}_{t} )}{1-P\left(T={t}_{i}\right| T\ge {t}_{i},{X}_{t} )}\right)=\gamma\left(t\right)+\phi{X}_{t}$$
(3)

So that the hazard may be obtained by solving Eq. 3 for \(logit\left\{{h}_{j}\left({t}_{i}|{X}_{t}\right)\right\}\) which may be written as

$${h}_{j}\left({t}_{i}|{X}_{t}\right)=\frac{\mathrm{exp}(\gamma \left(t\right)+\phi {X}_{t} )}{1+\mathrm{exp}(\gamma \left(t\right)+\phi {X}_{t} )}$$
(4)

where \(\gamma \left(t\right)\) is a function of time which models the baseline hazard on each time period and captures the log odds that an individual will experience the event in each time period [21] proposed different specification of the baseline hazard and \(\phi\) represents the regression coefficients. Depending on the assumptions of the model, other link functions may be used such as the complementary log–log function where the underlying data was continuous but now grouped [16] or the Probit link function where normality assumptions are made [22].

The likelihood function

To estimate the parameters \(\gamma\) and \(\phi\) from Eq. 4, the method of maximum likelihood can be used.

Let \({\delta }_{j}\) be an event indicator, where \({\delta }_{j}\) is 1 if an event occurs and is \({\delta }_{j}\) is 0 if an event does not occur. Then the contribution to the likelihood function by individual j is taken as the product of the probability of an individual to experience an event and probability of surviving beyond time \({t}_{i}\) such that,

$${L}_{j}{=h\left({t}_{i}|{X}_{t}\right)}^{{\delta }_{j }} {\left\{\prod_{i=1}^{n-1}\left[1-h\left({t}_{i}|{X}_{t}\right)\right]\right\}}^{1-{\delta }_{j}}$$

The overall likelihood for all individuals is then given by

$$L=\prod_{j=1}^{M}\left[{\left\{h({t}_{i}|{X}_{t}) \right\}}^{{\delta }_{j}}{\left\{\prod_{i=1}^{n-1}\left[1-h\left({t}_{i}|{X}_{t}\right)\right]\right\}}^{1-{\delta }_{j}}\right]$$
(6)

Taking the log transformation of Eq. 7 results to,

$$LL=\sum_{j=1}^{M}{\delta }_{J}logh({t}_{i}|{X}_{t})+\sum_{J=1}^{M}\sum_{i=1}^{n-1}\left(\left(1-{\delta }_{i}\right)\right)log\left[1-h\left({t}_{i}\right) \right]$$
(7)

Maximizing this log-likelihood and solving it with respect to the parameters will give the maximum likelihood estimates of the parameters [23, 24]. This likelihood is equivalent to that of a binary response model as used in Generalized Linear Models. This is easily implemented using readily available standard software which are designed for binary regression models [25, 12].

Survey discrete-time survival model

Apart from the assumption of having a binary response variable, discrete-time logistic regression further makes assumptions that the explanatory variables should be independent and have equal weights [26]. In other words, it assumes that a simple random sample of the sampling units was done. This, however, was not always the case in the DHS data at hand. The ordinary discrete-time logistic model does not consider the complex nature of survey design, which may lead to incorrect statistical inference [27]. The use of discrete-time complex survey logistic model mitigates these demerits of the ordinary logistic discrete-time survival model.

In complex survey sample designs, the design includes stratification, clustering, and unequal sampling weights. A function that approximates the likelihood function in the finite sampled population and known sampling weights is constructed [28, 29]. Let assume that the finite population is partitioned into \(h=\left\{\mathrm{1,2},\dots ,H\right\}\) strata, and each stratum is further divided into \(l=\left\{\mathrm{1,2},\dots ,{n}_{h}\right\}\) primary sampling units (PSUs), each which is made up of \(j=\left\{\mathrm{1,2},\dots ,{n}_{hl}\right\}\) secondary sampling units (SSUs), which consist of \({n}_{hlj}\) elements. Each sampling unit is associated with sampling weights which is given as the inverse of the probability of being included in the sample, denoted as

$${w}_{hlj}=\frac{1}{{\uppi }_{hlj}},$$
(8)

where \({\uppi }_{hlj}\) is the probability for a sampling unit to be included in the sample.

Under complex sampling design, the parameter of the regression coefficients \(\phi\), from Eq. 3, are estimated by maximum pseudo-likelihood method that incorporates the sampling design and different sampling weights [28].

To model age at child death, discrete-time survival model was used with survey weights applied to account for the complex nature of the data.

The baseline hazard

In a discrete-time survival model, time (age) is the fundamental covariate of the model. The effect of time (age) is captured in the baseline hazard. The baseline hazard, \(\gamma \left(t\right),\) can be specified using distinct functions such as polynomial or piecewise constant [30, 30, 21] highlights on several ways of specifying the baseline hazard in discrete time survival models. The use of general specification of baseline hazard involves treating the discretized time variable as a categorical variable \(\left[\gamma \left(t\right)={\gamma }_{1}\left(t\right)+{\gamma }_{2}\left(t\right)+{\gamma }_{3}\left(t\right)\dots \right]\) depending on the length of time interval. However, using the general constant specification does require the interval to be short. Another method that can be used when the baseline interval is large is non-parametric smoothing techniques of modelling the baseline hazard. These methods were used by [30, 25] extensively discussed how non-parametric of baseline hazard works. The effect of time is captured in the baseline hazard and omitting the time effect from the model implies that the hazard is constant over time. Using the Akaike Information Criterion (AIC) for the model selection, a baseline hazard with the smallest AIC statistic was preferred. Table 1 shows how the baseline hazards can be specified.

Table 1 Baseline Hazards specification

The data

This study uses data obtained from Demographic Health Surveys, which consist of data collected using different questionnaires. The data files are stored according to the nature of analysis that one intends to perform which are called recode files (i.e., Household recode, persons’ recode, women’s recode, children’s recode, etc.). Since the main objective in this study was to discover variables that are affecting child mortality rates in Sierra Leone, we used the children (kids) recode file (KR) [8].

The cross-sectional data set used in this study was obtained from the Sierra Leone Demographic Health Survey (SLDHS) 2019. The dataset is publicly available in the DHS website. Sierra Leone has five provinces, and each province is subdivided into districts. The sample for the 2019 SLDHS is a stratified sample selected using a two-stage sampling. Stratification was achieved by separating each district into urban and rural areas, which formed the enumeration areas (EA) or clusters in the first stage of stratification, 578 enumeration areas were selected according to probability proportional to the population size. From the 578 EAs, a list of households in each EA was compiled which formed the sampling frame for selection of household in second stage. A fixed number of twenty-four households were selected in second stage through equal probability systematic sampling method, which resulted to approximately 13,872 selected households [8].

The overall response rate for the 2019 SLDHS data collection was 99%, and 97% response from women aged 15 – 49 who were eligible for individual interviews. The interview included maternal history questions that helped to obtain information regarding birth dates, survival status, and age at death for children that respondents (women) gave birth to [8].

To select variables for this study, the analytical framework for the study determinants in developing countries by [31] was used. [31] proposed the incorporation of social and biological variables to study the impact that these factors have on mortality rates. As part of conceptual framework, studies by [32, 10, 11] indicated current place of residence, number of children never born, duration of breastfeeding, maternal and paternal education, household income, mother’s anemia condition, birth spacing between children and marital status to be associated with child mortality. Table 2 shows the distribution of variables used in this study, which includes some of the variables that were significant in previous studies as well as other variables selected using theoretical framework.

Table 2 Descriptive statistics

Figure 1 shows that the distribution of death by age. It is evident that majority of child death occur in between month 0 and month 5. the distribution of number of deaths by time is positively skewed.

Fig. 1
figure 1

Distribution of death by age

The proportion of children who died before the age of 60 months in Sierra Leone was 8.4% and 91.6% was reported to be alive. Descriptive statistics results from Table 2 indicate that there are 6.1% of the children residing in rural areas in Sierra Leone who died before turning 60 months while 2.3% of the children in urban areas died before the age of 60 months. Majority of children who died had mothers with no education 4.9% and 0.1% of children who died had mothers with higher education. For children with no health insurance, 8.1% of them died while 0.4% of children with health insurance died. There was 5.8% and 4.1% of children who died and did not receive prenatal and postnatal care, respectively.

Statistical software

The statistical analysis in this paper was done using the statistical software SAS Enterprise Guide 7.1 and SAS On Demand (SAS Studio). To fit the discrete-time model, we use the PROC LOGISTIC procedure in SAS. However, this fits the ordinary discrete-time logistic model and computes statistics with assumption that the sample is chosen using simple random sampling methods [33]. Since the SLDHS is a survey data with survey weights, using a PROC SURVEYLOGISTIC model is more appropriate as it allows for specification of survey weights, stratification and clusters that are included in the sample design [33].

Results and discussion

Since the age interval in the study has sixty intervals, the parametric method of fitting the baseline hazard in a discrete time survival model involved fitting polynomial functions of the intercept \(\gamma \left(t\right)\) from the model. In this study we fitted four distinct functions of the baseline hazard and according to the results in Table 3. Based on the model with smallest AIC it can be concluded that modelling the baseline hazard with cubic polynomial was best fit. Figure 2 shows the baseline hazard, the hazard of child death was steadily constant at about zero from month 0 to month 28 and began to exponentially rise after month 28.

Table 3 Baseline Hazard AIC Results
Fig. 2
figure 2

Baseline Hazard Plot

Model results

Table 4 displays estimates results from two models, namely, basic discrete-time model, and the survey discrete-time model.

Table 4 Results from the two Fitted Discrete time Models

Table 4 illustrates how the fitted discrete-time models produce partially consistent results in terms of statistical significance of explanatory variables, the significance of the explanatory variables was tested under \(5\%\) level of significance. Results from the survey discrete-time logistic model indicate that children whose mothers were from Northern and Southern region in Sierra Leone were at significantly higher odds of experiencing child death than those children whose mothers were from the Western region. \((OR = 1.216, p-value=0.0016; OR = 1.193, p-value=0.0104,\) respectively \()\)

Output further highlights that children whose mothers were from rural areas were at significantly lower odds of child mortality compared to children whose mothers were from urban areas\((OR = 0.861, p-value=0.0008)\). Use of contraceptives was also a significant predictor of child death. It is evident that children whose mothers were currently using contraceptives and those whose mothers who had been using them since their previous pregnancy \((OR=1.118, p-value=0.0003; OR=1.372, p-value=<.0001,\) respectively \()\) were more likely to experience child death than those whose mothers had never used contraceptives.

Health Insurance had positive relationship with child death in Sierra Leone. Children whose mothers had no health insurance had significantly higher odds of dying \((OR=1.337; p-value=<.0001)\) than children whose mothers had health insurance. Certain levels of marital status were statistically significant predictors of child death. It is further notable how children of married, and formerly mothers were at significantly higher odds \(\left(OR= 1.207, p-value=0.0003; OR=1.308, p-value=0.0009, \mathrm{respectively}\right)\) of experiencing child death than children whose mothers had never been in union.

Place of delivery variable indicates that children that were delivered at home and public health sectors were at significantly higher odds of dying\((OR=1.298, p-value=0.0395; OR=1.262, p-value=0.040\), respectively \()\) than children that were delivered at private health sectors. Increase in mothers age illustrates an increase in the odds of having children more prone to child death.\((20-29: OR=1.943, p-value=<.0001;30-39: OR=2.397, p-value=<.0001;>40 age: OR=2.895, p-value=<.0001\); respectively\()\). The preceding birth interval was also a statistically significant predictor of child death, it is notable children whose mothers have low interval birth spacing are at significantly higher odds of experiencing child death that children whose mothers have higher interval birth spacing\((no previous birth:OR=1.424, p-value=<.0001;1-12 months:OR=1.039, p-value=0.7433;13-24 months:OR=1.295, p-value=<.0001;25-48 months:OR=1.125, p-value=0.0040;49-60 months:OR=0.961, p-value=0.4710\)).

Discussion

The aim of this study was to model age at child death and determine factors that are associated with child death in Sierra Leone using the 2019 DHS data. This analysis was done using discrete-time survival models, and discrete-time survival with survey weights. From the fitted models, the discrete-time survival with survey weights has a slightly higher predictive power than the basic discrete-time survival model. Therefore, the analysis and interpreted results are based on this model. It was discovered that the median age of experiencing child death in at 16 months from birth and it behaved in a cubic polynomial, as indicated by the fitted baseline hazard.

Place of residence was a significant predictor of age at child death, where children residing in rural area were at lower risk of experiencing death compared to those residing in urban areas. Initiatives to improve health service and affordability, such as Free Health Care Initiative (FHCI) and Rural Health Care Initiative (RHCI), may have had a significant impact on lowering child death rates especially in rural areas. Most of the studies conducted prior to 2019, [10, 34], contradict findings from our study with regards to place of residence as they discovered that children in rural areas are more prone to experiencing child death than those in urban areas.

According to our study, the use of contraceptives was significantly associated with age at child death, for mothers who had used them historically and those that were currently using them. However, other existing literature about long term effects of contraceptives, [35,36,37], revealed that contraceptives do not have adverse effects on children. A study by [38], however, shows results that are consistent with what was discovered in this study that contraceptive use is associated with increased risk of child mortality.

Other significant factors which show consistent results with what was discovered by other authors include: marital status, preceding birth interval and mother’s age. The cross-sectional nature of the SLDHS makes it difficult to discover the actual causes of the child death [39]. Birth spacing has remained a significant predictor of child death over the years around this research topic. Our study showed that birth spacing of less than two years significantly increased the odds of child death. In addition, women who had never used contraceptives in Sierra Leone were married. This may result in short interval birth spacing as majority of married women may not be practicing family planning. Increase in birth spacing reduces risk of child death as the children are not compelled to stopping breastfeeding too soon, which has been said to affect development and growth of the child [40].

The study done by [10], where time to child death was modeled using continuous time models (Cox proportional hazard model), showed how postnatal visits reduces the hazard of child death, however in this study child postnatal checkups was not a significant predictor of child death. Gender, religion, and wealth were used as a covariate in this study and results show that they are not significant predictors of age to child death. These results are consistent with the findings from [10]. In [11]’s study there is evidence of marital status and educational level being statistically significant covariates of modeling child mortality. Similarly, in this study, some levels of marital status (married and formerly married) are significant predictors of child death. Many authors have produced research about child mortality in various parts of the world, however, there is very few available literature which focus on the time to child mortality. Therefore, this study will have significant contribution to literature as it stretches the study of child mortality to also studying how age is plays a role in child death.

Conclusion

In this study we modelled age at child death in Sierra Leone using discrete time survival with survey weights. Results show that the baseline hazard of child death is behaving in a cubic polynomial shape. Furthermore, we see how having health insurance reduced the odds of experiencing child death. Mother having children at older age also increased the odds of their children to be prone to child death. Another variable that gave significant findings is the place of delivery, as it shows how those who deliver at home and public sectors had greater odds of having their children dying before the age of 5 years than those that may have delivered at private health sectors. This may be due to lack of proper infrastructure, sufficient health services or high volumes of people dependent on limited available health care.

The results obtained from this study are important and may assist the government of Sierra Leone in policy making decisions, particularly with regards to how the country can achieve Sustainable Development Goals and maintain low child mortality rates. Along with other variables that may be significant predictors of child death, the ones identified in this study can formulate guidance on areas that require stringent monitoring to ensure that Sierra Leone reaches the sustainable development goals.

Availability of data and materials

The datasets supporting the results and conclusion in this study are publicly available from the DHS program https://dhsprogram.com/methodology/survey/survey-display-545.cfm

Abbreviations

AIC:

Akaike Information Criterion

DHS:

Demographic Health Survey

MDG4:

Millennium Development Goal 4

OR:

Odds Ratio

PSU:

Primary Sampling Units

SLDHS:

Sierra Leone Demographic Health Survey

SSU:

Secondary Sampling Units

References

  1. Sartorius BK, Sartorius K. Global infant mortality trends and attributable determinants – an ecological study using data from 192 countries for the period 1990–2011. Popul Health Metrics. 2014;12:29. https://doi.org/10.1186/s12963-014-0029-6.

  2. Bryce Jennifer, Black Robert E, Victoria Caesar G. Millennium Development Goals 4 and 5: progress and challenges. BMC Medicine. 2013;11:1–4.

    Article  Google Scholar 

  3. Molyneux M, Molyneux E. Reaching millennium development goal 4. Lancet Glob Health. 2016;4. https://doi.org/10.1016/S2214-109X(16)00009-7.

  4. Byaro M. Commentary: on the effects of health expenditure on infant mortality in sub-Saharan Africa: evidence from panel data analysis. Health Econ Rev. 2021;11:11. https://doi.org/10.1186/s13561-021-00310-6.

  5. World population, review. Sierra Leone population (Demographics, maps, graphs). 2022. https://worldpopulationreview.com/countries/sierra-leone-population.

  6. Unicef. Unicef for evey child: Sierra Leone. Unicef.org. [Online] 2016. https://www.unicef.org/sierraleone/maternal-neonatal-and-child-health.

  7. Government of Sierra Leone. Interim millenium development goals Sierra Leone . relief web . [Online] 2016. https://reliefweb.int/report/sierra-leone/sierra-leone-interim-millennium-development-goals-report-2015.

  8. StatsSL and ICF. Sierra Leone DHS Report. Demographic Health Surveys Program. [Online] 2020. https://dhsprogram.com/publications/publication-FR365-DHS-Final-Reports.cfm.

  9. XIA, FANG. Infant mortality and desired fertility: the case of the free health care initiative in Sierra Leone. Taylor & Francis online. [Online] 07 06 2022. https://doi.org/10.1080/00220388.2022.2081501.

  10. Naz L, Patel KK. "Determinants of infant mortality in Sierra Leone: applying Cox proportional hazards model". Int J Soc Econ. 2020;47(6):711–26. https://doi.org/10.1108/IJSE-08-2019-0478.

  11. Izugbara C. Single motherhood and neonatal and infant mortality in Sierra Leone, Burkina Faso and Burundi. Pub health. 2016;135:122–130. https://doi.org/10.1016/j.puhe.2016.01.017.

  12. Discrete-Time Methods for the Analysis of Event Histories; 13; https://doi.org/10.2307/270718. Allison, Paul D. 1982, JSTOR,. 70–72.

  13. Xin, Xin;. University ofGguelph. [Online] August 2011. https://atrium.lib.uoguelph.ca/xmlui/handle/10214/2974.

  14. Validity and efficiency of approximation methods for tied survival times in cox regression. Hertz-Picciotto, I., & Rockhill, B. 3, s.l. : JSTOR,. 53.

  15. Discrete Survival Time Constructions for Studying Marital Formation and Dissolution in Rural South Africa; 11; https://www.frontiersin.org/articles/https://doi.org/10.3389/fpsyg.2020.00154 . Batidzirai, Jesca, Manda , Samuel and Mwambi, Henry. 2020, Frontiers. 1664–1078.

  16. Manda S. Meyer R. Age at first marriage in malawi: a bayesian multilevel analysis using a discrete time-to-event model. Journal of the royal statistical society. Series a (Statistics in Society). 2005;168(2):439–55. http://www.jstor.org/stable/3559971.

  17. Steele F. Multilevel discrete-time event history models with applications to the analysis of recurrent employment transitions. Australian & New Zealand Journal of Statistics. 2011;53:1–20. https://doi.org/10.1111/j.1467-842X.2011.00604.x.

  18. Baker C. Don’t let complex survey data get the best of you! SAS® survey PROCs for categorical data analysis. SAS institute. [Online] 2019. https://support.sas.com/resources/papers/proceedings19/3958-2019.pdf.

  19. Survival analysis using SAS. A practical guide. Allison, P.D. 9781599946405. SAS Institute. 2010. https://books.google.co.za/books?id=nhtpmAEACAAJ.

  20. Survival analysis. A1 - Guo, S. 9780195337518. PKT GD to social work research methods. USA: Oxford University Press; 2010. https://books.google.co.za/books?id=g58RDAAAQBAJ.

  21. Neels, Prof. dr. K. KU Luven. [Online] 2014. https://lstat.kuleuven.be/about/Course_Documents/Eventhistoryanalysis/Chapter3%28Discrete-timeModels%2CAcademicYear2012-2013%29.pdf.

  22. Introducing survival and event history analysis. Mills, M. 9781446268360. Sage research methods. SAGE; 2011. https://books.google.co.za/books?id=AuVOnQAACAAJ.

  23. Muthén B, Masyn K. Discrete-time survival mixture analysis. J Educ Behav Stat. 2005;30(1):27–58. http://www.jstor.org/stable/3701319.

  24. Singer JD, Willett JB. It’s about time: using discrete-time survival analysis to study duration and the timing of events. J Educ Stat. 1993;18(2):155–95. https://doi.org/10.2307/1165085.

  25. Tutz G, Schmid M. Modeling discrete time-to-event data. 2016. https://doi.org/10.1007/978-3-319-28158-2.

  26. Thanda, Amanika. Career Foundry. [Online] 24 May 2022. https://careerfoundry.com/en/blog/data-analytics/what-is-logistic-regression/#logistic-regression-assumptions.

  27. Dlamini, welcome Jabulani. Statistical models to understand factors associated with under-five child mortality in tanzania. UKZN Research Space. [Online] 2016. http://hdl.handle.net/10413/13987.

  28. Applied logistic regression. Hosmer, D.W. Lemeshow, S. 9780471654025. Applied logistic regression. Wiley; 2004. https://books.google.co.za/books?id=Po0RLQ7USIMC.

  29. Archer K, Lemeshow S. Goodness-of-fit test for a logistic regression model fitted using survey sample data. Stata J. 2006;6:97–105. https://doi.org/10.1177/1536867X0600600106.

  30. Khaki, Jessie Jane. Discrete time hazard parameterization for child survival. University of Malawi chancellor college; 2018.

  31. Mosley WH, Chen LC. An analytical framework for the study of child survival in developing countries. Pop Develop Rev. 1984;10:25–45. https://doi.org/10.2307/2807954.

  32. Bailey, Mohamed . Oxford Academic. [Online] 1 August 1988. https://academic.oup.com/tropej/article-abstract/34/4/165/1633092.

  33. Anthony AB. Performing logistic regression on survey data with the new SURVEYLOGISTIC procedure. 2002. https://api.semanticscholar.org/CorpusID:581518.

  34. Davids S, Susuman AS, Latief A. Vulnerable diseases affecting child mortality in Sierra Leone: emerging Health Issue. J Asian African Stud. 2013;48:348–69. https://doi.org/10.1177/0021909612459017.

  35. Jellesen R, Strandberg-Larsen K, Jørgensen T, Olsen J, Thulstrup AM, Andersen AM. Maternal use of oral contraceptives and risk of fetal death. Paediatr Perinat Epidem. 2008;22(4):334–340. https://doi.org/10.1111/j.1365-3016.2008.00942.x.

  36. Pardthaisong T, Yenchit C, Gray R. The long-term growth and development of children exposed to Depo-Provera during pregnancy or lactation. Contraception. 1992;45(4):313–24. https://doi.org/10.1016/0010-7824(92)90053-v.

  37. Stover John, Winfrey Willia. The effects of family planning and other, factors on fertility, abortion, miscarriage, and stillbirths in the Spectrum model. BMC Public Health. 2017;17:43–50.

    Article  Google Scholar 

  38. Tagoe Eunice Twumwaa, et al. A predictive model and socioeconomic and demographic determinants of under-five mortality in Sierra Leone. Heliyon. 2020;6(3):2405–8440.

    Article  Google Scholar 

  39. Clark S, Hamplová D. Single motherhood and child mortality in sub-Saharan Africa: a life course perspective. Demography. 2013;50(5):1521–49. https://doi.org/10.1007/s13524-013-0220-6.

  40. Murray, Donna. Very Well Family. [Online] 20 April 2020. https://www.verywellfamily.com/sudden-weaning-from-breastfeeding-4140720.

Download references

Acknowledgements

We gratefully acknowledge the DHS program for providing with the datasets. We also acknowledge the support we received from the University of KwaZulu-Natal through the provision of research facilities.

Funding

The authors have no support or funding to report this study.

Author information

Authors and Affiliations

Authors

Contributions

FM performed the literature review, data management, statistical analysis, and wrote the initial draft of the manuscript. JB designed and developed of the methodology of the study. FM and JB read and approved the final version of the manuscript.

Corresponding author

Correspondence to Fundiswa Pearl Mdluli.

Ethics declarations

Ethics approval and consent to participate

All participants gave informed consent to participate in the survey before each interview. The DHS program keeps strict standards of ensuring anonymity and protecting the privacy of all participants. ICF Institutional Review Board (IRB) ensures that the survey complies with the U.S Department of Health and Human Services regulations and procedures to protect human subjects while the host country IRB ensures that the survey complies with the laws and norms of the nation.

Consent to publication

Not applicable.

Competing interests

The authors declare that they have no complicit of interest.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Mdluli, F.P., Batidzirai, J.M. Discrete-time survival analysis with survey weights: a case study of age at child death in Sierra Leone. BMC Public Health 23, 1522 (2023). https://doi.org/10.1186/s12889-023-16412-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12889-023-16412-1

Keywords