Skip to main content

Spatial variation and hot-spots of district level diarrhea incidences in Ghana: 2010–2014



Diarrhea is a public health menace, especially in developing countries. Knowledge of the biological and anthropogenic characteristics is abundant. However, little is known about its spatial patterns especially in developing countries like Ghana. This study aims to map and explore the spatial variation and hot-spots of district level diarrhea incidences in Ghana.


Data on district level incidences of diarrhea from 2010 to 2014 were compiled together with population data. We mapped the relative risks using empirical Bayesian smoothing. The spatial scan statistics was used to detect and map spatial and space-time clusters. Logistic regression was used to explore the relationship between space-time clustering and urbanization strata, i.e. rural, peri-urban, and urban districts.


We observed substantial variation in the spatial distribution of the relative risk. There was evidence of significant spatial clusters with most of the excess incidences being long-term with only a few being emerging clusters. Space-time clustering was found to be more likely to occur in peri-urban districts than in rural and urban districts.


This study has revealed that the excess incidences of diarrhea is spatially clustered with peri-urban districts showing the greatest risk of space-time clustering. More attention should therefore be paid to diarrhea in peri-urban districts. These findings also prompt public health officials to integrate disease mapping and cluster analyses in developing location specific interventions for reducing diarrhea.

Peer Review reports


Diarrhea is an ongoing public health threat, especially in developing countries. More than 1.7 billion episodes of diarrhea are recorded globally every year with the majority of these occurring in low and middle income countries [1,2,3,4,5,6]. Infection is mainly through contaminated water and food as a result of poor hygiene [7]. The persistence of diarrhea has been attributed to socio-economic inequalities such as low income levels, illiteracy, and inadequate safe water and sanitation [8,9,10,11,12].

In Ghana, diarrhea is the second most common health problem treated in out-patient departments. The nationwide reported diarrhea incidences increased from 725,976 cases in 2010 to 1,576,542 cases in 2014. Improvement in water and sanitation conditions still remains the long-term solution to reducing diarrhea. Under scarce budgetary resources, knowledge of the geographic hot-spots is a consequential alternative that could provide immediate solution with respect to decision making towards appropriate allocation of resources. Previous diarrhea studies in Ghana have predominantly focused either on single geographic units or the characteristics of the affected individuals [13,14,15,16,17]. These studies are unable to characterize the geographic areas of priority; hence a knowledge gap with respect to the geographical patterns still remains. Diarrhea morbidities vary across geographical areas; some areas are likely to sustain exceptionally high morbidities over time due to unplanned urbanization. The premise of this study also derives from previous population based studies [14, 15] that have suggested variation in diarrhea incidences at wider geographical units. Yet it is still unknown which areas have a higher than expected risk. It is thus imperative to identify areas of hot-spots as it is crucial to assist decision makers to assess programmatic needs, prioritize interventions and monitor progress. Children are the most vulnerable to diarrhea; knowledge of diarrhea hot-spots will also be an important step towards achieving the Sustainable Development Goal 3 (SDG 3) of ensuring healthy lives and promote well-being for all at all ages.

Our objective is to study the geographical patterns and hot-spots of diarrhea in Ghana. The demographic and socio-demographic indices amongst districts in Ghana are widely diverse as are diarrhea incidences. Since diarrhea morbidities are conditioned by socio-demographic factors, and since these factors are geographically correlated in space, we expect diarrhea morbidities to exhibit space-time clustering. For instance, unplanned rapid urbanization fueled by rural-urban migration can have substantial influence on diarrhea morbidities due to stress on existing amenities which do not meet the demands of the rising population. No previous study has explored the country-wide spatial patterns and hot-spots of diarrhea in Ghana. Our study is therefore focused on the spatial and space-time clustering of diarrhea. An additional purpose of the study is to examine the impact of urbanization on space-time clustering of diarrhea. Geographical hot-spots of diarrhea have been explored in Thailand [18] using the Local Indicator for Spatial Association (LISA) statistic. Kulldorff’s spatial scan statistic is well suited for detecting space-time clusters, hypotheses testing and making etiological inferences [19]. It has been used to study clustering of diarrhea in Ethiopia without formally recounting the possible causes of the clusters [20]. We recognize the challenge in the arbitrary selection of the maximum cluster size for spatial scan statistics [21]. We use the average behavior of spatial dependency structure, i.e. the practical range of the semi-variogram, to infer an empirical cluster window size. We consider a semi-variogram estimator that accounts for heterogeneous denominators of the rate parameter [22].

The remainder of the manuscripts is organized as follows. First, we develop empirical Bayesian smoothed maps of diarrhea. Second, we detect and map geographical areas of higher than expected incidences using the spatial scan statistics. Third, we describe the impact of urbanization on the occurrence of space-time clustering using logistic regression. We end with discussions and conclusions.

Methods and analysis

Study area and data

Ghana is centrally located on the west coast of Africa (Fig. 1) with a total land area of 238,589 km2. It is a tropical region with varying temperatures and rainfall intensities. Ghana consists of ten administrative regions which are subdivided into 170 districts. Projections by the Ghana Statistical Service (GSS) puts the current population at 27,043,093. The spatial scale of our analysis is the district level of which data had been recorded. The population data were obtained from the Ghana Statistical Service (GSS). Diarrhea morbidities on outpatient records from 2010 to 2014 were obtained from the Centre for Health Information and Management (CHIM) of the Ghana Health Services (GHS).

Fig. 1

District map of Ghana showing the its neighboring countries; Cote d’Ivaire (left), Burkina Faso (top) and Togo (right)

Mapping the spatial distribution of diarrhea risk

Area specific disease indices such as the relative risk, also called standard morbidity ratio (SMR), are important measures of neighborhood health status. The SMR is useful for guiding health interventions and allocations of health resources. In this study, we mapped the spatial distribution of the SMR rather than considering the disease rates in isolation. Let O i , i = 1 ,  …  , m, represent random variable of diarrhea cases o 1 ,  …  , o m in m districts. We assume that the O i are independently Poisson distributed O i  ~ Pois(e i r i )with mean proportional to the unknown relative risk r i , such that \( p\left({o}_i\right)=\left\{{\left({e}_i{r}_i\right)}^{O_i} \exp \left(-{e}_i{r}_i\right)\right\}/{o}_i! \), where e i is the expected number of cases in district i. Using the log-likelihood function \( \log L={\sum}_{i=1}^n\left\{{o}_i \log \left({e}_i{r}_i\right)-{e}_i{r}_i\right\} \), the maximum likelihood estimator for the unknown relative risk is obtained as \( {\widehat{r}}_i={o}_i/{e}_i \). The corresponding conditional mean and variance are \( E\left[{\widehat{r}}_i\left|{r}_i\right.\right]={r}_i \) and \( V\left[{\widehat{r}}_i\left| r\right.\right]={r}_i/{e}_i \), respectively. The expected number of casese i is defined in the absence of covariates as the number of cases in an epidemiologic “null model” of incidences e i  = πn i . Here, n i is the number of persons at risk in district i, and πis the individual level constant baseline risk estimated from the aggregated population by means of \( \pi ={\sum}_{i=1}^m{o}_i/{\sum}_{i=1}^m{n}_i \). A major drawback of this estimate is that it leads to unstable estimates with areas of small populations showing the highest variability [23,24,25]. To account for this, we use the empirical Bayesian smoothing to borrow information across neighboring districts. This smoothing consists of obtaining a weighted average between the raw estimates for each district and the neighboring average, with weights proportional to the underlying population at risk [26]. In effect, districts with relatively small populations will tend to have their estimates adjusted considerably, whereas for districts with relatively large populations, the estimates will barely change. Following Clayton and Kaldor [26] and Gatrell and Bailly [27], the smoothed estimates of the relative risk is expressed as\( {r}_i^{\mathrm{EB}}={\varpi}_i{r}_i+\left(1-{\varpi}_i\right){\overline{r}}_i \), where the respective weights ϖ i for districts equal \( {\varpi}_i={\sigma}_i^2/\left[{\sigma}_i^2+\left({\overline{r}}_i/{e}_i\right)\right] \). Here \( {\overline{r}}_i \) and \( {\widehat{\sigma}}_i^2 \)are the empirical local estimates of spatially varying prior mean and variance, respectively. We used the method of moments [25] to estimate \( {\overline{r}}_i={\Sigma}_j{w}_{i j}{o}_i/{\Sigma}_j{w}_{i j}{e}_i \)and \( {\widehat{\sigma}}_i^2=\left[{\Sigma}_j{w}_{i j}{e}_i{\left({r}_i-{\overline{r}}_i\right)}^2\right]/{\Sigma}_j{w}_{i j}{e}_i-{\overline{r}}_i/\left({\Sigma}_j{w}_{i j}{e}_i/ n\right) \). We estimated the local mean \( {\overline{r}}_i \) and the variance \( {\widehat{\sigma}}_i^2 \) based on the spatial neighborhood structure of the dataw ij , such that w ij  = 1if districts i and j are neighbors, and zero otherwise.

Spatial scan statistics

We used the spatial scan statistics developed by Kulldorff’s [21] to detect the presence of spatial and space-time clusters or hot-spots of diarrhea. We defined hot-spots as clusters with high than expected or elevated risk. The spatial scan statistic is a widely cluster detection tool to detect and evaluate geographical areas of excess risk against the null hypothesis of random distribution. It is based upon the principle that the number of cases in a geographic area follow a Poisson distribution according to a known underlying population at risk. This cluster detection method offers several advantages over other scan statistics methods (e.g. [28,29,30]): (1) it corrects for multiple comparisons, (2) it adjusts for the heterogeneous population densities amongst the different areas in the study, (3) it detects and identifies the location of the clusters without prior specification of their suspected location or size thereby overcoming pre-selection biases, and (4) it allows adjustment for covariates. The significance of Kulldorff’s scan statistic is widely acknowledged in spatial epidemiology [15, 19, 21, 31,32,33,34,35,36,37,38,39,40,41].

Cluster window size

Critical to the spatial scan statistics is the selection of the maximum window size. Since there is no clear guideline for a choice, it is often chosen somewhat arbitrarily; e.g. as a percentage of the at risk population, or either based on experience or knowledge of the extent of clustering. Hjalmars et al. [34] suggest 10% of the population at risk whereas Kulldorff et al. [42] suggest 50% of the population at risk. Too large a window size may define too large areas as clusters which might be expensive and difficult for further epidemiological investigation. Larger sizes would also indicate areas of exceptionally low rates outside the window rather than areas of exceptionally high rates within the window. Too small a window may obscure important clusters. Since the maximum window size is related to the extent of spatial continuity, we estimated this using the semi-variogram. We assumed the relative risk as second order stationary random field; its theoretical semi-variogram γ(h)between any two districts i and j is γ(h) = 0.5E[r i  − r j ]2, whereh = |i − j|is the Euclidian distance between the centroids and E denotes the mathematical expectation. The corresponding method of moments (empirical) estimator [43], after forming multiple distance pairs, equals\( {\gamma}^{\ast }(h)=0.5{\left\{ N(h)\right\}}^{-1}{\sum_{i=1}^{N(h)}\left({r}_i-{r}_j\right)}^2 \), where N(h)is the number of observation pairs separated by h. The traditional semi-variogram estimator, however, is not suited for the analysis of proportion since it does not account for heterogeneous denominators. Following Monestiez et al. [22, 44], the different pairs (r i  − r j )were weighted by their corresponding denominators \( \frac{e_i\cdot {e}_j}{e_i+{e}_j} \) to homogenize their variance terms by dividing by weights proportional to their standard deviations. The adjusted experimental semi-variogram is then

$$ {\gamma}^{\ast }(h)=0.5{\left\{ N(h)\right\}}^{-1}{\sum}_{i=1}^{N(h)}\left\{\frac{e_i\cdot {e}_j}{e_i+{e}_j}{\left({r}_i-{r}_j\right)}^2-\overline{r}\right\} $$

where \( N\left(\mathrm{h}\right)=\sum \frac{e_i\cdot {e}_j}{e_i+{e}_j} \) is a normalizing constant and \( \overline{r}=\Sigma {e}_i\cdot {r}_i/\Sigma {e}_i \)is an estimate of the weighted mean of r. Monestiez et al. [22, 44] developed the above semi-variogram to account for the spatially heterogeneous observation efforts of sparse animal sightings for mapping the relative abundance of species (Balenoptera physalus). Simulation studies indicated that this approach performs better than simple population-weighted approaches and Bayesian smoothers [45]. Permissible semi-variogram models by means of least squares were fitted to the experimental semi-variograms. From the fitted models, the largest range amongst the range parameters of the various models was noted as the maximum window size for the spatial scan statistics.

Hot-spots detection

For the detection of purely spatial hot-spots, a circular window was defined which moves over the study region, centered on the centroid of each district. This varies from 0 to the maximum window size. This window size was defined based on the largest range of the semi-variogram models described in the previous section. Possible hot-spots are tested within the window whenever it centers on the centroid of each district. The null and alternative hypothesis are \( {H}_0: r\left(\Omega \right)= r\left(\overline{\Omega}\right) \)and \( {H}_1: r\left(\Omega \right)> r\left(\overline{\Omega}\right) \), respectively, where r(Ω)and \( r\left(\overline{\Omega}\right) \) are the relative risk within and outside the widows Ω and \( \overline{\Omega} \). We can then express o(Ω) ~ Pois(e(Ω) r(Ω)) and\( o\left(\overline{\Omega}\right)\sim Pois\left( e\left(\overline{\Omega}\right)\cdot r\left(\overline{\Omega}\right)\right) \). Whenever the window finds a new case, the likelihood function for elevated risk within the window in comparison with those outside the window is calculated. The likelihood function for window Ω is proportional to.

\( L\left(\Omega \right)=\begin{array}{c}\hfill \sup \hfill \\ {}\hfill \Omega \in \boldsymbol{\Omega} \hfill \end{array}{\left(\frac{o\left(\Omega \right)}{e\left(\Omega \right)}\right)}^{O\left(\Omega \right)}{\left(\frac{o\left(\overline{\Omega}\right)}{e\left(\overline{\Omega}\right)}\right)}^{O\left(\overline{\Omega}\right)}\times I\left(\frac{o\left(\;\Omega \right)}{e\left(\Omega \right)}>\frac{o\left(\overline{\Omega}\;\right)}{e\left(\overline{\Omega}\right)}\right) \)where I( )is the indicator function. The window Ω to be scanned by the spatial scan statistic is included in the set:Ω = {Ω ik |1 ≤ i ≤ m, 1 ≤ k ≤ K i }, where Ω ik , k = 1 ,  …  , K i , is the window composed of the (k − 1) nearest neighbors to district i. The window Ω that attains the maximum likelihood is defined as the most likely hot-spot (MLH). We carried out the test of significance level by means of the Monte Carlo hypothesis testing [46]. We rejected the null hypothesis of no clustering when the simulated p-value is less than or equal to 0.05 for most likely hot-spots and 0.1 for secondary hot-spots [47].

For the detection of space-time hot-spots, a cylindrical window with a circular geographic base and height corresponding to time was used. The base of the cylinder is centered around one of several possible districts and its radius is varying continuously in size. The height of the cylinder reflects any possible time interval of less than or equal to half the total study period. The window then moves in space and time, visiting each time interval and geographic location [19, 21]. The likelihood ratio test statistic is constructed in the same way of the purely spatial hot-spots. However, the computational algorithm is in three rather than two dimensions [48]. Most likely hot-spots for different time lengths (i.e. 1, 2, 3, or 4-year length) were scanned.

Odds of space-time hot-spots and population density

We applied binary logistic regression to unfold the odds of a particular district being a space-time hot-spot conditioned on the socio-demographic status. Here, we used urbanizationρas the independent variable. Such variable is an invaluable proxy for many socio-demographic indicators known to influence diarrhea. For the observed value y, dichotomized as y = 1 if a district is a space-time cluster and y = 0otherwise, the conditional probability is \( p\left( y=1\left|\rho \right.\right)=\frac{ \exp \left({\beta}_0+{\beta}_1\rho \right)}{1+ \exp \left({\beta}_0+{\beta}_1\rho \right)} \). This is linearized by means of the logit transform logit(p) = β 0 + β 1 ρ, where \( \mathrm{logit}(p)= \log \left(\frac{p}{1- p}\right) \), β 0is the intercept term, and β 1is the fixed effect of the independent variableρ. For meaningful interpretation and inferences, we classified urbanization into three strata, i.e. rural, peri-urban, and urban. Districts with predominantly rural communities were classified as rural (< 30% urban population), those with mixed urban and rural communities were classified as peri-urban (30%–70% of urban population), and those with predominantly urban communities were classified as urban (> 70% of urban population). We estimated three different fixed effect parameters for the odds ratios (OR),exp(β k ) k = 1 , 2 , 3, corresponding to each stratum.

Results and analysis

Spatial distribution of relative risk

The overall risk of diarrhea varied with increasing trend ranging from 0.3% in 2010 to 0.58% in 2014. Seasonal variations were not analyzed due to the coarse temporal resolution of the data. Figure 2 shows the empirical Bayesian smoothed maps with remarkable spatial variations. We found temporal similarities of spatial patterns as some districts of either high or low rates remained same throughout the study period. Typically, the relative risk of districts within the mid-west parts remained pronounced and consistent throughout the study period.

Fig. 2

Spatial distributions of the empirical Bayesian smoothed estimates of the relative risks for 2010 (a), 2011 (b), 2012 (c), 2013 (d), 2014 (e), and cumulative estimates from 2010-2014 (f)

Spatial scan statistic

Cluster window

Setting the maximum window size was guided by the extent of spatial correlation of the relative risk. Experimental semi-variograms were computed using both traditional and adjusted estimators. The semi-variograms were estimated using 20 lags of 10 km. we fitted spherical, Gaussian, and exponential models using least squares. The exponential model expressed the largest range of spatial correlation, followed by the spherical and Gaussian models (See Table 1, Fig. 3a and b). When population heterogeneities were accounted for, the adjusted semi-variogram models had larger ranges and lower variances compared with the traditional estimators (Fig. 3c). Based on these, we chose a cluster window size of 70 km obtained from the practical range of exponential model of the adjusted semi-variogram estimator.

Table 1 Comparison between the adjusted and traditional semi-variogram models
Fig. 3

Empirical and theoretical semi-variogram models for both adjusted and traditional estimator. a: Adjusted semi-variogram estimator and theoretical models (exponential, spherical, and Gaussian). b: Traditional semi-variogram estimator and theoretical models (exponential, spherical, and Gaussian). c: Variations between exponential semi-variogram models estimated from the traditional and adjusted estimators

Hot-spots detection

Statistically significant primary and secondary hot-spots were observed using the maximum window size of 70 km. The primary hot-spot encompassed 15 districts with higher than expected relative risk of 1.67 (p < 0.001). This hot-spot had 595,655 observed cases compared with 370,194.21 expected cases covering almost 6.03% of the population. A total of 73 statistically significant secondary hot-spots were also observed. Table 2 presents the characteristics of the first 5 spatial hot-spots of diarrhea, while Fig. 4a shows the spatial distribution of the spatial hot-spots.

Table 2 Characteristics of the first 5 spatial hot-spots of diarrhea, 2010–2014
Fig. 4

Spatial distribution of purely spatial (a) and space-time hot-spots (b) for diarrhea, 2010–2014

Statistically significant space-time hot-spots (p < 0.001) were also observed. This consisted of a primary hot-spot and 21 secondary hot-spots (Table 3). The primary hot-spot was observed in 2013–2014 encompassing 15 districts with a likelihood ratio of 71,867.76 and relative risk of 2.16 (Table 3, Fig. 4b). The first secondary hot-spot had similar characteristics as the primary hot-spot. This hot-spot occurred in 2013–2014 and encompassed 11 districts with a likelihood ratio of 54,964.05 and relative risk of 2.07. The existence of most of the space-time hot-spots spanned for more than one year and were considered as long-term hot-spots. Space-time hot-spots which existed for only one year period were considered as emerging hot-spots.

Table 3 Characteristics of the space-time hot-spots of diarrhea, 2010–2014

Odds of space-time hot-spots and population density

From the results of the logistic regression model, the overall odds of space-time clustering was 1.62 (Table 4). The mostly likely stratum of space-time clustering is peri-urban districts. Space-time clustering is 11% higher in peri-urban districts (OR = 1.11; CI = [0.59–2.19]) than rural districts, and 43% lower in urban districts (OR = 0.57; CI = [0.23–1.39]) as compared with rural districts.

Table 4 Odds ratios and 95% confidence intervals of the logistic regression model


This study aimed to explore and map the spatial variation and hot-spots of district level diarrhea incidences in Ghana. The findings showed temporal variation in the overall risk of diarrhea, with increasing burden since 2010 to 2014. This is probably due to unmatched population increase with the provision of safe sanitation and drinking water. From 2010 to 2014, Ghana’s population has grown from ≈24.6 to ≈27.2 million, a growth rate of ≈10.6%. This high population growth rate has caused major changes in socio-economic and demographic activities especially in rural and peri-urban districts where health and sanitation is already limited.

The empirical Bayesian smoothed maps show substantial variation in the spatial distribution of diarrhea with districts of higher/lower than expected risk clustered. This is a symptom of wider socio-economic inequalities amongst districts. We found diarrhea risk was more pronounced and consistent within the mid-west parts probably because these parts are dominated with semi-deciduous and rain forests. High precipitation, which is mostly associated with the semi-deciduous and rain forests has been found to exacerbate the risk of diarrhea infection [49,50,51]. Temporal similarities in the spatial patterns is also an indication of sustained transmission of diarrhea, suggesting that the spatial variation of the risk factors haven’t changed over the period. For instance, higher than expected risks were observed at the mid-west part of Ghana throughout 2010 to 2014 while the southern part continued to exhibit lower than expected risks. Complementarily, statistical inference of patterns using the spatial scan statistics detected both primary and secondary hot-spots, with the primary hot-spot (Cluster 1) detected within the mid-west part. This was the largest hot-spot with a radius of 68.79 km and encompassed 15 districts. We observed mutual occurrences between the empirical Bayesian smoothed maps and the hot-spots detected by the spatial scan statistics. The districts within the primary hot-spot also had higher than expected relative risks from the empirical Bayesian smoothed maps. Only few of the districts with higher than expected relative risk were not identified as hot-spots, thus indicating the significance of formal testing and inference in cluster analysis. While testing whether these spatial hot-spots were emerging or long-term, the space-time scan statistics recounted most of the spatial hot-spots as long-term (Fig. 4a and b). Specifically, the first five purely spatial hot-spots detected at the mid-west part of Ghana were also statistically significant long-term hot-spots. These clustering patterns imply less progress in prevention and control as well as unimproved hygiene and sanitation practices amongst in these districts. The epidemiological implication of the hot-spots can be deduced from the varying nature of the possible risk factors of diarrhea. Many known correlates of diarrhea are environmental and socio-demographic factors which are diversely distributed amongst the districts in Ghana. Since changes in population dynamics are highly variable in space [52], their effects on socio-demographic factors are also variable in space. Since such variation is spatially dependent and continuous, the expectation is that their ripple effects on health outcomes will also be spatially dependent and clustered. This implies that countermeasures should be opportunely undertaken, and focused on the areas of long-term hot-spots.

The impact of urbanization on space-time clustering was diverse amongst the various urbanization strata. Comparatively, space-time clustering was lowest in urban districts than rural and peri-urban districts. The underlying reason might be that the richer and better educated who are knowledgeable to prevent, and can secure safe water and sanitation for their households are mostly found in urban communities. Also urban populations have greater opportunities for health education and preventative health care. We found that space-time clustering was comparatively higher in peri-urban districts than in rural, which was inconsistent with our expectation. The reason might be that peri-urban districts are mostly transitional zones often neglected by urban planners; they are constantly under pressure by increasing populations from urban and rural population influx. For instance, the high cost of housing in urban districts restrains most rural-urban migrants and the urban poor to settle in peri-urban communities, thus heightening the creation of slums and informal settlements. Ghana has been able to achieve remarkable levels of access to improved drinking water in urban areas, yet meeting the needs of unserved and underserved communities as well as growing peri-urban areas is still a considerable challenge. As a consequence, such peri-urban settlements are often plagued with poor water and sanitation problems which are the well-known driving forces of diarrhea. We found no study linking rural-urban morphology to space-time clustering of diarrhea. This prompts that further studies are required to explore detailed comparative dynamics of diarrhea morbidities between the different urbanization strata.

The implications of our findings are stated with some caution. First, homogeneity in both population and disease counts are assumed. Thus, within-district variation is assumed to be absent to restrain our study to fall within the ecological analysis framework. While such studies are necessary for neighborhood health planning and large area intervention, they do not access and infer individual level risk characteristics, the so called ecological fallacy. Secondly, confounding and interaction effects have not been accounted for in this study. It is possible that rural-urban morphology would not matter if individual level variables mediating diarrhea risk were taken into account. Thirdly, this study used rural-urban morphology as the only proxy to capture socio-demographic risk of diarrhea. Studies have associated diarrhea with a mix of attributable socioeconomic inequalities such as low income level, illiteracy, inadequate water and sanitation [8,9,10,11,12]. Our future studies seek to explore the spatially varying effects of several of these factors on diarrhea morbidities. That notwithstanding, the overriding advantage of our findings is two- fold. First, this study shows the importance of spatial locations as a covariate in identifying and mapping areas of elevated and sustained transmission of diarrhea in Ghana. These maps provide valuable information to assist in appropriate allocation of health care resources for better control and prevention. Second, it divulges the dependency of high space-time clustering on peri-urban districts. This may provide a valuable factor for consideration in neighborhood health planning.


This study has investigated the spatial variation of district level diarrhea incidences in Ghana by mapping and detecting hot-spots. Our study demonstrates the use of the extent of spatial continuity, the range parameter of the semi-variogram, to infer cluster window size for spatial scan statistics. We conclude that that the spatial distribution of diarrhea in Ghana is clustered, with evidence of emerging and long-term space-time hot-spots. The findings also infer that space-time clustering is higher in peri-urban districts compared with rural districts, and lowest in urban districts. These findings prompt health planners and policy makers to consider these patterns as critical when developing both short-term and long-term strategies to reduce diarrhea. We intend to further investigate risk factor characteristics of diarrhea within the emerging and long-term space-time hot-spots in the future.


  1. 1.

    Black RE, Cousens S, Johnson HL, Lawn JE, Rudan I. Global, regional, and national causes of child mortality in 2008: a systematic analysis. Lancet [Internet]. 2010;375. Available from:

  2. 2.

    Black RE, Morris SS, Bryce J. Where and why are 10 million children dying every year? Lancet. 2003 Jun;361(9376):2226–34.

    Article  PubMed  Google Scholar 

  3. 3.

    Boschi-Pinto C. Estimating child mortality due to diarrhoea in developing countries. Bull World Health Organ. 2008 Sep 1;86(9):710–7.

    Article  PubMed  PubMed Central  Google Scholar 

  4. 4.

    Fischer Walker CL, Perin J, Aryee MJ, Boschi-Pinto C, Black RE. Diarrhea incidence in low- and middle-income countries in 1990 and 2010: a systematic review. BMC Public Health. 2012;12(1):1–7.

    Article  Google Scholar 

  5. 5.

    Parashar UD, Hummelman EG, Bresee JS, Miller MA, Glass RI. Global illness and deaths caused by rotavirus disease in children. Emerg Infect Dis J. 2003;9(5):565.

    Article  Google Scholar 

  6. 6.

    Liu L, Oza S, Hogan D, Perin J, Rudan I, Lawn JE, et al. Global, regional, and national causes of child mortality in 2000-13, with projections to inform post-2015 priorities: an updated systematic analysis. Lancet Lond Engl. 2015 Jan 31;385(9966):430–40.

    Article  Google Scholar 

  7. 7.

    Vesikari T, Torun B. Diarrheal diseases. In: Lankinen KS, Bergstrthn S, editors. Makela PH, and Peltomaa M, editors. London: Macmillan Press; 1994. p. 135–46.

    Google Scholar 

  8. 8.

    Dasgupta R. Exploring intra-household factors for diarrhoea diseases: a study in slums of Delhi. India J Water Health. 2008;6:289–99.

    Article  PubMed  Google Scholar 

  9. 9.

    Gupta A, Sarker G, Rout AJ, Mondal T, Pal R. Risk correlates of diarrhea in children under 5 years of age in slums of Bankura-West Bengal. J Glob Infect Dis. 2015;7:23–9.

    Article  PubMed  PubMed Central  Google Scholar 

  10. 10.

    Mekasha A, Tesfahun A. Determinants of diarrhoeal diseases: a community based study in urban south western Ethiopia. East Afr Med J 80. 2003:77–82.

  11. 11.

    Pande S, Keyzer MA, Arouna A, GJS SB. Addressing diarrhea prevalence in the West African Middle Belt: social and geographic dimensions in a case study for Benin. Int J Health Geogr. 20087:17.

  12. 12.

    Woldemicael G. Diarrheal morbidity among young children in Eritrea: environmental and socio-economic determinants. J Health Popul Nutr. 2001;19:83–90.

    CAS  PubMed  Google Scholar 

  13. 13.

    Benneh G, Songsore J, Nabila JS, Amuzu AT, Tutu KA. Yangyuoru Y an. M, et al. in: (environmental problems and the urban household in the Greater Accra metropolitan area. GAMA)-Ghana. Stockholm Environment Institute: Stockholm; 1993.

    Google Scholar 

  14. 14.

    Gyimah SO. Interaction effects of maternal education and household facilities on childhood diarrhea in sub saharan Africa, the case of Ghana. J Health Pop Dev Countries. 2003. doi:10.12927/whp.2003.17628.

  15. 15.

    Krumkamp R, Sarpong N, Schwarz NG, Adelkofer J, Loag W, Eibach D, et al. Gastrointestinal infections and Diarrheal disease in Ghanaian infants and children: an outpatient case-control study. PLoS Negl Trop Dis. 2015 Mar 4;9(3):e0003568.

    Article  PubMed  PubMed Central  Google Scholar 

  16. 16.

    Osumanu IK. Household environmental and behavioural determinants of childhood diarrhoea morbidity in the tamale metropolitan area (TMA). Ghana Geogr Tidsskr-Dan J Geogr. 2007;107(1):59–68.

    Google Scholar 

  17. 17.

    Shier RP, Dollimore N, Ross DA, Binka FN, Quigley M. an. S, G P. Drinking water sources, mortality and diarrhoea morbidity among young children in Northern Ghana. Trop Med Int Health. 1996;1:334–41.

    CAS  Article  PubMed  Google Scholar 

  18. 18.

    Chaikaew N, Nitin T, Marc S. Exploring spatial patterns of diarrhea in Chiang Mai. 8:36.

  19. 19.

    Kulldorff M, Nagarwalla N. Spatial disease clusters: detection and inference. In: Statistics in Medicine; 1995. p. 799–810

    Google Scholar 

  20. 20.

    Azage M, Kumie A, Worku A. Amvrossios CB. Childhood diarrhea exhibits spatiotemporal variation in Northwest Ethiopia: SaTSacn spatial statistical Analysis. 2015;10:12.

    Google Scholar 

  21. 21.

    Kulldorff M. A spatial scan statistic. Commun Stat-Theory Methods. 1997;269(6):1481–96.

    Article  Google Scholar 

  22. 22.

    Monestiez P, Dubroca L, Bonnin E, Durbec JP, Guinet C. Geostatistical modeling of spatial distribution of Balenoptera physalus in the northwestern Mediterranean Sea from sparse count data and heterogeneous observation efforts. Ecol Model. 2006;193(3–4):615–28.

    Article  Google Scholar 

  23. 23.

    Lawson. AB: Statistical Methods in Spatial Epidemiology. 2nd ed. New York: John; 2006.

  24. 24.

    Lawson AB, Browne WJ, Vidal-Rodeiro. CL: disease mapping with WinBUGS and MLwiN. Chichester 2003. Wiley and Sons;

  25. 25.

    Marshall RJ. Mapping disease and mortality rates using empirical Bayes estimators. Appl Stat. 1991:40–283.

  26. 26.

    Clayton D, Kaldor J. Empirical Bayes estimates of age-standardized relative risks for use in disease mapping. Biometrics. 1987;43(3):671–81.

    CAS  Article  PubMed  Google Scholar 

  27. 27.

    Gatrell AC, Bailly TC. Interactive spatial data analysis in medical geography. Soc Sci Med. 1996;42(6):843–55.

    CAS  Article  PubMed  Google Scholar 

  28. 28.

    Loader CR. Large-deviation approximations to the distribution of scan statistics. Adv Appl Probab. 1991;23(4):751–71.

    Article  Google Scholar 

  29. 29.

    Naus JI. Clustering of random points in two dimensions. Biometrika. 1965;52(1/2):263–7.

    Article  Google Scholar 

  30. 30.

    Turnbull BW, Iwano EJ, Burnett WS, Howe HL. Clark LC. Monitoring for clusters of disease: application to leukemia incidence in upstate. 1990;132:136–43.

    Google Scholar 

  31. 31.

    Chaput EK, Meek JI, Heimer R. Spatial analysis of human granulocytic ehrlichiosis near Lyme. Connect. 2002:8–943.

  32. 32.

    Cousens EK, Smith PG, Ward H, Everington D, Knight RSG. Geographical distribution of variant Creutzfeldt-Jakob disease in great Britain. Lancet. 2001;357(9261):1002–7.

    CAS  Article  PubMed  Google Scholar 

  33. 33.

    Green C, Hoppa RD, Young TK, Blanchard JF. Geographic analysis of diabetes prevalence in an urban area. Soc Sci Med. 2003:57–551.

  34. 34.

    Hjalmars U, Kullforff M, Gustafsson G, Nagarwalla N. Childhood leukemia in Sweden: Using GIS and a spatial scan statistics for cluster detection. Stat Med. 1996;15(7–9):707–15.

    CAS  Article  PubMed  Google Scholar 

  35. 35.

    Michelozzi P, Capon A, Kirchmayer U, Forastiere F, Biggeri A, Barca A, et al. Adult and childhood leukemia near a high-power radio station in. 2002. 155–1096 p.

  36. 36.

    Odoi A, Martin SW, Michel P, Middleton D, Holt J, Wilson J. Investigation of clusters of giardiasis using GIS and spatial scan statistics. Int J Health Geogr. 2004;3:11.

    Article  PubMed  PubMed Central  Google Scholar 

  37. 37.

    Sabel CE, Boyle PJ, Loytonen M, Gatrell AC, Jokelainen M. Spatial clustering of amyotrophic lateral sclerosis in Finland at place of brith and place of death. Am J Epidemiol. 2003;157(10):898–905.

    CAS  Article  PubMed  Google Scholar 

  38. 38.

    Sheehan TJ, DeChelo LM. A space-time analysis of the proportion of late stage breast cancer in Massachusetts, 1988 to 1997. Int J Health Georgr. 2005;4:15.

    Article  Google Scholar 

  39. 39.

    Tiwari N, Adhikari CS, Tewari A, Kandpal V. Investigation of geo-spatial hotspost for the occureence of tuberculosis in Almora district, India, using GIS and spatial scan statistic. Int J Health Goegr. 2006;5:33.

    Article  Google Scholar 

  40. 40.

    Turnbull BW, Iwano EJ, Burnett WS, Howe HL, Clark LC. Monitoring for clusters of disease: application to leukemia incidence in upstate. N Y Am J Epidemiol. 1990;132:136–43.

    Article  Google Scholar 

  41. 41.

    Viel JF, Arveux P, Baverel J, Cahn JY. Soft-tissue sarcoma and nonHodgkin’s lymphoma clusters around a municipal solid waste incinerator with high dioxin emission. 2000. 152–13 p.

  42. 42.

    Kulldorff M, Feuer EJ, Miller BA, Freedman LS. Breast Cancer clustering in the northeast United State, a geographic approach. Am J Epidemiol [Internet]. 1997;146. Available from:

  43. 43.

    Matheron G. Les variables régionalisées et leur estimation: une application de la théorie des fonctions aléatoires aux sciences de la nature. Paris: Masson; 1965.

    Google Scholar 

  44. 44.

    Monestiez P, Dubroca L, Bonnin E, Durbec JP, Guinet C. Comparison of model based geostatistical methods in ecology: application to fin whale spatial distribution in northwestern Mediterranean Sea. In: Leuangthong O, Dordrecht DCV, editors. Geostatistics Banff. Kluwer Academic Publishers: The Netherlands; 2005. p. 777–86.

    Google Scholar 

  45. 45.

    Goovaerts P. Geostatistical analysis of disease data: estimation of cancer mortality risk from empirical frequencies using Poisson kriging. Int J Health Geogr. 2005;4:31.

    Article  PubMed  PubMed Central  Google Scholar 

  46. 46.

    Dwass M. Modified randomization tests for non-parametric hypothesis. Ann Math Stat. 1957;28:181–7.

    Article  Google Scholar 

  47. 47.

    Kulldorff M. SaTScan users guide for version 6.0. Last accessed 4. 2006.

  48. 48.

    Kulldorff M. Prospective time-periodic geographical disease surveillance using a scan statistic. J R Stat Soc A. 2001;164:61–72.

    Article  Google Scholar 

  49. 49.

    Alexander KA, Carzolio M, Goodin D, Vance E. Climate change is likely to worsen the public health threat of Diarrheal disease in Botswana. Int J Environ Res Public Health. 2013 Apr;10(4):1202–30.

    Article  PubMed  PubMed Central  Google Scholar 

  50. 50.

    Carlton EJ, Eisenberg JNS, Goldstick J, Cevallos W, Trostle J, Levy K. Heavy rainfall events and diarrhea incidence: the role of social and environmental factors. Am J Epidemiol. 2014 Feb 1;179(3):344–52.

    Article  PubMed  Google Scholar 

  51. 51.

    Philipsborn R, Ahmed SM, Brosi BJ, Levy K. Climatic drivers of Diarrheagenic Escherichia coli incidence: a systematic review and meta-analysis. J Infect Dis. 2016 Jul 1;214(1):6–15.

    Article  PubMed  PubMed Central  Google Scholar 

  52. 52.

    Salvacion AR, Magcale-Macandog DB. Spatial analysis of human population distribution and growth in Marinduque Island. Philippines J Mar Isl Cult. 2015 Jun;4(1):27–33.

    Article  Google Scholar 

Download references


We extend our sincere appreciation to the Centre for Health Information and Management [CHIM] of the Ghana Health Services for providing all the necessary data and background information for this research.


Not applicable.

Availability of data and materials

The data that support the findings of this study are available from Centre for Health Information and Management (CHIM) but restrictions apply to the availability of these data, which were used under license for the current study, and so are not publicly available. Data are however available from the authors upon reasonable request and with permission of CHIM.

Author information




FBO conceived of the study and carried out the analysis and drafted the manuscript. AS conceived of the study, and participated in its design and coordination and helped to draft the manuscript. both authors read and approved the final manuscript.

Corresponding author

Correspondence to Frank Badu Osei.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Osei, F.B., Stein, A. Spatial variation and hot-spots of district level diarrhea incidences in Ghana: 2010–2014. BMC Public Health 17, 617 (2017).

Download citation


  • Diarrhea Incidence
  • Spatial Scan Statistic
  • Space-time Clustering
  • Peri-urban District
  • Urbanization Strata