 Research article
 Open Access
 Open Peer Review
 Published:
Spatiotemporal correlation networks of dengue in the state of Bahia
BMC Public Healthvolume 14, Article number: 1085 (2014)
Abstract
Background
Dengue is a public health problem that presents complexity in its dissemination. The physical means of spreading and the dynamics of the spread between municipalities need to be analyzed to guide effective public policies to combat this problem.
Methods
This study uses timing varying graph methods (TVG) to construct a correlation network between occurrences of reported cases of dengue between cities in the state of BahiaBrazil. The topological network indices of all cities were correlated with dengue incidence using Spearman correlation. A randomization test was used to estimate the significance value of the correlation.
Results
The correlation network presented a complex behavior with a heavytail distribution of the network edges weight. The randomization test exhibit a significant correlation (P < 0.0001) between the degree of each municipality in the network and the incidence of dengue in each municipality.
Conclusions
The hypothesis of the existence of a correlation between the occurrences of reported cases of dengue between different municipalities in the state of Bahia was validated. The significant correlation between the node degree and incidence, indicates that municipalities with high incidence are also responsible for the spread of the disease in the state. The method proposed suggests a new tool in epidemiological control strategy.
Background
Dengue is a tropical disease of viral origin transmitted through the bite of the Aedes aegypti mosquito. Because dengue is an important arboviral disease, it is important to clearly define which control objectives can in fact be achieved and which preventive measures are required to do so. Dengue has a higher incidence in tropical countries where the climate is favorable to the proliferation of the A aegypti mosquito. In 2012, there were approximately 2.5 billion people worldwide at risk of infection. As a result, dengue is considered one of the most serious public health problems among reemerging diseases [1].
Twofifths of the world' population is at risk of dengue infection. The lack of effective drugs and vaccines makes vector control the sole tool for primary intervention. Understanding the dengue virus, the transmitting agent, and its interactions with the host is essential for the development of epidemiological control strategies [2, 3].
Many factors are responsible for the resurgence of epidemics of classic and hemorrhagic dengue in the last years of the 20th century. Demographic and social changes, such as population growth, urbanization, and modern transport, contribute to the increased incidence and geographic spread of dengue. The prevalence of this public health problem is greater in tropical areas of Africa, Asia and the Americas because the vector does not survive at low temperatures. The epidemiological situation in Latin America is similar to the situation in Southeast Asia, where multiple serotypes circulate, thus leading to an increased number of cases of classic and hemorrhagic dengue. In 2002, Latin American countries reported more than one million cases of dengue, among which approximately 17,000 cases were of hemorrhagic dengue, which resulted in 225 deaths [4, 5]. Dengue is a major cause of morbidity in the tropics, especially prior to 1999 [6, 7].
Despite being a disease with significant impact on public health policies, there are unanswered questions about dengue’s spreading dynamics, such as the importance of the means of transport or the network of disease spreading across municipalities.
This work aims to study the mechanisms for the spread of dengue across municipalities in the state of Bahia  Brazil. We used an adapted version of the correlation network method proposed by Eguiluz et al. [8] to build the relationships between municipalities.
Correlation networks have been used to characterize various dissemination processes in complex systems, such as seismic events in California [9], rainfall indices [10], firing rates in neurons [11, 12], brain activity [8, 13], climatic factors [14], and other factors related to dengue [15, 16]. In all of them, the generated networks provided insight into new properties due to the complex structure of the interactions among network elements.
In this study, we evaluated the hypothesis that there are correlations between the occurrences of cases of dengue between municipalities in the state of Bahia and that the network generated from these correlation is related to the information on the mechanism of disease spread in the state.
Methods
To assess temporal trends in connectivity between cases of dengue, we used the mathematical basis of the timevarying graph (TVG) formalism [17–19].
Timevarying graphs (TVG)
According to the conventional formalism of graph theory, a graph is defined as G (E, V), where V represents the set of vertices or nodes and E the set of edges e_{i,j}, with i and j ∈V such that E ⊆ V × V, i.e., each edge of the set E is defined by the ordered pair (V_{i}, V_{j}). Because a TVG is a dynamic system, the relationship between its elements is considered in a defined time interval T ⊆ N, where T represents the lifetime of the system, and Ν is the set of natural numbers. Thus, the TVG formalism in its simplified form can be defined as a graph G (V, E, F), where F represents the edge presence function, F: E × T ➔ {0, 1}. This function indicates whether there is an edge e_{i,j} ∈E at a given time t ∈T.
Timevarying correlation networks (TVCN)
The use of the correlation between time series to build networks was originally proposed by Eguiluz et al. [8] in studies on brain activity performed with a functional magnetic resonance imaging device. In that study, the different brain regions (voxels) represented the vertices of the network, and the occurrence of significant correlations between the time series of activity represented the edges of the network. A generalization of the method developed by Eguiluz et al. [8] was proposed by Silva et al. [12]. In the proposed method, the correlations between nodes are calculated only for a time window smaller than the size of the series. Based on this modification, the authors estimated the temporal evolution of neuronal activity networks in mice by sliding the window along the neuronal firing time series.
The method proposed in this paper adds the concept of static aggregated networks (SAN) to the method proposed by Silva et al. [9]. Using the formalism of the timevarying graph, we defined a graph as G(C, M), where M is the set of vertices of the network, and C is the set of edges that represents the existence of significant correlation between the time series assessed between each vertex i and j, with i and j ∈M and C ⊆ M × M. The TVCN is a dynamic system with a range defined by T and can thus be formalized as a timevarying graph G(M, C, F), where F represents the edge presence function, F: C × T ➔ {0,1}, representing the existence (1) or nonexistence (0) of correlation between the time series of a pair of vertices in a given time. Another way to understand the presence function F is that it indicates the existence of an edge C _{ i,j } at a given time t[12].
For the TVCN, this process can be formally defined as follows:
where c _{ i,j } (t) represents the correlation between cases of dengue in the municipalities i and j within a time window of size J centered at time t. Accordingly, this definition implies that two vertices are connected in the network only if the assessed correlation c _{ i,j } (t) is high enough ($\ge \stackrel{\u2012}{\mathrm{c}}$). Without loss of generality, this study assumes $\stackrel{\u2012}{\mathrm{c}}$ to be constant over time for simplicity.
Once the set of TVG networks is created, we can integrate the networks in time so that
where w _{ i,j } defines the weight of the edge between vertices i and j of the static aggregated network (SAN). Thus, the edges of the SAN will represent the frequency of occurrence of the edge over period T of the TVG.
Timevarying correlation networks of cases of dengue (TVCND)
The method described in Section TVCN was applied to the daily time series of cases of dengue in 417 municipalities of the state of Bahia in Brazil for the period between 01/02/2000 and 04/16/2009. Data were collected from the database of the Notifiable Diseases Information System (Sistema de Informação de Agravos de Notificação  SINAN), an entity of the Federal Government. The SINAN database is fed by the reporting and investigation of cases of diseases and clinical conditions that appear on the national list of diseases of compulsory notification. In the creation of TVCND, the municipalities represent the vertices of the network, and their correlations in time represent the edges. The window size used was 10 days because this time is the average time for occurrence of the disease symptoms. The correlation used was the Pearson correlation, and the correlation index used was the pvalue. When pvalues are used to measure the correlation, the equation (1) inverts its logical expression i.e. an edge are added when ${c}_{i\mathit{,}j}\left(t\right)\le \stackrel{\u2012}{\mathrm{c}}$.
In Figure 1, we illustrate the method applied to the dengue data. Figure 1(a) shows the time series for each municipality; Figure 1(b) shows the correlation matrix between all pairs of municipalities for the time window J. The grayscale matrix illustrates the different Pearson correlation values (pvalue). The edge is considered for values of correlation below a critical value $\left({c}_{i\mathit{,}j}\left(t\right)\phantom{\rule{0.25em}{0ex}}\le \stackrel{\u2012}{\mathrm{c}}\right)$ and is represented by the value 1 in the adjacency matrix. It is important to remember that small pvalues indicate high correlations. The network is built from the adjacency matrix in Figure 1(c).
Once a network has been built for a time t, the window is slid by a oneday increment, and a new network is calculated. The set of all networks over time forms the TVCND.
The search for the critical value of the threshold correlation $\stackrel{\u2012}{\mathrm{c}}$ that best represents the network shows that when it is very high (represents low correlation), the noises result in the creation of many edges. Conversely, when the threshold is very low (indicating high correlations), the restrictions increase, and much information is removed from the network.
Results and discussion
Static aggregated networks of dengue in Bahia (SAN)
The criterion used to find the optimal value of $\stackrel{\u2012}{\mathrm{c}}$ was to adopt the value where the total number of edges of the oneday subgraph best correlates with the sum of all cases of municipalities of that day. In Figure 2, we show the scatterplot of the network generated for a presence function with the critical correlation $\stackrel{\u2012}{\mathrm{c}}$ that exhibited the best correlation ($\stackrel{\u2012}{\mathrm{c}}=\phantom{\rule{0.25em}{0ex}}0.05$).
This correlation shows a coupling between the number of cases in a day and network connectivity on the same day, which indicates that the connection mechanism between the municipalities is an important factor in the spread of the disease.
The TVCN method was applied to cases of dengue in Bahia, generating 3393 subgraphs, where each subgraph represents a correlation network for a 10day window of data. Its respective SAN was calculated so that a single weighted network is generated where the weights of the edges represent the number of days during which there was a correlation between adjacent vertices. Even assuming a high level of correlation ($\stackrel{\u2012}{\mathrm{c}}=\phantom{\rule{0.25em}{0ex}}0.05$), many edges with low weight (<100 days) occur in the network. In Figure 3, we show the network for weights above 100 days. The figure shows that few nodes govern the pattern of correlation between cases of dengue in the municipalities. If the 100 days threshold filter is not used, the correlation between cases of dengue in municipalities present a great number of edges, blurring completely the figure, showing no information about the network connectivity.
We recall that if we do not use weights the correlation between cases of dengue in municipalities presents a great number of nodes featuring a highly connected graph.For a better evaluation of the SAN, we calculated the cumulative distribution of the weights of the edges (Figure 4). The figure shows a heavytailed distribution without a defined mean. The central region of the curve, between 10 and 365 days, exhibits a powerlaw decrease with an exponent equal to −1.90.
Correlation between the degree of the correlation network of cases of dengue and the incidence of dengue
To seek an epidemiological interpretation of timevarying correlation networks of dengue (TVCND), we calculated the correlation between the degree of each municipality in the SAN and the incidence of dengue in each municipality. Using the randomization method [20–22], we applied Spearman correlation to 100,000 randomizations of the original data. The correlation was greater than or equal to the original correlation in only 0.001% of the set of randomizations, which leads to a pvalue <0.00001. Figure 5 shows a comparison between the distribution of Spearman correlation indices of the random outputs with the correlation value obtained from the original data. This comparison indicates that, although low, the correlation is significant.
The existence of a significant correlation between the degree of correlation in SAN and the incidence of dengue in the municipality indicates that municipalities with high incidence are also responsible for the spread of the disease in the state.
The correlations between the occurrences of reported cases of dengue in different municipalities in the state of Bahia can aid in creating more efficient campaigns for prevention and the fight against dengue. The weights of the edges of the correlation network identify the most connected municipalities in the context of dengue in this state. In Table 1, in the case of an outbreak in the municipality of Abaíra, the municipalities of Jequié and Salvador should prioritize actions for preventing dengue.
Conclusions
The correlation network of cases of dengue in Bahia shows how one municipality follows the behavior of a different municipality. The correlation between the incidence data and the correlation network itself validates the hypothesis of the existence of a correlation between the occurrences of reported cases of dengue between different municipalities in the state of Bahia.
The information generated shows municipalities with high correlation, when cases of dengue increase in a municipality, there is also an increase of cases in the other correlated municipality and vice versa. This information can guide the attention of public authorities so that when diagnosing a growth in the number of cases or even an epidemic in a municipality, campaigns can be launched in the municipalities that have the highest correlations with the municipality where the outbreak occurred. Thus, epidemics that spread through several municipalities would be minimized with regard to their spread, which would reduce its period of existence.
Abbreviations
 TVG:

Timing Varying Graph
 A aegypti :

Aedes Aegypti
 TVCN:

TimeVarying Correlation Networks
 TVCND:

TimeVarying Correlation Networks of Cases of Dengue
 SINAN:

Sistema de Informação de Agravos de Notificação
 SAN:

Static Aggregated Networks of dengue in Bahia.
References
 1.
Braga IA, Valle D: Aedes aegypti: histórico do controle no Brasil [Aedes aegypti: history of control in Brazil]. Epidemiol Serv Saúde Brasília. 2007, 16:
 2.
Guo X, Xu Y, Bian G, Pike AD, Xie Y, Xi Z: Response of the mosquito protein interaction network to dengue infection. BMC Genomics. 2010, 11: 38010.1186/1471216411380.
 3.
Kappagoda S, Ioannidis J: Prevention and control of neglected tropical diseases: overview of randomized trials, systematic reviews and metaanalyses. Bull World Health Organ. 2014, 92 (5): 356366C. 10.2471/BLT.13.129601.
 4.
Huy R, Buchy P, Conan A, Ngan C, Ong S, Ali R, Duong V, Yit S, Ung S, Te V, Chroeung N, Pheaktra NC, Uok V, Vong S: National dengue surveillance in Cambodia, 1980−2008: insights on epidemiological and virological trends and impact of vector control interventions. Bull World Health Organ. 2010, 88: 650657. 10.2471/BLT.09.073908.
 5.
Nogueira R, Schatzmayr H, Santos A, Cunha F, Coelho R, Souza J, Guimarães L, Araújo F, Simone E, Baran T, Teixeira M, Miagostovich G, Pereira M: Emerging Infectious Diseases. 2005, Atlanta: EUA, 13761381. n. 9, 11
 6.
Gubler DJ, Meltzer M: Impact of dengue/dengue haemorrhagic fever on the developing world. Adv Virus Res. 1999, 53: 3570.
 7.
RorizCruz M, Sprinz E, Rosset I, Goldani L, Texeira MA: Dengue and primary care: a tale of two cities. Bull World Health Organ. 2010, 88: 244245. 10.2471/BLT.10.076935.
 8.
Eguiluz VM, Chialvo D, Cecchi GA, Baliki M, Apkarian AV: Scalefree brain functional networks. Phys Rev Lett. 2005, 94: 018102
 9.
Abe S, Suzuki N: Scalefree network of earthquakes. Europhysics Letters. 2004, 65: 581586. 10.1209/epl/i2003101081. n.4
 10.
Santana CN, Fontes AS, dos S Cidreira MA, Almeida RB, González AP, Andrade RFS, Miranda JGV: Graph theory defining nonlocal dependency of rainfall in Northeast Brazil. Ecol Complex. 2009, 6: 272277. 10.1016/j.ecocom.2009.05.011.
 11.
Mutlu AY, Bernat E, Aviyente S: A signalprocessingbased approach to timevarying graph analysis for dynamic brain network identification. Comput Math Methods Med. 2012, 2012: 451516
 12.
Silva BBM, Miranda JGV, Corso G, Copelli M, Vasconcelos N, Ribeiro S, Andrade RFS: Statistical characterization of an ensemble of functional neural networks. Eur Phys J E Condensed Matter Complex Systems. 2012, 85: 358
 13.
Quaak I, Brouns MR, van de Bor M: The dynamics of autism spectrum disorders: how neurotoxic compounds and neurotransmitters interact. revista internacional da investigação ambiental e de saúde pública. 2013, 10: 33843408. n. 8
 14.
Nakapan S, Tripathi NK, Tipdecho T, Souris M: Spatial diffusion of influenza outbreakrelated climatic factors in Chiang Mai Province, Thailand. revista internacional da investigação ambiental e de saúde pública. 2012, 9 (11): 38243842.
 15.
Lin CH, Wen TH: Using geographically weighted regression (GWR) to explore the different spatial varying relationships of immature mosquitos and human densities with the incidence of dengue. revista internacional da investigação ambiental e de saúde pública. 2011, 8 (7): 27982815.
 16.
LozanoFuentes S, ElizondoQuiroga D, FarfanAle JA, LoroñoPino MA, GarciaRejon J, GomezCarro S, LiraZumbardo V, NajeraVazquez R, FernandezSalas I, CalderonMartinez J, DominguezGalera M, MisAvila P, Morris N, Coleman M, Moore CG, Beaty BJ, Eisen L: Use of Google Earth to strengthen public health capacity and facilitate management of vectorborne diseases in resourcepoor environments. Bull World Health Organ. 2008, 86 (9): 718725. 10.2471/BLT.07.045880.
 17.
Casteigts A, Flocchini P, Quattrociocchi W, Santoro N: TimeVarying Graphs and Dynamic Networks. 2010, 20Arxiv preprint arXiv10120009
 18.
Flocchini P, Mans B, Santoro N: Exploration of periodically varying graphs. Proc. 20th Intl. Symposium on Algorithms and Computation (ISAAC). 2009, 534543
 19.
Tang J, Scellato S, Musolesi M, Mascolo C, Latora V: Smallworld behavior in timevarying graphs. Phys Rev E Stat Nonlinear Soft Matter Phys. 2010, 81 (5 Pt 2): 055101
 20.
Manly BFJ: Randomization, Bootstrap and Monte Carlo Methods in Biology. 2006, Flórida: Chapman & Hall, 460
 21.
Viola DN: Tese de Doutorado em Estatística e Experimentação Agronômica. Detecção e modelagem de padrão eários e de contagem. 2007, Piracicaba: USP: Escola Superior de Agricultura “Luiz de Queiroz”, [Detection and modeling of spatial patterns in binary and counting data. PhD thesis in Agronomic Statistics and Experimentation. USP: “Luiz de Queiroz” School of Agriculture]
 22.
Saba H, Miranda JGV, Moret MA: Selforganized critical phenomenon as a qexponential decay: avalanche epidemiology of dengue. Physica A Stat Mech Appl. 2014, 413: 205211.
Prepublication history
The prepublication history for this paper can be accessed here:http://www.biomedcentral.com/14712458/14/1085/prepub
Acknowledgments
This work received financial support from CNPq (grant numbers 306571/20110 and 308785/20118) and Coordination of Improvement of Higher Education Personnel (CAPES).
Author information
Additional information
Competing interests
The authors declare that they have no competing interests.
Authors’ contributions
HS proposed the research question, collected and analyzed the data. VCV was responsible for the theoretical background in epidemiology and dengue. MAM did the statistical analyzes. JGVM developed the computational tools, assisted in data analysis and coordinated the entire process of research development. All authors took an active part in discussions and writing of the manuscript. All authors read and approved the final manuscript.
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
Rights and permissions
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
About this article
Received
Accepted
Published
DOI
Keywords
 Dengue
 Correlation
 Transport
 Randomization
 Bahia