Internet-based surveillance of Influenza-like-illness in the UK during the 2009 H1N1 influenza pandemic
© Tilston et al. 2010
Received: 28 June 2010
Accepted: 27 October 2010
Published: 27 October 2010
Skip to main content
© Tilston et al. 2010
Received: 28 June 2010
Accepted: 27 October 2010
Published: 27 October 2010
Internet-based surveillance systems to monitor influenza-like illness (ILI) have advantages over traditional (physician-based) reporting systems, as they can potentially monitor a wider range of cases (i.e. including those that do not seek care). However, the requirement for participants to have internet access and to actively participate calls into question the representativeness of the data. Such systems have been in place in a number of European countries over the last few years, and in July 2009 this was extended to the UK. Here we present results of this survey with the aim of assessing the reliability of the data, and to evaluate methods to correct for possible biases.
Internet-based monitoring of ILI was launched near the peak of the first wave of the UK H1N1v influenza pandemic. We compared the recorded ILI incidence with physician-recorded incidence and an estimate of the true number of cases over the course of the epidemic. We also compared overall attack rates. The effect of using different ILI definitions and alternative denominator assumptions on incidence estimates was explored.
The crude incidence measured by the internet-based system appears to be influenced by individuals who participated only once in the survey and who appeared more likely to be ill. This distorted the overall incidence trend. Concentrating on individuals who reported more than once results in a time series of ILI incidence that matches the trend of case estimates reasonably closely, with a correlation of 0.713 (P-value: 0.0001, 95% CI: 0.435, 0.867). Indeed, the internet-based system appears to give a better estimate of the relative height of the two waves of the UK pandemic than the physician-recorded incidence. The overall attack rate is, however, higher than other estimates, at about 16% when compared with a model-based estimate of 6%.
Internet-based monitoring of ILI can capture the trends in case numbers if appropriate weighting is used to correct for differential response. The overall level of incidence is, however, difficult to measure. Internet-based systems may be a useful adjunct to existing ILI surveillance systems as they capture cases that do not necessarily contact health care. However, further research is required before they can be used to accurately assess the absolute level of incidence in the community.
Every year influenza A viruses cause epidemics leading to high morbidity and mortality - approximately 3 to 5 million cases and between 250,000 and 500,000 deaths worldwide . The majority of complications usually occur in the young, elderly and the immunologically compromised. Although immunity from the virus may be long term, this immunity is subtype and strain specific which means that as long as the virus retains its ability to mutate rapidly recurring outbreaks will continue to be seen [1–3].
On the 12th of June 2009, the WHO alerted the world of an influenza pandemic caused by a novel H1N1 influenza strain. The UK epidemic was characterised by two peaks: one in late July, and another in late October . Traditional influenza monitoring methods have relied on sentinel networks of physicians to diagnose and report ILI (influenza-like-illness) . These systems have been the mainstay of influenza surveillance for many years. However, they have a number of potential drawbacks, particularly during a pandemic, as they require individuals to attend physicians when they are ill. An unknown proportion of cases do not attend heath care, and this proportion may vary with age, sex or other social groups, and may change over the course of an epidemic, particularly if there are changes in levels of public concern, capacity problems, or mechanisms to divert patients away from physician's offices (such as was implemented in the UK during 2009). In response, internet-based surveillance has been suggested as a means of rapidly assessing the level of illness in the community. Two approaches have been implemented. First, people's internet browsing behaviours have been used as proxies for ILI incidence [6–13]. As these only record proxy measures of influenza, they are limited in their usefulness (they cannot, for instance, track health-care usage, or the clinical spectrum of cases). An alternative is to recruit members of the public to record, using an internet-based questionnaire, specific symptoms over time. That is, to recruit an internet-based cohort and survey patterns of illness and health care attendance in these individuals. Such systems have been implemented in Belgium and the Netherlands (under the name "Der Grote Griepmeting" - the Great Influenza Survey, GIS) in 2003 , Portugal ("Gripenet") in 2005 , and in Italy ("Influweb") in 2007 . Results, in non-pandemic years, appear to be highly correlated temporally with those obtained through traditional surveillance methods [17–19], and the systems may be able to detect increased influenza activity more rapidly than GP-based surveillance [17–19]. The incidence recorded in these systems appears to be consistent between countries, and is consistently higher than that reported by GP-based systems . However, it is not clear exactly what is being measured by the internet-based surveillance systems, as the sample is unlikely to be representative and it is not clear how to calculate the appropriate denominator (see below). We adapted these internet-based influenza monitoring systems to the UK in July 2009 (during the first wave of the UK pandemic of H1N1v ). Here we present data collected using this system. We examine different methods of calculating incidence from such a system, including examining different ways of defining influenza-like-illness. We compare our incidence estimates to the case estimates generated by the Health Protection Agency (HPA) , noting that these case estimates should themselves be treated with some caution, and with the ILI attack rates for England and Wales. The aim is to assess how well internet-based surveillance of Influenza performs when compared to a range of other measures.
The UK flusurvey was launched in mid-July 2009. With the help of a publicity campaign involving television, radio, and newspaper coverage and word of mouth, over 5,000 participants were recruited in the first week. Participants were recruited from all parts of the UK (Additional file 1).
The study was approved by the London School of Hygiene and Tropical Medicine Ethics Committee (Application number 5530).
Registration for the UK flusurvey took place through the web page http://www.flusurvey.org.uk. Upon registration, following provision with a password-protected account, participants were requested to complete a background questionnaire. This covered age, gender, household size and composition, occupation, location of home and workplace, receipt of influenza vaccine during the 2008/2009 influenza season, and membership of a risk group (self-report of any of the following: diabetes, asthma, other chronic lung disease, immunocompromised, chronic heart disease, other chronic disease, pregnant - individuals reporting any of these are referred to hereafter as being in a "risk group"). As well as registering on their own behalf, participants were able to create accounts on behalf of other members of their family/household, thus enabling, for instance, parents to record data about their children.
Each week, whether or not they had symptoms, participants were asked to complete a symptoms questionnaire (see below) and a questionnaire about their social contacts. A questionnaire about vaccine uptake was added in autumn 2009, once the pandemic-specific vaccine became available. Each questionnaire was intended to take no more than a couple of minutes to complete.
An email newsletter was sent to participants each week to remind them to complete the symptoms questionnaire. To maintain participants' interest in the survey the weekly newsletters contained a summary of the latest influenza facts and news, and the flusurvey website was updated on a daily basis with items including the estimated incidence and the spatial distribution of cases being continuously updated.
The list of symptoms used in the symptoms questionnaire.
Muscle pain and/or joint pain
Participants who reported the presence of symptoms were asked when their symptoms began and whether fever onset (if any) was rapid; they were asked about healthcare seeking behaviour - whether they consulted a medical professional and, if so, whether in person or by phone/internet, who they consulted, and when the consultation took place; they were asked whether they took any medication as a result of their symptoms, and, if so, when; they were asked whether they took time off work as a result of their symptoms and, if so, for how long they were off work. Participants were also asked whether they had encountered anyone with flu like symptoms in the past week.
Here we analyse data collected via the symptoms questionnaire from the period 16/07/09 until 31/12/09. In all, 5,738 participants contributed 20,901 records to this dataset.
In an attempt to reduce the effect of individuals who only register and take part as a one-off response to their current symptoms we followed the method of Friesema et al. , by restricting our dataset to participants' second and subsequent reports (15,163 records). We compared estimates of incidence from this dataset with those produced using a third dataset that contained all reports made by participants who participated more than once (17,532 reports), and found that they were very similar. Here, therefore, we use the latter dataset, and will from now refer to this sample as the "censored sample".
Comparison of the spatial distribution of our sample with the UK general population 
Flusurvey Population (2369)
UK Population (61383200)
East of England
Yorkshire & the Humber
Wales, Scotland & NI
Wi = Weight of individual i
Pi UK = Proportion of the UK population in the same age and gender group as individual i
Pi Flusurvey = Proportion of the Flusurvey population in the same age and gender group as individual i
Analysis and calculations were carried out in Stata 11 and Excel 2007.
In an ideal world it would be possible to use a simple list of symptoms to determine, with perfect sensitivity and specificity, whether someone had experienced an influenza infection. However, the symptoms of influenza are subject to too much variation from person to person, and there are too many other infections with similar symptoms. Therefore, instead, we seek to use the symptoms questionnaire to decide which participants had experienced ILI. This distinction between influenza itself and influenza-like-illness is one made by most public health bodies [27, 28], in the absence of laboratory confirmation.
ILI can be determined using a symptoms list but, to further complicate matters, several different definitions were in use during the H1N1v pandemic. For example, the Health Protection Agency (HPA) defined ILI as "Fever plus two or more of the following symptoms - cough, sore throat, runny nose, limb/joint pain, headache, vomiting/diarrhoea" , whereas the US Centers for Disease Control and Prevention (CDC) defined it as "Fever plus cough plus headache plus any of the following symptoms - runny nose, blocked nose, joint pain, muscle pain, weakness, sore throat, chest pain, abdominal pain, nasal congestion) . The GIS, on which the UK flusurvey was based, uses the following definition of ILI : "a sudden onset of fever with muscle pain accompanied by either a cough, sore throat or chest pain".
We compare these three definitions to explore the effect they would have on estimated incidence of ILI. However, for comparability with previous work , and to allow data to be quickly compared from country to country, we will make use of the GIS definition of ILI unless stated otherwise.
Because not every participant recorded the date of symptom onset, and because of the difficulty in determining an appropriate denominator (see below), we measure ILI incidence in week j as the number of participants recording, in week j, symptoms consistent with ILI.
Different numbers of participants completed the symptoms questionnaire each week. Therefore, it would be misleading simply to report the number of incidences of ILI week by week. Instead, we consider an appropriate denominator, to allow us to estimate ILI incidence rates. The key to calculating this denominator is to determine who in the sample population is at risk of acquiring ILI. The most straightforward approach is to take as the denominator for any given week all those who completed the symptoms questionnaire that week. However, although we continually encouraged those without symptoms to report their lack of symptoms, it is highly probable that individuals were more motivated to complete the surveys on those weeks when they experienced symptoms; thus, one could argue that the denominator should include all flusurvey participants, whether or not they completed the symptoms questionnaire that week.
Another issue arises when we consider in more detail what is actually being measured; although we classify symptoms as "ILI" or "non-ILI", it is likely that during summer/autumn 2009 much of what was classified as ILI was in fact H1N1v influenza. If this is so, then participants will be unable to experience a second case of "ILI", and should be removed from the population considered to be at risk. We examine two possibilities: the first, in which we assume that participants could experience more than one case of ILI, and that they are immediately at risk of subsequent infection (i.e. a susceptible-infected-susceptible (SIS) infection); the second in which we assume that ILI represents H1N1v, and that participants are removed from the population at risk once they have experienced ILI (i.e. a susceptible-infected-removed (SIR) infection).
In summary, we have 4 different methods of calculating a plausible denominator: either SIS or SIR, and either assuming the weekly population or the total population to be at risk. In what follows, we describe these 4 possibilities as
Unless otherwise stated we use the SIS week assumption for incidence.
About 17% (391 out of 2,369) of our participants had ILI at least once, out of whom 29% (113 out of 391) said they were in the risk group, while 8% (33 out of 391) said they were health care workers. About 43% (170 out of 391) of our ILI cases claimed to have met an infectious contact in the week prior to their illness.
Data from the UK suggests that the standard (GP-based) surveillance of ILI during the 2009 pandemic gave a distorted view of epidemic progress. In particular, the height of the summer peak was exaggerated in comparison to the autumn peak (Figure 3). This occurred because of changes in the pattern of access to care that occurred during the epidemic. For instance, data from the Flusurvey suggests that about 45% of those with ILI sought medical attention in July, dropping to around 20% in August, and recovering again to around 30% during the autumn peak (Brooks-Pollock E, Tilston N, Edmunds WJ, Eames KTD: Re-estimating H1N1 cases numbers in England in light of changing health-seeking behaviour Submitted). To get a clearer picture of the time course of the epidemic the HPA instigated random testing of patients accessing different health-care settings, which allowed the true numbers of cases to be estimated. However, this is expensive and induced further delays in the data stream. As is shown here, internet-based monitoring of ILI in the community can provide a direct and timely alternative - the relative size and timing of the peaks being close to the adjusted HPA case estimates. Because data were collected from people who did not attend health care, it is likely that internet-based surveillance captures a wider range of cases than traditional (i.e. GP-based) surveillance systems, and allows changes in utilisation of care to be monitored and accounted for in real-time. Internet-based surveillance is relatively quick and inexpensive to initiate, and is straightforward to scale up. However, as is also evident from this study considerable care must be taken to appropriately account for the different biases in the system.
The level of incidence observed is dependent on the definition of ILI and the denominator used. While there is a marked difference in the attack rates generated by different ILI definitions, all are higher than that estimated by a mathematical model . There are a number of possible explanations for this observation: first, not everything that is diagnosed as ILI will have been H1N1v influenza - the fact that some individuals reported more than one episode of ILI illustrates this. Second, the survey may result in response biases; we have, through weighting the sample, attempted to adjust for demographic (age and gender) biases in response, but it is not possible to adjust for unknown confounders. In addition it is likely that individuals preferentially took part when they were infected. Although censoring the sample by removing those individuals from the analysis who only report on one occasion reduces the bias somewhat, as illustrated by the estimated incidence curves, it is likely that some remains. In presenting the use of different denominators for incidence estimates we have explored two possibilities: one that assumes that completion of the survey in a given week is independent of symptom status, and the other that assumes that all participants with ILI in a given week will complete the survey that week. It is likely that reality lies somewhere in between, and further work is clearly needed to determine the nature of this response bias and how to adjust for it.
In making comparisons between Flusurvey estimates and HPA case numbers, it must be noted that the HPA figures are themselves only estimates, based on recorded treatment seeking behaviour (contact with GPs and use of the National Pandemic Flu Service), adjusted for antiviral positivity and a constant which represents the fraction of cases that consult GPs. Given that treatment-seeking behaviour changed over the course of the pandemic, there is no authoritative time series against which to compare Flusurvey incidence estimates. Furthermore, because our internet-based survey measures ILI, we would expect the difference between the Flusurvey and HPA case estimates to be largest when pandemic H1N1v was rare. This, indeed, was the case, with the Flusurvey (and RCGP) recording continuing ILI activity through December, whereas the HPA case estimates (which account for the fraction that were confirmed as H1N1v) were very low at this point (Figure 5).
It is likely that systems such as the UK Flusurvey will be increasingly used in the future, for both pandemic and seasonal influenza, as they are a relatively low-cost method of collecting ILI data, and allow access to cases that do not necessarily seek formal care. However, these systems are novel and need to be properly evaluated. One suggestion for validation would be to virologically test patients with ILI symptoms (perhaps by sending out self-swabbing packs to participants who report ILI symptoms), similar to the testing scheme implemented for the NPFS during the 2009 epidemic.
Internet-based surveillance has the potential to capture a wider range of cases than traditional (GP-based) surveillance systems, as well as to track changes in health-care attendance patterns in real-time. Our results suggest that trends in incidence can be captured by such systems perhaps even more reliably than standard GP-based systems, but it remains unclear how accurate they are for estimating the absolute level of incidence. Internet-based surveillance does not offer a replacement of traditional GP-based methods, but can provide an important adjunct, allowing the collection of valuable additional information.
Authors thank Michele Barley from the RCGP Research and Surveillance Centre for providing weekly incidence data for England and Wales.
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.