Skip to main content

Using data from patient interactions in primary care for population level chronic disease surveillance: The Sentinel Practices Data Sourcing (SPDS) project



Population health planning within a health district requires current information on health profiles of the target population. Information obtained during primary care interactions may provide a valuable surveillance system for chronic disease burden. The Sentinel Practices Data Sourcing project aimed to establish a sentinel site surveillance system to obtain a region-specific estimate of the prevalence of chronic diseases and mental health disorders within the Illawarra-Shoalhaven region of New South Wales, Australia.


In September 2013, de-identified information for all patient interactions within the preceding 24 months was extracted and collated using a computerised chronic disease management program that has been designed for desktop application (Pen Computer Systems Clinical Audit Tool: (PCS CAT)). Collated patient data included information on all diagnosed pathologies and mental health indicators, clinical variables such as anthropometric measures, and patient demographic variables such as age, sex, geographical location of residence and indigenous status. Age-standardised prevalence of selected health conditions was calculated.


Of the 52 general practices within the 6 major Statistical Local Areas (SLAs) of the health district that met the inclusion criteria, 17 consented to participate in the study, yielding data on n = 152,767 patients, and representing 39.7% of the regional population. Higher than national average estimates were found for the age-adjusted prevalence of chronic diseases such as obesity/overweight (65.9% vs 63.4%), hypertension (11.9% vs 10.4%) and anxiety disorders (5.0% vs 3.8%), but a lower than national average age-adjusted prevalence of asthma (8.0% vs 10.2%) was also identified.


This proof-of-concept study has demonstrated that the scope of data collected during patient visits to their general practitioners (GPs), facilitated through the Medicare-funded primary health care system in Australia, provides an opportunity for monitoring of chronic disease prevalence and its associated risk factors at the local level. Selection of sentinel sites that are representative of the population being served will facilitate an accurate and region-specific system for the purpose of population health planning at the primary care level.

Peer Review reports


Valid data on morbidity, at the regional level, is essential for planning of primary healthcare services that are specifically tailored to the needs, demands and requirements of the local population. The currently available peer reviewed literature indicates multiple avenues of population health surveillance including national surveys, administrative data and electronic health records. National surveys are surveillance methods that have commonly been used [1, 2], but that demonstrate multiple shortcomings. Prevalence estimates obtained from national and state level surveys are based on self-reported measures, provided by respondents of these surveys, the accuracy of which is questionable [2]. Secondly, surveys do not include all aspects of health, and therefore do not provide a full and accurate picture of health [3], resulting in a lack of generalisability to smaller regional populations [4]. Additionally, different subgroups of the population may demonstrate different response rates to surveys, which impacts on the generalisability of the survey data to the general population [5, 6]. Surveys, however, provide prevalence rates comparable to routinely collected clinical administrative data [13], indicating that clinical administrative data is a potential avenue for surveillance. The use of information obtained from administrative data, including primary care medical records (data collected during general practitioner (GP) visits), physician billing, specialist visits, pharmacy data (prescription dispensation) and hospital data (inpatient/outpatient information); has been shown to provide reliable and valid prevalence estimates of chronic disease conditions [13, 7, 8].

Clinical administrative data is widely available [1] and its collection and reporting systems are currently in place in both primary care and tertiary levels of care [9]. Further, administrative data is validated by the clinical judgment of medical practitioners and may be generalisable to smaller/regional populations [10]. Patient data that is entered into electronic medical recording software at the point of contact with primary health care practitioners is often supported by diagnostic testing and clinical examination, and is thus likely to be more valid than self-reported health information [10]. The peer-reviewed literature hence provides a vast amount of current evidence on the effectiveness of utilising administrative and/or primary care data for population health surveillance; however, there exists a current lack of such data based disease monitoring models in Australia.

The Sentinel Practices Data Sourcing (SPDS) project aimed to implement a sentinel site surveillance system within the Illawarra-Shoalhaven region of the state of New South Wales (NSW) in Australia to obtain a region-specific prevalence of chronic diseases and mental health disorders through the use of patient data obtained during primary care patient interactions. A pre-tested method of data extraction [10] was used, aimed at informing the population health planning within health service catchments of regional Australia.


The study was conducted in the Illawarra-Shoalhaven region of the state of NSW in Australia (Figure 1). Within NSW there are 15 Local Health Districts which are responsible for the acute, sub-acute and tertiary care service delivery in the state through the public hospital system; and 17 Medicare Locals which are responsible for the primary healthcare planning and delivery for their constituent regions [10]. Unlike other regions, the geographical catchment boundary of the Illawarra-Shoalhaven Local Health District (ISLHD) is the same as that covered by a single Medicare Local namely the Illawarra-Shoalhaven Medicare Local (ISML); which places the region in a unique and advantageous position in terms of planning and implementing a chronic disease surveillance system. Additionally the region has a diverse socio-economic profile and has pockets of both higher and lower socio-economic disadvantage, comparing the Index of Relative Socio-Economic Disadvantage (IRSD) scores between the region and for Australia as a whole (Figure 2). IRSD is a composite summary measure constructed by the Australian Bureau of Statistics (ABS) for all regions in Australia and is a based on income, educational attainment, employment status, occupation type, family structure, dwellings, house ownership, marital status and ethnicity [11].

Figure 1
figure 1

Illawarra-Shoalhaven region of NSW and its Statistical Local Areas (SLAs) on the map of Australia.

Figure 2
figure 2

Socioeconomic snapshot of the Illawarra-Shoalhaven region: Index of Relative Socio-Economic Disadvantage (IRSD) scores compared with NSW state and Australian national averages. *Illawarra-Shoalhaven score is a population weighted average of individual SLA scores.

The study undertook secondary analysis of administrative data through extraction of de-identified clinical patient information and the project was rolled out in 4 phases: -

Phase 1: Practice recruitment

The study aimed to recruit 12–18 practices within the Illawarra-Shoalhaven region based on the Statistical Local Area (SLA) geography and the demographic profile of the catchment. Eligible practices were identified by requesting the Illawarra-Shoalhaven Medicare Local (ISML) for a list of the region’s practices that fulfilled the following inclusion criteria (n = 52):

location in one of the 7 SLAs that represent the Illawarra-Shoalhaven region;

multiple (more than 1) GPs working at the practice site (solo practitioner sites are likely to have smaller patient numbers);

employment of either more than one full-time GP or more than two part-time GPs (i.e. who work for at least 20 hours a week);

additional criteria: -

  • installation of a clinical auditing software package on desktop software or a desire to install and use the Pen Computer Systems (PCS) Clinical Audit Tool: ™(PCS CAT) (multiple licensing for PCS CAT has been procured by ISML and is therefore freely available to all general practices within its catchment); and

  • a willingness to provide de-identified practice data extracts to the researchers for surveillance purposes.

Seventeen general practices in the catchment volunteered and consented to participate in the study (response rate = 33%). Only two electronic medical record software packages were being used by participating practices in the study, either Medical Director™ (n = 8) or Best Practice™ (n = 9).

Phase 2: Data cleansing and enhancement of data accuracy

Recruited practices undertook comprehensive “Data Cleansing” training to understand the usage and the various functionalities of PCS CAT, and to update and clean the data stored in their clinical systems. With the consent of the primary GPs and managers within each practice, the “Data Cleansing” training was conducted by the researchers. The cleaning of practice records improves searches (in both the practice electronic medical record software program and the PCS CAT), to identify patients with particular conditions and thus to target health research and patient management activities. This data cleaning process allowed the complete patient database that had been entered during GP consultations to be identified when searching for specific variables. The data cleansing phase of the study was conducted using the data maintenance utility tools which are available within both the GP electronic medical record software programs used in the study. Data cleansing included: -

  • encouraging all practice staff to use the ‘drop down box functionality’ of their clinical software to define and code all medical diagnoses and other sections of the patient record;

  • strictly avoiding free text entries in all sections of the patient record;

  • finding all identifiable free text non-coded past medical history items, and either linking them to appropriate coded items or replacing them with the correct coded item; and

  • coding all inactive patients as ‘Inactive’ (an ‘active patient’ is one who has attended the practice three or more times in the past two years as defined in the Royal Australian College of General Practice: Standards for general practices [12]).

Phase 3: Data collection

Patient data that had been de-identified by practice employees was extracted to a database. Data items extracted from general practice clinical systems included: demography (population by age and sex and population geography including postcodes and suburbs), chronic disease surveillance items (hypertension, type 2 diabetes mellitus, depression, anxiety, COPD, asthma, congestive heart disease, stroke, osteoarthritis, osteoporosis, high Body Mass Index (BMI) – overweight and obese), and Medicare Benefits Schedule (MBS) items uptake relevant to primary care services for GP and other non-referred attendances. A cleaned, de-identified PCS CAT data extract was performed in September 2013 for all recruited practices which included all information obtained from patient interactions in the preceding 24 months for all diagnosed pathologies, clinical variables such as anthropometric measures, and patient demographic information such as age, sex, geographical location of residence (postcodes and suburbs) and indigenous status.

Phase 4: Data collation and analysis

The research team collated all extracted data, cross-matched residential suburb and postcode information with health and clinical information using de-identified unique link ID tags, converted all resultant information into usable database formats, and then analysed the datasets using Microsoft Excel (V2007: Microsoft Corporation, Redmond Washington, USA). The final datasets hence included clinical diagnosis and patient demographic information as entered by GPs within each participating practice. Basic epidemiological measures, including age-specific prevalence and total prevalence were calculated for all major disease conditions. The prevalence figures were compared against comparable indicators reported for the same age groups by the Australian Health Survey (AHS) 2011–12 conducted by the ABS [13].

The age-specific disease prevalence figures obtained from the study sample and the estimated national prevalence figures reported by the AHS 2011–12 were then age-standardised using the 2011 estimated resident population of Australia [14]. Comparisons across age-standardised prevalence were conducted for all major chronic conditions that the SPDS project is targeting for regular surveillance namely; obesity, overweight, diabetes mellitus, hypertension, asthma, mental health disorders such as clinically diagnosed depression and anxiety disorder, coronary heart disease, stroke, and chronic bone diseases such as osteoarthritis and osteoporosis. Both Microsoft Excel (V2007: Microsoft Corporation, Redmond Washington, USA) and the PCS CAT tool (v.3.1: were used for graphical illustration of demographic data and age-specific disease prevalence.

The study undertook secondary analysis of administrative data through extraction of de-identified clinical patient information. The study was performed with the approval of the Human Research Ethics Committee (Health and Medical) of the University of Wollongong (HE 12/447). Written informed consent was not obtained from individual patients due to the retrospective nature of the study design, however all data was exclusively extracted and de-identified by trained practice clinical staff only.


The number of patients that had visited the 17 general practices within the previous 24 months (September 2011 to September 2013) was 164,435 (152,767 from within the Illawarra-Shoalhaven and 11,668 from outside of the catchment). The Illawarra-Shoalhaven catchment sample of 152,767 included 70,103 men, 82,506 women and 158 without an identified gender.

While 144 patients did not have their age recorded, the median age for the study sample (n = 152,767) was 39 years (IQR = 20 – 58 years). Adults aged 20–24 years comprised the largest age group at 7% of the total sample, followed by the 40–44 year old age group (6.8%), and 5–9 year old children (6.6%). Older adults aged 65 years and above comprised 18.2% of the sample. The population pyramid of the study sample along with the comparison with the population structure of the 2011 estimated resident population of the Illawarra-Shoalhaven catchment is shown in Figure 3. The proportion of the local residential population of the SLAs that consulted the study practices during the study period is shown in Table 1[14]. The majority of the study sample (92.9%) were found to reside within the Illawarra-Shoalhaven catchment SLAs.

Figure 3
figure 3

Population pyramid comparisons: study sample and the 2011 resident population of the Illawarra-Shoalhaven catchment.

Table 1 Proportion of local population that had consulted the 17 general practices during the previous 24 months (September 2011 to September 2013)

The age-specific population and disease counts within the study sample for major chronic conditions and high BMI are shown in Tables 2 and 3 respectively, while the crude and age-standardised prevalence comparisons of the sample and the Australian national estimates [13] are shown in Tables 4 and 5 respectively. Overall the study sample population exhibits figures higher than Australian averages for the age-standardised prevalence of chronic conditions such as anxiety, cancer, hypertension, obesity and overweight/obesity (Figure 4). An illustration of age-specific burden of disease (Figure 5) indicates that prevalence (non-age standardised) of asthma and mental health conditions (depression and anxiety) are significantly higher amongst younger age groups compared to older adults.

Table 2 Age-specific population and chronic disease counts within the study sample during the last 24 months (September 2011 to September 2013)
Table 3 Age-specific population and high BMI counts within the study sample during the last 24 months (September 2011 to September 2013)
Table 4 Crude prevalence proportions of chronic conditions in the study sample compared to Australian national averages
Table 5 Age-standardised prevalence of chronic conditions in the study sample compared to Australian national averages
Figure 4
figure 4

Comparisons of age-standardised chronic disease prevalence between the study sample and Australian national averages. *Includes adults only.

Figure 5
figure 5

Age-specific prevalence of chronic diseases within the study sample.


In Australia, nationally representative data is available through the National Health Surveys (NHS) conducted by the Australian Bureau of Statistics (ABS) [15] and regionally through state surveys such as the annual New South Wales Population Health Survey [16]; however, extrapolations of these data to smaller geographical areas such as Local Government Areas (LGAs) and/or small area geographic regions within LGAs such as SLAs and suburbs is limited [10].

In 2011–12, the percentage of adults who saw a GP in the preceding 12 months varied across Australia ranging from 74% to 86%, with the Illawarra-Shoalhaven catchment recording the highest percentage nationally [17]. This indicates that extraction of patient and GP interactions over a 24 month period has the potential to include health information of almost the entire resident population of catchment regions such as the Illawarra-Shoalhaven; however, not all SLAs were evenly represented in terms of coverage in this study.

All general practice clinical and electronic medical record softwares utilise one of the several nationally validated health coding and medical classification systems such as SNOMED-CT, DOCLE, PYEFINCH and ICPC2+ [10]. These medical vocabularies enable recording of nationally/internationally recognised coded disease diagnosis, which also assists in maintaining accurate and consistent primary care clinical data that can be extracted and analysed [10]. Additionally, recent introduction of the Personally Controlled Electronic Health Records (PCEHR) in Australia, further requires general practices to “work towards recording the majority of diagnoses for active patients electronically” [18]. This enables accurate recording and easy identification of medical conditions and hence provides an opportunity for successful public health and chronic disease surveillance. However, Aizpuru et al. [19] suggest that chronic disease data from electronic health records provide a lower prevalence of conditions, as compared to health surveys because actions taken by physicians are often not recorded leading to cases being missed out. The limitations of primary care practice based data collection have also been illustrated in systematic reviews. Common problems include inconsistent diagnostic coding vocabulary of different clinical systems [20] and errors in data entry and recording [7, 21]. Major barriers faced by general practice staff in this regard include difficulties with clinical coding of diagnoses; complexities with software applications; preference for entering free text rather than the pre-coded options; inadequate skills in information technology; time constraints; poor motivation and low prioritisation of data entry compared to other clinical duties; inconsistency of data entry; coding of a condition in order to justify choice of prescribed treatment; and the additional burden of including laboratory test results in patient records, as well as a need to enter a diagnosis, even in the early preclinical stages of the disease (Attard E, Ghosh A, Charlton K: Barriers faced by general practice staff in maintaining clean primary care databases: a systematic review, unpublished).

Studies conducted in Canada [22, 23], Italy [24] and the UK [25] have demonstrated methods to improve the quality and accuracy of practice-based disease surveillance models. Keshavjee et al. [22] trained and employed ‘data managers’ in an attempt to standardise disease coding and de-identify patient information. Similarly, Griever et al. [23] employed a trained data entry clerk to check missing or incorrectly coded records. Cricelli et al. [24] trained GPs, themselves, in data entry and use of standard software. Pearson et al. [25] provided initial training and updates to all doctors and practice staff and carried out validation procedures such as verification of clinical coding, checking for rare diagnoses and those made outside the usual age and sex parameters through random validation visits to all participating practices.

A number of limitations to the study need consideration. The SPDS study identified various common data entry errors, including misspelt suburb names or postcodes that did not match the suburb entry, missing geographic information (postcodes and suburbs), missing values for age and sex, incorrect/mismatched entries within data entry fields such as height inserted in the weight field and/or vice-versa, and missing entries for weight and/or height measurements. While geospatial analysis of disease patterns is highly useful to target services towards areas of need [26], the SPDS data has highlighted difficulties in obtaining consistent information on patient residential postcodes and suburbs, including missing entries and mismatched entries, for example, a record with suburb of the Kiama (A) SLA and postcode of 3000 which is the incorrect postcode for this suburb. It was often unclear which variable to change in order to resolve this inconsistency and hence led to the deletion of such records from the analysis. Data quality and accuracy discrepancies required immense post-extraction data cleaning/editing efforts by the researchers which is vital to improve data linkage quality [27]. It is therefore imperative to undertake further research and technological innovation into improving utility and interface functionality of practice clinical desktop systems and creation of valid and easy to use advanced data aggregation systems which could vastly improve the processes of primary care clinical data extraction and modelling resulting in furthermore accurate prevalence estimation.

Both international literature and Australian evidence identifies a higher reported prevalence of overweight and/or obesity within primary care settings [28, 29]. It has been argued that obese patients are more likely than healthy or underweight patients to visit their physician and also more likely to be weighed and measured by practice staff and clinicians. This results in lower population denominators for obesity and overweight, as also seen in our study (Table 3), and arguably higher prevalence figures. This is another limitation of the proposed method of surveillance.

The seventeen general practices recruited from the major SLAs within the Illawarra-Shoalhaven region include approximately 40% of the resident population of the catchment area but generalisability of the findings to the general population of the Illawarra-Shoalhaven region cannot be assumed. Additionally there was a clear coverage disparity between the 7 Illawarra-Shoalhaven SLAs with high representation of Kiama (A) and Shellharbour (C) residents, moderate representation of the Shoalhaven (C) - Pt A and Shoalhaven (C) - Pt B residents and low proportional share of Wollongong (C) Inner and Wollongong (C) Bal SLAs within the study sample (Table 1). This can be attributed to the recruitment of practices that voluntarily consented to participate rather than routine surveillance as such. Thus, disease prevalence estimates drawn from the study sample may not be representative of the true population disease status for the region.

Another limitation to the study is that it only investigated the interaction between one extraction tool (the PCS CAT) and two general practice electronic medical record (EMR) software systems (Best Practice™ and Medical Director™). Although these are the most commonly used systems in Australia, the findings cannot be extrapolated to other systems. Additionally the validity of a PCS CAT extract has not been completely investigated. While the tool is co-developed by the Royal Australian College of General Practice (RACGP), the peak body of general practice in Australia and is advocated by them as an integrated product aimed at improving the way patient information can be used to better inform decisions in both clinical and business settings [30]; to date there has not been any empirical validation of the PCS CAT as a general practice data extraction tool. Further research into validation of the PCS CAT extract and the assessment of its agreement with manual data review/audit is required. A final limitation is that we only included data that could potentially be extracted from the electronic medical record software programs. While the data cleansing phase of the study focused heavily on avoidance of any free text entered into medical or clinical notes by GPs and practice staff; if a practitioner still made free text entries rather than using the codable sections of the record, then neither the extraction tool nor our manual case record reviews/audits would be able to detect those cases.

Despite these limitations the SPDS study has significant implications for public health planning, primary health care delivery and epidemiological research. Apart from ongoing chronic disease surveillance, the study methodology and protocol also has the potential to provide evidence-based direction to population health planning strategies aimed at addressing the local health needs of regional areas of Australia. The most recently reported planning documents for the Illawarra-Shoalhaven region of NSW, both from the Local Health District level [31] and the Medicare Local level [32], illustrate disease rates and health status indicators drawn from statistically modelled estimates from the 2006–07 Australian National Health Surveys. These figures are significantly outdated for planning purposes in 2014 and their generalisability for regional and smaller area disease prevalence and health status is questionable [10]. The proposed surveillance system also provides opportunity for monitoring trends in chronic disease prevalence across regular time intervals and promotes the engagement of general practice staff and clinicians in maintaining primary care clinical data quality and accuracy. The inclusion of a larger number of sentinel sites that are generalisable to the population being served would provide an accurate and region-specific system for the purposes of population health planning at the primary care level in order to improve the overall health of the community.


This study has demonstrated that extraction of patient clinical data from general practice settings is both a feasible and valid method to obtain a region-specific estimate of the prevalence of chronic diseases and mental health disorders within regional NSW, Australia. General practices that agreed to participate were included in the study, however further sampling methodology is required to identify which sentinel sites would provide an accurate and truly representative surveillance system. Technological updates/changes to general practice clinical software systems are recommended to improve functionality and data quality within general practice databases. Drop down menus with fixed nationally recognised lists of suburb names, cross matched with correct geographical concordance postcode and state information is currently lacking within the general practice clinical software systems. Additionally, making age, sex, postcode and suburb mandatory fields for creating a new patient record could eliminate the issue of missing data for these essential socio-demographic variables. Investment in computer skills and data entry training for general practice staff and advancements in data aggregation instruments are essential to improve quality of clinical data and their collection methods for effective utilisation by researchers and population health planners for surveillance purposes. Annually obtained chronic disease prevalence figures through the surveillance methodology implemented by the SPDS project, could provide more updated and granular health information for prompt health service planning.



General practitioners


Illawarra-Shoalhaven Medicare local


Australian Bureau of Statistics


Local Government Areas


Statistical Local Areas


Medicare Locals


National Health Surveys


Illawarra Shoalhaven Local Health District


Public Health Information Development Unit


New South Wales


Pen Computer Systems Clinical Audit Tool


Systematized Nomenclature of Medicine Clinical Term


Doctor Command Language


International Classification for Primary Care 2+


Australian Health Survey


Medicare Benefits Schedule.


  1. Gini R, Francesconi P, Mazzaglia G, Cricelli I, Pasqua A, Gallina P, Brugaletta S, Donato D, Donatini A, Marini A, Zocchetti C, Cricelli C, Damiani G, Bellentani M, Sturkenboom M, Schuemie M: Chronic disease prevalence from Italian administrative databases in the valore project: A validation through comparison of population estimates with general practice databases and national survey. BMC Public Health. 2013, 13 (5): 1-11.

    Google Scholar 

  2. Bulloch A, Currie S, Guyn L, Williams J, Lavorato D, Patten S: Estimates of the treated prevalence of bipolar disorders by mental health services in the general population: Comparison of results from administrative and health survey data. Chronic Dis Inj Can. 2011, 31 (3): 129-134.

    CAS  PubMed  Google Scholar 

  3. Jackson B, Suzuki S, Coultas D, Singh K, Bae S: Chronic obstructive pulmonary disease and health-related quality of life in the 2009 Texas behavioural risk factor survey. Health Educ Behav. 2012, 40 (4): 469-479.

    Article  PubMed  Google Scholar 

  4. Fleming DM, Cross KW, Barley MA: Recent changes in the prevalence of diseases presenting for health care. Br J Gen Pract. 2005, 55 (517): 589-595.

    PubMed  PubMed Central  Google Scholar 

  5. Link M, Mokdad A, Stackhouse H, Flowers N: Race, ethnicity, and linguistic isolation as determinants of participation in public health surveillance surveys. Prev Chronic Dis: Public Health Res, Prac Policy. 2006, 3 (1): 1-12.

    Google Scholar 

  6. Qayad M, Chowdhury P, Hu S, Balluz L: Respondent differences and length of data collection in the behavioural risk factor surveillance system. Stat Can. 2010, 36 (2): 223-227.

    Google Scholar 

  7. Birtwhistle R, Keshavjee K, Lanning L, Godwin M: Building a pan-Canadian primary care sentinel surveillance network: initial development and moving forward. J Am Board Fam Med. 2009, 22 (4): 412-422.

    Article  PubMed  Google Scholar 

  8. Knox SA, Harrison CM, Britt HC, Henderson JV: Estimating prevalence of common chronic morbidities in Australia. Med J Aust. 2008, 189 (2): 66-70.

    PubMed  Google Scholar 

  9. Bernatsky S, Lix J, Hanly J, Hudson M, Badley E, Peschken C, Pineau C, Clarke A, Fortin P, Smith M, Belisle P, Lagace C, Bergeron L, Joseph L: Surveillance of systematic autoimmune rheumatic diseases using administrative data. Rheumatol Int. 2011, 31: 549-554.

    Article  CAS  PubMed  Google Scholar 

  10. Ghosh A, Charlton K, Girdo L, Batterham M, McDonald K: Addressing the deficiencies in the evidence-base for primary practice in regional Australia - Sentinel Practices Data Sourcing (SPDS) Project: A pilot study. BMC Fam Pract. 2013, 14: 109-

    Article  PubMed  PubMed Central  Google Scholar 

  11. Grande DE, Taylor A, Jury H, Greenland N: The health status of South Australians by socio-economic status (SEIFA): South Australian monitoring and surveillance system. 2002, July - June 2003

    Google Scholar 

  12. The Royal Australian College of General Practitioners: Standards for General Practices. 4th edition. 2010, South Melbourne: The RACGP,,

    Google Scholar 

  13. Australian Bureau of Statistics: Australian Health Survey: First Results. 2011, cat. no. 4364.0.55.001, –12

    Google Scholar 

  14. Australian Bureau of Statistics (ABS): Regional Population Growth, Australia. 2011, cat. no. 3218.0

    Google Scholar 

  15. Australian Bureau of Statistics (ABS), Health Survey (National):,

  16. NSW Ministry of Health, New South Wales Population Health Survey:,

  17. National Health Performance Authority (NHPA): Healthy Communities: Australians’ experiences with access to health care in. 2011,, –12,

    Google Scholar 

  18. Medicare Australia, eHealth Incentive:,

  19. Aizpuru F, Latorre A, Ibanez B, Garcia K, Vergara I, de Arriba JP, Samper R, Lopex-Pcado A, Apinaniz A, Bilbao JL: Variability in the detection and monitoring of chroic patients in primary care according to what is registered in the electronic health record. Fam Pract. 2012, 29: 696-705.

    Article  PubMed  Google Scholar 

  20. Khan NF, Harrison SE, Rose PW: Validity of diagnostic coding within the general practice research database: a systematic review. Br J Gen Pract. 2010, 60 (572): e128-e136.

    Article  PubMed  PubMed Central  Google Scholar 

  21. Thiru K, Hassey A, Sullivan F: Systematic review of scope and quality of electronic patient record data in primary care. Br Med J. 2003, 15 (7398): 635-641.

    Google Scholar 

  22. Keshavjee K, Chevendra V, Martin K, Jackson D, Aliarzadeh B, Kinsella L, Turcotte R, Sabri S, Chen T: Design and testing of an architecture for a national primary care chronic disease surveillance network in Canada. Stud Health Technol Inform. 2011, 164: 341-345.

    PubMed  Google Scholar 

  23. Greiver M, Barnsley J, Aliarzadeh B, Krueger P, Moineddin R, Butt DA, Dolabchian E, Jaakkimainen L, Keshavjee K, White D, Kaplan D: Using a data entry clerk to improve data quality in primary care electronic medical records: a pilot study. Inform Prim Care. 2011, 19 (4): 241-250.

    PubMed  Google Scholar 

  24. Cricelli C, Mazzaglia G, Samani F, Marchi M, Sabatini A, Nardi R, Ventriglia G, Caputi AP: Prevalence estimates for chronic diseases in Italy: exploring the differences between self-report and primary care databases. J Public Health. 2003, 25 (3): 254-257.

    Article  Google Scholar 

  25. Pearson N, O’Brien J, Thomas H, Ewings P, Gallier L, Bussey A: Collecting morbidity data in general practice: the Somerset morbidity project. Br Med J. 1996, 312: 1517-

    Article  CAS  Google Scholar 

  26. Dubowitz T, Williams M, Steiner ED, Weden MM, Miyashiro L, Jacobson D, Lurie N: Using geographic information systems to match local health needs with public health services and programs. Am J Public Health. 2011, 101: 1664-1665.

    Article  PubMed  PubMed Central  Google Scholar 

  27. Randall SM, Ferrante AM, Boyd JH, Semmens JB: The effect of data cleaning on record linkage quality. BMC Med Inform Decis Mak. 2013, 13: 64-

    Article  PubMed  PubMed Central  Google Scholar 

  28. Gauthier BM, Hickner JM, Ornstein S: High prevalence of overweight children and adolescents in the Practice Partner Research Network. Arch Pediatr Adolesc Med. 2000, 154 (6): 625-628.

    Article  CAS  PubMed  Google Scholar 

  29. McCallum Z, Wake M, Gerner B, Harris C, Gibbons K, Gunn J, Waters E, Baur LA: Can Australian general practitioners tackle childhood overweight/obesity? Methods and processes from the LEAP (Live, Eat and Play) randomized controlled trial. J Paediatr Child Health. 2005, 41 (9–10): 488-494.

    Article  PubMed  Google Scholar 

  30. The Royal Australian College of General Practitioners: Standards for Health Services in Australian Prisons: Criterion 3.1.1 Quality Improvement Activities. 2012, The RACGP,,

    Google Scholar 

  31. Illawarra Shoalhaven Local Health District (ISLHD): Working Together Building Healthy Futures: Health Care Services Plan. 2012,, –2022,

    Google Scholar 

  32. Ghosh A, McDonald K, Marshall K: Illawarra-Shoalhaven Medicare Local - Population Health Profile: 2013. 2013, NSW: Grand Pacific Health Ltd

    Google Scholar 

Pre-publication history

Download references


The authors would like to acknowledge the support provided by the clinical as well as administrative staff at all participating general practices. Their inputs and enthusiasm in database cleansing and providing an accurate, de-identified data extract was invaluable.

The authors would also like to acknowledge the financial support from the Grand Pacific Health Ltd and its funding body the Australian Government Department of Health.

Author information

Authors and Affiliations


Corresponding author

Correspondence to Abhijeet Ghosh.

Additional information

Competing interests

The author(s) declare that they have no competing interests.

Authors’ contributions

AG formulated the study design and conceptualisation, performed data extraction and all statistical analysis, conducted data interpretation, carried out literature search and collated contributions from co-authors to draft the paper. AG also carried out all manuscript revisions as per peer-reviewer’s comments. KEC contributed to study design and conceptualisation, provided editorial input, conducted data interpretation and carried out literature search. LG trained and educated practice staff in undertaking the data cleansing of their practice clinical database and carried out literature search. MB reviewed the statistical methodology and data analysis, and contributed to editing the final paper manuscript. All authors read and approved the final manuscript.

Authors’ original submitted files for images

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Ghosh, A., Charlton, K.E., Girdo, L. et al. Using data from patient interactions in primary care for population level chronic disease surveillance: The Sentinel Practices Data Sourcing (SPDS) project. BMC Public Health 14, 557 (2014).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: