Do people with risky behaviours participate in biomedical cohort studies?
© Taylor et al; licensee BioMed Central Ltd. 2006
Received: 07 July 2005
Accepted: 23 January 2006
Published: 23 January 2006
Analysis was undertaken on data from randomly selected participants of a bio-medical cohort study to assess representativeness. The research hypotheses was that there was no difference in participation and non-participations in terms of health-related indicators (smoking, alcohol use, body mass index, physical activity, blood pressure and cholesterol readings and overall health status) and selected socio-demographics (age, sex, area of residence, education level, marital status and work status).
Randomly selected adults were recruited into a bio-medical representative cohort study based in the north western suburbs of the capital of South Australia – Adealide. Comparison data was obtained from cross-sectional surveys of randomly selected adults in the same age range and in the same region. The cohort participants were 4060 randomly selected adults (18+ years).
There were no major differences between study participants and the comparison population in terms of current smoking status, body mass index, physical activity, overall health status and proportions with current high blood pressure and cholesterol readings. Significantly more people who reported a medium to very high alcohol risk participated in the study. There were some demographic differences with study participants more likely to be in the middle level of household income and education level.
People with risky behaviours participated in this health study in the same proportions as people without these risk factors.
Numerous large and important cohort studies have been established overseas [1–6] and in Australia [7–9]. Many of the cohort studies undertaken are based on volunteers or clinical/convenience samples and the follow-up is based on self-report data or record linkages. The establishment of population-based, biomedical, cohort studies using a random sample is less common [1, 2, 5, 9].
Cohort studies are based on the assessment of an individual at several points in time and, by recalling or re-contacting each individual, assessing the process and transition of the individual along the disease and life-course continuum [10, 11]. As argued by Szklo  there are numerous, and obvious, advantages if population-based cohort studies are representative of their defined population. The translations of the data into population estimates over time enable casual relationships to be explored and the ability to separate out the effects of age and maturation, although the main aims are to undertake intra-group comparisons and follow changes over time [12, 13]. Increasing emphasis is being placed upon these longitudinal data to inform policy makers, health promoters and health planners. With the ageing of the population and the resultant cost pressures placed upon health systems, these data are also being used to make informed predictions about the future use of health services, mortality and morbidity patterns. Cohort studies can provide unique data that provides a more detailed understanding of complex health issues, providing life-course analytical and useful evaluation research opportunities.
Biomedical cohort studies are very costly and logistically difficult to administer [12, 14]. The benefits and uses of the data are compromised if bias exists. A high initial response rate, a representative initial sample and a low attrition rate are areas where effort needs to be invested to limit selection bias [15–17]. While loss to follow-up is a somewhat expected consequence of the longitudinal nature of all cohort studies, the representativeness of the initial sample, and subsequent ongoing continual assessment of representativeness, are important aspects that warrant investigation [11–13].
This paper investigates the representativeness of an initial cohort to determine if people undertaking risky behaviours were less likely to participate in a major biomedical cohort study and to study the direction and magnitude of any bias found. As argued by Grimes , cohort studies should be upfront in identifying and describing the potential effects of any bias and assess similarities and dissimilarities of respondents. This paper aims to identify and describe these biases for a major cohort study established in the western and northern suburbs of Adelaide, the capital of South Australia in which over 4000 randomly selected adults have been recruited. The overall aim of this cohort study is to follow the continuum of selected chronic diseases and associated risk factors.
The North West Adelaide Health Cohort Study (NWAHCS) recruited between 2000 and 2002 with a total of n = 4060 adults participating. Randomly selected telephone numbers listed in the relevant postcodes (that equated to the boundaries of the suburbs selected to be included in the study) were drawn from the most current Electronic White Pages. A letter of invitation to participate was sent to these households followed within 10 days by a telephone call from trained health study recruiters. A randomly selected adult within the household (those with the next birthday aged 18 years and older) was asked to participate in the study. At each appointment, the participant was given additional detailed information about the study and asked to sign consent forms for participation in the study. The information given highlighted the longitudinal nature of the study, and participants were informed that they may be invited to participate in health-related sub-studies. Prior to the study commencing, approval for the research was obtained from the North West Adelaide Health Service Ethics of Human Research Committee.
Appointments were made for participants in one of the two hospital-based clinics in the region and participants were sent an information folder that included a questionnaire with questions on chronic disease, alcohol consumption, physical activity levels, quality of life and socio-economic details (including highest education level, marital status, work status, country of birth and household income level). Age, sex, smoking status, height, weight, and ever being told they had high blood pressure or high cholesterol were asked in the recruitment telephone interview. At the clinic a range of assessments were made including taking blood (to test fasting plasma glucose, lipids, HbA1c), skin prick tests to common allergens and spirometry lung function tests.
The overall response rate of the completed telephone interview, self-completed questionnaire and clinic biomedical assessment (including blood sample) was 49.6% (69% of those interviewed). This paper assesses data associated with the respondents who completed all aspects of the study. Full details of the methodology have been previously published [18–20].
To examine the representiveness of the NWAHS sample with regard to age, sex, area of residence and socio-economic status, a comparison was made using Australian Bureau of Statistics (ABS) Census figures. Socio-economic status was measured using the Socio Index for Areas, Index of Relative Social Disadvantage (SEIFA IRSD) .
To compare the other demographics and social characteristics of the respondents and the population estimates of key health risk factors, a comparison against a population-based survey, the South Australian Surveillance and Monitoring System (SAMSS), was undertaken. SAMSS is a representative, on-going, population household telephone interview surveillance/survey of the South Australian population based on EWP sampling and has operated each month since July 2002 using a consistent methodology . This involves a random sample of SA households with one person selected at random in each household according to next birthday. Trained health interviewers interview respondents using computer assisted telephone interviewing (CATI) technology and there is no replacement for non-respondents. From July 2002 to June 2004, n = 2904 adults in the NW suburbs of Adelaide were interviewed providing a non-replacement response rate of 68.7%. To compare physical activity rates, data from the South Australian Health Monitor were used. Methodology of this CATI survey, operated three times a year, is similar to SAMSS and has been detailed elsewhere . This is a separate comparable survey with a separate sample.
While the questions asked in NWAHCS and SAMSS were identical for age, sex, country of birth, household income, alcohol consumption, height and weight (to calculate body mass index (BMI)), current high blood pressure, current high cholesterol, physical activity and self-reported health status there were slight differences in wording of the question for highest education level, marital status, work status variables and smoking status. Questions on height, weight, blood pressure and cholesterol were only asked of the second half of the respondents although measurements in the clinic were undertaken on all participants.
All analyses were limited to data on respondents aged 18 years and over in the same geographical area to correspond to the NWAHCS sample. Data were weighted by age, sex, region and probalility of selection within the household to the 2001 ABS Census data for SA to provide estimates that were representative of the region's population. The comparison for age and sex using the ABS data used both weighted and un-weighted data. Significance was tested using SPSS V12.0 and EpiInfo Version 6 X 2 tests with a 0.05 level of significance. Adjusted standardized residuals were obtained using the methods of Haberman  and were used to test deviations from expected values separately in each cell. Bonferroni corrections were applied for multiple testing.
Age and sex comparison between 2001 Census and NWAHCS
ABS 2001 Census
18 to 29 years
30 to 39 years
40 to 49 years
50 to 59 years
60 to 69 years
70 years and over
Area of residence
SEIFA Index of relative socio-economic disadvantage – comparison between 2001 Census and NWAHCS
ABS 2001 Census
2001 SEIFA Index of Relative Socio-economic Disadvantage – quintile
Demographic comparison between NWAHCS and other comparable survey
(NW Adelaide Study area)
Highest education level obtained
Trade / Apprenticeship / Certificate / Diploma
Bachelor degree or higher
Married or living with partner
Home duties, retired, other
Country of birth
UK or Ireland
Europe, The USSR & the Baltic States
Gross annual household income
Up to $40,000
$40,001 – $80,000
More than $80,000
Risk factor comparison between NWAHCS and other comparable survey
(NW Adelaide Study area)
Non or ex-smokers
Non drinkers, no risk
Intermediate to very high risk
Current high blood pressure
Current high cholesterol
Low exercise level
Moderate exercise level
High exercise level
Undertakes low to high levels of exercise
Self-reported general health status
Excellent, very good or good
Fair or poor
This cohort study has offered a unique opportunity to study chronic disease and related risk factors and to define the relationship between lifestyle and health and disease in the Australian population. Cohort studies are one of the most important tools for epidemiological investigation but random sampling cohort studies are often marred by biased samples, low response rates and high loss to follow-up [12, 17]. Erroneous conclusions can be made if confounding factors are not incorporated in analytical comparisons and models . Bias can be corrected providing confounding was anticipated and confounding factors are appropriately controlled [26, 27]. This analysis has highlighted the variables that need consideration in future analyses associated with the NWAHCS.
This analysis has shown that in terms of bias associated with risk factors, the cohort participants are not dramatically unlike the community they represent. Their overall self-reported health status is the same, there is the same proportion of current smokers, their overall BMI status (except for underweight) is similar and the same proportion had current high blood pressure or high cholesterol readings. The only major difference was in terms of alcohol consumption with the cohort participants more likely to consume alcohol at an intermediate to high risk level.
In terms of demographic and social characteristics there were no differences by marital status or work status. The un-weighted comparison showed that less younger people and more older persons were recruited into the study. NWAHCS participants were also more likely to be born in UK or Ireland and less likely to be Australian born. These demographic differences could be explained by the fact that NWAHCS participants knew they were being recruited into a bio-medical cohort study with an outlay of personal time and effort required. The clinical meaning associated with the bias associated with this recruitment means we are missing out on the younger persons (expected to be healthier) and gaining more older persons (expected to be unhealthy) although the weighting of the data would counteract some of this bias. The country of birth differences should not affect clinical results as Australian-born and those born in UK/Ireland have similar heritages and social indicators. All future analyses and assumptions will take into account these differences and the fact that the study participants were more likely to have middle levels of education and be more likely to be in the middle levels of income.
The strength of this study lies in its representative nature, the large random sample and the high response rate. Although the response rate associated with the complete study involvement, including obtaining blood and other bio-medical measurements, was 49.6% (69% of people interviewed by telephone), this is high when compared to other recent, comparable Australian studies. The AusDiab study recorded a response rate of 28% and a recent pilot for a national Australian biomedical study reported obtaining blood from 23% of their sample . There is a trend towards lower response rates in all types of population surveys as people protect their privacy, are overwhelmed by marketing telephone calls or mail outs. The additional commitments associated with involvement in a cohort study add to respondent burden. To overcome some of the initial bias afforded to the response rate, the data were weighted by age and sex. The weights reflect unequal sample inclusion probability and compensate for differential non-response. Theoretically the weighted analyses should provide reliable population estimates of health phenomena.
To increase the initial response rate the study team implemented a range of well recognized survey techniques. These included consideration of timing of the initial phone contact, timing of phone call and technique (questionnaire length, size), training of recruitment staff, marketing, branding, and a free-call telephone number for inquiries . Early in the recruitment stage, qualitative interviews were conducted with subjects unwilling to participate in the cohort study and the findings incorporated into further recruitment procedures.
Limitations to this analysis include the use of data collected using mixed modes with comparisons based on data collected by telephone and self completion. Bias is known to exist by method of collection, especially in regard to socially desirable responses . An additional weakness of the study was the lack of data on non-responders. Although some data were collected on people who refused to participate in the biomedical components of the study, a comparison with people unable to be contacted (non-responders to the recruitment telephone call) was not possible due to data limitations on non-responders. The self-report nature of the data collection could also contain an element of bias and therefore be seen as a limitation. The use of biomedical data in addition to the self-reported data in this study (height, weight, and blood pressure and cholesterol measurements) will allow comparisons to be made between reported and measured variables. This analysis is planned.
Many studies have assessed the characteristics of participants and non-participants in population surveys and questionnaire-based cohort studies. Details on, in-depth analysis of, and subsequent publishing of the initial samples for cohort studies that have been initiated in the last decade, in which participants commit to clinic assessments, are few. This study has shown that for this population, people who have risk factors for ill-health were just as likely as others to participate. This is of relevance for researchers interested in establishing a bio-medical cohort study and offers positive encouragement that the huge financial and human resource costs are worthwhile.
We are most grateful to the clinic and recruiting staff for their enormous contribution to the success of the study, and for the generosity of the NWAHS participants in the giving of their time and effort.
The study was initially funded by grants from The University of Adelaide and the South Australian Department of Health.
- Messerli HF, Mittler BS: Framingham at 50. The Lancet. 1998, 352 (9133): 1006-10.1016/S0140-6736(05)60073-6.View ArticleGoogle Scholar
- Marwick C: NHANES III – Health data relevant for aging nation. JAMA. 1997, 277 (2): 100-102. 10.1001/jama.277.2.100.View ArticlePubMedGoogle Scholar
- Schouten LJ, Goldbohm RA, van den Brandt PA: Height, weight, weight change and ovarian cancer risk in the Netherlands cohort study on diet and cancer. Am J Epidemiol. 2003, 157 (5): 424-33. 10.1093/aje/kwf224.View ArticlePubMedGoogle Scholar
- Michels KB, Rosner BA, Manson JE, Stampfer MJ, Walker AM, Willett WC, Hennekens CH: Prospective study of calcium channel blocker use, cardiovascular disease and total mortality among hypertensive women: the Nurses Health Study. Circulation. 1998, 97 (16): 1540-1548.View ArticlePubMedGoogle Scholar
- Bergmann MM, Bussaa U, Boeing H: Follow-up procedures in EPIC – Germany – data quality aspects. European Prospective Investigation into Cancer and Nutrition. 1999, 43 (4): 225-34.Google Scholar
- Smith K, Joshi H: The Millennium Cohort Study. Population Trends. 2002, 107: 30-4.PubMedGoogle Scholar
- Giles GG, English DR: The Melbourne Collaborative Cohort Study. 2002, IARC Scientific Publications, 156: 69-70.Google Scholar
- Brown WJ, Bryson L, Byles JE, Dobson AJ, Lee C, Mishra G, Schofield M: Women's Health Australia: recruitment for a national longitudinal cohort study. Women and Health. 1998, 28 (1): 23-40. 10.1300/J013v28n01_03.View ArticlePubMedGoogle Scholar
- Knuiman MW, Divitini ML, Olynyk KJ, Cullen DJ, Bartholomew HC: Serum ferritin and cardiovascular disease: a 17-year follow-up study in Busselton, Western Australia. Am J of Epidemiology. 2003, 158 (2): 144-9. 10.1093/aje/kwg121.View ArticleGoogle Scholar
- Eldredge J: Cohort studies in health sciences librarianship. Journal of Medical Library Association. 2002, 90 (4): 380-392.Google Scholar
- Grimes DA, Schultz KF: Cohort studies: marching towards outcomes. The Lancet. 2002, 359 (9303): 341-345. 10.1016/S0140-6736(02)07500-1.View ArticleGoogle Scholar
- Szklo M: Population-based cohort studies. Epidemiological Reviews. 1998, 20 (1): 81-90.View ArticleGoogle Scholar
- Deeg DJH: Attrition in longitudinal population studies: Does it affect the generalizability of the findings?. Journal of Clinical Epidemiology. 2002, 55: 213-215. 10.1016/S0895-4356(01)00472-3.View ArticleGoogle Scholar
- Samet JM, Munoz A: Perspective: Cohort studies. Epidemiological reviews. 1998, 20 (1): 135-136.View ArticleGoogle Scholar
- Willett WC, Colditz GA: Approaches for conducting large cohort studies. Epidemiological Reviews. 1998, 20 (1): 91-99.View ArticleGoogle Scholar
- Zaccai JH: How to assess epidemiological studies. Post Graduate Medicine. 2003, 80: 140-147.View ArticleGoogle Scholar
- Doll R: Cohort studies: history of the method. I Prospective cohort studies. Sozial- und Praventivmedizin. 2001, 46 (2): 75-86. 10.1007/BF01299724.View ArticlePubMedGoogle Scholar
- Grant JF, Chittleborough CR, Taylor AW, Dal Grande E, Wilson DH, Phillips PJ, Adams RJ, Cheek J, Price K, Gill T, Ruffin RE, the North West Adelaide Health Study Team: The North West Adelaide Health Study: methodology and baseline segmentation of a cohort along a chronic disease continuum. Epidemiologic Perspectives & Innovations.
- Taylor A, Dal Grande E, Chittleborough C, Cheek J, Wilson D, Phillips P, Ruffin R: Key biomedical findings, policy implications and research recommendations. 2002, Adelaide: SA Department of HealthGoogle Scholar
- Population Research and Outcome Studies. North West Adelaide Health Study. Brief report 2002–07. [accessed: May 2005 ], [http://www.dh.sa.gov.au/pehs/PROS/br-nwahs-study02-07.pdf]
- Australian Bureau of Statistics. Census of Population and Housing: Socio-economic Indexes for Areas. Information paper. ABS Catalogue no. 2039.0. 1996Google Scholar
- The South Australian Monitoring and Surveillance System (SAMSS) Brief Report. 2002 – 20. Department of Human Services, South Australia, [http://www.dh.sa.gov.au/pehs/PROS/br-samss02-20.pdf]
- Population Research and Outcome Studies. The Health Monitor Survey methodology. Brief report 2002–12. [accessed: May 2005], [http://www.dh.sa.gov.au/pehs/PROS/br-monitor-method02-12.pdf]
- Haberman SJ: Analyses of qualitative data. 1978, New York, Academic Press, 1:Google Scholar
- Davis HTO, Crombie IK: Bias in cohort studies. Hospital Medicine. 2000, 61 (2): 133-135.View ArticleGoogle Scholar
- Grimes DA, Schulz : Bias and causal associations in observational research. The Lancet. 2002, 359 (9302): 248-252. 10.1016/S0140-6736(02)07451-2.View ArticleGoogle Scholar
- Fang CT, Shau WY: Epidemiological Research. The Lancet. 2002, 360: 258-10.1016/S0140-6736(02)09475-8.View ArticleGoogle Scholar
- Coyne TJ, Findley MG, Ibiebele TI, Forman DW: Overweight and obesity in Australia: an underestimation of the true prevalence. Medical J of Australia. 2004, 180 (2): 9304-Google Scholar
- Hunt JR, White E: Retaining and tracking cohort study members. Epidemiological Reviews. 1998, 20 (1): 57-70.View ArticleGoogle Scholar
- De Vaus DE: Surveys in Social Research. 2002, Allen & Unwin. Australia, 130-ISBN 1 86508 611 8, 5Google Scholar
- The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1471-2458/6/11/prepub