The National Women's Health Study: assembly and description of a population-based reproductive cohort
© Maconochie et al; licensee BioMed Central Ltd. 2004
Received: 07 February 2004
Accepted: 07 August 2004
Published: 07 August 2004
Miscarriage is a common event but is remarkably difficult to measure in epidemiological studies. Few large-scale population-based studies have been conducted in the UK.
This was a population-based two-stage postal survey of reproductive histories of adult women living in the United Kingdom in 2001, sampled from the electronic electoral roll. In Stage 1 a short "screening" questionnaire was sent to over 60,000 randomly selected women in order to identify those aged 55 and under who had ever been pregnant or ever attempted to achieve a pregnancy, from whom a brief reproductive history was requested. Stage 2 involved a more lengthy questionnaire requesting detailed information on every pregnancy (and fertility problems), and questions relating to socio-demographic, behavioural and other factors for the most recent pregnancy in order to examine risk factors for miscarriage. Data on stillbirth, multiple birth and maternal age are compared to national data in order to assess response bias.
The response rate was 49% for Stage 1 and 73% for the more targeted Stage 2. A total of 26,050 questionnaires were returned in Stage 1. Of the 17,748 women who were eligible on the grounds of age, 27% reported that they had never been pregnant and had never attempted to conceive a child. The remaining 13,035 women reported a total of 30,661 pregnancies. Comparison of key reproductive indicators (stillbirth and multiple birth rates and maternal age at first birth) with national statistics showed that the data look remarkably similar to the general population.
This study has enabled the assembly of a large population-based dataset of women's reproductive histories which appears unbiased compared to the general UK population and which will enable investigation of hard-to-measure outcomes such as miscarriage and infertility.
Despite improvements in obstetric care in the UK over the past fifty years, it is estimated that around one in five pregnancies will end in miscarriage (fetal death before 24 weeks) [1, 2]. The personal and public health impact of pregnancy loss is a neglected area in medical research and strategies of prevention remain outside mainstream medical services.
Although many large-scale population-based studies of miscarriage risk have been conducted elsewhere [3–10], relatively few such studies have been conducted in the UK, and most of these have been occupational [11–14]. There are no registers of miscarriage or routine data collection systems which would allow linkage of miscarriages to individual women in the UK . There are thus no national prevalence estimates which can be used as reference for UK-based clinical or epidemiological studies. In addition, although there is now greater knowledge of how the risk of miscarriage changes with maternal age and previous history of miscarriage , the influence and interaction of biological, behavioural and social risk factors are less well-understood. The lack of reliable information on risk factors, and the confusion surrounding ad hoc reports of spurious associations, makes research in this area of great importance.
Studies of miscarriage have tended to be clinical-based, and are thus subject to selection bias. For example, gestations are later among miscarriages reaching hospital-based clinics. Many miscarriages are managed at home, and some are not reported to a clinician. Not only is miscarriage hard to measure, and different clinical sources rarely see the full range of cases, but reported risks of miscarriage tend to be pregnancy-rather than woman-based: estimates of risk tend to relate to the proportion of pregnancies ending in miscarriage, and there are very few studies examining the risk of experiencing one, two or more miscarriages, or the chances of conceiving following a miscarriage . Large prospective cohort studies are theoretically the ideal design, but take time and are prohibitively expensive . An alternative and practical approach is a survey asking the women themselves for their full reproductive history, including all fetal losses at all gestations.
An increasing number of couples are also seeking help for problems achieving a pregnancy. Although it is estimated that up to 15% couples experience such problems , few population-based prevalence studies have been conducted in the UK, particularly where fertility problems have been treated solely by the general practitioner using ovarian stimulation.
We now report on a large UK population-based survey of reproductive health, the National Women's Health Study. The study design was developed from several other large epidemiological surveys of reproductive outcome which showed that a postal method could be used to obtain full reproductive histories from large study populations [13, 14, 17, 18]. The aim of the study was to obtain population-based prevalence estimates relating to miscarriage and infertility, and to obtain good quality data on potential risk factors for miscarriage to be used when advising and counselling women who have suffered miscarriage and those who wish to reduce their risk of future pregnancy loss. The design of the study, together with response rates and description of the study population, is presented in this report. Further reports on risk factors for miscarriage, plus population-based estimates of miscarriage and of pregnancies conceived using assisted reproduction techniques will follow.
This was a population-based cross-sectional postal survey of reproductive histories of adult women living in the United Kingdom in 2001, designed to enable the construction of a retrospective population-based reproductive cohort and a case-control study of risk factors for miscarriage. A sample of women was randomly selected from electronic electoral registers for England, Wales, Scotland and Northern Ireland held by the company Eurodirect . All UK citizens aged 18 and over are eligible to vote; registration is voluntary, but in 2001 around 98% of the entire resident population were on the electoral register , the remainder being largely non-UK citizens and iterant population. At the time of survey there was no opt-out clause for those who did not wish to be on an electronic version of the electoral register, so the sampling frame contained all UK residents eligible (and registered) to vote.
In order to reduce possible biases associated with memory, we aimed for a sample aged 55 years and under at survey. Date of birth is not, however, routinely recorded on the electoral register. To avoid unnecessary mailing and expense, we therefore made use of a probabilistic process offered by Eurodirect based on forename, whereby the sampling frame was restricted to women thought likely to be aged 55 and under on the basis of their name. This process was based on empirical data relating to birth certificates going back to the beginning of the 20th century, from which it could be calculated that, for example, those named "Elsie" are likely to be aged over 55, and those named "Kylie" under 55 years. Predictions are further refined by examination of combinations of names within a household (a "Jane" married to or living with an Alfred likely to be older than a "Jane" married to or living with a "Darren") and length of residency (e.g. someone registered to vote at the same address for 12 years has to be over 30). We requested a random sample of 61,000 women likely to be aged 55 and under (sample size calculations based on achieving at least 80% power for key risk factors in the case-control analysis, and cost). After removing those known to be under age 18 at study (those turning 18 in the year of registration are allowed to register early, giving date of birth), the final sample consisted of 60,814 women.
The study received approval from the Trent Multi-Centre Research Ethics Committee and the Ethics Committee of the London School of Hygiene & Tropical Medicine.
The postal survey had two stages. Stage one consisted of a single-page "screening" questionnaire which asked for details of all pregnancies experienced by study participants, as well as periods of infertility and infertility treatment. This form was sent to the whole sample and included "opt-out" boxes to be ticked if the recipient had never been pregnant and had never attempted to have children, and/or was over age 55, and/or did not wish to take part. The second stage of the study consisted of a longer postal questionnaire which was sent to all those responding to Stage 1 who had ever been pregnant or who reported ever attempting to conceive and who agreed to be re-contacted. Excluded from this second stage were women who had had one or more termination for non-medical reasons (i.e. for reasons other than that a defect had been identified in the fetus or that continuing the pregnancy would put the mother at risk) and no other pregnancies. The Stage 2 questionnaire requested more general detail about the women (including height, age at menarche, educational level, marital status and details of infertility problems, treatment and diagnosis, if appropriate); detailed information on all pregnancies (including whether the pregnancy was the planned, the result of infertility treatment, father's date of birth and whether father had remained the same); plus socio-demographic and behavioural details relating to the most recent pregnancy. These details included questions relating to weight at start of pregnancy, nausea, smoking, coffee and alcohol consumption, diet, vitamin intake, ill health, air travel, sexual intercourse, occupation and stress levels. The most recent pregnancy was selected to minimise biases related to recall, and since it could be at the start, middle or end of the reproductive careers of these women whose ages at survey ranged from 18 to 55 years potential biases relating to ending reproductive careers on a "success" were not expected to be large. For those whose most recent pregnancy had ended in miscarriage (defined as fetal death at <24 weeks gestation), brief information relating to clinical management of miscarriage and the advice given was also requested. Permission to access clinical notes relating to outcomes reported in the questionnaire, and to contact the women for further study if needed, was also requested. In order to increase the number of cases for the case-control analysis of risk factors for miscarriage, women who had had a miscarriage recently (since 1995) but whose last pregnancy was not a miscarriage were sent a third questionnaire. This was a shortened version of the Stage 2 questionnaire, containing only those questions relating to biological, socio-demographic and behavioural details of the most recent pregnancy, but now requesting these details in relation to the most recent miscarriage. Such women then had two pregnancies in case-control analyses and standard errors were computed using a robust method based on the "sandwich estimate" to account for this statistically.
A free telephone helpline was run throughout the study, to answer queries and refer on to other organizations for professional help, if appropriate, and this was well used.
All analyses in this paper were performed using Stata statistical software . To investigate possible selection bias we compared stillbirth and multiple delivery rates with rates in the general population. For this we obtained annual registered stillbirth risks and registered multiple delivery rates by maternal age for England and Wales, 1980–2001  (data for 2002 was estimated from that for 2001). Standardised registered stillbirth ratios (SRSR) and standardised multiple delivery rates (SMDR) were then calculated using logistic regression analysis (offsetting the log odds of the population risk) . The unit of analysis for stillbirths was a registered birth. A registered livebirth is defined as a baby born alive at any gestation, registered stillbirth being defined as a fetal death at 28 weeks or more gestation until the end of 1992, and at 24 weeks or more gestation from 1993 onwards. Where gestational age was not available from Stage 2 data, a pregnancy was considered to be a stillbirth if it was so described. Forty-one (40%) of the total 102 stillbirths in the analysis fell into this category. For multiple delivery, the unit of analysis was a pregnancy containing at least one livebirth or registered stillbirth (as described above). For the purposes of the analyses presented in this paper (comparisons with the general population), a pregnancy was only considered multiple if it contained two or more babies who were liveborn or (registered) stillborn in order to be consistent with the definitions used in the national data. Thus, for example, a twin pregnancy occurring before 1993 and resulting in a livebirth and a fetal death at less than 28 weeks was considered to be a singleton pregnancy in this analysis. Average maternal age at first birth, if live, was also compared with that in the general population. Annual average maternal age at first (registered) birth, if live, was obtained with denominators for England and Wales, 1980–2001  and re-calculated for 5-year periods. This national data was available for births within marriage only. Marital status of mother at time of birth was known only for the most recent pregnancy (or most recent miscarriage since 1995) in this dataset. For the NWHS average maternal age was therefore calculated for all first registered births, if live. No formal statistical comparisons of maternal age were made, partly because the numbers were so large that slight, non-meaningful, nuances in the data would give a statististically significant result, and render the comparison meaningless, and partly because the average ages in the general population, though comparable, were expected to be similar but slightly older in the general population data owing to the fact that the data related to births within marriage only. Births where the date of birth or maternal age were not known were excluded from all comparisons with population data.
The National Women's Health Survey – response rates
TOTAL QUESTIONNAIRES POSTED
Did not wish to participate
Aged >55 years or otherwise ineligible3
Aged < = 55 years but never attempted to have children
Aged < = 55, ever attempted to have children
- Never pregnant
- Ever pregnant
TOTAL QUESTIONNAIRES POSTED
No longer wished to participate
- Attempted pregnancy, never pregnant
- Ever pregnant 4
NWHS Stages 1 and 2 – description of women reporting one or more pregnancy, and of the pregnancies they reported
TOTAL NO. WOMEN IN ANALYSIS
Age at survey (years)
> = 40
Mean age (SD) 1
Year of first pregnancy
Total number of pregnancies reported per woman
> = 6
2 (1 – 18)
2 (1 – 18)
No dates given for any pregnancies
All pregnancies occurred before 1980
Pregnancies before and after 1980
Pregnancy history commenced 1980 onwards
All pregnancies conceived after 31/03/2000
TOTAL REPORTED PREGNANCIES
Outcome of pregnancy
Livebirth, surviving >7 days
Livebirth, early neonatal death
Termination for medical reasons5
Termination for non-medical reasons6
Ongoing (current) pregnancy
Year of pregnancy end
11,424 (88%) women ever attempting to have children (successfully or unsuccessfully) agreed to participate in the second stage of the study. Of these 596 (5%) were not sent a Stage 2 questionnaire, 212 because they had only ever had one or more termination of pregnancy for non-medical reasons, and 384 because their Stage 1 form arrived back after mailing had ended. A total of 10,828 women were thus sent a second stage questionnaire. The response to this second stage was high (73%), though 2% of women had decided that they no longer wished to participate (Table 1). The 7,702 women completing a Stage 2 questionnaire, and the 18,391 pregnancies they reported, are described in Table 2. Their characteristics are almost identical to those of Stage 1, indicating that Stage 2 responders were an apparently unbiased subset of those responding to Stage 1. 5,777 (75%) women responding to Stage 2 gave signed consent for us to access their medical notes, with 6,963 (90%) agreeing to be contacted again in the future, if required.
Comparison with national data
Comparison with population birth data of reported births in Stages 1 and 21 of the National Women's Health Study occurring since 19802
Total livebirths & stillbirths3
SRSR4 (95% CI)
(94 – 139)
(79 – 132)
MULTIPLE (REGISTERED) DELIVERY 5
No. multiple deliveries5
SMDR4 (95% CI)
(99 – 126)
(93 – 126)
AVERAGE MATERNAL AGE AT FIRST 6 (LIVE)BIRTH (years)
No. first6 livebirths
Mean (SD) age7
England & Wales8
Year of delivery
Age at first (live) birth was remarkably similar to national data for both Stage 1 and Stage 2 data (Table 3). Exactly as expected, though showing no evidence to suggest any biases with respect to maternal age, average age at first birth was very slightly higher for the national data, since it related to births within marriage only, whereas the NWHS data related to all births (marital status at delivery was unknown).
Using a novel method, the National Women's Health Study has enabled a large UK population-based dataset to be assembled, comprising full reproductive histories, including any history of infertility, for 13,035 women, 12,695 of whom had conceived 30,661 pregnancies. We have obtained further detailed information for 7,702 of these women (18,391 pregnancies), including fertility diagnoses for both male and female partner (if appropriate), and lifestyle and behavioural risk factors for the most recent pregnancy. Seventy-five percent of these women consented to their medical notes being accessed in relation to information reported in the questionnaire, and 90% agreed to be contacted again, thus providing the means to carry out a population-based cohort study of these women at some time in the future.
UK population-based data, collected at government level by England & Wales, Scotland and Northern Ireland, relate to registered births (live and still) and terminations of pregnancy, with Scotland also routinely collecting maternity data on hospital deliveries at any gestation. The National Women's Health Study goes one step further than this, providing the whole reproductive picture. Rather than being a pregnancy-based, cross-sectional survey, the data collected for each woman covers the complete spectrum of reproductive outcomes from infertility problems through miscarriage, ectopic pregnancies and terminations (for both medical and non-medical reasons), to live and stillbirths, and does not rely on legal definitions for inclusion in the dataset. Furthermore, unlike most epidemiological studies of adverse reproductive outcome such as miscarriage, the data source is not clinical (which, for miscarriage, leads to inevitable biases relating to gestational age), but relates to women selected randomly from the UK electoral register. And for outcomes such as infertility no other data currently exist to enable estimation of how many pregnancies in the population as a whole result from fertility treatment.
The study does rely on maternal recall and this could be a source of bias. Studies of self-reported reproductive history and exposures relating to reproductive events have, however, found maternal recall to have acceptably high reliability, and to be little affected by time from event [24–26].
In terms of the key reproductive indicators of stillbirth, multiple delivery rates and maternal age at first birth, the data look remarkably similar to the general population. We therefore feel confident that response was unlikely to be related to adverse reproductive outcome. Indeed, the average age at survey of around 40 years, coupled with average ages at first birth which are exactly as would be expected from general population data, could be seen to indicate that non-responders to the survey tended to concentrate among younger women who had not yet tested their fertility. In addition, we feel confident that those responding to the more detailed Stage 2 questionnaire are an unbiased sample of those responding to Stage 1. Both Stage 1 and Stage 2 data can thus can be considered unbiased with respect to reproduction, and representative of patterns among all women in the UK population who have ever tried to have children, hence prevalence estimates might be taken as unbiased estimates of hard-to-measure outcomes such as miscarriage and pregnancies conceived through assisted reproduction techniques. Such data will be invaluable as population-based reference data for epidemiological studies of reproduction.
In addition to both pregnancy-and woman-based population prevalence estimates, further papers to follow include reports of case-control analyses of behavioural and lifestyle risk factors for miscarriage.
In summary, we have assembled a large population-based dataset of women's reproductive histories which appears representative of the general UK population and which will enable investigation of hard-to-measure outcomes such as miscarriage and infertility.
We thank Ruth Bender Atik and Barbara Hepworth-Jones from The Miscarriage Association who instigated the study. The project was funded by the National Lottery Community Fund (through the Miscarriage Association), and by the Miscarriage Association.
- Garcia-Enguidanos A, Calle ME, Valero J, Luna S, Dominguez-Rojas V: Risk factors in miscarriage: a review. Eur J Obstet Gynecol Reprod Biol. 2002, 102 (2): 111-9. 10.1016/S0301-2115(01)00613-3.View ArticlePubMedGoogle Scholar
- Savitz DA, Hertz-Picciotto I, Poole C, Olshan AF: Epidemiologic measures of the course and outcome of pregnancy. Epidemiol Rev. 2002, 24 (2): 91-101. 10.1093/epirev/mxf006.View ArticlePubMedGoogle Scholar
- Naylor AF, Warburton D: Sequential analysis of spontaneous abortion. II. Collaborative study data show that gravidity determines a very substantial rise in risk. Fertil Steril. 1979, 31 (3): 282-6.View ArticlePubMedGoogle Scholar
- Risch HA, Weiss NS, Clarke EA, Miller AB: Risk factors for spontaneous abortion and its recurrence. Am J Epidemiol. 1988, 128 (2): 420-30.PubMedGoogle Scholar
- Rowland AS, Baird DD, Shore DL, Weinberg CR, Savitz DA, Wilcox AJ: Nitrous oxide and spontaneous abortion in female dental assistants. Am J Epidemiol. 1995, 141 (6): 531-8.PubMedGoogle Scholar
- Nybo Andersen A-M, Wohlfahrt J, Christens P, Olsen J, Melbye M: Maternal age and fetal loss: population based register linkage study. BMJ. 2000, 320: 1708-12. 10.1136/bmj.320.7251.1708.View ArticlePubMedGoogle Scholar
- Li DK, Odouli R, Wi S, Janevic T, Golditch I, Bracken TD, Senior R, Rankin R, Iriye R: A population-based prospective cohort study of personal exposure to magnetic fields during pregnancy and the risk of miscarriage. Epidemiology. 2002, 13 (1): 9-20. 10.1097/00001648-200201000-00004.View ArticlePubMedGoogle Scholar
- Slama R, Werwatz A, Boutou O, Ducot B, Spira A, Hardle W: Does male age affect the risk of spontaneous abortion? An approach using semiparametric regression. Am J Epidemiol. 2003, 157 (9): 815-24. 10.1093/aje/kwg056.View ArticlePubMedGoogle Scholar
- Xiong X, Fraser WD, Demianczuk NN: History of abortion, preterm, term birth, and risk of preeclampsia: a population-based study. Hum Reprod. 2002, 17 (6): 1649-56. 10.1093/humrep/17.6.1649.View ArticleGoogle Scholar
- Osborn JF, Cattaruzza MS, Spinelli A: Risk of spontaneous abortion in Italy, 1978-and the effect of maternal age, gravidity, marital status, and education. Am J Epidemiol. 2000, 151 (1): 98-105.View ArticlePubMedGoogle Scholar
- Pandya PP, Snijders RJ, Psara N, Hilbert L, Nicolaides KH: The prevalence of non-viable pregnancy at 10–13 weeks of gestation. Ultrasound Obstet Gynecol. 1996, 7 (3): 170-3. 10.1046/j.1469-0705.1996.07030170.x.View ArticlePubMedGoogle Scholar
- Elliott RC, Jones JR, McElvenny DM, Pennington MJ, Northage C, Clegg TA, Clarke SD, Hodgson JT, Osman J: Spontaneous abortion in the British semiconductor industry: An HSE investigation. Health and Safety Executive. Am J Ind Med. 1999, 36 (5): 557-72. 10.1002/(SICI)1097-0274(199911)36:5<557::AID-AJIM8>3.0.CO;2-Q.View ArticlePubMedGoogle Scholar
- Doyle P, Maconochie N, Roman E, Davies G, Smith P, Beral V: Fetal deaths and congenital malformation in the offspring of nuclear industry employees: a report from the Nuclear Industry Family Study . Lancet. 2000, 356: 1293-1299. 10.1016/S0140-6736(00)02812-9.View ArticlePubMedGoogle Scholar
- Maconochie N, Doyle P, Davies G, Lewis S, Prior S, Sampson P, Pelerin M: The study of reproductive outcome and the health of offspring of UK veterans of the Gulf war: methods and description of the study population. BMC Public Health. 2003, 3: 4-10.1186/1471-2458-3-4.View ArticlePubMedPubMed CentralGoogle Scholar
- Regan L, Braude PR, Trembath PL: Influence of past reproductive performance on risk of spontaneous abortion. BMJ. 1989, 299 (6698): 541-5.View ArticlePubMedPubMed CentralGoogle Scholar
- Templeton A: Infertility-epidemiology, aetiology and effective management. Health Bull (Edinb). 1995, 53 (5): 294-8.Google Scholar
- Roman E, Doyle P, Ansell P, Bull D, Beral V: The health of children of radiographers in the UK. Occup Environ Med. 1996, 53: 73-79.View ArticlePubMedPubMed CentralGoogle Scholar
- Doyle P, Roman E, Beral V, Brookes M: Spontaneous abortion in dry cleaning workers potentially exposed to perchloroethylene. Occup Environ Med. 1997, 54 (12): 848-53.View ArticlePubMedPubMed CentralGoogle Scholar
- Eurodirect. [http://www.eurodirect.co.uk]
- Office for National Statistics: Parliamentary electors on the 2000 and 2001 electoral registers – revised figures. United Kingdom; England; Wales; Scotland; Northern Ireland. London. 2001, [http://www.statistics.gov.uk/StatBase/xsdataset.asp?vlnk=3258&Pos=2&ColRank=2&Rank=816]Google Scholar
- Statacorp: Stata Statistical Software: Release 8.0.College Station, TX: Stata Corporation. 2003,Google Scholar
- Office for National Statistics: Birth Statistics. London: HMSO. 1970, 27-30. –2001 Series FM1 nos 13,18Google Scholar
- Breslow NE, Day NE: Statistical methods in cancer research. The analysis of cohort studies Lyons: International Agency for Research on Cancer. 1980, II:Google Scholar
- Olson JE, Shu XO, Ross JA, Pendergrass T, Robison LL: Medical record validation of maternally reported birth characteristics and pregnancy-related events: a report from the Children's Cancer Group. Am J Epidemiol. 1997, 145 (1): 58-67.View ArticlePubMedGoogle Scholar
- Farrow A, Farrow SC, Little R, Golding J: The repeatability of self-reported exposure after miscarriage. ALSPAC Study Team. Avon Longitudinal Study of Pregnancy and Childhood. Int J Epidemiol. 1996, 25 (4): 797-806.View ArticlePubMedGoogle Scholar
- Yawn BP, Suman VJ, Jacobsen SJ: Maternal recall of distant pregnancy events. J Clin Epidemiol. 1998, 51 (5): 399-405. 10.1016/S0895-4356(97)00304-1.View ArticlePubMedGoogle Scholar
- The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1471-2458/4/35/prepub
This article is published under license to BioMed Central Ltd. This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.