Public health research outputs from efficacy to dissemination: a bibliometric analysis
© Milat et al; licensee BioMed Central Ltd. 2011
Received: 11 July 2011
Accepted: 15 December 2011
Published: 15 December 2011
Skip to main content
© Milat et al; licensee BioMed Central Ltd. 2011
Received: 11 July 2011
Accepted: 15 December 2011
Published: 15 December 2011
More intervention research is needed, particularly 'real world' intervention replication and dissemination studies, to optimize improvements in health. This study assessed the proportion and type of published public health intervention research papers over time in physical activity and falls prevention, both important contributors to preventable morbidity and mortality.
A keyword search was conducted, using Medline and PsycINFO to locate publications in 1988-1989, 1998-1999, and 2008-2009 for the two topic areas. In stage 1, a random sample of 1200 publications per time period for both topics were categorized as: non-public health, non-data-based public health, or data-based public health. In stage 2 data-based public health articles were further classified as measurement, descriptive, etiological or intervention research. Finally, intervention papers were categorized as: efficacy, intervention replication or dissemination studies. Inter-rater reliability of paper classification was 88%.
Descriptive studies were the most common data-based papers across all time periods (1988-89; 1998-1999;2008-2009) for both issues (physical activity: 47%; 54%; 65% and falls 75%; 64%; 63%), increasing significantly over time for physical activity. The proportion of intervention publications did not increase over time for physical activity comprising 23% across all time periods and fluctuated for falls across the time periods (10%; 21%; 17%). The proportion of intervention articles that were replication studies increased over the three time periods for physical activity (0%; 2%; 11%) and for falls (0%; 22%; 35%). Dissemination studies first appeared in the literature in 2008-2009, making up only 3% of physical activity and 7% of falls intervention studies.
Intervention research studies remain only a modest proportion of all published studies in physical activity and falls prevention; the majority of the intervention studies, are efficacy studies although there is growing evidence of a move towards replication and dissemination studies, which may have greater potential for improving population health.
Research evidence can enhance public health policy and practice by assisting in the identification and definition of priorities, informing decisions on policy development and implementation and by evaluating the impact of policies and programs [1, 2]. There is increasing recognition of the potential value of research evidence as one of the many factors considered by policy makers and practitioners [2, 3]. Recent years have seen significant efforts by governments and health research funding agencies to facilitate closer links between policy makers and researchers and to target funding to increase the uptake of evidence into policy and practice [4, 5].
Figure 1 describes public health research progression from descriptive studies undertaken to understand a problem, through to intervention dissemination studies testing widespread implementation and adoption of interventions. It is acknowledged that the progression described in the diagram represents an ideal and that intervention research development doesn't always progress in a linear fashion.
In this conceptualization, research that is targeted at understanding the problem includes measurement studies, descriptive studies exploring the frequency, patterns, correlates or predictors of a behavior or health issue and etiological studies, that is, epidemiological and other research studies that investigated a causal relationship between exposure to a risk factor and subsequent illness, disease or health outcome of public health significance.
Efficacy studies are those which evaluate the impact of an intervention (whether it does more good than harm among individuals in the target population) typically when delivered under optimal conditions (or in an ideal setting) . Though there is a distinction between 'explanatory trials' which are implemented under highly controlled conditions and 'pragmatic trials' which are delivered in real world practice , using the Nutbeam and Bauman conceptualization both are considered efficacy research as they use random allocation of participants and their primary focus is on internal validity, establishing causal relationships between an intervention and an outcome . They are also geared to focus on outcomes rather than process evaluation or contextual factors and as such may have poor generalizability .
Intervention replication studies are those which assess the practicality of implementing interventions with demonstrated efficacy, in new populations or settings under real world conditions . They examine whether the core components of the original intervention are faithfully transported to the real-world setting (i.e. the degree of fidelity of the intervention with the original efficacy study), shifting the emphasis from outcomes to consider more closely how to identify the conditions for success. Intervention replication research also begins to assess the impact of the climate and culture in an organization or community on intervention effectiveness [8, 12].
Finally, dissemination studies systematically study processes and factors that lead to widespread use of an evidence-based intervention across a target population or whole population. Its focus is to identify the best methods to enhance the widespread uptake and utilization of effective interventions [11, 12]. Typically in dissemination research, the relative balance between outcome and process evaluation has moved further still, with the primary focus on measuring the process of change and assessing the reach of program dissemination across settings and target populations as the scale of intervention increases. This research involves a comprehensive assessment of impact of the climate and culture in an organization or community on intervention effectiveness to inform potential system adoption or widespread intervention roll-out in communities. As such this research can also measure program implementation in the context of quality management systems [8, 13].
It is increasingly acknowledged that in order to maximize impact on health outcomes, intervention research must move beyond efficacy studies [14, 15] to intervention replication and dissemination studies [8, 16]. These study types implement interventions, ideally with demonstrated efficacy, under real life conditions and examine practical issues important to policy makers such as the extent to which a program can be adapted to meet variations in local need, circumstances and at what cost. These studies lead to widespread implementation and adoption of interventions into usual public health practice . Without more intervention replication and in particular dissemination studies, the potential contribution of research to policy and practice is unlikely to be realized [15, 17].
Recent efforts have been made to codify and quantify the types of research published in the peer-reviewed literature using variants of the typology shown in Figure 1. Sanson-Fisher et al., (2008) found a consistent dearth of intervention research in the fields of smoking, alcohol and physical activity, constituting only 9% to 31% of publications, while descriptive research was the most commonly published research type across time (1987-88, 1997-98 and 2005-06) . However, this study did not distinguish between the types of intervention research conducted.
In the present study, the issues of physical inactivity and injurious falls were examined as both make significant contributions to preventable morbidity and mortality in developed countries [18, 19] and are targeted by public health and chronic disease prevention funding agencies . Research in these areas can make major contributions to providing evidence to underpin important public health policy initiatives [21, 22].
The aims of the study were to: a) determine the proportion of published literature for both health issues that were 'intervention research' and to further categorize 'intervention research' papers into three sub-categories of 'efficacy studies', 'intervention replication studies' and 'dissemination studies'; and b) to determine whether the relative proportion of 'intervention research' and it's sub-categories changed over three time periods.
This study differs from previous research in four important ways: first, it extends previous methods by classifying categorized 'intervention' papers into three new sub-categories to determine the proportion of intervention studies across the spectrum from efficacy to dissemination; second, the time period covered is longer than previous research; third, it assesses larger samples sizes of relevant papers, and finally, addresses a new health priority of falls prevention.
A literature search was conducted using online databases Medline and PsycINFO to locate publications across three time periods: January 1988-December 1989, January 1998-December 1999, and January 2008-December 2009. Citations were located by key word searches and exploding medical subject headings (MeSH). A MeSH and key word pretesting process was undertaken prior to the final data collection using non study time periods in order to maximize the number of intervention studies identified by the search process and to refine the categorization system. A description of the final search process follows.
For Medline, the key word 'physical activity' and the MeSH terms, 'exercise' and 'physical fitness' were searched. For PsycINFO, the MeSH terms 'physical activity', 'exercise' and 'physical fitness' were searched. Abstracts were categorized as physical activity if they: examined physical activity as a primary research question and/or study outcome. While those that examined muscular movement without an intended health or performance benefit were excluded i.e. biomechanical studies describing physical impairment.
For Medline, the key word 'falls' and the MeSH terms 'accidental falls', 'accident prevention', 'prevention', 'aged' were searched individually. For PsycINFO, the MeSH terms 'falls', 'accidental falls', 'accident prevention', 'prevention' and the key word 'aged' were searched. Followed by a combined MeSH and keyword search: accidental falls AND aged; accident prevention AND aged; falls AND aged; accidental falls AND prevention; falls AND prevention.
Falls were defined as an unexpected event in which a person comes to rest on the ground or floor or a lower level . Falls prevention was considered a series of behavioral, medical, pharmacological and or environmental strategies employed to reduce the number of accidental falls suffered by older people . Studies focusing on falls from heights or falls in populations other than older people were excluded. Falls prevention-related medical and pharmacological interventions were considered non-public health if they were not accompanied by some form of education, counseling, environmental and or health care delivery system change.
When the key word search generated a sample greater than 1200 papers for designated health issues the random number generation function in R Statistics  was used to randomly select 1200 records for inclusion in the study sample. The pilot testing coding of 300 papers per health area in a non-study time period suggested that a sample of 1200 would provide between 150-300 data-based public health articles per time period. It was estimated that this sample would be sufficient to estimate a difference in independent proportions of between 5-8% at the 95% confidence level. In stages 1 and 2 abstracts were independently coded by two of the authors (AJM, NC). In stage 3 full citations were used to classify intervention research papers independently by two authors and where different classifications emerged these were discussed amongst coders against the coding criteria and assigned a final categorization. Inter-rater reliability was assessed using approximately 50 cross over records for each time period yielding a total of n = 295 records for comparison. Inter-rater agreement was 88%.
In stage 1 relevant articles on topics of interest were first categorized into one of three categories: non-public health, non-data-based public health, or data-based public health . For this study, research that described efforts to protect and promote health, and to prevent illness, injury and disability on a population or population subgroup level was considered 'public health'. Preventive non-pharmacological interventions conducted in clinical populations were classified as public health in this study, that is, it included studies focusing on primary, secondary and tertiary prevention . Preventive interventions focusing on those suffering rare conditions with population prevalence rates of < 1% i.e. cystic fibrosis, were excluded.
These papers included laboratory and clinical research that explored the topic areas from physiological or basic science perspectives, for example, examination of the impact of physical activity on organ function or metabolism; or the effects of medications on bone density. When the effects of some form of education or counseling or system change were examined in addition to medication, the research was classified as 'public health'.
These papers did not report new data. They included reviews, discussion papers, or commentaries, news, editorials, case reports, conference reports and descriptions of research projects or health programs. Meta-analyses were classified as non-data based public health in this study, as they are generally secondary analyses of already published primary data.
These were original articles reporting new data, both quantitative and qualitative. Included papers examined: prevalence; health behaviors, and associated risk and protective factors have with health, illness, and other variables; determinants or correlates; knowledge and attitudes; and intervention research findings.
In stage 2 data-based public health research articles were further categorized into one of the following categories: measurement, descriptive, etiological or intervention. In Figure 1, measurement, descriptive and etiological studies would all be categorized as 'understanding the problem'.
Papers developed or examined the qualities of a measurement instrument such as reliability, validity, or acceptability. Data collection methods included the use of questionnaires, interviews, physiological assessments, risk screening and observations. Papers that focused on both measurement and descriptive issues were coded as measurement research.
Papers explored the frequency, patterns, correlates or predictors of physical activity, or fall-related injury, or related variables such as knowledge, attitudes, healthcare practices, policy, or legislation. They included epidemiological studies examining frequency or patterns of risk factors and how these may be related to disease at a community or population level and other correlates research.
Epidemiological and other research studies that investigated a causal relationship between exposure to a risk factor and subsequent illness, disease or health outcome of public health significance.
Papers that tested the effectiveness of an intervention to modify preventive health-risk behaviors and/or the implementation of best practices by health care professionals. Intervention publications were defined by the research aims rather than the study design or type of intervention. Papers that focused on both descriptive and intervention issues were classified as intervention research. Intervention research was then further categorized.
Assessment criteria for intervention research categorization
Type of Research
Focus on outcome/impact measures
Focus on process evaluation
External validity (generaliz-ability)
Measures of system Adoption
Cost per participant
*Limited to study subjects - small
*Limited to study subjects (multiple sites) - moderate
*Study subjects in new target groups or settings
*Step wedge trial
*Step wedge trial
*System level - large
*Lower cost per participant.
*Broad population reach (Multiple populations and settings)
*Measures of system adoption costs.
After attempting to apply the above categorization a fourth classification emerged of low quality efficacy research. This latter category assessed the impact of small scale interventions using weak research designs including 'before and after studies' or measured impact of interventions using organizational quality assurance mechanisms. These studies were only identified in the falls prevention literature and as there were very few examples of these studies in the total sample (n = 10) they were collapsed into the efficacy research category.
Frequencies, chi square statistics and chi-square statistics for trend (Χ 2 trend) were calculated in winPEPI .
Number and percentage of publications classified as non-public health, data-based, or non-data-based public health
Publications generated by search
Publications relevant to topica
Non-public health researchb
Data-based public health researchb
Non-data based public health researchb
Of the physical activity papers that were relevant to the topic, the proportion that were data-based public health increased significantly overtime (X2 trend = 69.2, p = 0.001), from 18% in 1988-1989, 29% in 1998-1999 and 36% in 2008-2009. Of the falls papers that were relevant to the topic, the proportion identified as data-based publications increased initially from 48% in 1988-1989, to at 59% in 1998-1999 and stabilized at 55% in 2008-2009.
Number and percentage of data-based public health publications classified as measurement, descriptive, etiological or - intervention research
For physical activity, there was a significant increase in the proportion of descriptive studies published over time (X2 trend = 15.66, p = 0.001) from 47% in 1988-1989 to 65% in 2008-2009. The proportion of measurement articles dropped slightly over time constituting 8% of data-based public health papers in 1988-1989 and 5% in 2008-2009, while there was a significant decrease in the proportion of etiological studies published over time (X2 trend = 23.54, p = 0.001). The proportion of intervention research publications remained stable over time staying at 23% of data-based publications across all time periods.
For falls prevention, the proportion of descriptive studies published progressively decreased over time from 75% in 1988-1989, 64% in 1998-1999 to 63% in 2008-2009. While, the proportion of measurement studies significantly increased over time from 8% in 1988-1989 to 18% in 2008-2009 (X2 trend = 7.23, p = 0.007), etiological studies decreased significantly over time (X2 trend = 7.53, p = 0.006) from 8% in 1988-1989 to 2% in 2008-2009. For falls intervention publications there was a notable but non-significant increase between 1988-1989 from 10% and to 21% in 1998-1999, followed by a modest drop to 17% in 2008-2009 (X2 trend = 0.00, p = 0.96).
Number and percentage of intervention papers classified as efficacy, intervention replication and intervention dissemination studies
There was a significant increase in the proportion of data-based public health papers published over time for physical activity, but not for falls prevention. At all three time periods, publications in each topic were predominantly descriptive, with smaller numbers of intervention, measurement and etiological publications. The proportion of descriptive research studies increased significantly over time for physical activity, despite increasing policymaker interest in solutions generation . While, descriptive research marginally declined for falls prevention perhaps reflecting differences in the research progression in the two fields.
Although recent statements by both governments and funding agencies indicate that more information is required about the impact of health policies and programs [4–6] the results of this study fail to demonstrate that more intervention research is occurring. Rates remained stable over the past three decades for physical activity (23%) and fluctuated for falls prevention (10% 1988-1989; 21% 1998-1999; 17% 2008-2009). The continued dominance of descriptive research in this as well as other earlier studies  is a matter for concern, with relatively few studies testing the effectiveness of interventions across the two topic areas. They highlight a consistent and ongoing under-representation of intervention research over time and across topic areas, suggesting that research development is not progressing toward its full potential in public health. In this review, the problem of sustained and even increased descriptive research applies to physical activity, an area in great need of generalizable evidence.
Descriptive research may be easier to conceptualize, complete and publish than intervention research . In addition researchers may find it easier to attract research resources to undertake descriptive research, and such research is often less intrusive for participants, and may be more rapidly completed and published than intervention research . This coupled with the fact that performance indicators of individual researchers and research groups usually incorporate the number of publications, grants, postgraduate research students, and the amount of competitive research funding as core performance metrics , there is ample incentive to focus on descriptive research over intervention research. Some argue that the system provides incentives for counting what can be measured rather than measuring what counts .
Efficacy studies made up the majority of intervention publications across health issues and time periods. Though efficacy studies remain important, these optimally designed studies are often developed using a level of research resources and individual assessment that are not easily reproduced under real world conditions. Once efficacy is established, the relative scientific comfort of clinical trials needs to give way to the greater uncertainty of working in larger populations and with organizations and communities to engage in more generalizable program and intervention delivery .
As Greenhalgh (2004) states: 'Context and "confounders" lie at the heart of the diffusion, dissemination, and implementation of complex innovations. They are not extraneous to the object of study; they are an integral part of it.'  p. 615. Greenhalgh stresses that the 'context' must become a legitimate objective for scientific study and a legitimate influence on decision making. Intervention replication and dissemination studies provide policy makers and practitioners with context specific information on what needs to be done, by whom, to what standard and at what cost, all prerequisites for optimal adoption of intervention programs into policy or practice.
Notably, there has however been some progression in the types of intervention research published over time, with the proportion of replication studies increasing from zero in 1988-1989 to11% in 2008-2009 for physical activity and of particular note, increasing from zero in 1988-1989 to 35% in 2008-2009 for falls. This latter estimate indicates nearly half of published falls prevention interventions are now more generalizable tests of the replicability of efficacy studies into new and broader settings.
Though it is difficult to speculate as to the reason for the difference in research progression between the fields, a likely contributor is the nature of the falls prevention action. The fact that most falls prevention interventions use individual level outcomes as units of measurement, may make them more amenable to evaluation using randomized control trial designs, subsequently making the transition from efficacy to replication studies easier than it is for population-based physical activity interventions.
However, there remains a dearth of examples of progression to the next arguably more important dissemination phase in the literature. Though dissemination studies increased over time for both issues 3% and 7% in 2008-2009 for physical activity and falls respectively, they still remain a very small proportion of published intervention research, particularly for physical activity. Although rare, current dissemination studies suggest directions for the future, for example, Fortinsky et al. 2008  examined the extent to which fall risk assessment and management practices for older patients were implemented in 19 Medicare-certified home health agencies (HHAs) in southern New England, United States of America. Our findings clearly demonstrate that such systematic study of processes and factors that lead to widespread use of evidence-based interventions remain the exception rather than the rule .
Though there has been some progress in the area of falls prevention, this study reinforces the need for active efforts by funding agencies, research community and governments alike to foster strategic progression of studies across the continuum from efficacy through to dissemination with the ultimate aim of impacting on policy and practice. Though there is increasing acknowledgement of the importance of research translation and funding intervention research , our findings suggest that significant cultural and system change are still required amongst funding decision makers and researchers alike.
The limitations of randomized controlled trial designs for population-level dissemination evidence needs to be better acknowledged by research publishers and funders . Funding agencies and journals should be encouraged to support the generation and publication of more methodologically rigorous intervention replication and dissemination studies. Where appropriate journals should apply broader reviewer and reporting criteria that take into account guidance such as the Medical Research Council 'Developing and evaluating complex interventions', Cochrane Collaboration 'Effective Practice and Organization of Care (EPOC) Criteria for reviewing study designs and data quality'  and Consolidated Standards of Reporting Trials (CONSORT) statement . While funding agencies should also adopt these broader interpretations of research design commensurate with generating policy-relevant evidence , as well as funding more targeted dissemination research grant schemes.
Closer examination of other health areas may again reveal considerable diversity in the extent to which good-quality intervention studies are available for particular topics. This study highlights the importance of the investigation of research progression in other areas as a way of identifying evidence-generating research needs. Investigations of this kind are also needed to shed light on whether the translational research rhetoric matches reality.
The current study has built on bibliometric methods previously used in the published literature . While there may be individual variation in coding, these differences are unlikely to change the trend data, proportion sampled or inter-topic differences. The literature search used in this study provided a snapshot of research outputs within three selected time periods; but as with previous work in this area  the use of only two major health literature databases may under-estimate the true number of published articles. However, this underestimation is likely to be non-differential, in that it would not change relationships in these trends in publication. Another potential limitation of this study is its reliance on MeSH search terms, which have been shown in some instances to identify fewer published studies . Though this study predominantly used MeSH search terms, it also employed key words searches that were pretested prior to use to ensure higher capture rates for relevant intervention articles.
Though dissertations were included in this review, policy documents, conference abstracts, and grey literature published for government departments were not. Sanson-Fisher et al., (2008) argue that well-designed studies are likely to be published in peer-reviewed journals and that it is unlikely that rigorous intervention studies would be under-represented as a result . Based on the lack of dissemination research found in this study, we surmise that either this research is not occurring or that it may be more likely to be published in the grey literature. As such, it is recommended that future assessments of intervention research outputs should attempt to include these types of literature despite the methodological challenges this may present.
Despite recent efforts by policy makers and funders to increase intervention research outputs, there remains a need to increase the quantity and quality of such research, with a greater focus on the conduct and publication of intervention replication and particularly dissemination studies.
The authors would like to thank Ms Rona Macniven and Ms Josephine Chau for their assistance with the bibliometric methods development.
This research was partially supported by the New South Wales Department of Health Capacity Building Grants Scheme.
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.