Effectiveness of smoking cessation therapies: a systematic review and meta-analysis
© Wu et al. 2006
Received: 11 August 2006
Accepted: 11 December 2006
Published: 11 December 2006
Skip to main content
© Wu et al. 2006
Received: 11 August 2006
Accepted: 11 December 2006
Published: 11 December 2006
Smoking remains the leading preventable cause of premature deaths. Several pharmacological interventions now exist to aid smokers in cessation. These include Nicotine Replacement Therapy [NRT], bupropion, and varenicline. We aimed to assess their relative efficacy in smoking cessation by conducting a systematic review and meta-analysis.
We searched 10 electronic medical databases (inception to Sept. 2006) and bibliographies of published reviews. We selected randomized controlled trials [RCTs] evaluating interventions for smoking cessation at 1 year, through chemical confirmation. Our primary endpoint was smoking cessation at 1 year. Secondary endpoints included short-term smoking cessation (~3 months) and adverse events. We conducted random-effects meta-analysis and meta-regression. We compared treatment effects across interventions using head-to-head trials and when these did not exist, we calculated indirect comparisons.
We identified 70 trials of NRT versus control at 1 year, Odds Ratio [OR] 1.71, 95% Confidence Interval [CI], 1.55–1.88, P =< 0.0001). This was consistent when examining all placebo-controlled trials (49 RCTs, OR 1.78, 95% CI, 1.60–1.99), NRT gum (OR 1.60, 95% CI, 1.37–1.86) or patch (OR 1.63, 95% CI, 1.41–1.89). NRT also reduced smoking at 3 months (OR 1.98, 95% CI, 1.77–2.21). Bupropion trials were superior to controls at 1 year (12 RCTs, OR1.56, 95% CI, 1.10–2.21, P = 0.01) and at 3 months (OR 2.13, 95% CI, 1.72–2.64). Two RCTs evaluated the superiority of bupropion versus NRT at 1 year (OR 1.14, 95% CI, 0.20–6.42).
Varenicline was superior to placebo at 1 year (4 RCTs, OR 2.96, 95% CI, 2.12–4.12, P =< 0.0001) and also at approximately 3 months (OR 3.75, 95% CI, 2.65–5.30). Three RCTs evaluated the effectiveness of varenicline versus bupropion at 1 year (OR 1.58, 95% CI, 1.22–2.05) and at approximately 3 months (OR 1.61, 95% CI, 1.16–2.21). Using indirect comparisons, varenicline was superior to NRT when compared to placebo controls (OR 1.66, 95% CI 1.17–2.36, P = 0.004) or to all controls at 1 year (OR 1.73, 95% CI 1.22–2.45, P = 0.001). This was also the case for 3-month data. Adverse events were not systematically different across studies.
NRT, bupropion and varenicline all provide therapeutic effects in assisting with smoking cessation. Direct and indirect comparisons identify a hierarchy of effectiveness.
Smoking is the leading cause of preventable death in the world. Given the multitude of health benefits of smoking cessation, considerable effort has been focused on identifying mechanisms to assist smokers in quitting. However, smoking cessation is challenging and behavioural interventions have had only modest success .
Drug therapy has been increasingly relied upon to assist in smoking cessation. The most common of these has been nicotine replacement therapy [NRT] . More recently, attention has focused on the use of anti-depressant therapy and specifically the agent bupropion. Five new trials have demonstrated the effectiveness of a new agent with a novel mechanism of action, varenicline, in improving cessation rates [5–9].
How effective are pharmacologic smoking cessation therapies? No systematic review and meta-analysis has examined this issue since the availability of information of the newest agent. We conducted a meta-analysis of Randomized Controlled Trials [RCTs] to identify the effectiveness of the various pharmacological interventions in improving cessation rates.
Our primary outcome of interest was smoking cessation at 1 year. Our secondary outcomes were short-term smoking cessation defined as 3 months after initiating treatment or closest available data to that time point, within one month. Our additional secondary outcome evaluated adverse events. We included any RCT of NRT of any delivery method, bupropion or varenicline. We included only RCTs of at least 1 years duration with chemical confirmation of smoking cessation, as there is reason to doubt patient self-report regarding addictions. Methods of assessing smoking cessation also vary from study to study. The most common method is self-report, however, this can have false cessation rates as high as 30%. False reporting is most likely to occur in a trial setting or in assessing smoking status after a medical event. In both situations the smoker is under considerable pressure to quit. Laboratory tests are, therefore, often used to verify smoking status, especially in clinical trials. Methods of biological verification include serum and saliva thiocyanate (SCN), expired carbon monoxide (CO), plasma, saliva and urinary cotinine and plasma and urinary nicotine. Each of these have various strengths and weaknesses. Studies had to report smoking cessation as either sustained abstinence at the time periods or point-prevalence of abstinence. When both outcomes were available, we considered sustained abstinence to be a superior clinical marker of abstinence. We excluded non-RCTs, post-hoc analyses, maintenance therapy, and studies that reported outcomes as self-report.
In consultation with a medical librarian (PR), we established a search strategy. We searched independently, in duplicate, the following 10 databases (from inception to September 10, 2006): MEDLINE, EMBASE, Cochrane CENTRAL, AMED, CINAHL, TOXNET, Development and Reproductive Toxicology, Hazardous Substances Databank, Psych-info and Web of Science, databases that included the full text of journals (OVID, ScienceDirect, and Ingenta, including articles in full text from approximately 1700 journals since 1993). In addition, we searched the bibliographies of published systematic reviews[3, 12–18, 4, 19] and health technology assessments. Searches were not limited by language, sex or age.
Two investigators (EM, PW) working independently, in duplicate, scanned all abstracts and obtained the full text reports of records, that indicated or suggested that the study was a RCT evaluating a smoking cessation therapy on the outcomes of interest. After obtaining full reports of the candidate trials (either in full peer-reviewed publication or press article) the same reviewers independently assessed eligibility from full text papers.
Two reviewers (PW, PD) conducted data extraction independently using a standardized pre-piloted form. Reviewers collected information about the smoking cessation intervention tested, the population studied (age, sex, underlying conditions), treatment dosages and dosing schedules, the treatment effect at 1 year and at 3 months, the specific measurement of abstinence (sustained or point-prevalence), and the chemical confirmation methods. Study evaluation included general methodological reporting quality features including allocation concealment, sequence generation, blinding status, intention-to-treat, and appropriate descriptions of loss to follow-up. Quality of reporting could be considered as analogous to methodological quality if one assumes that failure to report on a component of study design (for example blinding) actually indicated that the component was not employed. We entered the data into an electronic database such that duplicate entries existed for each study; when the two entries did not match, we resolved differences through discussion and consensus.
In order to assess inter-rater reliability on inclusion of articles, we calculated the Phi statistic (ϕ), which provides a measure of inter-observer agreement independent of chance. We calculated the Odds Ratios [OR] and appropriate 95% Confidence Intervals [CIs] of outcomes according to the number of events of abstinence reported in the original studies or sub-studies. In circumstances of zero outcome events in one arm of a trial, we added 1 to each arm, as suggested by Sheehe. We first pooled studies of all NRT interventions versus all controls using the DerSimonian-Laird random effects method, which recognizes and anchors studies as a sample of all potential studies, and incorporates an additional between-study component to the estimate of variability. We calculated the I2 statistic for each analysis as a measure of the proportion of the overall variation that is attributable to between-study heterogeneity. Forest plots are displayed for each primary analysis, showing individual study effect measures with 95% CIs, and the overall DerSimmonian-Laird pooled estimate. We then conducted a meta-regression analysis on the NRT studies with predictors of heterogeneity including the following covariates: placebo control; reporting of sequence generation; reporting of allocation concealment; use of gum or patch; and, method of chemical confirmation of abstinence. When the meta-regression indicated heterogeneity, we conducted alternative sensitivity tests using z-tests to determine differences between the studies reporting the covariates to the pooled all-studies effect size. We additionally conducted separate pooled analyses of NRT versus placebo, gum versus control and patch versus control. We conducted all analyses at 1 year and also at 3 months. For bupropion trials, we pooled all bupropion trials (standard and sustained-release) versus all controls and conducted a meta-regression analysis using the following covariates: placebo control; reporting of sequence generation; reporting of allocation concealment; method of chemical confirmation of abstinence; and plans to quit. We conducted separate meta-regression analyses and calculated the relevant ORs for the covariates as the exponent of the point estimates. The point estimates and 95% CI around denote the expected change in the pooled effect size when the covariates are considered. The OR of the point estimates and 95% CI confer the likelihood of the covariate affecting individual trial outcomes. We additionally pooled all placebo-controlled trials and evaluated effect sizes at 1 year and at 3 months. For head-to-head trials of bupropion versus NRT, we conducted pooled random-effects analyses at 1 year and at 3 months. For varenicline trials, we conducted pooled random-effects analyses of varenicline versus placebo at 1 year and at 3 months and for head-to-head trials of varenicline versus bupropion at 1 year and at 3 months. Head-to-head trials provide the strongest inferences regarding intervention superiority. However, in the absence of head-to-head trials of varenicline versus NRT, we conducted indirect comparisons of these interventions versus placebo using methods described by Bucher et al and conducted z-tests to confirm. This method maintains the randomization from each trial and compares the summary estimates of pooled interventions with CIs. We calculated adverse events, where reported, using Peto's Odds Ratio [OR] with 95% CIs. Analyses were conducted using StatsDirect (version 2.5.2) and Comprehensive Meta-analysis (version 2).
Univariable meta-regression of NRT studies
Lower 95% limit
Upper 95% limit
Lower 95% limit
Upper 95% limit
Planned to quit
Thirty-three studies evaluated NRT gum [30–49, 51–53, 55, 61, 63, 64, 71, 83, 87, 95, 96, 98, 99], 23 evaluated NRT patch [50, 56, 57, 60, 62, 65, 67–70, 72, 76, 78–80, 82, 84, 86, 88, 90, 92, 94, 97]. The remaining studies evaluated the efficacy of nicotine inhalers, nasal spray or lozenges [54, 58, 59, 66, 73–75, 77, 81, 85, 89, 91, 93].
All of the studies provided sufficient details to evaluate NRT versus control at 1 year [30–99]. Fifty-nine provided sufficient details to evaluate NRT versus control at or about 3 months[30–33, 36, 37, 39–41, 44, 47, 49–54, 56–59, 43, 59–61, 63–86, 88–96, 98, 99].
We also found 11 studies evaluating bupropion versus placebo [5, 6, 8, 82, 100–106] and one RCT evaluating bupropion with no intervention (See Figure 1 and Additional File 2). Further, 2 of these evaluated bupropion versus NRT [82, 97].
Finally, we identified 4 studies evaluating varenicline versus placebo (See Figure 1 and Additional File 3) [5, 6, 8, 9]. Of these, 3 also evaluated varenicline versus bupropion[5, 6, 8]. Agreement on original inclusion of all trials was excellent (ϕ = 0.88).
Of the 70 trials assessing NRT versus controls, studies were varied in reporting important methodological features, including: sequence generation (22/70), allocation concealment (11/70), blinding status (64/70), appropriate blinding (45/70), intention-to-treat (67/70), and appropriate descriptions of loss to follow up (44/70).
Studies assessing bupropion versus controls were similarly varied in reporting, including: sequence generation (4/12), allocation concealment (4/12), blinding status (11/14), intention to treat (12/12) and appropriate descriptions of loss to follow up (10/12).
The 4 studies assessing varenicline appropriately reported the methodological criteria, except 1 study that did not report the method of sequence generation.
We anticipated variable between-study heterogeneity, considering the interventions used, methodological issues and measurement tools. Table 1 displays the covariates predicting heterogeneity in the primary outcomes of the NRT analysis using meta-regression. In this analysis, significant predictors of heterogeneity included: allocation concealment, use of NRT gum; and, methods of chemical confirmation (CO, cotinine, and urine markers). Using sensitivity analysis, only studies (n = 3[49, 53, 71]) using urine as a marker were significantly different from the pooled estimate (P = 0.03), however, all but 1 of these studies also used CO as a chemical marker (P = 0.5).
Univariable meta-regression of bupropion studies.
Lower 95% limit
Upper 95% limit
Lower 95% limit
Upper 95% limit
Planned to quit
Additional File 1 displays the common adverse events associated with NRT. Inadequate detail was provided for pooling. For NRT trials, we found that the following adverse events were reported significantly more often in active groups than control groups: mouth or throat irritation (n = 12); skin irritation (n = 11); nausea/vomiting (n = 10); coughing (n = 9); hiccoughs (n = 6); dyspepsia (n = 4); watering of eyes (n = 3); headaches (n = 3); heart palpitations (n = 3); sneezing (n = 3); sleep disturbances and dream abnormalities (n = 2); insomnia (n = 2); rhinitis (n = 2); vertigo (n = 1); taste disturbances (n = 1) and muscle aches (n = 1).
For bupropion trials, the following adverse events were reported significantly more in the active groups than control groups: dry mouth (9 trials[5, 6, 8, 82, 102–106], n = 4,885, OR 1.90, 95% CI, 1.50–2.42, P =< 0.0001); insomnia (10 trials[5, 6, 8, 82, 101–106], n = 4,775, OR 2.02, 95% CI, 1.53–2.68, P =< 0.0001); gastrointestinal upset (7 trials[5, 6, 8, 82, 104–106], n = 4,026, OR 1.34, 95% CI, 1.06–1.70, P = 0.01) and constipation (6 trials[5, 6, 8, 104–106], n = 3,622, OR 2.48, 95% CI, 1.62–3.80, P =< 0.0001). Other severe events associated with trial participants in the active arms were: septic shock; grand mal seizure; sleep disorders; and anxiety. These were single cases and did not achieve significance.
For varenicline trials, the following adverse events were reported significantly more often than in the placebo groups: nausea (4 trials [5, 6, 8, 9], n = 2,506, OR 3.17, 2.35–4.29, P =< 0.0001); flatulence (2 trials[5, 9], n = 1,323, OR 2.04, 95% CI, 1.16–3.57, P = 0.01); and, constipation (4 trials[5, 6, 8, 9], n = 2,506, OR 2.57, 95% CI, 1.21–5.45, P =< 0.0001). Other, severe events that occurred in the active groups included: atrial fibrillation, pneumonia, possible stroke, chest pain, and elevated blood pressure. These were, however, single cases and did not achieve significance.
In this study we present the results of three main meta-analyses examining the effectiveness of various pharmacological strategies to improve smoking cessation. Our primary meta-analyses specifically examined the effectiveness of NRT versus control, bupropion versus control, and varenicline versus bupropion or placebo. We demonstrated the consistent effectiveness of each intervention in short-term and long-term smoking cessation.
Our findings confirm the effectiveness of two established pharmacological therapies, NRT and bupropion, to improve cessation rates. The primary new information from this review is the evidence of effectiveness of varenicline. This modality of treatment was identified to be more effective than placebo, bupropion and in indirect comparisons with NRT. However, this meta-analysis consisted of the fewest studies and although the studies were well reported and apparently well conducted, further studies comparing varenicline versus either NRT or bupropion will strengthen inferences about the superiority of this intervention. Nevertheless the initial findings are suggestive of the potential superiority of this latest pharmacological intervention.
This review has several strengths and some limitations that deserve mention. The strengths of this review include the comprehensive search strategy that improved the likelihood of identifying all relevant studies. Duplicate extraction of data reduced the potential for bias in this component of the synthesis process. By limiting this review to randomized trials we ensured that the included studies would have reduced likelihood of systematic error and therefore have high internal validity. Our use of meta-regression to identify sources of heterogeneity in the meta-analyses is a strength and demonstrated that several of the a priori chosen covariates were predictors of heterogeneity. While our primary endpoint was one-year cessation, as defined by the study, we evaluated the robustness of these findings through multiple sensitivity analyses which included one-year sustained cessation, a strong predictor of sustained smoking cessation[108, 109]. Unlike some meta-analyses and systematic reviews of smoking cessation therapies [3, 4, 12–19], we included only studies that chemically confirmed the cessation of smoking at the specific time-points. This is a strength of our analysis as we demonstrated that even the different chemical markers for cessation can contribute greatly to between-study differences. Our meta-regression analysis indicates that the type of chemical markers employed yields greater variability of effects between studies over more commonly investigated covariates, such as methodological issues.
Limitations of this meta-analysis include the potential for publication bias, specifically the possibility that small negative studies would not be published. We included only published trials so it is possible that other trials have been conducted and never published. However, it is unlikely that the presence of these studies would have altered the findings of the NRT meta-analysis given the large number of studies included. Similarly, since varenicline is a new agent and the studies synthesized were the first clinical trials evaluating its effectiveness, it is improbable other negative studies exist. We also limited our search to English language databases (although we would include non-English articles if identified) so the possibility of quality studies in other languages does exist. We used both direct and indirect comparisons to evaluate the relative effectiveness of agents. Head-to-head trials provide the strongest inferences regarding intervention superiority. However, in the absence of head-to-head trials, specifically NRT versus varenicline, we conducted indirect comparisons. We used the indirect comparison method proposed by Bucher et al., which respects the principle of randomization between trials . The utility of indirect comparisons is debatable and some evidence suggests that indirect comparisons may provide misleading measures of superiority, although the direction of bias cannot be assumed until direct comparison trials have been completed. Figure 8 displays an interesting methodologic outcome related to choosing random-effects models over a fixed-effects model for meta-analyses. The pooled OR of the two head-to-head trials is a non-significant difference of 1.14 (95% CI, 0.20–6.42, P = 0.88) when using the random effects model, as planned in our protocol [82, 97]. However, had we used a fixed effects model, this would have resulted in a significant pooled OR of 1.84 (95% CI, 1.11–3.06, P = 0.02). We believe that this is due to the very small number of trials included (n = 2) [82, 97], where one trial is much larger than the other (n of 488 compared to an n of 60). Finally, our meta-analyses did not examine in detail other aspects of therapy, such as compliance and tolerance, which may influence real life clinical effectiveness.
There are several important implications of these meta-analyses. Identifying effective mechanisms to improve smoking cessation is essential. Smoking is the number one preventable cause of death in the world. Approximately one out of every two long-term smokers will die of a smoking related death. Furthermore, smoking cessation has been clearly demonstrated to reduce the likelihood of future morbidity. Smoking cessation in individuals with chronic obstructive lung disease reduces the deterioration of FEV1. Smoking cessation also reduces the likelihood of developing several smoking related cancers. Smoking cessation in patients with coronary artery disease has been consistently shown to be associated with dramatically improved survival. Observational studies demonstrate that smokers who quit smoking after a myocardial infarction have mortality rates approximately 40% less than those that continue to smoke, suggesting that smoking cessation could be one of the most effective therapies to improve survival in this group of patients. Former smokers have also been demonstrated to have considerably lower rates of cerebrovascular disease than ongoing smokers.
The findings of this review point the direction for future studies. It is interesting that the three modalities examined have distinct biological mechanisms of action. NRT presumably works by reducing symptoms of nicotine withdrawal, thereby increasing the likelihood of smoking cessation. Bupropion is a weak dopamine and nor-epinephrine reuptake inhibitor. One of the primary symptoms of smoking cessation has been depressive symptoms and it has been hypothesized that smokers may be increasing central dopamine levels by reducing monoamine oxidase inhibitor activity. The mechanism of action of bupropion, therefore, may be to maintain central levels of dopamine through the process of cessation, although its effectiveness has been identified to be independent of symptoms of depression. Varenicline is a nicotinic acetylcholine receptor partial agonist. The authors of one the studies demonstrating its efficacy comment that "Partial agonists at this (receptor) could stimulate the release of sufficient dopamine to reduce craving and withdrawal while simultaneously acting as a partial antagonist by blocking the binding and consequent reinforcing effects of smoked nicotine." Regardless of the exact mechanism of action of the three modalities, it is clear they are distinct and suggest the possibility of combination therapy or therapy targeted on the particular type of symptoms experienced during cessation. Future studies could examine these options, given that despite the effectiveness of these therapies rates of smoking remain high at one year in the treatment groups . Furthermore, given the benefits described of smoking cessation as secondary prevention, the use of these cessation modalities in patients with active smoking related disease warrants further study . Future studies should further examine the safety and effectiveness in reducing morbidity and mortality of all three of these modalities in patients with active smoking related disease.
The contribution of Gordon Guyatt is gratefully acknowledged. KW and EM are supported by the Canadian Institutes of Health Research (CIHR). Thanks to Pearl Raju for information systems assistance.
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.