It has been argued that placebos may not have important clinical impacts in general. However, there is increasing evidence of a publication bias among trials published in journals. Therefore, we explored the potential for publication bias in randomized trials with active treatment, placebo, and no-treatment groups.
Three-armed randomized trials of acupuncture, acupoint stimulation, and transcutaneous electrical stimulation were obtained from electronic databases. Effect sizes between treatment and placebo groups were calculated for treatment effect, and effect sizes between placebo and no-treatment groups were calculated for placebo effect. All data were then analyzed for publication bias.
For the treatment effect, small trials with fewer than 100 patients per arm showed more benefits than large trials with at least 100 patients per arm in acupuncture and acupoint stimulation. For the placebo effect, no differences were found between large and small trials. Further analyses showed that the treatment effect in acupuncture and acupoint stimulation may be subject to publication bias because study design and any known factors of heterogeneity were not associated with the small study effects. In the simulation, the magnitude of the placebo effect was smaller than that calculated after considering publication bias.
Randomized three-armed trials, which are necessary for estimating the placebo effect, may be subject to publication bias. If the magnitude of the placebo effect is assessed in an intervention, the potential for publication bias should be investigated using data related to the treatment effect.
Citation: Koog YH, We SR, Min B-I (2011) Three-Armed Trials Including Placebo and No-Treatment Groups May Be Subject to Publication Bias: Systematic Review. PLoS ONE 6(5): e20679. doi:10.1371/journal.pone.0020679
Editor: Ulrich Thiem, Marienhospital Herne - University of Bochum, Germany
Received: December 9, 2010; Accepted: May 9, 2011; Published: May 31, 2011
Copyright: © 2011 Koog et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: The authors have no support or funding to report.
Competing interests: The authors have declared that no competing interests exist.
The “powerful placebo”  was widely accepted until recently, when consecutive reviews of the placebo effect were published –. In these reviews, authors defined the placebo effect as the difference in outcome measures between placebo and no-treatment groups . All possible randomized trials with three arms (i.e., an active treatment group, a placebo group, and a no-treatment group) were rigorously collected. The authors found that although the effect varied from large to non-existent, the placebo generally did not have a powerful impact in clinical situations .
Because such conclusions were based on publicly reported clinical trials, trials used for analysis should be unbiased. However, there have been concerns over publication bias –, where small studies with negative results in an active group would be less likely to be published. In a recent study on antidepressant agents , 37 of 38 trials that were deemed positive by the Food and Drug Administration of the United States were published in journals, whereas only 3 of 36 trials with negative results were published. In fact, 11 of 36 trials with negative results were published in journals in a way that conveyed a positive outcome.
If three-armed trials that include placebo and no-treatment groups are subject to publication bias, the conclusion for the placebo effect might be misleading. To address the publication bias in three-armed trials, we investigated two datasets on active treatment versus placebo groups and placebo versus no-treatment groups. Because acupuncture has been a hot-button issue in discussions of the placebo effect , , our study focuses on acupuncture and its relevant interventions (i.e., acupoint stimulation and transcutaneous electrical nerve stimulation (TENS)).
All trials were identified by searching randomized trials using the search terms pertaining to each treatment via MEDLINE (PubMed), EMBASE (or SCOPUS), and the Cochrane Central Register of Controlled Trials from their inception through October 2009. For example in PubMed, we used terms for acupuncture [“acupuncture” OR “electroacupuncture”], terms for acupoint stimulation [“acupressure” OR “acustimulation” OR “acupoint stimulation” OR “acupoint massage” OR “capsicum plaster” OR “transcutaneous electrical stimulation” OR “functional electrical stimulation”], and terms for TENS [“transcutaneous electrical stimulation” OR “transcutaneous electrical nerve stimulation” OR “TENS”], with limits to randomized controlled trials and humans. We used EMBASE for acupuncture and SCOPUS for the other two interventions because the availability of EMBASE expired at Asan Medical Library during our search. We defined acupoint simulation as any treatment that simulates the traditional acupuncture points without penetrating human skin.
The titles and abstracts of all resulting papers were read by two independent reviewers. However, those retrieved for TENS via the Cochrane Central Register of Controlled Trials database were read jointly. We then independently selected trials that included the following: (1) a randomized clinical trial; (2) a group where an intervention was pragmatically labeled as placebo; and (3) comparison of treatment, placebo, and no-treatment groups under identical conditions in one trial. However, we found only four –, four –, and zero trials with binary outcomes for acupuncture, acupoint stimulation, and TENS, respectively. Because regression-based tests are reported to have low statistical power for 10 or fewer trials , we decided to present the results of trials with continuous outcomes.
Prior to data extraction, we prepared a protocol. First, we attempted to select the main outcome that was considered primary or was used for power calculation. When the above conditions were not fulfilled, there were two methods we could choose: (1) selecting the outcome on which the conclusion was based or (2) choosing the outcome reported first in the table or figure. When we examined the data of a previous report  using 31 eligible trials that did not explicitly report the main outcome, the former method resulted in 27 matches, whereas the latter resulted in 23 matches. Therefore, we extracted data using the former method. Second, we attempted to extract end-point data, because 52 eligible trials reported end-point data, whereas only 15 reported data on change from baseline. If such data were not available, the data on change from baseline were used. Third, we attempted to extract data evaluated at the end of the treatment, because most trials reported data assessed at the end of the treatment. We (KYH and WSR) then independently extracted data from eligible trials and referenced the previous reviews ,  in open discussion. When necessary, we contacted the corresponding authors of included trials.
However, we met one problem in a trial  where standard deviations could not be obtained. Because the outcome used in this trial was unique within all eligible trials, we extracted data on the outcome used in a previous report .
In addition, we (KYH and WSR) independently extracted information on disease type and data type, as well as methodological characteristics (i.e., allocation concealment, assessor blinding, attrition rate, and intention-to-treat analysis). Allocation concealment was considered adequate if researchers responsible for patient selection could not predict the next treatment for a patient. Assessor blinding was considered adequate if outcome measures of interest were evaluated by researchers blinded to the treatment allocation or by objective instruments. Attrition rate was considered adequate if the flow of the patients' dropout throughout the trial was explicitly stated, and the attrition rate of all randomized patients who were assessed at baseline was below 15%. Intention-to-treat analysis was considered adequate if all randomized patients who were assessed at baseline were included in the analysis.
In each trial, we calculated effect sizes (standardized mean differences) between the active treatment and placebo groups and between the placebo and no-treatment groups. The effect sizes between active treatment and placebo groups were defined as “treatment effect” and those between placebo and no-treatment as “placebo effect”. We excluded trials from the calculations that reported only median and range because estimation from median and range might produce bias . Indeed, the effect size calculated from median and range was overestimated in our previous study . We also excluded trials that were clear outliers. To do this, we performed a test based on the blocked adaptive computationally efficient outliers nominator algorithm , with a significance level of 0.15.
Identification of small study effects
We used four methods to address small study effects, where the smaller studies in a meta-analysis show larger treatment effects. First, we considered trials with more than 200 patients at baseline in two relevant arms as “large” trials and trials with fewer than 200 patients as “small” trials . For example, when we considered a trial where 300 patients were randomized to an active treatment group (n = 150), a placebo group (n = 75), and a no-treatment group (n = 75), it was classified as a large trial for the treatment effect and as a small trial for the placebo effect. We then calculated the effect sizes of large and small trials separately using a random effects model  and derived the differences between the effect sizes of large and small trials. The p value was based on an interaction test, which is defined as the difference in effect sizes divided by the standard error of the difference . For summary estimates, we combined all differences between large and small trials using a random effects model. Second, we drew a contour-enhanced funnel plot . In this study, a plot was divided into areas of significance (two-sided P≤0.05) and areas of non-significance (two-sided P>0.05). Thirdly, we evaluated funnel plot asymmetry using the asymmetry coefficient, which is defined as the difference in effect size per standard error increase . To this end, we predicted a treatment or placebo effect from a weighted linear regression with the standard error as an independent variable. We then combined all asymmetry coefficients using a random effects model, crude and adjusted for methodological characteristics, clinical condition (pain or non-pain), and data type (subjective or objective outcome). Fourth, we performed an Egger's regression test .
Identification of sources of small study effects
When the small study effects were detected, we performed two additional tests to exclude the other sources of small study effects (i.e., quality of methodological design and true heterogeneity) . In the first test, we categorized trials by methodological characteristics and compared the pooled effect sizes between trials with or without characteristics based on an interaction test. Even if the small study effects were detected in only one treatment, we decided to show all three treatments to maintain the internal consistency of our study. In the second test, we investigated the causes of heterogeneity by univariate meta-regression using the following conditions: clinical conditions (pain or non-pain), disease duration (acute or chronic), cointervention (present or none), outcome type (objective or subjective), trial duration, and treatment session.
First, we estimated two different effect sizes for the treatment and placebo effect in each intervention: (1) pooled effect sizes from all identified trials and (2) effect sizes predicted at standard error = 0 for hypothetical trials of infinite size ,  from random effects meta-regression analysis with the standard error as an independent variable , . Second, we produced hypothetical trials that could be suppressed by publication bias using a non-parametric trim and fill analysis with a fixed effects model on original data for the treatment effect . We estimated effect sizes for the treatment effect from identified trials and hypothetical trials. In hypothetical trials, we then assumed that active treatment was at least as effective as no-treatment. We added these trials to the original data on placebo versus no-treatment and estimated effect sizes for the placebo effect. Finally, we compared the three effect sizes for the treatment and placebo effect.
For heterogeneity, we assessed the values of between-trial variance (τ2). The data are presented as the mean with 95% confidence interval. Microsoft Excel 2003 was used for interaction tests and STATA version 11.0 for all further analyses.
Figure 1 describes the procedure for selecting eligible trials. We included 63 trials with continuous outcomes: 32 trials for acupuncture (Text S1), 14 trials for acupoint stimulation (text S2), and 17 trials for TENS (Text S3) (Table 1). Of these, the overall number of large trials with more than 200 patients in two relevant arms was small: 6 (18.8%) trials for treatment effect and 3 (9.4%) for placebo effect in acupuncture, 2 (11.8%) in TENS, and none in acupoint stimulation. In total, 3060 patients were included at baseline in the active treatment group, 2576 patients were included in the placebo group, and 2533 patients were included in the no-treatment group. In the eligible trials, many different clinical conditions were assessed. Acupuncture and TENS trials frequently studied pain-related disease, and acupoint stimulation trials frequently investigated nausea-related disease. Placebo type also varied within each intervention. Acupuncture needles that were normally inserted or minimally inserted at irrelevant points were commonly used as a placebo in acupuncture trials. Stimulation on irrelevant points was mostly used as the placebo in acupoint stimulation. Simulated TENS with electricity off was mostly used as the placebo in TENS.
Figure 1. Study flow diagram.doi:10.1371/journal.pone.0020679.g001
Table 1. Characteristics of trials with continuous outcomes.doi:10.1371/journal.pone.0020679.t001
Of 63 eligible trials, 2 reported outcomes with median and range: one  for acupuncture and another  for acupoint stimulation. One trial  for TENS presented insufficient data (e.g., no patient number). One trial  for TENS was a clear outlier. Therefore, we excluded these trials from our analysis. Within the remaining trials, the summary treatment effects were 0.41 (0.24 to 0.58), 0.64 (0.28 to 0.99), and 0.30 (0.11 to 0.49) for acupuncture, acupoint stimulation, and TENS, respectively. The summary placebo effects were 0.34 (0.19 to 0.49), 0.21 (0.07 to 0.35), and 0.05 (−0.06 to 0.17), respectively. When heterogeneity was compared between the treatment and placebo effects, the placebo effects were less heterogeneous than the treatment effects in all interventions.
When small and large trials were compared for treatment effect (Figure 2), the difference in effect sizes between large and small trials was statistically significant in acupuncture (P = 0.009) and acupoint stimulation (P = 0.0005). For acupuncture, small trials showed more benefits by 0.39 (0.10 to 0.68) in effect size than large trials, and for acupoint stimulation, more benefits by 0.64 (0.28 to 0.99) in effect size. However, there was no significant difference between small and large trials in TENS. The summary difference of −0.37 (−0.69 to −0.05) over the three interventions was statistically significant. When small and large trials were compared for placebo effect, a significant difference was found only in the acupoint stimulation (P = 0.004). The summary difference of −0.06 (−0.25 to 0.13) was not statistically significant.
Figure 2. Difference in effect sizes between large trials with at least 100 patients per arm and small trials with fewer than 100 patients.
ES = effect size.doi:10.1371/journal.pone.0020679.g002
Figure 3 presents the funnel plots, where predicted treatment or placebo effect lines (i.e., coefficient asymmetries) were included. For the treatment effect in acupuncture and acupoint stimulation, the left portion of the triangle was clearly missing when an imaginary triangle was drawn with the lowest standard error as a peak. In addition, the predicted treatment effect lines were not upright (P = 0.047 in acupuncture and P = 0.006 in acupoint stimulation) (Figure 3 and Table 2). However, the scatter plot of effect sizes in TENS was clearly symmetrical, and the predicted treatment effect line was upright (P = 0.975). The summary asymmetry coefficient was 2.48 (−0.54 to 5.50). Even when the summary asymmetry coefficient was adjusted for methodological characteristics, clinical condition, and data type, it was still similar to the crude value. For the placebo effect in the three interventions, the scatter plots of the effect sizes were clearly symmetrical and the predicted placebo effect lines were upright (P = 0.459, 0.638, and 0.683 for acupuncture, acupoint stimulation, and TENS, respectively) (Figure 3 and Table 2). The summary asymmetry coefficient was −0.11 (−0.99 to 0.78), and it did not differ after adjustment of methodological characteristics, clinical condition, and data type.
Figure 3. Contour-enhanced funnel plot including predicted lines from univariable meta-regression models.doi:10.1371/journal.pone.0020679.g003
Table 2. Asymmetry coefficients.doi:10.1371/journal.pone.0020679.t002
Table 3 shows the results of Egger's regression tests. For the treatment effect, bias was present in acupuncture (P = 0.012) and acupoint stimulation (P = 0.005), although no bias was found in TENS (P = 0.716). For the placebo effect, no significant bias was found in any of the interventions (P = 0.376, 0.607, and 0.665 for acupuncture, acupoint stimulation, and TENS, respectively).
Table 3. Egger's regression tests.doi:10.1371/journal.pone.0020679.t003
Table 4 presents the pooled treatment effects of three interventions categorized by methodological characteristics. P values for the interaction test did not show any significant differences between trials in any of the three interventions. When the causes of heterogeneity were examined, no factor was associated with the effect sizes in acupuncture or acupoint stimulation.
Table 4. Treatment effect of trials with or without methodological characteristics.doi:10.1371/journal.pone.0020679.t004
Figure 4 shows the results of the pooled effect sizes of all eligible trials, the predicted effect sizes for hypothetical trials with infinite size, and the simulated effect sizes for data from non-parametric trim and fill analysis. For the treatment effect (Figure 4A and B), the effect sizes combined over eligible trials were greater than those predicted for hypothetical trials with infinite size or those simulated on data from nonparametric trim and fill analysis. For the placebo effect (Figure 4A and B), the effect sizes combined over eligible trials were smaller than those simulated on data from non-parametric trim and fill analysis, although they were included within the range of the 95% confidence interval for hypothetical placebo effect from meta-regression with standard error = 0.
Figure 4. Results of effect sizes combined over all trials, effect sizes predicted for trials from random effects meta-regression analysis with standard error = 0, and effect sizes simulated on data from nonparametric trim and fill analysis.
SE = standard error.doi:10.1371/journal.pone.0020679.g004
In this study on three-armed trials for placebo effect, we found that small trials showed a greater effect than large trials (i.e., small study effects) when examining the treatment effect for acupuncture and acupoint stimulation, defined by the effect size between active and placebo groups. We did not find any such tendency in the placebo effect for the three interventions, defined by the effect size between placebo and no-treatment groups. In further analysis, the small study effects in acupuncture and acupoint stimulation did not appear to be related to trial methodology or true heterogeneity, thus indicating publication bias.
It is surprising that some three-armed trials may be published according to the significance of an active treatment group. If trials with a significantly greater effect of an active treatment compared with a placebo are more likely to be published, the magnitude of the placebo effect may be seriously biased. In fact, when the missing trials were considered, the summary treatment effects for acupuncture and acupoint stimulation decreased from those combined over all eligible trials (Figure 4). In contrast, the summary placebo effects for acupuncture and acupoint stimulation increased from those pooled over all eligible trials (Figure 4). Consequently, publication bias distorted the results of meta-analyses based on identified trials for both effects.
However, it should be noted that the magnitude of the placebo effect cannot be accurately predicted, because excellent statistical analyses cannot predict missing trials accurately. In fact, a trim and fill analysis using a random effects model detected no missing trials in three interventions. Although missing trials are identified by some analyses, the magnitude of placebo effect cannot be easily conjectured. In the simulation, we assumed that active treatment was at least as effective as no-treatment. However, this assumption cannot always be applied in general situations. Because active treatment may be superior to no-treatment in most situations , the magnitude of the placebo effect will be much greater than that predicted in the simulation. Therefore, we are not sure, at present, whether the placebo effect from meta-regression with standard error = 0 can predict the placebo effect that was recalculated after considering publication bias, although the former predicted the latter in our simulation.
Previous reviews – have shown that placebos may not have important clinical impacts in general. This finding led some researchers to conclude that the concept of a “powerful placebo” remains groundless . However, our finding implies that the small overall placebo effect might be produced by publication bias. Because publication bias is dependent on the significance of treatment effect, trials published in journals are more likely to have a relatively smaller placebo effect. If such trials are combined, the overall placebo effect would be small.
Previous reviews ,  have also shown that when placebos were examined in acupuncture trials with high quality, they were associated with greater effect in some situations and with non-existing effect in other situations. However, our finding implies that the variable magnitude of the placebo effect may be secondary to the natural process of publication. For example, if one intervention is developed as a new therapy, trials with greater effect of intervention begin to be published. At this time, the heterogeneity of the placebo effect would be small. However, trials with a smaller or negative effect will be published in the future. In this case, the magnitude of the placebo effect begins to be variable. To confirm this, we categorized acupuncture trials by publication year. Interestingly, as time passed, the value of τ2 for the placebo effect gradually increased from 0.00 to 0.10 with a shape of ~.
Previous reviews – have found that the placebo effect on pain-related clinical conditions was great. We did not address this point in our study, but we did address other questions regarding whether methodological characteristics or some other factors were associated with publication bias. We found that none were associated with publication bias. However, we only investigated three interventions. Furthermore, we could not extract diverse factors from trials of each intervention (e.g., all TENS trials were focused on pain-related conditions). Therefore, all interventions should be investigated to determine whether certain factors are associated with publication bias.
Although a previous review  and our study investigated the placebo effect using the same criteria, interpretations were very different. The discrepancies can be explained in several ways.
First, we analyzed many other trials, including the most recent ones. We reviewed all randomized trials, even if the abstract was not in the web databases. Surprisingly, this simple search strategy yielded more trials than the updated review that used complex search strategies aimed at detecting all three groups in one trial. When trials published up to March 2008 were considered, we consequently included seven more acupuncture trials –, two more acupoint stimulation trials , , and four more TENS trials , – than the previous review.
Second, we investigated two datasets on the treatment and placebo effects of each individual intervention. Using two datasets, we attempted to study whether three-armed trials for the placebo effect were biased. We found that the placebo effect should be explored after examining the potential for bias on the treatment effect. Meanwhile, the previous review studied only one dataset on the placebo effect. The previous review also investigated the potential for bias. However, our finding suggests that it is difficult to find any bias in such early investigations of data related to the placebo effect.
In our study, we attempted to prove publication bias. To this end, we tried to review all randomized trials and thus included many relevant three-armed trials. However, reviewing all randomized trials is labor-intensive and time-consuming. Unfortunately, we may have missed some relevant trials. In addition, we did not use several potential sources to identify further trials. First, we did not consult the existing relevant review . When our study was compared with the previous review , we found that one trial  was not included in our study. Second, we did not search the public trial registries, such as ClinicalTrials.gov (http://www.clinicaltrials.gov/) or the International Standard Randomized Controlled Trial Number Web site (http://www.controlled-trials.com/isrctn/). It is possible that three-armed trials might be reported as two-armed trials in journals for many reasons (e.g., authors' performance). Unless such trials explicitly report this point, they cannot be easily identified without searching the clinical trials registers. The failure to use means at our disposal to identify additional trials represents a limitation of this study.
In this study, we did not fully address the issue of publication bias. Because there are no definitive methods to evaluate publication bias , we assessed small study effects and then excluded two potential sources of small study effects (i.e., quality of methodological design and true heterogeneity). However, some researchers  may wonder whether the small study effects were associated with real treatment effects. It is possible that patients at high risk of disease in smaller trials could have received substantial benefits from interventions. However, when we examined acupuncture trials reporting pain intensity, patients with more severe pain did not receive increased benefits (P = 0.50). Other researchers  may wonder whether interventions have been implemented less thoroughly in larger trials, thus resulting in more positive results than in smaller trials. When acupuncture trials were considered as an example, this appeared to be unlikely because relatively larger trials – utilized semi-individualized treatments, whereas small trials , – only utilized standardized treatments.
We think that our findings have laid the groundwork for debate on the use of placebos in clinical practice. Clinical evidence in support of the placebo effect has been accumulated in a wide range of conditions –. However, the evidence has been discounted because it was derived from randomized trials that did not include a no-treatment group . In contrast, previous reviews – addressing this defect argued that the placebo effect was limited in general. We revealed that this argument might be misleading. To sum up, the placebo effect appears to be a common phenomenon. Therefore, ethical guidelines for the use of placebos should be discussed . We also think that our findings provide different viewpoints on the placebo effect. Two previous reviews ,  concluded that the greater placebo effect was associated with pain-related clinical conditions, and a recent study  added that physical placebo interventions were also associated. However, according to our findings, the placebo effect for TENS was not great in pain-related conditions. Therefore, our findings indicate that analyzing the placebo effect as categorized by intervention is also important.
Consequently, randomized three-armed trials necessary for estimating the placebo effect were published in journals according to the significance of an active treatment group in some interventions. Publication bias distorted results for the placebo effect in meta-analyses based soley on identified trials. Therefore, if the magnitude of the placebo effect is being assessed in some interventions, the potential for publication bias should be investigated in data related to the treatment effect.
Trials for acupuncture.
Trials for acupoint stimulation.
Trials for transcutaneous electrical nerve stimulation.
We were inspired by paper by Nüesch E et al. . Dr Min Sun Park searched and reviewed all eligible acupuncture trials. We gave special thanks to relevant authors for providing data and to anonymous reviewers for providing comments.
Conceived and designed the experiments: YHK B-IM. Performed the experiments: YHK SRW B-IM. Analyzed the data: YHK SRW B-IM. Contributed reagents/materials/analysis tools: YHK SRW. Wrote the paper: YHK SRW B-IM.
- 1. Beecher HK (1955) The powerful placebo. JAMA 159: 1602–6.
- 2. Hróbjartsson A, Gøtzsche PC (2001) Is the placebo powerless? An analysis of clinical trials comparing placebo with no treatment. N Engl J Med 344: 1594–602.
- 3. Hróbjartsson A, Gøtzsche PC (2004) Is the placebo powerless? Update of a systematic review with 52 new randomized trials comparing placebo with no treatment. J Intern Med 256: 91–100.
- 4. Hróbjartsson A, Gøtzsche PC (2010) Placebo interventions for all clinical conditions. Cochrane Database Syst Rev. (1).CD003974 p.
- 5. Gøtzsche PC (1994) Is there logic in the placebo? Lancet 344: 925–6.
- 6. Easterbrook PJ, Berlin JA, Gopalan R, Matthews DR (1991) Publication bias in clinical research. Lancet 337: 867–72.
- 7. Ioannidis JP (1998) Effect of the statistical significance of results on the time to completion and publication of randomized efficacy trials. JAMA 279: 281–6.
- 8. Topol EJ (2004) Failing the public health—rofecoxib, Merck, and the FDA. N Engl J Med 351: 1707–9.
- 9. Kyzas PA, Loizou KT, Ioannidis JP (2005) Selective reporting biases in cancer prognostic factor studies. J Natl Cancer Inst 97: 1043–55.
- 10. Tang JL (2005) Selection Bias in Meta-Analyses of Gene-Disease Associations. PLoS Med 2: e409.
- 11. Turner EH, Matthews AM, Linardatos E, Tell RA, Rosenthal R (2008) Selective publication of antidepressant trials and its influence on apparent efficacy. N Engl J Med 358: 252–60.
- 12. Kaptchuk TJ, Kelley JM, Conboy LA, Davis RB, Kerr CE, et al. (2008) Components of placebo effect: randomised controlled trial in patients with irritable bowel syndrome. BMJ 336: 999–1003.
- 13. Madsen MV, Gøtzsche PC, Hróbjartsson A (2009) Acupuncture treatment for pain: systematic review of randomised clinical trials with acupuncture, placebo acupuncture, and no acupuncture groups. BMJ 338: a3115.
- 14. Fanti L, Gemma M, Passaretti S, Guslandi M, Testoni PA, et al. (2003) Electroacupuncture analgesia for colonoscopy. A prospective, randomized, placebo-controlled study. Am J Gastroenterol 98: 312–6.
- 15. Rusy LM, Hoffman GM, Weisman SJ (2002) Electroacupuncture prophylaxis of postoperative nausea and vomiting following pediatric tonsillectomy with or without adenoidectomy. Anesthesiology 96: 300–5.
- 16. Aune A, Alraek T, LiHua H, Baerheim A (1998) Acupuncture in the prophylaxis of recurrent lower urinary tract infection in adult women. Scand J Prim Health Care 16: 37–9.
- 17. Dundee JW, Chestnutt WN, Ghaly RG, Lynas AG (1986) Traditional Chinese acupuncture: a potentially useful antiemetic? Br Med J (Clin Res Ed) 293: 583–4.
- 18. Tarçin O, Gürbüz AK, Poçan S, Keskin O, Demirtürk L (2004) Acustimulation of the Neiguan point during gastroscopy: its effects on nausea and retching. Turk J Gastroenterol 15: 258–62.
- 19. Alkaissi A, Stålnert M, Kalman S (1999) Effect and placebo effect of acupressure (P6) on nausea and vomiting after outpatient gynaecological surgery. Acta Anaesthesiol Scand 43: 270–4.
- 20. McMillan CM (1994) Transcutaneous electrical stimulation of neiguan anti-emetic acupuncture point in controlling sickness following opioid analgesia in major orthopaedic surgery. Physiotherapy 80: 5–9.
- 21. Alkaissi A, Evertsson K, Johnsson VA, Ofenbartl L, Kalman S (2002) P6 acupressure may relieve nausea and vomiting after gynecological surgery: an effectiveness study in 410 women. Can J Anaesth 49: 1034–9.
- 22. Sterne JA, Gavaghan D, Egger M (2000) Publication and related bias in meta-analysis: power of statistical tests and prevalence in the literature. J Clin Epidemiol 53: 1119–29.
- 23. Röschke J, Wolf C, Müller MJ, Wagner P, Mann K, et al. (2000) The benefit from whole body acupuncture in major depression. J Affect Disord 57(1-3): 73–81.
- 24. Higgins JPT, Green S, editors. (2006) Chichester, UK: John Wiley & Sons, Ltd.
- 25. Koog YH, Min BI (2010) Effects of botulinum toxin A on calf muscles in children with cerebral palsy: a systematic review. Clin Rehabil 24: 685–700.
- 26. Weber S (2010) Bacon: An effective way to detect outliers in multivariate data using Stata (and Mata). STATA Journal 10: 331–8.
- 27. Nüesch E, Trelle S, Reichenbach S, Rutjes AW, Tschannen B, et al. (2010) Small study effects in meta-analyses of osteoarthritis trials: meta-epidemiological study. BMJ 341: c3515.
- 28. DerSimonian R, Laird N (1986) Meta-analysis in clinical trials. Control Clin Trials 7: 177–88.
- 29. Altman DG, Bland JM (2003) Interaction revisited: the difference between two estimates. BMJ 325: 219.
- 30. Peters JL, Sutton AJ, Jones DR, Abrams KR, Rushton L (2008) Contour-enhanced meta-analysis funnel plots help distinguish publication bias from other causes of asymmetry. J Clin Epidemiol 61: 991–6.
- 31. Shang A, Huwiler-Müntener K, Nartey L, Jüni P, Dörig S, et al. (2005) Are the clinical effects of homoeopathy placebo effects? Comparative study of placebo-controlled trials of homoeopathy and allopathy. Lancet 366: 726–32.
- 32. Egger M, Davey Smith G, Schneider M, Minder C (1997) Bias in meta-analysis detected by a simple, graphical test. BMJ 315: 629–34.
- 33. Moreno SG, Sutton AJ, Turner EH, Abrams KR, Cooper NJ, et al. (2009) Novel methods to deal with publication biases: secondary analysis of antidepressant trials in the FDA trial registry database and related journal publications. BMJ 339: b2981.
- 34. Duval S, Tweedie R (2000) A nonparametric “trim and fill” method of accounting for publication bias in meta-analtsis. J Am Stat Assoc 95: 89–98.
- 35. Rösler A, Otto B, Schreiber-Dietrich D, Steinmetz H, Kessler KR (2003) Single-needle acupuncture alleviates gag reflex during transesophageal echocardiography: a blinded, randomized, controlled pilot trial. J Altern Complement Med 9(6): 847–9.
- 36. Arai YC, Kato N, Matsura M, Ito H, Kandatsu N, et al. (2008) Transcutaneous electrical nerve stimulation at the PC-5 and PC-6 acupoints reduced the severity of hypotension after spinal anaesthesia in patients undergoing Caesarean section. Br J Anaesth 100(1): 78–81.
- 37. Tonella RM, Araújo S, Da Silva ÁMO (2006) Transcutaneous electrical nerve stimulation in the relief of pain related to physical therapy after abdominal surgery. Revista Brasileira de Anestesiologia 56: 630–42.
- 38. Defrin R, Ariel E, Peretz C (2005) Segmental noxious versus innocuous electrical stimulation for chronic pain relief and the effect of fading sensation during treatment. Pain 115: 152–60.
- 39. Ernst E, Lee MS (2008) A trial design that generates only “positive” results. J Postgrad Med 54: 214–6.
- 40. John C, Bailar JC 3rd (2001) The Powerful Placebo and the Wizard of Oz. N Engl J Med 344: 1630–2.
- 41. Gioia L, Cabrini L, Gemma M, Fiori R, Fasce F, et al. (2006) Sedative effect of acupuncture during cataract surgery: prospective randomized double-blind study. J Cataract Refract Surg 32: 1951–4.
- 42. Freire AO, Sugai GC, Chrispin FS, Togeiro SM, Yamamura Y, et al. (2007) Treatment of moderate obstructive sleep apnea syndrome with acupuncture: a randomised, placebo-controlled pilot trial. Sleep Med 8: 43–50.
- 43. Facco E, Liguori A, Petti F, Zanette G, Coluzzi F, et al. (2008) Traditional acupuncture in migraine: a controlled randomized study. Headache 48: 398–407.
- 44. Ziaei S, Hajipour L (2006) Effect of acupuncture on labor. Int J Gynaecol Obstet 92: 71–2.
- 45. Schuler MS, Durdak C, Höl NM, Klink A, Hauer KA, et al. (2005) Acupuncture treatment of geriatric patients with ischemic stroke: a randomized, double-controlled, single-blind study. J Am Geriatr Soc 53: 549–50.
- 46. Johnstone PA, Bloom TL, Niemtzow RC, Crain D, Riffenburgh RH, et al. (2003) A prospective, randomized pilot trial of acupuncture of the kidney-bladder distinct meridian for lower urinary tract symptoms. J Urol 169: 1037–9.
- 47. Gosman-Hedström G, Claesson L, Klingenstierna U, Carlsson J, Olausson B, et al. (1998) Effects of acupuncture treatment on daily life activities and quality of life: a controlled, prospective, and randomized study of acute stroke patients. Stroke 29: 2100–8.
- 48. Arai YC, Kato N, Matsura M, Ito H, Kandatsu N, et al. (2008) Transcutaneous electrical nerve stimulation at the PC-5 and PC-6 acupoints reduced the severity of hypotension after spinal anaesthesia in patients undergoing Caesarean section. Br J Anaesth 100: 78–81.
- 49. Maa SH, Tsou TS, Wang KY, Wang CH, Lin HC, et al. (2007) Self administered acupressure reduces the symptoms that limit daily activities in bronchiectasis patients: pilot study findings. J Clin Nurs 16: 794–804.
- 50. Breit R, Van der Wall H (2004) Transcutaneous electrical nerve stimulation for postoperative pain relief after total knee arthroplasty. J Arthroplasty 19: 45–8.
- 51. Presser M, Birkhan J, Adler R, Hanani A, Eisenberg E (2000) Transcutaneous electrical nerve stimulation (TENS) during epidural steroids injection: A randomized controlled trial. Pain Clinic 12: 77–80.
- 52. Galloway DJ, Boyle P, Burns HJ, Davidson PM, George WD (1984) A clinical assessment of electroanalgesia following abdominal operations. Surg Gynecol Obstet 159: 453–6.
- 53. Naumann VC, Lange A (1989) The use of transcutaneous electrical nerve stimulatin for analgesia in the postoperative phase. Z Physiother 41: 9–13.
- 54. Stern JAC, Harbord RM (2004) Funnel plots in meta-analysis. STATA Journal 4: 127–41.
- 55. Smith GD, Egger M (1994) Who benefits from medical interventions? Treating low risk patients can be a high risk strategy. BMJ 308: 72–4.
- 56. Brinkhaus B, Witt CM, Jena S, Linde K, Streng A, et al. (2006) Acupuncture in patients with chronic low back pain: a randomized controlled trial. Arch Intern Med 166: 450–7.
- 57. Melchart D, Streng A, Hoppe A, Brinkhaus B, Witt C, et al. (2005) Acupuncture in patients with tension-type headache: randomised controlled trial. BMJ 331: 376–82.
- 58. Linde K, Streng A, Jürgens S, Hoppe A, Brinkhaus B, et al. (2005) Acupuncture for patients with migraine: a randomized controlled trial. JAMA 293: 2118–25.
- 59. Häuser W, Bartram-Wunn E, Bartram C, Reinecke H, Tölle T (2011) Systematic review: placebo response in drug trials of fibromyalgia syndrome and painful peripheral diabetic neuropathy-magnitude and patient-related predictors. Pain.
- 60. Zhang W, Robertson J, Jones AC, Dieppe PA, Doherty M (2008) The placebo effect and its determinants in osteoarthritis: meta-analysis of randomised controlled trials. Ann Rheum Dis 67: 1716–23.
- 61. Kemeny ME, Rosenwasser LJ, Panettieri RA, Rose RM, Berg-Smith SM, et al. (2007) Placebo response in asthma: a robust and objective phenomenon. J Allergy Clin Immunol 119: 1375–81.
- 62. Patel SM, Stason WB, Legedza A, Ock SM, Kaptchuk TJ, et al. (2005) The placebo effect in irritable bowel syndrome trials: a meta-analysis. Neurogastroenterol Motil 17: 332–40.
- 63. Cho HJ, Hotopf M, Wessely S (2005) The placebo response in the treatment of chronic fatigue syndrome: a systematic review and meta-analysis. Psychosom Med 67: 301–13.
- 64. Hróbjartsson A (2002) What are the main methodological problems in the estimation of placebo effects? J Clin Epidemiol 55: 430–5.
- 65. Finniss DG, Kaptchuk TJ, Benedetti F (2010) Biological, clinical, and ethical advances of placebo effects. Lancet 375: 686–95.