The culture of designing hepato-biliary randomised trials

doi:10.1016/j.jhep.2005.12.006

Journal of Hepatology

Volume 44, Issue 3, March 2006, Pages 607-615

https://doi.org/10.1016/j.jhep.2005.12.006 Get rights and content

Introduction

The Scottish naval surgeon James Lind started his controlled trial of 12 scurvy-ridden sailors on 20th May 1747 [1]. Lind divided them: two got oranges and lemons, two cider, two vinegar, two elixir vitriol, two a concoction of spices, garlic, and mustard seeds, and two sea water. Within 6 days, the two sailors given oranges and lemons became well. The others did not. Lind was intelligent. His trial marks a major breakthrough. The 20th May is now the International Clinical Trials' Day [2]. Lind was lucky. We seldom see such dramatic intervention effects. We are usually looking for smaller, but still important effects. Such effects, however, may be blurred by random and systematic errors. Scientists have, therefore, developed larger trials using central randomisation, blinding, and intention-to-treat analyses, aiming at reducing random errors and systematic errors to a minimum [1], [3], [4], [5], [6].

Although randomised trials provide the fairest way to test the effects of interventions [1], [3], [4], [5], [6], over 200 years went before the first hepato-biliary randomised trial was published [7]. Thomas C Chalmers and co-workers conducted their two factorial-designed trials on diet, rest, and physical reconditioning in 460 patients with acute infectious hepatitis in 1955 [7]. Other trials in liver diseases followed [8] and hepato-biliary randomised trials appeared regularly from the 1970s (Fig. 1) [9]. Currently over 500 publications on hepato-biliary randomised trials are published each year (Fig. 1) [9]. Here, I describe some of the issues one has to consider when assessing or designing a randomised clinical trial. Further, I contrast the cultures of hepato-biliary randomised trials to randomised trials from any other medical field.

Section snippets

Why is it important to randomise?

The hierarchy of evidence is well-established [10], [11], [12], [13]. It is based on the risks of bias in the different study designs. Randomised trials are internationally considered the gold standard for intervention comparisons [1], [3], [4], [5], [6], [10], [11], [12], [13]. The results from randomised trials form the basis for determining which diagnostics, drugs, drills, or devices are effective. Randomisation forms the basis for making fair comparisons [6].

Historically controlled

What kind of participants to include and which data to collect?

The participants going to be included in a trial should be clearly defined. You should be able to list few entry criteria and few exclusion criteria. The reason for stressing few is that we often see trials with so many in- and exclusion criteria that it becomes difficult to identify such patients in clinical practice. Such trials may have adequate internal validity, but are less valuable due to lack of external validity.

When designing a new trial you want to include patients having a known

Which experimental intervention?

Apart from questions about which diagnostic method, drug dosage, endoscopic technique, or surgical technique to test, it is essential to decide if you want to conduct an explanatory trial or a pragmatic trial.

Explanatory trials test whether an intervention is efficacious. That is, whether the intervention has a beneficial effect in an ideal situation. The explanatory trial seeks to maximise the internal validity by assuring rigorous control of all variables. Explanatory trials often have a

Which comparator: placebo or active?

If there is no evidence-based intervention offered in clinical practice for the potential trial participants, then placebo or ‘sham’ procedure is the right comparator choice. Claims that the Food and Drug Administration and the European Medicines Agency require placebo-controlled trials are wrong.

If a systematic review of low-bias trials or other convincing evidence show that the potential participants should be offered an intervention, the intervention must be offered. There are three

Parallel-group or cross-over randomised trial?

Whether you read a report on a trial or you are going to design a trial, one of the questions you have to answer is: should this trial be a parallel-group or a cross-over trial? Both parallel-group and cross-over trials offer the opportunity to randomise to experimental intervention and comparator. It is, however, a delicate decision when to use one design in stead of the other [42], [43], [44], [45].

In parallel-group trials one randomises consecutive participants fulfilling entry criteria and

Multiple promising interventions: the factorial design

Randomised trials may create plenty of problems if you have one experimental intervention and a comparator. What should you do if you have two experimental interventions that both look promising? You can of course conduct a three-armed randomised trial (experimental A versus experimental B versus control C). If the interventions do not interact, you are far better off conducting a 2 × 2 factorial trial. You obtain the same information with fewer patients plus at the same time you assess any

Cluster randomised trials

Asking a clinician to offer an intervention to half of the patients, you run the risk of contamination in the other half. In such situations you may want to apply your intervention at a higher level than the individual participant, e.g. the individual clinician, group of clinicians, hospital wards, cities, regions, or countries. You hereby randomise trial participants in clusters [48]. Because the responses of participants within clusters can be expected to be more similar than responses of

What is the goal of the trial?

To find the goal of a trial you have to answer the three questions: do you want to show your experimental intervention is superior, equivalent, or non-inferior to your comparator?

The superiority trials are the usual trials (Fig. 2). You want to establish if your experimental intervention is superior to your control. If you do not have a convincing evidence-based intervention that works, the choice of a superiority trial is straightforward. Thirty years ago there were variable approaches to

Sample size estimation in randomised trials

Your sample size estimation depends on the goal of the trial (superiority, equivalence, or non-inferiority) and the type of the primary outcome measure (dichotomous or continuos).

In a superiority trial with a dichotomous primary outcome, the sample size is determined from four pieces of information based on the primary outcome measure [4]:

•
The expected proportion of patients with the primary outcome during the trial in the control arm. Very often this variable is grossly overestimated. The

Sample size of randomised trials

Most hepato-biliary randomised trials are too small [9], [34], [37], [40], [51], [52], [54], [55], [56] (Table 2, Table 3). The number of patients included in hepato-biliary randomised trials only varied a little depending on the journal in which they were published [34], [37], [51], [52] (Table 3). The median number of participants per intervention arm was 23 (10th–90th percentiles from 7 to 102) in hepato-biliary trials published in 12 journals during 1985–1996 [54] (Table 2). In

Methodological quality: the risk of bias

Conducting randomised trials with high methodological quality (i.e. avoiding selection, performance, assessment, attrition, and other biases) decreases the risks of bias [28], [29], [30], [31], [32], [33]. We have examined the methodological quality of hepato-biliary randomised trials (Table 2, Table 3, Table 4). Most trials have one or more methodological deficiencies [9], [34], [37], [51], [52], [54], [55], [56].

The low methodological quality raises the question if biased estimates of

Conflicting interests

The impact of conflicts of interests may have profound effects on the results of trials as well as how results are interpreted [63], [64], [65], [66]. It is clear to many that the influence of the drug and device industry has become too large [67].

Discussion

During the last 50 years we have witnessed a very positive increase in the number of randomised trials being conducted (Fig. 1). Compared to randomised trials in general, hepato-biliary trials are less often cross-over trials and more often conducted with adequate generation of allocation sequence and adequate allocation concealment. These are very positive observations. On the other hand, the size, the bias risks, the analysis of and the interpretation of hepato-biliary trials still leave a

First page preview

Click to open first page preview

View PDF

References (80)

T.C. Chalmers
Randomization of the first patient
Med Clin North Am
(1975)
E. Christensen
Prognostic models including the Child-Pugh, MELD and Mayo risk scores—where are we and where should we go?
J Hepatol
(2004)
V. Beral et al.
Evidence from randomised trials on the long-term effects of hormone replacement therapy
Lancet
(2002)
G. Bjelakovic et al.
Antioxidants for preventing gastrointestinal cancers: a systematic Cochrane review and meta-analysis
Lancet
(2004)
C. Young et al.
Putting clinical trials into context
Lancet
(2005)
D. Moher et al.
Does quality of reports of randomised trials affect estimates of intervention efficacy reported in meta-analyses?
Lancet
(1998)
L.L. Kjaergard et al.
Validity of randomized clinical trials in gastroenterology from 1964–2000
Gastroenterology
(2002)
A.-W. Chan et al.
Epidemiology and reporting of randomised trials published in PubMed journals
Lancet
(2005)
C. Gluud et al.
Quality assessment of reports on clinical trials in the journal of hepatology
J Hepatol
(1998)
L.L. Kjaergard et al.
Funding, disease area, and internal validity of hepato-biliary randomised trials
Am J Gastroenterol
(2002)

S.F. Assmann et al.

Subgroup analysis and other (mis)uses of baseline data in clinical trials

Lancet

(2000)

C. Gluud

‘Negative trials’ are positive!

J Hepatol

(1998)

G.A. Diamond et al.

Prior convictions: Bayesian approaches to the analysis and interpretation of clinical megatrials

J Am Coll Cardiol

(2004)

C. Gluud

Trials and errors in clinical research

Lancet

(1999)

James Lind Library. Available from...

European Clinical Research Infrastructures Nework (ECRIN), May 20th 2005, the first International Clinical Trials’ Day....

S. Yusuf et al.

Why do we need some large, simple randomized trials?

Stat Med

(1984)

S.J. Pocock

Clinical trials–a practical approach

(1996)

C. Gluud et al.

New developments in the conduct and management of multi-center trials: an international review of clinical trial units

Fundam Clin Pharmacol

(1995)

I. Chalmers

Comparing like with like: some historical milestones in the evolution of methods to create unbiased comparison groups in therapeutic experiments

Int J Epidemiol

(2001)

T.C. Chalmers et al.

The treatment of acute infectious hepatitis. Controlled studies of the effects of diet, rest, and physical reconditioning on the acute course of the disease and on the incidence on relapses and residual abnormalities

J Clin Invest

(1955)

T.C. Chalmers

Randomised controlled clinical trials in diseases of the liver

Prog Liver Dis

(1976)

Gluud C, Als-Nielsen B, D'Amico G, Gluud LL, Khan S, Klingenberg SL, et al. Cochrane Hepato-Biliary Group. About The...

D.L. Sackett et al.

Evidence-based Medicine, how to practise and teach EBM

(2000)

G. Guyatt et al.

Users' guides to the medical literature: a manual of evidence-based clinical practice

(2002)

D. Moher et al.

The CONSORT statement: revised recommendations for improving the quality of reports of parallel-group randomized trials

J Am Med Assoc

(2001)

C. Gluud et al.

Evidence based diagnostics

BMJ

(2005)

R.E. Sanborn et al.

Gastrointestinal stromal tumors and the evolution of targeted therapy

Clin Adv Hematol Oncol

(2005)

A. Tatsioni et al.

Challenges in systematic reviews of diagnostic technologies

Ann Intern Med

(2005)

G. D'Amico et al.

Survival and prognostic indicators in compensated and decompensated cirrhosis

Dig Dis Sci

(1986)

J.P. Ioannidis et al.

Better reporting of harms in randomised trials: an extension of the CONSORT statement

Ann Intern Med

(2004)

R. Chou et al.

Challenges in systematic reviews that assess treatment harms

Ann Intern Med

(2005)

Vist GE, Hagen KB, Devereaux PJ, Bryant D, Kristoffersen DT, Oxman AD. Outcomes of patients who participate in...

Jespersen CM, Als-Nielsen B, Damgaard M, Fischer Hansen J, Hansen S, Helø OH, et al. A randomised, placebo controlled,...

Gong Y, Gluud C. Methotrexate for primary biliary cirrhosis. The Cochrane Database of Systematic Reviews, Issue 3,...

S.S. Ellenberg et al.

Data monitoring committees in clinical trials. A practical perspective

(2003)

K.F. Schulz et al.

Empirical evidence of bias. Dimensions of methodological quality associated with estimates of treatment effects in controlled trials

J Am Med Assoc

(1995)

L.L. Kjaergard et al.

Reported methodological quality and discrepancies between large and small randomised trials in meta-analyses

Ann Intern Med

(2001)

E.M. Balk et al.

Correlation of quality measures with estimates of treatment effect in meta-analyses of randomised controlled trials

J Am Med Assoc

(2002)

Als-Nielsen B, Chen W, Gluud LL, Siersma V, Hilden J, Gluud C. Are trial size and quality associated with treatment...

Cited by (47)

A two-center radiomic analysis for differentiating major depressive disorder using multi-modality MRI data under different parcellation methods
2022, Journal of Affective Disorders
Citation Excerpt :
Another reason for the inconsistencies in conclusions among previous studies of MDD may be the absence of involvement of multi-center imaging data. Multi-center training-validation patterns can greatly improve the reliability and generality of classifier (Gluud, 2006; Xia et al., 2019). However, to the best of our acknowledge, there are few studies of MDD concurrently using multi-modality and multi-center brain imaging to quantitatively explore neural differences between patients with MDD and healthy controls.
The present study aimed to explore the difference in the brain function and structure between patients with major depressive disorder (MDD) and healthy controls (HCs) using two-center and multi-modal MRI data, which would be helpful to investigate the pathogenesis of MDD.
The subjects were collected from two hospitals. One including 140 patients with MDD and 138 HCs was used as primary cohort. Another one including 29 patients with MDD and 52 HCs was used as validation cohort. Functional and structural magnetic resonance images (MRI) were acquired to extract four types of features: functional connectivity (FC), amplitude of low-frequency fluctuations (ALFF), regional homogeneity (ReHo), and gray matter volume (GMV). Then classifiers using different combinations among the four types of selected features were respectively built to discriminate patients from HCs. Different templates were applied and the results under different templates were compared.
The classifier built with the combination of FC, ALFF, and GMV under the AAL template discriminated patients from HCs with the best performance (AUC=0.916, ACC=84.8%). The regions selected in all the different templates were mainly located in the default mode network, affective network, prefrontal cortex.
First, the sample size of the validation cohort was limited. Second, diffusion tensor imaging data were not collected.
The performance of classifier was improved by using multi-modal MRI imaging. Different templates would be suitable for different types of analysis. The regions selected in all the different templates are possibly the core regions to investigate the pathophysiology of MDD.
The effect of adding psychodynamic therapy to antidepressants in patients with major depressive disorder. A systematic review of randomized clinical trials with meta-analyses and trial sequential analyses
2012, Journal of Affective Disorders
Major depressive disorder afflicts an estimated 17% of individuals during their lifetimes at tremendous suffering and costs. Psychodynamic therapy may be a treatment option for depression, but the effects have only been limitedly assessed in systematic reviews.
Using Cochrane systematic review methodology, we compared the benefits and harms of psychodynamic therapy versus ‘no intervention’ or sham for major depressive disorder. We accepted any co-intervention, including antidepressants, as long as it was delivered similarly in both intervention groups. Trials were identified by searching the Cochrane Library's CENTRAL, MEDLINE via PubMed, EMBASE, Psychlit, Psyc Info, and Science Citation Index Expanded until February 2010. Two authors independently extracted data. We evaluated risk of bias to control for systematic errors. We conducted trial sequential analysis to control for random errors.
We included five trials randomizing a total of 365 participants who all received antidepressants as co-intervention. All trials had high risk of bias. Four trials assessed ‘interpersonal psychotherapy’ and one trial ‘short psychodynamic supportive psychotherapy’. Meta-analysis showed that psychodynamic therapy significantly reduced depressive symptoms on the 17-item Hamilton Rating Scale for Depression (mean difference − 3.01 (95% confidence interval − 3.98 to − 2.03; P < 0.00001), no significant heterogeneity between trials) compared with ‘no intervention’. Trial sequential analysis confirmed this result.
Our results are based on few trials with high risk of bias and a limited number of participants so our results may be questionable.
Adding psychodynamic therapy to antidepressants might benefit depressed patients, but the possible treatment effect measured on the Hamilton Rating Scale for Depression is small.
Effects of cognitive therapy versus interpersonal psychotherapy in patients with major depressive disorder: A systematic review of randomized clinical trials with meta-analyses and trial sequential analyses
2012, Psychological Medicine
Disclosure of investigators' recruitment performance in multicenter clinical trials: A further step for research transparency
2011, PLoS Medicine
Vitamin D supplementation for chronic liver diseases in adults
2021, Cochrane Database of Systematic Reviews
Error Matrix Tool to Overview the Validity of Evidence on Radix Sophorae flavescentis for Chronic Hepatitis B
2019, Journal of Alternative and Complementary Medicine

View all citing articles on Scopus

View full text

ReviewThe culture of designing hepato-biliary randomised trials

Introduction

Section snippets

Why is it important to randomise?

What kind of participants to include and which data to collect?

Which experimental intervention?

Which comparator: placebo or active?

Parallel-group or cross-over randomised trial?

Multiple promising interventions: the factorial design

Cluster randomised trials

What is the goal of the trial?

Sample size estimation in randomised trials

Sample size of randomised trials

Methodological quality: the risk of bias

Conflicting interests

Discussion

First page preview

Med Clin North Am

J Hepatol

Lancet

Lancet

Lancet

Lancet

Gastroenterology

Lancet

J Hepatol

Am J Gastroenterol

Lancet

J Hepatol

J Am Coll Cardiol

Lancet

Why do we need some large, simple randomized trials?

Stat Med

Clinical trials–a practical approach

New developments in the conduct and management of multi-center trials: an international review of clinical trial units

Fundam Clin Pharmacol

Comparing like with like: some historical milestones in the evolution of methods to create unbiased comparison groups in therapeutic experiments

Int J Epidemiol

The treatment of acute infectious hepatitis. Controlled studies of the effects of diet, rest, and physical reconditioning on the acute course of the disease and on the incidence on relapses and residual abnormalities

J Clin Invest

Randomised controlled clinical trials in diseases of the liver

Prog Liver Dis

Evidence-based Medicine, how to practise and teach EBM

Users' guides to the medical literature: a manual of evidence-based clinical practice

The CONSORT statement: revised recommendations for improving the quality of reports of parallel-group randomized trials

J Am Med Assoc

Evidence based diagnostics

BMJ

Gastrointestinal stromal tumors and the evolution of targeted therapy

Clin Adv Hematol Oncol

Challenges in systematic reviews of diagnostic technologies

Ann Intern Med

Survival and prognostic indicators in compensated and decompensated cirrhosis

Dig Dis Sci

Better reporting of harms in randomised trials: an extension of the CONSORT statement

Ann Intern Med

Challenges in systematic reviews that assess treatment harms

Ann Intern Med

Data monitoring committees in clinical trials. A practical perspective

Empirical evidence of bias. Dimensions of methodological quality associated with estimates of treatment effects in controlled trials

J Am Med Assoc

Reported methodological quality and discrepancies between large and small randomised trials in meta-analyses

Ann Intern Med

Correlation of quality measures with estimates of treatment effect in meta-analyses of randomised controlled trials

J Am Med Assoc

Review
The culture of designing hepato-biliary randomised trials