Open Access
Open Peer Review

This article has Open Peer Review reports available.

How does Open Peer Review work?

Publication proportions for registered breast cancer trials: before and following the introduction of the results database

Research Integrity and Peer Review20161:10

DOI: 10.1186/s41073-016-0017-4

Received: 24 April 2016

Accepted: 6 July 2016

Published: 18 July 2016



To limit selective and incomplete publication of the results of clinical trials, registries including were introduced. The registry added a results database in 2008 to enable researchers to post the results of their trials as stipulated by the Food and Drug Administration Amendment Act of 2007. This study aimed to determine the direction and magnitude of any change in publication proportions of registered breast cancer trials that occurred since the inception of the results database.


A cross-sectional study design was employed using, a publicly available registry/results database as the primary data source. Registry contents under the subcategories ‘Breast Neoplasms’ and ‘Breast Neoplasms, Male’ were downloaded on 1 August 2015. A literature search for included trials was afterwards conducted using MEDLINE and DISCOVER databases to determine publication status of the registered breast cancer trials.


Nearly half (168/340) of the listed trials had been published, with a median time to publication of 24 months (Q1 = 14 months, Q3 = 42 months). Only 86 trials were published within 24 months of completion. There was no significant increase in publication proportions of trials that were completed before the introduction of the results database compared to those completed after (OR = 1.00, 95 % CI = .61 to 1.63; adjusted OR = 0.84, 95 % CI = .51 to 1.39). Characteristics associated with publication included trial type (observational versus interventional adjusted OR = .28, 95 % CI = .10 to .74) and completion/termination status (terminated versus completed adjusted OR = .22, 95 % CI = .09 to .51).


Less than a half of breast cancer trials registered in are published in peer-reviewed journals.


Breast cancer Incomplete publication Registries Results databases


“Full and transparent reporting of trials is crucial to ensure that decisions about health care are based on the best available evidence.” [1]. Selective, incomplete or delayed reporting of trial results (for instance, due to their statistical significance, effect size or direction) distorts the available evidence, compromises systematic reviews, renders evidence-based decisions inaccurate, wastes resources and is unethical [210]. There is evidence of selective, incomplete and delayed reporting across several if not all fields, including cancer, cardiovascular disease and neurological disorders [11, 12]; pain [13]; surgery [14]; arthritis [15]; oral health [16]; eczema [17]; and acupuncture [18]. Registration of trials at their inception and appropriate follow-up was proposed to enable the tracking of protocol changes as well as improve research transparency, accountability and integrity [8]. Examples of trial databases/registries established to accomplish these objectives are and the European Clinical Trials Database (EudraCT) launched by the US National Institute of Health (NIH) in 2000 and the European Medicines Agency (EMA) in 2004 respectively [19, 20].

Despite the establishment of these registries, the ultimate aim of trial registration (full and transparent reporting) has not been achieved. Empirical evidence demonstrates a continued low compliance to the required reporting standards [8]. For instance, Ross and colleagues [21] reported that less than a half of trials funded by the National Institute of Health (n = 635) were published within 30 months of being completed. In addition to the incomplete reporting, there was a lack of transparency with trial outcomes being changed to favour positive or statistically significant findings [22]. These results were consistent to those reported before trial registries were established [2327].

Earlier studies have reported that investigators are less likely to submit negative or null findings [6, 24]. Originally, trial registries/databases did not mandate the submission of trial findings; however, it was hoped that an additional mandatory requirement to post results of registered trials would improve completeness and transparency in reporting [8]. Once posted, it would be impossible to conceal negative or null findings. Theoretically, removing this reason for not publishing would possibly, among other factors, increase trial publication rates. The Food and Drug Administration Amendments Act of 2007 (FDAAA) [28] paved way for mandatory (specified group of trials) as well as voluntary posting of results to a results database that was added to the registry in 2008. Although there are some contradictory reports [29], both trial registration and mandatory results reporting have been shown to improve publication rates [3032].

Studies have previously assessed delayed or incomplete publication in several specialties (including cancer trials in general) [11, 12, 33, 34], with some assessing the influence of the introduction of the results database on publication rates [29, 30, 32]. A search of the literature failed to identify reports related to this issue in relation to breast cancer trials. Considering that breast cancer is one of the leading causes of morbidity and mortality in the world today [35, 36], this study sought to (1) determine the proportion of registered breast cancer trials (i.e. initiated after February 2000) completed/terminated on or before 31 July 2013 that were published in peer-reviewed journals overall and within 24 months of completion; (2) compare publication rates for trials completed before introduction of a results database (from initiation of the database in February 2000 to 31 September 2008) with those completed afterwards (from 1 October 2008 to 31 July 2013); and (3) outline the main trial characteristics associated with results’ publication. was used because not only is it the largest publicly available registry (195,624 registered trials as of 30 July 2015) but it also combines the registry with a results database [8, 37].


Study design

This was a cross-sectional study of breast cancer trials registered at

Search strategy was searched on 1 August 2015 by browsing through the different conditions by category/topic. Under the category ‘Cancers and Other Neoplasms’, the two subcategories ‘Breast Neoplasms’ and ‘Breast Neoplasms, Male,’ were chosen and the registry contents downloaded.

Selection of relevant trials

Inclusion criteria

Both experimental and observational trials under the subcategories ‘Breast Neoplasms’ and ‘Breast Neoplasms, Male,’ with primary completion or termination dates up to 31 July 2013 were included (this was to allow for a 24 month period; 1 August 2013 to 31 July 2015). The primary completion date was defined as the date of collection of the primary outcome measure for the last included patient [38] (when unreported, the expected completion date as defined by the trial investigators was used); whereas termination date as a premature/early date on which participant recruitment, examination and, or treatment stops with the trial not scheduled to start again [38].

Exclusion criteria

‘Ongoing’ trials, trials initiated before/during February 2000, trials completed after 31 July 2013, terminated trials that did not enrol a single participant and trials with ‘unknown’ status or unknown completion dates were excluded. ‘Ongoing’ trials were defined as trials that were ‘active, but not recruiting’, ‘suspended’, ‘enrolling by invitation’, ‘recruiting’, ‘not yet recruiting’, ‘available for expanded access’ and ‘temporarily not available for expanded access’, whereas trials with ‘unknown status’ were those trials whose recruitment status had not been verified within the previous two years for ‘recruiting’ or ‘not yet recruiting’ studies [38]. A track of trials included per study period was kept with trials in a given period excluded once an accumulated number of 170 trials in the same period was reached.

Size of selected sample

This was based on the ‘sample size calculation formula for a difference in proportions’ [39]. Using the Ramsey and Scoggins [33] publication rate of p 1 = 17.6 % (before the release of the results database), 90 % power and assuming a type I error rate of 5 %, power calculations were undertaken under various publication rate increments including 10 % (724 trials), 15 % (340 trials) and 20 % (199 trials). The 15 % absolute change in publication rate (i.e. p 2 = 32.6 %), which produced 340 trials in total or 170 trials per study period, was decided on since it reflected a reasonably large increment whilst still requiring a manageable sample size.

Sampling technique

A web-based random number generator [40] was used to generate a sequence of random numbers from within the range of 1 to 6389, 6389 being the number of breast cancer trials registered with as of 1 August 2015 (Table 1). Using the generated sequence, the first 170 trials from each trial period fulfilling inclusion/exclusion criteria (below) were selected.
Table 1

Breast cancer trials registered with as of 1 August 2015

Breast cancer category (number of trials) number (study rank)

Allocated survey number

Breast Neoplasms (6109 studies)



Breast Neoplasms, Male (280 studies)



Data extraction

A Microsoft Office Excel template was used to extract information from downloaded content including trial characteristics (design: type, phase, randomization status, control status, blinding, interventional model, and endpoint classification; population: age and gender; sample size; study location; registration before/after initiation; and, primary sponsor); completion/termination status (completed or terminated, month and year of completion/termination, completed before/after 1 October 2008 and registered before/after completion/termination); results’ posting in (results posted or not; time to posting results); and trial publication status (results published or not, journal of publication, and time to publication).

Search for corresponding publications

For the literature search of published trials, the publication link (or citation) within the, if available, was used. If no link (or citation) was available, a search was conducted in MEDLINE using the identification number (CTN). If no publication was found using this number, another search using the keywords ‘breast cancer’ and the study intervention(s)’ primary outcomes and/or principal investigator (if named) was conducted. Articles were matched to the registry information using the study description, primary and secondary objectives, location, enrolment start and end dates, etc. For multiple publications, the publications that most closely fitted the study description in the registry records were chosen. If a decision could not be made on this basis (i.e. for two or more publications both/all closely fitting the registry records) or, if the same publication was published twice, the earliest publication (including electronic publications) was used. For studies whose publication records were not found using the MEDLINE search, a similar search was conducted using University of Liverpool’s DISCOVER. DISCOVER is the University of Liverpool’s electronic library database which contains/links to sources from more than 489 other databases including MEDLINE, Scopus, Science Citation Index, Science Direct, CINAHL Plus, Cochrane database of systematic reviews, EBSCO, ProQuest, Bandolier, etc. [41]) The search included publications for all languages with non-English publications included if a translated abstract and/or main text was available. For published trials, the journal and date of publication, the database in which published records were found and the study’s visibility using a CTN search as well as the reporting of the CTN in the title/abstract were captured. Unpublished studies that had meeting abstracts were also documented but not included as full publications.

From both the and publication records, other measures were derived including registered before/after initiation/completion, registered within 21 days if not prospectively registered, time in months from receipt of certification or request for extension to delay results, time in months from completion to publication/posting in and whether or not results were posted 12 (or published 24) months or less. Finally and depending on the year of publication, each journals’ impact factor was obtained by viewing the respective ‘Journal Citation Reports’ available from the ISI Web of Science. Data collection was completed on 31 August 2015.

Data cleaning

This involved the review of trial registries to check for and rectify typographical errors. Missing/unclear/inconsistent data variables were resolved as shown below (completion dates were not altered since they had been used to classify the registry records into the two study periods):
  • Some missing data was resolved (imputed) by using other information. For instance, since all single-group assignments/designs are non-randomized, open-label and have no control group [38], all studies that had missing information under ‘randomization status’, ‘blinding status’ and/or ‘control group’ were imputed accordingly if they were labelled ‘single-group assignment’. For those that had inconsistent data, e.g. a ‘single-group assignment’ also labelled as ‘placebo-controlled,’ the study’s description in the narrative text, if available, and/or the study’s objectives were used to resolve the issue. Similarly, the study’s objectives were used to impute missing ‘endpoint classifications’ whereas studies with more than one listed site or conducted in more than one country had to be ‘multicentre’.

  • For studies with publications, unresolved missing/inconsistent data was imputed using information from published records. For inconsistencies between registry and published records. e.g. achieved sample size, published records took precedence.

  • For information especially age ranges that did not perfectly match the data categories as per the data extraction template, the closest category was recorded, e.g. a study recruiting participants ≥21 years but <70 years was considered to be in the ‘Adults (≥18 years, <65 years) only’ category.

Finally, to aid data analysis, all missing/blank fields were completed using a ‘0’ except for those fields that required ‘time to completion, in months’ (in relation to results posting within and publication in peer-reviewed journals), ‘sample size’ and ‘number of multiple centres’.


The IBM Statistical Package for Social Sciences (SPSS) Version 21 was used. Descriptive analysis was used to identify the main trial characteristics as well as the proportions of the main outcome measures. Associations between selected variables were examined using chi-square tests (both variables categorical), independent sample T test/Mann-Whitney/logistic regression (one categorical, one quantitative) or linear regression (both quantitative). Conducted tests included:
  • Tests between the two study periods versus the trial characteristics to ascertain whether/not trial characteristics were different across the two periods

  • Tests between the two study periods versus publication status (published/not within 24 months and time to publication) to compare publication rates and timeliness across the two periods

  • Tests between publication status (results published/not) versus trial characteristics to determine which trial characteristics were associated with trial publication

The Kaplan-Meier survival method was used to compare the cumulative probability of trials being published within 24 months for the two study periods (all trials without publications after a 24-month follow-up were censored). Finally, all factors that were associated to publication status (p value ≤.25) [42] were introduced in a multivariate logistic regression to identify factors independently associated with publication status. To examine the effects of multiple testing, the Bonferroni adjustment method [43] was employed.


Trial selection and characteristics

A total of 340 registered trials were included as shown in Fig. 1 (also see Additional file 1). The majority of these trials were interventional, phase I/II or II, non-randomized, had no control, open-label, of single-group assignment and assessed both safety and efficacy (Table 2). The median sample size was 48 (first quartile, Q 1 = 24, third quartile, Q 3 = 118) with most trials including female participants aged ≥18 years old. Most trials were multicentre; and, of the multicentre trials that reported the number of centres (n = 87), the median number of sites was 9 (Q 1 = 4, Q 3 = 23). The primary sponsor was categorized as non-industry/non-government in 60.9 % of the trials. The registration, completion/termination and results posting statuses of the same trials are also presented in Table 2. A notable observation is that most (70.9 %) trials were not prospectively registered (i.e. before initiation). When registration within 21 days of initiation was considered, an extra 34 trials were registered adequately. The median year or trial completion was 2008 (Q 1 = 2007, Q 3 = 2011). Only 59 (17.4 %) trials had posted results with a median time to posting results of 24 months (Q 1 = 13 months, Q 3 = 49 months) (of these, only 14 had posted results within the required 12 months). For trials without posted results, only 7 had applied for a certification or request for extension to delay results. However, these trials had not posted results even after a median time of 31 months (from receipt of the certification/request by The greatest proportions of unknown/missing values were recorded in the categories ‘number of centres’ and ‘number of centres, if multiple’ which respectively had 37.1 and 47.0 % values missing.
Fig. 1

Selection of trials

Table 2

Trial characteristics, comparison between the two study periods and publication status.

Trial characteristics



p valuea

Publication Statusd

p valuea


Before 1 Oct 2008

On/after 1 Oct 2008


Not Published



297 (87 %)

158 (93 %)

139 (82 %)


159 (54 %)

138 (47 %)




43(13 %)

12 (7 %)

31 (18 %)


9 (21 %)

34 (79 %)




47 (14 %)

28 (21 %)

19 (17 %)


21 (45 %)

26 (55 %)



I/II or II

145 (43 %)

80 (59 %)

65 (60 %)


85 (59 %)

60 (41 %)



45 (13 %)

24 (18 %)

21 (19 %)


29 (64 %)

16 (36 %)



8 (2 %)

4 (3 %)

4 (4 %)


6 (75 %)

2 (25 %)


Not applicable (observational studies)

43 (13 %)


Unknown (missing values)

52 (15 %)


Randomization status


107 (32 %)

60 (39 %)

47 (34 %)


63 (59 %)

44 (41 %)



186 (55 %)

95 (61 %)

91 (66 %)


96 (52 %)

90 (48 %)


Not applicable (observational studies)

43 (13 %)


Unknown (missing values)

4 (1 %)


Control status

Placebo only

29 (9 %)

16 (10 %)

13 (10 %)


15 (52 %)

14 (48 %)



Active-control only

74 (22 %)

34 (22 %)

40 (29 %)


41 (55 %)

33 (45 %)


Both placebo and active-control

17 (5 %)

13 (8 %)

4 (3 %)


12 (71 %)

5 (29 %)


No control

172 (51 %)

92 (59 %)

80 (58 %)


90 (52 %)

82 (48 %)


Not applicable (observational studies)

43 (13 %)


Unknown (missing values)

5 (2 %)




255 (75 %)

128 (82 %)

127 (93 %)


134 (53 %)

121 (48 %)




9 (3 %)

5 (3 %)

4 (3 %)


4 (44 %)

5 (56 %)


Double (or Triple) blind

30 (9 %)

24 (15 %)

6 (4 %)


20 (67 %)

10 (33 %)


Not applicable (observational studies)

43 (13 %)


Unknown (missing values)

3 (1 %)


Interventional model


107 (32 %)

54 (35 %)

53 (39 %)


60 (56 %)

47 (44 %)



9 (3 %)

7 (5 %)

2 (2 %)


4 (44 %)

5 (56 %)



4 (1 %)

2 (1 %)

2 (2 %)


4 (100 %)

0 (0 %)


Single-group assignment

172 (51 %)

92 (59 %)

80 (58 %)


90 (52 %)

82 (48 %)


Not applicable (observational studies)

43 (13 %)


Unknown (missing values)

5 (2 %)


Endpoint classification

Safety only

28 (8 %)

15 (10 %)

13 (10 %)


12 (43 %)

16 (57 %)


Efficacy only

103 (30 %)

49 (32 %)

54 (40 %)


49 (48 %)

54 (52 %)


Both Safety and Efficacy

160 (47 %)

91 (59 %)

69 (51 %)


96 (60 %)

64 (40 %)


Not applicable (observational studies)

43 (13 %)


Unknown (missing values)

6 (2 %)



Included adults (≥18 years, >65 years)

51 (15 %)

28 (18 %)

23 (15 %)


29 (57 %)

22 (43 %)



Included adults and older adults (≥65 years)

237 (70 %)

121 (76 %)

116 (75 %)


117 (49 %)

120 (51 %)


Included adults and children (<18 years)

5 (2 %)

2 (1 %)

3 (2 %)


2 (40 %)

3 (60 %)


Included older adults (≥65 years) only

20 (6 %)

8 (5 %)

12 (8 %)


9 (45 %)

11 (55 %)


Included children (<18 years) only

0 (0 %)

0 (0 %)

0 (0 %)


0 (0 %)

0 (0 %)


Unknown (missing values)

27 (8 %)



Male only

1 (0 %)

0 (0 %)

1 (1 %)


0 (0 %)

1 (100 %)



Female only

225 (66 %)

106 (62 %)

119 (70 %)


105 (47 %)

120 (53 %)


Both male and female

114 (34 %)

64 (38 %)

50 (29 %)


63 (55 %)

51 (45 %)


Number of Centres


50 (15 %)

18 (17 %)

32 (29 %)


34 (68 %)

16 (32 %)



164 (48 %)

87 (83 %)

77 (71 %)


112 (68 %)

52 (32 %)


Unknown (missing values)

126 (37 %)


Number of Countries


301 (89 %)

148 (88 %)

153 (91 %)


143 (48 %)

158 (53 %)



36 (11 %)

20 (12 %)

16 (10 %)


25 (69 %)

11 (31 %)


Unknown (missing values)

3 (1 %)



US/Canada only

209 (62 %)

105 (63 %)

104 (62 %)


100 (48 %)

109 (52 %)



International (outside USA/Canada) only

102 (30 %)

49 (29 %)

53 (31 %)


48 (47 %)

54 (53 %)


USA/Canada and International

26 (8 %)

14 (8 %)

12 (7 %)


20 (77 %)

6 (23 %)


Unknown (missing values)

3 (1 %)


Primary sponsor


91 (27 %)

45 (27 %)

46 (27 %)


45 (50 %)

46 (51 %)


Government (US and non-US)

42 (12 %)

24 (14 %)

18 (11 %)


24 (57 %)

18 (43 %)



207 (61 %)

101 (59 %)

106 (62 %)


99 (48 %)

108 (52 %)


Registered before initiation


59 (17 %)

22 (14 %)

37 (27 %)


35 (59 %)

24 (41 %)



241 (71 %)

139 (86 %)

102 (73 %)


119 (49 %)

122 (51 %)


Unknown (missing dates)

40 (12 %)


Registered within 21 days of initiation


93 (27 %)

30 (19 %)

63 (30 %)


47 (51 %)

46 (50 %)



217 (64 %)

127 (81 %)

90 (59 %)


110 (51 %)

107 (49 %)


Unknown (missing dates)

30 (9 %)




288 (85 %)

148 (87 %)

140 (82 %)


155 (54 %)

133 (46 %)



52 (15 %)

22 (13 %)

30 (18 %)


13 (25 %)

39 (75 %)


Registered before completion/termination?


283 (84 %)

131 (78 %)

152 (90 %)


135 (48 %)

148 (52 %)



55 (16 %)

38 (23 %)

17 (10 %)


32 (58 %)

23 (42 %)


Unknown (missing dates)

2 (1 %)


Results posted in


59 (17 %)

20 (12 %)

39 (23 %)


31 (53 %)

28 (48 %)



281 (83 %)

150 (88 %)

131 (77 %)


137 (49 %)

144 (51 %)


Publication link available


77 (23 %)

41 (24 %)

36 (21 %)


77 (100 %)

0 (0 %)



263 (77 %)

129 (76 %)

134 (79 %)


91 (35 %)

172 (51 %)


aPearson chi-square p values used unless otherwise. For cells with expected cell count less than 5, exact p values (Fisher’s exact test or Monte Carlo significance) were used

bLikelihood ratio p value

cPercentages by column for frequencies and study period comparisons (some percentages may not add up to 100 % due to rounding)

dPercentages by row for publication comparisons

A comparison of trial characteristics between the two study periods for continuous variables using the Mann-Whitney U test, i.e. sample size (N = 340, U = 14,447.0, Sig. =.997), number of multiple centres (N = 87, U = 915.0, Sig. =.857) and time to posting of results (N = 59, U = 237.5, Sig. =.014) showed that only the latter was statistically significantly different across the periods. Other comparisons are shown in Table 2. Overall, 8/22 factors/characteristics differed between the two periods including trial type, blinding status, number of centres (single versus multiple), registration before trial initiation, registration within 21 days of trial initiation, registration before completion/termination and result’s posting (including time to posting) in

Overall publication of study results

Of the included trials (n = 340), only 77 had valid publication links. The links (n = 77) and a MEDLINE search using the ‘ identifier (CTN)’ (n = 5) and ‘applicable search terms’ (n = 86) produced a total of 168 trials (Additional file 2) published by MEDLINE-indexed journals (with a median impact factor of 4.17, Q 1 = 2.63, Q 3 = 6.41). Apart from 16 meeting abstracts (excluded), the DISCOVER search did not yield any more publications. Therefore, 49.4 % (168/340) of breast cancer trials registered with were published, with a median time to publication of 24 months (Q 1 = 14 months, Q 3 = 42 months). The publication trend based on year of completion/termination is shown in Fig. 2. Considering publication timeliness, only 86 (51.2 %) of the 168 published trials (which translates to 25.3 % of the 340 registered trials) were published within 24 months of trial completion. With regard to ‘visibility’, less than half (45.8 %, 77/168), a third (27.4 %, 46/168) and a fifth (14.9 %, 25/168) of published trials had publication links, were obtainable using the CTN search and reported the CTN in the title/abstract, respectively.
Fig. 2

Publication (number of trials) by year of completion/termination

Publication across the two study periods

Of the 168 trials published overall, 93 (55.4 %) trials were completed/terminated before 1 October 2008 compared to 75 (44.6 %) that were completed/terminated on/after 1 October 2008. Both trial periods had an equal number of trials (n = 43, 25.3 % of 170 trials) being published within 24 months. Chi-square tests did not reveal an association between completion/termination before/after 1 October 2008 and results being published within 24 months/not (OR = 1.00, 95 % CI = .61 to 1.63, p value = 1.000, n = 340; OR adjusted for type and completion/termination status = 0.84, 95 % CI = .51 to 1.39, p value = .493, n = 340). Concerning time to publication, Kaplan-Meier survival analysis (Fig. 3) did not find a significant difference between the two time periods (log rank test p value = .989).
Fig. 3

A comparison of the cumulative probability of trials being published over time across the two study periods (all trials followed up for 24 months)

Study characteristics associated with trial publication

The associations between publication status (results published/not) with trial characteristics (Table 2) revealed that observational studies were less likely to be published than interventional studies (OR = .23, 95 % CI = .11 to .50, p value < .0001, n = 340) whereas terminated trials were less likely to be published than those completed as planned (OR = .29, 95 % CI = .15 to .56, p value <.0001, n = 340). There was also a significant difference in publication rates between trials conducted in single countries and those conducted in multiple countries (OR = .40, 95 % CI = .19 to .84, p value = .013, n = 337). Finally, the country of location was significantly associated with publication status (p value = .013). Although the availability of a publication link is shown in Table 2 (p value < .0001), the presence of a publication link was not considered a predisposing factor to publication because of a clear temporal relationship (publication precedes establishment of a link). None of the continuous variables was significantly associated with publication status (Mann-Whitney U tests were sample size, N = 340, U = 12716.0, Sig. =.056; number of multiple centres, N = 87, U = 275.5, Sig. =.944; and time to posting of results, N = 59, U = 407.5, Sig. =.687 whereas logistic regression ORs (95 % CIs) were sample size, OR = 1.00, 95 % CI = 1.00 to 1.00; number of multiple centres, OR = .98, 95 % CI = .94 to 1.03; and time to posting of results, OR = 1.00, 95 % CI = .98 to 1.02).

Chi-square comparisons between early phase (phase I and I/II or II) versus late phase (phase II/III or III and IV), blinded (single, double or triple blind) versus not-blinded (open-label) and industry versus non-industry funding with regard to results publication were also compared. The results (together with associations of other dichotomous trial characteristics) are presented in Fig. 4.
Fig. 4

Non-adjusted ORs (95 % CIs) of results publication for selected trial (dichotomous) characteristics

On multivariate analysis, only trial type (observational versus interventional studies) (adjusted OR = .28, 95 % CI = .10 to .74) and completion/termination status (terminated versus completed as planned) (adjusted OR = .22, 95 % CI = .09 to .51) were associated with publication status.

Examination of the effects of multiple testing

A total of 45 tests (21 comparing trial characteristics across the study periods, 2 between the study periods and publication status and 22 examining trial characteristics associated with publication status) were conducted. According to Bland [43], these many tests inflate the α (type I) error. For the overall study error to remain at the set α level, the Bonferroni adjustment sets a new error rate of 1 − (1 − α) n (which approximates α/n), where n is the number of tests performed. 0.05/45 = .001. Using this p value, only two associations (results published/not with trial type and results published/not with completion/termination status) remained significant.


Principal findings

This study was designed to measure the publication proportion for registered breast cancer trials and detect any changes in the same after the 2008 introduction of the results database. First and foremost, the proportion of trials published within 24 months was very low at 25.3 % (increasing to 49.4 % when publication timeliness is not considered). Although these figures are higher than those earlier reported (for oncology trials) [11, 33], a majority of registered trials remain unpublished, which is worrisome given that oncology was among the first medical fields to widely adopt trial registration [44]. Overall, more trials were published in the period prior to compared to that after 1 October 2008 (55.4 versus 44.6 %). However, it is not possible to associate this difference to the introduction of the results database since trials conducted before 1 October 2008 had a longer follow-up [45]. Indeed, this difference ceased to exist when all trials were followed up for the same 24-month period (50 versus 50 %).

For both study periods, a majority of studies remained unpublished. This study however did not evaluate the reasons such studies were not published. Reasons for non-publication of research studies have nevertheless been discussed elsewhere [7, 11, 33, 4548]. It is, however, also possible that some studies designated as ‘unpublished’ were still undergoing preparation for publication or editorial review and will soon be published [49, 50]. This is especially important since the study’s minimum follow-up period was 2 years yet according to Hopewell et al. [50], positive and negative findings are on average published within 4–5 and 6–8 years respectively. Schmucker and colleagues also noted that the probability of publication within 2 years, for their study, ranged from 7 to 30 % [51]. As Manzoli [52] demonstrated publication rates increase with time, with a follow-up of 355 vaccine trials showing publication rates of 12, 29, 53 and 73 % after 12, 24, 36 and 48 months after completion, respectively. For this study therefore, trials completed before 1 October 2008 with a minimum follow-up of 7 years should have had enough time for publication. Worryingly, 45.3 % (77/170) of these trials were not published which is in line with Khan et al.’s [15] report that 20 to 70 % of trials remain unpublished even after a follow-up of several years.

Only trial type (interventional versus observational) and completion/termination status were associated with trial publications. As mentioned earlier, registration of trials was introduced among other reasons to ensure the complete reporting of the same trials [8]. With FDAAA [28] among other laws not mandating their registration, exacerbated by the fact that the medical community has given them little attention with regard to their need to be registered [53], it is no surprise that the publication proportion of observational trials was much less than that of interventional trials. However as Williams and colleagues [53] explain, most, if not all, ethical and scientific reasons that prompted registration of interventional studies also apply to observational studies. They should therefore also be registered (and results publicly disseminated). With regard to completion/termination status, reasons similar/related to those for termination (e.g. recruitment failure, safety concerns, futility, economic reasons, etc.) [31, 54] may discourage those initiating/conducting/funding the same trials from investing more time/money into their publication. However, as Shamliyan and Kane [31] report, reporting of such trials is especially important if they were terminated for safety concerns. Lessons learnt from such trials are important in shaping future research [3].

Study strengths and weaknesses

To the best of the researchers’ knowledge, this is the first study to determine publication rates in the breast cancer field and compare such rates before and after the introduction of the results database. is currently the largest registry/results database which contains trials conducted from almost all areas of the world. It is also publicly available meaning any researcher can access and replicate the study if scientifically/ethically justifiable. By allowing a 2-year period, the study ensured that completed/terminated trials had at least 2 years to be published in peer-reviewed journals. However, several limitations are recognized.
  • First, the study was designed to detect a 15 % change in publication rate yet the observed change was 10.6 % (54.7 to 44.1 %). A difference of 10 % required a sample size of 724 trials. The study’s smaller sample size produced less precise results (wider CIs) which could have affected the statistical significance of some of the associations as well as their interpretation [42].

  • Secondly, although is the largest publicly available database, the majority of trials originate from the USA/Canada (Table 2) which as Herrmann et al. [55] note might limit generalizability of the findings.

  • Thirdly, the study relied on the accuracy of information in the registry. However, this has previously been found to be unreliable [52, 56]. Incomplete, inconsistent or inaccurate records (e.g. some included studies were published before their registered completion dates) negatively impact the validity of the study’s findings. For instance, as Ross et al. report, studies providing completion dates are generally more likely to be published than those not providing the same dates [7]. This implies that this study, by not including studies without completion dates, may have overestimated the proportion of published studies in the registry. Additionally, because studies did not provide the date/day of completion, the time to publication (months) was approximate. Where it was possible to rectify inconsistencies/inaccurate/incomplete information, this was done as described under the ‘Data cleaning’ section.

  • Fourthly, although the Bonferroni adjustment was used to assess the effects of multiple testing, it was not employed to adjust for the type I error. The Bonferroni adjustment discourages multiple tests yet these are necessary for the interpretation of findings; it also increases the type II error (‘false negatives’) [57]. As Perneger [57] summarizes, “…simply describing what tests of significance have been performed, and why, is generally the best way of dealing with multiple comparisons.” The number of tests conducted (and why) were described in the analysis section. It should be noted that the ‘primary’ outcome (proportion of published trials) which was in the first instance not significant would not be affected by any Bonferroni adjustment. The only affected tests were ‘secondary’ (i.e. used to interpret the ‘primary’ finding) and were therefore of less relevance to the overall effect of multiple testing on the study’s results.

  • Fifthly, the study design did not include the assessment of unpublished findings. Nevertheless, the registry does not contain contact information for completed trials (contains only for ‘recruiting’ or ‘not yet recruiting’ trials) [58] which would have made it difficult had the researcher opted to contact investigators to assess unpublished findings.

  • Sixthly, it is possible that the study missed some published reports. However, as Manzoli states, any publications missed using the study’s systematic search are in essence not ‘publicly available’ or ‘visible’ to the public and are unlikely to be identified during subsequent/routine searches [52].

  • Finally, a single researcher derived information from the registry in addition to searching for publication records. The lack of a second independent search/data extraction lessens scientific vigour since it is impossible to rectify (and/or quantify) issues identifiable thorough researcher disagreements [49].

Policy implications

This study has several implications. First, the lack of publications or delays in publishing trial results means that at any one time, available evidence will be incomplete [210, 59]. Incomplete evidence in turn affects most, if not all, populations including researchers (conducting systematic reviews or further research that needs to be guided by current evidence), health service providers (making treatment decisions), policy makers (making treatment guidelines), patients with their friends and families (final healthcare consumers) and healthcare funders who pay for the treatment interventions. Wrong policy decisions, treatments given or research conducted is wasteful of limited resources. Secondly, patients, healthcare providers, funders, ethical boards, etc. participate in/fund or approve research with the hope that results will be disseminated and used to inform clinical practice. A failure to publish research findings violates these agreements/understandings and is not only unethical but could be regarded as scientific misconduct. In the long run, it erodes the public trust in clinical research [210, 59]. It is hoped that all concerned (including but not limited to researchers, journal editors, peer reviewers, sponsors, policymakers, regulators, Institutional Review Boards or Research Ethics Committees, etc.) will continue to work toward the attainment of complete and transparent reporting in the breast cancer and other fields (the OPEN (Overcome failure to Publish nEgative fiNdings)) project has developed targeted recommendations to that effect [60]) so that ultimately patients, healthcare professionals, funders, researchers, etc. benefit through ethical, scientific and efficient utilization and advancement of medical knowledge.


Less than a half of breast cancer trials registered in are published in peer-reviewed journals. Although this is an overall improvement from earlier reported oncology trials [11, 33], the results of the findings suggests the results of the majority of breast cancer trials remain unavailable to the public. These findings raise both ethical and scientific concerns and question both the completeness and validity of the evidence base that guides treatment decisions/guidelines, further research among others with regard to breast cancer treatments. Additionally, they question the effectiveness of the measures including registries and results databases (or their implementation) that have over time been introduced to limit selective, incomplete and delayed publication.


As Zarin and Tse explain with regard to trial registration, “…the infrastructure…is in place, but culture change by all stakeholders…is necessary before key goals can be reached.” [61] It is important therefore that this ‘cultural change’ be facilitated and the existing infrastructure strengthened. This among others includes increasing the capability of to detect and enable rectification of incomplete, inaccurate or inconsistent entries (especially applicable optional data elements) (see Table 2 for proportions of incomplete entries). Responsible authorities should be made accountable for complete, accurate and up-to-date information. should retain contact information for completed (especially unpublished) trials. The trial registry should also include a field that enables viewers determine whether/not a study is required to post results as mandated by FDAAA. There should be enforcements (rewards and punishments) at all levels; e.g. Institutional Review Boards, peer reviewers or journal editors should not respectively approve, approve for publication or publish (unless otherwise/justifiable) applicable trials that are not prospectively registered, do not have trial registration numbers or have not posted results as required by the FDAAA (many published trials did not fulfil these requirements).


CONSORT, Consolidated Standards of Reporting Trials; EMA, European Medicines Agency; EudraCT, European Clinical Trials Database; FDAAA, Food and Drug Administration Amendment Act of 2007; NIH, US National Institute of Health; OPEN, Overcome failure to Publish nEgative fiNdings; WHO, World Health Organization



The authors are grateful to the Laureate Online Education/University of Liverpool May 2015 Dissertation Class Cohort and the Research Methods Training Class instructor Allison Gilchrist for lending support to this project.


There was no external source of funding. The funds for the design of the study and collection, analysis and interpretation of data and in writing the manuscript were availed by the authors.

Availability of data and materials

The datasets supporting the conclusions of this article are included within the article (under the Additional files 1 and 2).

Authors’ contributions

IGA was responsible for conceiving the study, collecting and analysing the data. IGA and RD were both involved in designing the study and drafting the manuscript. Both authors read and approved the final manuscript.

Authors’ information

IGA is a senior pharmacist under Goodhope District Health Management Team, Botswana. He has undertaken and completed a Master of Science in Clinical Research at the University of Liverpool. RD is an online instructor and dissertation advisor on the Master of Science in Clinical Research programme at Laureate Online Education/University of Liverpool. She is also the Director of the Liverpool Reviews and Implementation Group.

Competing interests

The authors declare that they have no competing interests.

Consent for publication

Not applicable.

Ethics approval and consent to participate

Not applicable.

Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Authors’ Affiliations

University of Liverpool
Liverpool Reviews and Implementation Group (LRiG)


  1. Consolidated standards of reporting trials. Now published: CONSORT 2010 statement. 2010. Accessed 4 Jun 2015.
  2. Hopewell S, Loudon K, Clarke MJ, Oxman AD, Dickersin K. Publication bias in clinical trials due to statistical significance or direction of trial results. Cochrane Database Syst Rev. 2009; Issue 1. Art. No.: MR000006.
  3. Dickersin K, Chalmers I. Recognising, investigating and dealing with incomplete and biased reporting of clinical research: from Francis Bacon to the WHO. J R Soc Med. 2011;104(12):532–8.View ArticleGoogle Scholar
  4. Dickersin K, Chan S, Chalmers TC, Sacks HS, Smith H. Publication bias and clinical trials. Control Clin Trials. 1987;8:343–53.View ArticleGoogle Scholar
  5. Macleod MR, Michie S, Roberts I, Dirnagl U, Chalmers I, Ioannidis JPA, Salman RA, Chan A, Glasziou P. Biomedical research: increasing value, reducing waste. Lancet. 2014;383(9912):101–4.View ArticleGoogle Scholar
  6. Reynolds T. Eliminating publication bias: the effect of negative trial results. JNCI. 2000;92(9):682.View ArticleGoogle Scholar
  7. Ross JS, Mulvey GK, Hines EM, Nissen SE, Krumholz HM. Trial publication after registration in a cross-sectional analysis. PLoS Med. 2009;6(9):e1000144.View ArticleGoogle Scholar
  8. Tse T, Zarin DA, Williams RJ, Ide NC. The role and importance of clinical trial registries and results databases. In: Gallin JI, Ognibene FP, editors. Principles and practices of clinical research. 3rd ed. Amsterdam: Elsevier; 2012. p. 171–80.View ArticleGoogle Scholar
  9. Zarin DA, Tse T, Williams RJ, Califf RM, Ide NC. The results database—update and key issues. N Engl J Med. 2011;364(9):852–60.View ArticleGoogle Scholar
  10. Zarin DA, Williams RJ, Bergeris A, Dobbins HD, Ide NC, Loane RF, Robbins A, Tse T. and related projects: improving access to information about clinical studies. A report to the Board of Scientific Counselors April 2013, Technical Report TR-2013-001. Bethesda: Lister Hill National Center for Biomedical Communications, US National Library of Medicine; 2013. Accessed 30 Jul 2015.
  11. Nguyen T, Dechartres A, Belgherbi S, Ravaud P. Public availability of results of trials assessing cancer drugs in the United States. J Clin Oncol. 2013;31(24):2998–3003.View ArticleGoogle Scholar
  12. Hakala A, Kimmelman J, Carlisle B, Freeman G, Fergusson D. Accessibility of trial reports for drugs stalling in development: a systematic assessment of registered trials. BMJ. 2015;350:h1116.View ArticleGoogle Scholar
  13. Smith SM, Wang AT, Pereira A, Chang RD, McKeown A, Greene K, Rowbotham MC, Burke LB, Coplan P, Gilron I, Hertz SH, Katz NP, Lin AH, McDermott MP, Papadopoulos EJ, Rappaport BA, Sweeney M, Turk DC, Dworkin RH. Discrepancies between registered and published primary outcome specifications in analgesic trials: ACTTION systematic review and recommendations. Pain. 2013;154(12):2769–74.View ArticleGoogle Scholar
  14. Killeen S, Sourallous P, Hunter IA, Hartley JE, Grady HL. Registration rates, adequacy of registration, and a comparison of registered and published primary outcomes in randomized controlled trials published in surgery journals. Ann Surg. 2014;259(1):193–6.View ArticleGoogle Scholar
  15. Khan NA, Singh M, Spencer HJ, Torralba KD. Randomized controlled trials of rheumatoid arthritis registered at what gets published and when. Arthritis Rheumatol. 2014;66(10):2664–74.View ArticleGoogle Scholar
  16. Smail-Faugeron V, Esposito M. Should all clinical trials of oral health be registered? Eur J Oral Implantol. 2014;7(4):327–8.Google Scholar
  17. Nankervis H, Baibergenova A, Williams HC, Thomas KS. Prospective registration and outcome-reporting bias in randomized controlled trials of eczema treatments: a systematic review. J Invest Dermatol. 2012;132:2727–34.View ArticleGoogle Scholar
  18. Su C, Han M, Ren J, Li W, Yue S, Hao Y, Liu J. Empirical evidence for outcome reporting bias in randomized clinical trials of acupuncture: comparison of registered records and subsequent publications. Trials. 2015;16:28.View ArticleGoogle Scholar
  19. European Medicines Agency. EudraCT Public Web Report for January 2015. 2015. Accessed 28 May 2015.
  20. US National Institute of Health. history, policies and laws. 2015. Accessed 28 May 2015.
  21. Ross SJ, Tse T, Zarin DA, Xu H, Zhou L, Krumholz HM, Hines HH. Publication of NIH funded trials registered in cross sectional analysis. BMJ. 2012;344:d7292.View ArticleGoogle Scholar
  22. Mathieu S, Boutron I, Moher D, Altman DG, Ravaud P. Comparison of registered and published primary outcomes in randomized controlled trials. JAMA. 2009;302(9):977–84.View ArticleGoogle Scholar
  23. Easterbrook PJ, Gopalan R, Berlin JA, Matthews DR. Publication bias in clinical research. The Lancet. 1991;1(8746):867–72.View ArticleGoogle Scholar
  24. Dickersin K, Min YI, Meinert CL. Factors influencing publication of research results. Follow-up of applications submitted to two institutional review boards. JAMA. 1992;267(3):374–8.View ArticleGoogle Scholar
  25. Ioannidis JPA. Effect of the statistical significance of results on the time to completion and publication of randomized efficacy trials. JAMA. 1998;279(4):281–6.View ArticleGoogle Scholar
  26. Simes RJ. Publication bias: the case for an international registry of clinical trials. J Clin Oncol. 1986;4(10):1529–41.Google Scholar
  27. Stern JM, Simes RJ. Publication bias: evidence of delayed publication in a cohort study of clinical research projects. BMJ. 1997;315:640–5.View ArticleGoogle Scholar
  28. Food and Drug Administration Amendments Act of 2007. Title VIII—Clinical trial databases, Sec. 801. Expanded Clinical Trial Registry Data Bank. 2007. Accessed 2 Jun 2015.
  29. Gopal RK, Yamashita TE, Prochazka AV. Research without results: inadequate public reporting of clinical trial results. Contemp Clin Trials. 2012;33:486–91.View ArticleGoogle Scholar
  30. Munch T, Dufka FL, Greene K, Smith SM, Dworkin RH, Rowbotham MC. RReACT goes global: perils and pitfalls of constructing a global open-access database of registered analgesic clinical trials and trial results. Pain. 2014;155(7):1313–7.View ArticleGoogle Scholar
  31. Shamliyan TA, Kane RL. Clinical research involving children: registration, completeness, and publication. Pediatrics. 2012;129(5):e1291–300.View ArticleGoogle Scholar
  32. Shamliyan TA, Kane RL. Availability of results from clinical research: failing policy efforts. J Epidemiol Global Health. 2013;4(1):1–12.View ArticleGoogle Scholar
  33. Ramsey S, Scoggins J. Practicing on the tip of an information iceberg? Evidence of underpublication of registered clinical trials in oncology. Oncologist. 2008;13(9):925–9.View ArticleGoogle Scholar
  34. You B, Gan HK, Pond G, Chen EX. Consistency in the analysis and reporting of primary end points in oncology randomized controlled trials from registration to publication: a systematic review. J Clin Oncol. 2012;30(2):210–6.View ArticleGoogle Scholar
  35. American Cancer Society. Cancer Facts & Figs. 2016. Accessed 31 Mar 2016.
  36. Cancer Research UK. Breast cancer statistics. 2014. Accessed 8 Jun 2015.
  37. US National Institute of Health. trends, charts and maps. 2015. Accessed 31 Jul 2015.
  38. US National Institute of Health. glossary of clinical trials terms. 2015. Accessed 12 Jun 2015.
  39. Whitley E, Ball J. Statistics review 4: sample size calculations. Crit Care. 2002;6:335–41.View ArticleGoogle Scholar
  40. GraphPad Software, Inc. Random number generator. 2015. Accessed 1 Aug 2015.
  41. University of Liverpool. Electronic library: A-Z database. 2015. Accessed 9 Jun 2015.
  42. Bowers D. Medical statistics from scratch: an introduction for health professionals, 2nd ed. Chichester: John Wiley and Sons Ltd; 2008.Google Scholar
  43. Bland JM. Multiple significance tests: the Bonferroni method. BMJ. 1995;310:170.View ArticleGoogle Scholar
  44. Rasmussen N, Lee K, Bero L. Association of trial registration with the results and conclusions of published trials of new oncology drugs. Trials. 2009;10:116.View ArticleGoogle Scholar
  45. Malicki M, Marusic A. Is there a solution to publication bias? Researchers call for changes in dissemination of clinical research results. J Clin Epidemiol. 2014;67(10):1103–10.View ArticleGoogle Scholar
  46. Ohnmeiss DD. The fate of prospective spine studies registered on Spine J. 2015;15:487–91.View ArticleGoogle Scholar
  47. Scherer RW, Ugarte-Gil C, Schmucker C, Meerpohl JJ. Authors report lack of time as main reason for unpublished research presented at biomedical conferences: a systematic review. J Clin Epidemiol. 2015;68(7):803–10.View ArticleGoogle Scholar
  48. Smith HN. Comparison of arthroplasty trial publications after registration in J Arthroplasty. 2012;27(7):1283–8.View ArticleGoogle Scholar
  49. Vawdrey DK, Hripcsak G. Publication bias in clinical trials of electronic health records. J Biomed Inform. 2013;46:139–41.View ArticleGoogle Scholar
  50. Hopewell S, Clarke MJ, Stewart L, Tierney J. Time to publication for results of clinical trials. Cochrane Database Syst Rev. 2007; Issue 2. Art. No.: MR000011.
  51. Schmucker C, Schell LK, Portalupi S, Oeller P, Cabrera L, Bassler D, Schwarzer G, Scherer RW, Antes G, von Elm E, Meerpohl JJ. Extent of non-publication in cohorts of studies approved by research ethics committees or included in trial registries. PLoS One. 2014;9(12):e114023.View ArticleGoogle Scholar
  52. Manzoli L. Non-publication and delayed publication of randomized trials on vaccines: survey. BMJ. 2014;348:g3058.View ArticleGoogle Scholar
  53. Williams RJ, Tse T, Harlan WR, Zarin DA. Registration of observational studies: Is it time. CMAJ. 2010;182(15):1638–42.View ArticleGoogle Scholar
  54. Williams RJ, Tse T, DiPiazza K, Zarin DA. Terminated trials in the results database: evaluation of availability of primary outcome data and reasons for termination. PLos One. 2015;10(5):e0127242.
  55. Herrmann KH, Wolff R, Scheibler F, Waffenschmidt S, Hemkens LG, Sauerland S, Antes G. All nations depend on the global knowledge pool—analysis of country of origin of studies used for health technology assessments in Germany. PLoS One. 2013;8(3):e59213.View ArticleGoogle Scholar
  56. Korevaar DA, Ochodo EA, Bossuyt PM, Hooft L. Publication and reporting of test accuracy studies registered in Clin Chem. 2014;60(4):651–9.View ArticleGoogle Scholar
  57. Perneger TV. What’s wrong with Bonferroni adjustments? BMJ. 1998;316(7139):1236–8.View ArticleGoogle Scholar
  58. US National Institute of Health. protocol data element definitions (DRAFT). 2014. Accessed 26 Oct 2015.
  59. Zarin DA. Overview of webinar series. 2011. Accessed 30 Jul 2015.
  60. Meerpohl JJ, Schell LK, Bassler D, Gallus S, Kleijnen J, Kulig M, La Vecchia C, Marusic A, Ravaud P, Reis A, Schmucker C, Strech D, Urrutia G, Wager E, Antes G. Evidence-informed recommendations to reduce dissemination bias in clinical research: conclusions from the OPEN (Overcome failure to Publish nEgative fiNdings) project based on an international consensus meeting. BMJ Open. 2015;5(5):e006666.View ArticleGoogle Scholar
  61. Zarin DA, Tse T. Trust but verify: trial registration and determining fidelity to the protocol. Ann Intern Med. 2013;159(1):65–7.View ArticleGoogle Scholar


© The Author(s) 2016