Cutoff Finder: A Comprehensive and Straightforward Web Application Enabling Rapid Biomarker Cutoff Optimization

Jan Budczies; Frederick Klauschen; Bruno V. Sinn; Balázs Győrffy; Wolfgang D. Schmitt; Silvia Darb-Esfahani; Carsten Denkert

doi:10.1371/journal.pone.0051862

Abstract

Gene or protein expression data are usually represented by metric or at least ordinal variables. In order to translate a continuous variable into a clinical decision, it is necessary to determine a cutoff point and to stratify patients into two groups each requiring a different kind of treatment. Currently, there is no standard method or standard software for biomarker cutoff determination. Therefore, we developed Cutoff Finder, a bundle of optimization and visualization methods for cutoff determination that is accessible online. While one of the methods for cutoff optimization is based solely on the distribution of the marker under investigation, other methods optimize the correlation of the dichotomization with respect to an outcome or survival variable. We illustrate the functionality of Cutoff Finder by the analysis of the gene expression of estrogen receptor (ER) and progesterone receptor (PgR) in breast cancer tissues. This distribution of these important markers is analyzed and correlated with immunohistologically determined ER status and distant metastasis free survival. Cutoff Finder is expected to fill a relevant gap in the available biometric software repertoire and will enable faster optimization of new diagnostic biomarkers. The tool can be accessed at http://molpath.charite.de/cutoff.

Citation: Budczies J, Klauschen F, Sinn BV, Győrffy B, Schmitt WD, Darb-Esfahani S, et al. (2012) Cutoff Finder: A Comprehensive and Straightforward Web Application Enabling Rapid Biomarker Cutoff Optimization. PLoS ONE 7(12): e51862. https://doi.org/10.1371/journal.pone.0051862

Editor: Paul van Diest, University Medical Centre Utrecht, The Netherlands

Received: May 8, 2012; Accepted: November 12, 2012; Published: December 14, 2012

Copyright: © 2012 Budczies et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Funding: This work was funded by the European Commission within FP7, grants #200327 (METAcancer) and #257669 (ARROWS), and by OTKA, (Orszagos Tudomanyos Kutatasi Alap Hungarian National Scientific Research Fund) PD 83154. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors have declared that no competing interests exist.

Introduction

The use of diagnostic, prognostic and predictive biomarkers is of increasing importance in many areas of medicine. Many markers are measured in laboratory assay as continuous variables or as ordinal variables. Gene expression is usually measured on a metric scale, using qRT-PCR or hybridization based methods, for example using single gene assays such as TaqMan or on a global scale using DNA microarrays. In tissue-based diagnostics, protein expression is usually evaluated by immunohistochemistry (IHC) and quantified on an ordinal scale using the percentage of stained cells, staining intensity or combinations of these. Examples of such combinations include the Allred score that takes values between 0 and 8 [1], [2] and the immunoreactive score (IRS) that takes values between 0 and 12 [3], [4].

In order to translate a continuous or ordinal diagnostic variable into a clinical decision, it is necessary to determine a cutoff point and to stratify patients into distinct groups each requiring a different kind of treatment [5]. Although the mean or median value of the diagnostic factor was used before, it is often desirable to determine cutoff points based on the distribution of the variable or by optimizing the correlation with clinical outcome or response to a treatment. Methods for cutoff optimization include minimization of p-values or maximization of combinations of test sensitivity and test specificity [6]. X-Tile is a freely available tool that is specialized to the analysis of survival data and uses a minimal p-value approach for cutoff optimization [7]. Previously, like many others, we manually investigated and optimized cutoff points of molecular markers [8]–[10]. However, methods for cut-off determination vary among published studies and the underlying algorithms remain obscure in many instances.

The literature is filled with newly identified biomarkers, but only a small minority has found acceptance in patient care so far. This is in part due to overestimation of the true effects in the first study and a low reproducibility in subsequent validation studies. Cutoff optimization was demonstrated to contribute to overestimation of results when multiple cutoff points are investigated and the problem of multiple hypothesis testing is ignored [11], [12]. Conflicting the need of a higher degree of objectivity [13], cut-off point determination is often done in a non-systematic manner and therefore among the causes for a poor reproducibility of biomarker studies.

The reason for this is the lack of comprehensive and easy-to-use tools for cutoff determination. Therefore, we developed Cutoff Finder, a bundle of optimization and visualization methods that are accessible via the internet (http://molpath.charite.de/cutoff). The purpose of the web application is twofold: (i) As not a single methods can be considered to be optimal for all kind of data and all clinical situations, we implemented a multitude of five methods for cutoff determination. (ii) To avoid overestimation of the true effect, the robustness when varying the cutoff point can be investigated in overview plots showing effect sizes with confidence intervals. The workflow through the web application is shown in Fig. 1.

Download:

Figure 1. Workflow of Cutoff Finder.

The track below the icons refers to the places where the steps of data processing are done. These are the steps of data procession: 1. Data are uploaded from a tab-separated file or imported from one of three example data sets. 2. The user selects the biomarker and optionally outcome and survival variables from the table columns. 3. The user selects the method for cutoff determination. 4. The user chooses the set of plots to be generated. 5. The optimal cutoff point is determined and analysis plots are generated using R as statistical engine. 6. Cutoff point and plots are shown at the results webpage.

https://doi.org/10.1371/journal.pone.0051862.g001

Methods

Implementation and Availability

Cutoff Finder is a freely available web application that can be accessed using an arbitrary web browser (http://molpath.charite.de/cutoff). The web application is implemented as Java Server Pages (JSPs) that connect to R using the R package and TCP/IP server Rserve [14]. The R statistical language [15] serves as an engine for all statistical computing and visualization. The R code for cutoff optimization and the plot functions can be found in Text S1. The R file is loaded as static source to the R server. For each cutoff analysis, the web application calls the wrapper R function get.cutoff() that serves as controller. The controller calls specialized R functions for each kind of cutoff optimization and for each kind of plot. Results of the cutoff optimization and plots are returned to the web server. Using the R code, results and plots of the web page can be reproduced 100%.

Download:

Figure 2. Distribution based cutoff optimization (independent of outcome and survival data) in the GSE2034 breast cancer data.

Histograms of ER (A) and PgR (B) gene expression in 286 lymph-node negative breast cancers. A mixture model of two Gaussian distributions is fitted to each of the histograms (red lines). Vertical lines designate the optimal cutoffs derived from the mixture model.

https://doi.org/10.1371/journal.pone.0051862.g002

Analysis Workflow

The steps of data procession are shown in Figure 1:

The data are uploaded from a tab-separated table, rows representing patients and columns representing variables. The maximum number of rows is 5000, the maximum number of columns is 50.
The user selects the biomarker and optionally outcome and survival variables from the table columns.
The user selects the method for cutoff determination.
The user chooses the set of plots to be generated.
The inquiry is send to the statistical engine R. The optimal cutoff point is determined and analysis plots are generated.
The user is directed to the results webpage, where the optimal cutoff point and plots are shown.

Download:

Figure 3. Cutoff optimization by correlation with a binary variable or survival in the GSE2034 breast cancer data.

(A) For each possible cutoff, ESR1 gene expression is correlated with the immunohistologically determined ER status. The odds ratio (OR) including 95% CI is plotted in dependence of the cutoff. A vertical line designates the dichotomization showing the most significant correlation with immunohistologically determined ER status. (B) For each possible cutoff, PgR gene expression was correlated with distance metastasis free survival. The hazard ratio (HR) including 95% CI is plotted in dependence of the cutoff. A vertical line designates the dichotomization showing the most significant correlation with survival. The distribution of the gene expression values in the 286 tumors is shown as a rug plot at the bottom of the figures.

https://doi.org/10.1371/journal.pone.0051862.g003

Cutoff Determination

A multitude of five methods is offered for cutoff optimization. The first method is based solely on the distribution of the biomarker in the sample cohort. Methods 2–4 deal with optimization of the correlation with a binary variable, for example response to treatment. The last method is devoted to optimization of the correlation with survival.

Fit of mixture model: A mixture model of two Gaussian distributions is fitted to the histogram of the biomarker. This procedure is implemented using the function flexmix from the R package flexmix [16]. The optimal cutoff is determined as the value where the probability density functions of the mixing distribution coincide.
Significance of correlation with binary variable: This method correlates the dichotomized biomarker with a binary outcome variable using logistic regression. Logistic regression is executed using the function glm from R package stats [15]. The optimal cutoff is defined as the point with the most significant (Fisher’s exact test) split. Odds ratios (ORs) as well as sensitivity and specificity including 95% confidence intervals are calculated. Confidence intervals for proportions are estimated using Wilson’s method as it is implemented in the R package binom [17].
Based on ROC curve: Two methods determine the cutoff point by minimizing the distance on the ROC curve to the left top edge of the diagram. The first method minimizes the Euclidean distance between these points. The second method minimizes the Manhattan distance between the points. Here, the sum of sensitivity and specificity is maximized, equivalent to maximization of Youden’s statistics J = sensitivity+specificity –1 [18].
Minimum sensitivity or specificity: For each or these two methods, the user enters a percentage value. The cutoff point is chosen as the first threshold for the biomarker, where the sensitivity (or specificity) exceeds this predefined value.
Significance of correlation with survival variable: This method fits Cox proportional hazard models to the dichotomized variable and the survival variable. Survival analysis is executed using the functions coxph and survfit from the R package survival [19]. The optimal cutoff is defined as the point with the most significant (log-rank test) split. Hazard ratios (HRs) including 95% confidence intervals are calculated.

Download:

Figure 4. Plot of the differences in survival time.

The mean survival time is estimated in the samples where the PgR is highly expressed and in the samples where the PgR is lowly expressed. The difference of the mean survival times including is plotted. The distribution of PgR expression is shown as rug plot at the bottom of the figure.

https://doi.org/10.1371/journal.pone.0051862.g004

Additionally, the user can manually enter a cutoff point that is used for calculations and visualization.

Download:

Figure 5. Detailed analysis of the optimal dichotomization of the GSE2034 breast cancer data.

(A) Comparison of gene expression based and immunohistochemical determination of estrogen receptor status. The classification using ESR1 expression and the optimal cutoff taken from Fig. 2A is compared to the IHC result. (B) Kaplan Meier analysis of PgR expression using the optimal cutoff taken from Fig. 2B. Distant metastasis free survival (dmfs) was significantly longer for patients with PgR expression above the cutoff.

https://doi.org/10.1371/journal.pone.0051862.g005

Visualization

Two different kinds of plots can be generated: overview plots and plots at the cutoff point. Overview plots give a summary of all possible cutoff points with the optimal cutoff marked by a vertical line. The second kinds of plots are Waterfall and Kaplan-Meier plots that are generated for a fixed cutoff point. The overview plots include plots of ORs, HRs and differences in survival. ORs are calculated using the function glm form the R package stats [15]. HRs are calculated using the function coxph from the R package survival [19]. Differences in survival are calculated from the mean survival times in the good prognosis and the poor prognosis group. Mean survival times are estimated from the area under the Kaplan-Meier curve using the maximum time that occurs in the data as uniform time endpoint.

Download:

Table 1. Cutoff point optimization for estrogene receptor (ER) expression (reporter 205225_at).

https://doi.org/10.1371/journal.pone.0051862.t001

Example Data Sets

To illustrate the functionality of Cutoff Finder we analyzed gene expression data from breast cancer tissues. To this end, the gene expression series GSE2034, GSE7390 and GSE11121 were downloaded from the Gene Expression Omnibus (GEO) repository [20], [21]. We analyzed measurements of probe 205225_at for estrogen receptor (ESR1) and of 208305_at for progesterone receptor (PgR). The gene expression data are analyzed on log scale, such that a difference of can be interpreted as fold change of 2.

Download:

Table 2. Cutoff point optimization for progesterone receptor (PgR) expression (reporter 208305_at).

https://doi.org/10.1371/journal.pone.0051862.t002

Results

We illustrate the functionality of Cutoff Finder by the analysis of the gene expression of estrogen receptor (ER) and progesterone receptor (PgR) in breast cancer tissues. In routine diagnostic protocols, ER and PgR status are determined based on the percentage of positive cells in immunohistochemically stained tissue sections. As an alternative approach, assessment of hormone receptor status using gene expression is currently under investigation in several studies [22]. Therefore, we extracted expression data of ER and PgR from the publicly available microarray data sets GSE2034 (286 patients), GSE7390 (198 patients) and GSE11121 (200 patients). Cutoff points were defined for ER and PgR and the correlation of the resulting split with immunohistochemically determined ER status and distance metastasis free survival was analyzed. All the figures presented in the following were automatically generated by the web application.

Download:

Table 3. Development of a test for ER status with at least 90% sensitivity and 90% specificity.

https://doi.org/10.1371/journal.pone.0051862.t003

Fit of a Mixture Model

This method for cutoff determination is convenient for a molecular variable with bimodal shaped distribution. The cutoff point is optimized based on the hypothesis that the variable is distributed according to a mixture of two Gaussian distributions. Using this method, the histograms ER and PgR expression were generated and optimal cutoff points were determined (Fig. 2). Both distributions had a pronounced bimodal shape with cutoffs points located at 10.6 (ER) and at 5.0 (PgR).

Optimizing the Correlation with Outcome

One of the most straightforward methods for cutoff optimization concerning a binary outcome variable is to maximize the significance of the 2×2 table. Other methods are discussed in a paragraph below. As example, we searched for a cutoff point for ER expression that optimally classifies with respect to ER status. In this analysis, ER status is determined by immunohistochemistry (IHC), the gold standard for investigation of ER and PgR. Fig. 3A shows the correlation of gene expression and ER status according to the designated cutoff point (Fig. 3A). The correlation was significant for almost all (99.6%) of the investigated cutoff points. The optimal cutoff was determined as 10.1 with odds ratio OR = 67.8 (30.2–152.1).

Optimizing the Correlation with Survival

The straightforward method for determination of a prognostic cutoff point is to optimize the significance of the split in the Kaplan-Meier plot. As an example, we analyzed the correlation of gene expression determined PgR status and distance metastasis-free survival according to the designated cutoff point (Fig. 3B). The correlation was significant over a large range (70%) of the investigated cutoff points. The optimal cutoff was determined as 2.5 with hazard ratio HR = 0.46 (0.30–0.71). Additionally, a plot showing the difference in survival times between the tumors of high PgR expression and low PgR expression can be generated (Fig. 4). These results are in accordance with the strong prognostic relevance of PgR status for distance metastasis-free und overall survival in breast cancer [23], [24].

Plots at Cutoff Point

Next, we investigated the dichotomization induced by the IHC-data-based and the survival-data-based cutoff determination in more detail. Using ERS1 expression measured by the microarray, determination of ER status was feasible with a sensitivity of 85.7% and a specificity of 91.9% (Fig. 5A). Kaplan Meier analysis showed a significantly better outcome for patients with high PgR expression (Fig. 5B, p = 0.00028).

Comparison of Different Methods for Cutoff Optimization

For cutoff determination in the ER expression data, we compared the mixture model approach and three approaches based on comparison with immunohistologically determined ER status (Tab. 1). The latter approaches included optimization of the significance and of the distance of a point on the ROC curve from the upper left edge of the ROC diagram. One of the ROC based methods minimizes the Euclidean distance to this point, while the other minimizes the Manhattan distance to this point. The cutoff points determined by the significance method and by the Euclidean distance method were exactly the same for all data sets, while the Manhattan distance method agreed with these for one of the data sets and derived by only 0.3 for the other data set. The mixture model method showed a larger deviation from these results and between the three data sets. However, all cutoffs were between 9.6 and 11.2 resulting sensitivities greater than 88% and specificities greater than 81%.

For cutoff determination in the PgR expression data, we compared the mixture model approach and the approach based on the significance of the split in the Kaplan-Meier plot. The distribution based cutoffs for PgR expression were between 4.2 and 5.6 (Tab. 2). Interestingly, survival-data based cutoff determinate led to a reduction of the cutoff value in some data sets (GSE2034 and GSE11121) and to an increase of the cutoff value in another data set (GSE7390).

Use Case: Development of a Molecular Test with Sensitivity and Specificity Greater than 90%

Finally, we discuss the development of a test for ER status from ER gene expression measured by the microarray probe 205552_at (Tab. 3). Using immunohistochemistry as gold standard and GSE2034 as training set, we executed two runs of Cutoff Finder to determine cutoff points with sensitivity >90% and with specificity >90%. Based on that, we defined a test result as positive, if it was above the upper threshold (11.49) and as negative, if it was below the lower threshold (10.22). The interval between the lower threshold and the upper threshold was defined as equivocal zone that contained only 6.6% of the samples in the training set. Using GSE7390 as validation set, test sensitivity was again above 90%, while test specificity was slightly below 90%, but inside the confidence interval estimated from the training set.

Discussion

Cutoff Finder is a freely available in the internet (http://molpath.charite.de/cutoff). The web application is straightforward to use: The user can upload a molecular data set that optionally includes outcome and/or survival data. Then, the user selects the variables for analysis, the method for cutoff determination and the set of plots. On a result web page, the optimal cutoff point, overview plots and plots at the optimal cutoff point are displayed. The plots can be downloaded and included in scientific publications.

The problem of the choosing the optimal cutoff is difficult to answer generally. The best method for cutoff determination may depend on the biomarker, the assay and the clinical application under investigation. We exemplified this situation by analyzing the agreement between RNA based with immunohistology based determination of estrogen receptor status using different methods. It turned out that the cutoff points determined by three optimization methods in two data sets were very similar (all between 10.1 and 10.7). We believe that this is a consequence of the high concordance between gene expression and immunohistology in this use case, while in other situations cutoffs determined by different methods can differ substantially [6]. To cope with different biomarkers and assays, Cutoff Finder offers a multitude of five methods for cutoff optimization.

Stratification into two groups, but not into three or more groups, seems to be most natural for translation into clinics where most of the decisions are binary (treatment or no treatment). However, some of the gene tests for breast cancer stratify patients into a low risk and a high group (Mammaprint [25] and Endopredict [26]), while other additionally introduce a medium risk group (Oncotype DX [27]). An interesting feature of the software X-Tile is simultaneous optimization of two cutoffs leading to the identification of high, medium and low risk populations using a minimal p-value approach. X-Tile is a bioinformatic tool for prognostic cutoff optimization that is freely available to academic users [7]. Cutoff Finder is primarily specialized to the dichotomization situation. However, we provided an example of stratification into three groups by two subsequent runs: Patients were divided into a group where the test is highly sensitive, a group where the test is highly specific and a group of the remaining patients with equivocal test results.

Many of the methods for cutoff determination are based on optimization of a target quantity in dependence of the cutoff. For example, the target quantity can be the test accuracy or the significance of correlation of the biomarker with outcome or survival. It is clear that in such situation a multiple testing problem occurs that can lead to overestimation of the significance of the optimal cutoff and the effect size at the optimal cutoff [11], [12], [28]. Correction of p-values and confidence intervals for maximally selected chi square statistics and maximal selected log rank test was investigates in the 1980s and the 1990s [29]–[32]. Cutoff Finder addresses the multiple testing problem by visualizing odds ratios or hazard rations including confidence intervals for each of the cutoffs under investigation. This kind of plots together with the proportion of significant tests can help to assess the stability and significance of the dichotomization. Further, we recommend integrating cutoff optimization in a sequential biomarker study design: In the first step of such an approach, the cutoff is optimized in one or more retrospective studies. In the second step, the predefined cutoff is validated in one or more retrospective or prospective studies. Both steps can be executed using Cutoff Finder.

Pathologists often use packages like SPSS (SPSS, Inc., an IBM Company), GraphPad Prism (GraphPad Software, Inc.) or Winstat (R.Fitch Software) in order to correlate biomarkers with outcome or survival data. However, neither of them includes functions for cutoff optimization. Therefore, cutoffs are often chosen as median or mean value of the biomarker or adjusted manually. For example, percentile based cutoffs were also implemented in our previous project where biomarkers can be assessed for their association to survival [33]. The commercial software JMP (SAS, Inc.) allows for optimizing a dichotomization for correlation with a binary outcome variable in a multivariate context. However, distribution-based cutoff optimization or cutoff optimization in context of a survival variable is not supported. The STEPP method allows graphical assessment of treatment-covariate interactions using Cox models on subsets of the data [34]–[36]. However, STEPP addresses the analysis of a clinical study in a series of overlapping patient subgroups rather than optimization of a cutoff point.

A specialty of Cutoff Finder is the plot of ORs and HRs including confidence intervals. The plots include information on both, the strength and significance of the prognostic stratification. This is in particular important in complex situations where ORs or HRs change strongly in dependence of the cutoff. For example, in a study on Ki67 protein expression in lung cancer, we observed positive HRs for low cutoffs and negative HRs for high cutoffs (data not shown).

Cutoff Finder was developed to facilitate cutoff optimization for tissue biomarkers that are investigated in pathology research. Additionally, we expect our software to be interesting for a wider group of scientists, including clinical chemists and other biomedical investigators, who want to convert a metric or ordinal variable into a dichotomization. As an example, we analyzed the expression of hormone receptors in breast cancer tissues. Cutoff point determination using different analysis modes turned out to be feasible and useful. In summary, we presented a comprehensive and straightforward tool for cutoff point optimization that can help to improve the quality of biomarker studies.

Supporting Information

Text S1.

R code of the functions that are used for cutoff optimization and the generation of plots.

https://doi.org/10.1371/journal.pone.0051862.s001

(R)

Author Contributions

Conceived and designed the experiments: JB BVS SDE CD. Analyzed the data: FK BVS BG WDS SDE. Wrote the paper: JB. Designed the software: JB.

References

1. Allred DC, Clark GM, Elledge R, Fuqua SA, Brown RW, et al. (1993) Association of p53 protein expression with tumor cell proliferation rate and clinical outcome in node-negative breast cancer. J Natl Cancer Inst 85: 200–206.
- View Article
- Google Scholar
2. Harvey JM, Clark GM, Osborne CK, Allred DC (1999) Estrogen receptor status by immunohistochemistry is superior to the ligand-binding assay for predicting response to adjuvant endocrine therapy in breast cancer. J Clin Oncol 17: 1474–1481.
- View Article
- Google Scholar
3. Remmele W, Stegner HE (1987) Recommendation for uniform definition of an immunoreactive score (IRS) for immunohistochemical estrogen receptor detection (er-ica) in breast cancer tissue. Pathologe 8: 138–140.
- View Article
- Google Scholar
4. von Minckwitz G, Müller BM, Loibl S, Budczies J, Hanusch C, et al. (2011) Cytoplasmic poly(adenosine diphosphate-ribose) polymerase expression is predictive and prognostic in patients with breast cancer treated with neoadjuvant chemotherapy. J Clin Oncol 29: 2150–2157.
- View Article
- Google Scholar
5. Mazumdar M, Glassman JR (2000) Categorizing a prognostic variable: review of methods, code for easy implementation and applications to decision-making about cancer treatments. Stat Med 19: 113–132.
- View Article
- Google Scholar
6. Perkins NJ, Schisterman EF (2006) The inconsistency of “optimal” cutpoints obtained using two criteria based on the receiver operating characteristic curve. Am J Epidemiol 163: 670–675.
- View Article
- Google Scholar
7. Camp RL, Dolled-Filhart M, Rimm DL (2004) X-tile: a new bio-informatics tool for biomarker assessment and outcome-based cut-point optimization. Clin Cancer Res 10: 7252–7259.
- View Article
- Google Scholar
8. Darb-Esfahani S, Wirtz RM, Sinn BV, Budczies J, Noske A, et al. (2009) Estrogen receptor 1 mrna is a prognostic factor in ovarian carcinoma: determination by kinetic PCR in formalin-fixed paraffin-embedded tissue. Endocr Relat Cancer 16: 1229–1239.
- View Article
- Google Scholar
9. Sinn BV, Darb-Esfahani S, Wirtz RM, Faggad A, Weichert W, et al. (2009) Vascular endothelial growth factor c mrna expression is a prognostic factor in epithelial ovarian cancer as detected by kinetic rt-pcr in formalin-fixed paraffin-embedded tissue. Virchows Arch 455: 461–467.
- View Article
- Google Scholar
10. Sinn BV, Darb-Esfahani S, Wirtz RM, Budczies J, Sehouli J, et al. (2011) Evaluation of a hormone receptor-positive ovarian carcinoma subtype with a favourable prognosis by determination of progesterone receptor and oestrogen receptor 1 mrna expression in formalin-fixed paraffin-embedded tissue. Histopathology 59: 918–927.
- View Article
- Google Scholar
11. Faraggi D, Simon R (1996) A simulation study of cross-validation for selecting an optimal cutpoint in univariate survival analysis. Stat Med 15: 2203–2213.
- View Article
- Google Scholar
12. Hilsenbeck SG, Clark GM, McGuire WL (1992) Why do so many prognostic factors fail to pan out? Breast Cancer Res Treat 22: 197–206.
- View Article
- Google Scholar
13. McShane LM, Altman DG, Sauerbrei W, Taube SE, Gion M, et al. (2005) Reporting recommendations for tumor marker prognostic studies. J Clin Oncol 23: 9067–9072.
- View Article
- Google Scholar
14. Urbanek S (2011) Rserve: binary R server. http://cran.r-project.org/package=Rserve.
15. R Core Team (2012) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria (Ed.). http://www.r-project.org.
16. Leisch F (2004) Flexmix: a general framework for finite mixture models and latent class regression in R. Journal of Statistical Software. 11(8): 1–18.
- View Article
- Google Scholar
17. Dorai-Raj S (2009) Binom: binomial confidence intervals for several parameterizations. http://carn.r-project.org/package=binom.
18. Youden WJ (1950) Index for rating diagnostic tests. Cancer 3: 32–35.
- View Article
- Google Scholar
19. Therneau T, Lumley T (2011) Survival: survival analysis, including penalised likelihood. http://cran.r-project.org/package=survival.
20. Barrett T, Troup DB, Wilhite SE, Ledoux P, Evangelista C, et al. (2011) NCBI GEO: archive for functional genomics data sets - 10 years on. Nucleic Acids Res 39: D1005–10.
- View Article
- Google Scholar
21. Edgar R, Domrachev M, Lash AE (2002) Gene expression omnibus: NCBI gene expression and hybridization array data repository. Nucleic Acids Res 30: 207–210.
- View Article
- Google Scholar
22. Müller BM, Kronenwett R, Hennig G, Euting H, Weber K, et al. (2011) Quantitative determination of estrogen receptor, progesterone receptor, and her2 mRNA in formalin-fixed paraffin-embedded tissue – a new option for predictive biomarker assessment in breast cancer. Diagn Mol Pathol 20: 1–10.
- View Article
- Google Scholar
23. Clark GM, McGuire WL, Hubay CA, Pearson OH, Marshall JS (1983) Progesterone receptors as a prognostic factor in stage ii breast cancer. N Engl J Med 309: 1343–1347.
- View Article
- Google Scholar
24. Magdelénat H, Pouillart P, Jouve M, Palangié T, Garcia Giralt E, et al. (1982) Progesterone receptor as a more reliable prognostic parameter than estrogen receptor in patients with advanced breast cancer. Breast Cancer Res Treat 2: 195.
- View Article
- Google Scholar
25. van de Vijver MJ, He YD, van’t Veer LJ, Dai H, Hart AAM, et al. (2002) A gene-expression signature as a predictor of survival in breast cancer. N Engl J Med 347: 1999–2009.
- View Article
- Google Scholar
26. Filipits M, Rudas M, Jakesz R, Dubsky P, Fitzal F, et al. (2011) A new molecular predictor of distant recurrence in er-positive, her2-negative breast cancer adds independent information to conventional clinical risk factors. Clin Cancer Res 17: 6012–6020.
- View Article
- Google Scholar
27. Sparano JA, Paik S (2008) Development of the 21-gene assay and its application in clinical practice and clinical trials. J Clin Oncol 26: 721–728.
- View Article
- Google Scholar
28. Altman DG, Lausen B, Sauerbrei W, Schumacher M (1994) Dangers of using “optimal” cutpoints in the evaluation of prognostic factors. J Natl Cancer Inst 86: 829–835.
- View Article
- Google Scholar
29. Halpern J (1982) Maximally selected chi square statistics for small samples. Biometrics 38: 1017–1023.
- View Article
- Google Scholar
30. Koziol JA (1991) On maximally selected chi-square statistics. Biometrics 47: 1557–1561.
- View Article
- Google Scholar
31. Lausen B, Schumacher M (1992) Maximally selected rank statistics. Biometrics 48: 73–85.
- View Article
- Google Scholar
32. Miller R, Sigmund D (1982) Maximally selected chi square statistics. Biometrics 38: 1011–1016.
- View Article
- Google Scholar
33. Györffy B, Lanczky A, Eklund AC, Denkert C, Budczies J, et al. (2010) An online survival analysis tool to rapidly assess the effect of 22,277 genes on breast cancer prognosis using microarray data of 1,809 patients. Breast Cancer Res Treat 123: 725–731.
- View Article
- Google Scholar
34. Bonetti M, Gelber RD (2000) A graphical method to assess treatment-covariate interactions using the cox model on subsets of the data. Stat Med 19: 2595–2609.
- View Article
- Google Scholar
35. Bonetti M, Gelber RD (2004) Patterns of treatment effects in subsets of patients in clinical trials. Biostatistics 5: 465–481.
- View Article
- Google Scholar
36. Bonetti M, Zahrieh D, Cole BF, Gelber RD (2009) A small sample study of the stepp approach to assessing treatment-covariate interactions in survival data. Stat Med 28: 1255–1268.
- View Article
- Google Scholar

[ref1] 1. Allred DC, Clark GM, Elledge R, Fuqua SA, Brown RW, et al. (1993) Association of p53 protein expression with tumor cell proliferation rate and clinical outcome in node-negative breast cancer. J Natl Cancer Inst 85: 200–206.
View Article
Google Scholar

[2] View Article

[3] Google Scholar

[ref2] 2. Harvey JM, Clark GM, Osborne CK, Allred DC (1999) Estrogen receptor status by immunohistochemistry is superior to the ligand-binding assay for predicting response to adjuvant endocrine therapy in breast cancer. J Clin Oncol 17: 1474–1481.
View Article
Google Scholar

[5] View Article

[6] Google Scholar

[ref3] 3. Remmele W, Stegner HE (1987) Recommendation for uniform definition of an immunoreactive score (IRS) for immunohistochemical estrogen receptor detection (er-ica) in breast cancer tissue. Pathologe 8: 138–140.
View Article
Google Scholar

[8] View Article

[9] Google Scholar

[ref4] 4. von Minckwitz G, Müller BM, Loibl S, Budczies J, Hanusch C, et al. (2011) Cytoplasmic poly(adenosine diphosphate-ribose) polymerase expression is predictive and prognostic in patients with breast cancer treated with neoadjuvant chemotherapy. J Clin Oncol 29: 2150–2157.
View Article
Google Scholar

[11] View Article

[12] Google Scholar

[ref5] 5. Mazumdar M, Glassman JR (2000) Categorizing a prognostic variable: review of methods, code for easy implementation and applications to decision-making about cancer treatments. Stat Med 19: 113–132.
View Article
Google Scholar

[14] View Article

[15] Google Scholar

[ref6] 6. Perkins NJ, Schisterman EF (2006) The inconsistency of “optimal” cutpoints obtained using two criteria based on the receiver operating characteristic curve. Am J Epidemiol 163: 670–675.
View Article
Google Scholar

[17] View Article

[18] Google Scholar

[ref7] 7. Camp RL, Dolled-Filhart M, Rimm DL (2004) X-tile: a new bio-informatics tool for biomarker assessment and outcome-based cut-point optimization. Clin Cancer Res 10: 7252–7259.
View Article
Google Scholar

[20] View Article

[21] Google Scholar

[ref8] 8. Darb-Esfahani S, Wirtz RM, Sinn BV, Budczies J, Noske A, et al. (2009) Estrogen receptor 1 mrna is a prognostic factor in ovarian carcinoma: determination by kinetic PCR in formalin-fixed paraffin-embedded tissue. Endocr Relat Cancer 16: 1229–1239.
View Article
Google Scholar

[23] View Article

[24] Google Scholar

[ref9] 9. Sinn BV, Darb-Esfahani S, Wirtz RM, Faggad A, Weichert W, et al. (2009) Vascular endothelial growth factor c mrna expression is a prognostic factor in epithelial ovarian cancer as detected by kinetic rt-pcr in formalin-fixed paraffin-embedded tissue. Virchows Arch 455: 461–467.
View Article
Google Scholar

[26] View Article

[27] Google Scholar

[ref10] 10. Sinn BV, Darb-Esfahani S, Wirtz RM, Budczies J, Sehouli J, et al. (2011) Evaluation of a hormone receptor-positive ovarian carcinoma subtype with a favourable prognosis by determination of progesterone receptor and oestrogen receptor 1 mrna expression in formalin-fixed paraffin-embedded tissue. Histopathology 59: 918–927.
View Article
Google Scholar

[29] View Article

[30] Google Scholar

[ref11] 11. Faraggi D, Simon R (1996) A simulation study of cross-validation for selecting an optimal cutpoint in univariate survival analysis. Stat Med 15: 2203–2213.
View Article
Google Scholar

[32] View Article

[33] Google Scholar

[ref12] 12. Hilsenbeck SG, Clark GM, McGuire WL (1992) Why do so many prognostic factors fail to pan out? Breast Cancer Res Treat 22: 197–206.
View Article
Google Scholar

[35] View Article

[36] Google Scholar

[ref13] 13. McShane LM, Altman DG, Sauerbrei W, Taube SE, Gion M, et al. (2005) Reporting recommendations for tumor marker prognostic studies. J Clin Oncol 23: 9067–9072.
View Article
Google Scholar

[38] View Article

[39] Google Scholar

[ref14] 14. Urbanek S (2011) Rserve: binary R server. http://cran.r-project.org/package=Rserve.

[ref15] 15. R Core Team (2012) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria (Ed.). http://www.r-project.org.

[ref16] 16. Leisch F (2004) Flexmix: a general framework for finite mixture models and latent class regression in R. Journal of Statistical Software. 11(8): 1–18.
View Article
Google Scholar

[43] View Article

[44] Google Scholar

[ref17] 17. Dorai-Raj S (2009) Binom: binomial confidence intervals for several parameterizations. http://carn.r-project.org/package=binom.

[ref18] 18. Youden WJ (1950) Index for rating diagnostic tests. Cancer 3: 32–35.
View Article
Google Scholar

[47] View Article

[48] Google Scholar

[ref19] 19. Therneau T, Lumley T (2011) Survival: survival analysis, including penalised likelihood. http://cran.r-project.org/package=survival.

[ref20] 20. Barrett T, Troup DB, Wilhite SE, Ledoux P, Evangelista C, et al. (2011) NCBI GEO: archive for functional genomics data sets - 10 years on. Nucleic Acids Res 39: D1005–10.
View Article
Google Scholar

[51] View Article

[52] Google Scholar

[ref21] 21. Edgar R, Domrachev M, Lash AE (2002) Gene expression omnibus: NCBI gene expression and hybridization array data repository. Nucleic Acids Res 30: 207–210.
View Article
Google Scholar

[54] View Article

[55] Google Scholar

[ref22] 22. Müller BM, Kronenwett R, Hennig G, Euting H, Weber K, et al. (2011) Quantitative determination of estrogen receptor, progesterone receptor, and her2 mRNA in formalin-fixed paraffin-embedded tissue – a new option for predictive biomarker assessment in breast cancer. Diagn Mol Pathol 20: 1–10.
View Article
Google Scholar

[57] View Article

[58] Google Scholar

[ref23] 23. Clark GM, McGuire WL, Hubay CA, Pearson OH, Marshall JS (1983) Progesterone receptors as a prognostic factor in stage ii breast cancer. N Engl J Med 309: 1343–1347.
View Article
Google Scholar

[60] View Article

[61] Google Scholar

[ref24] 24. Magdelénat H, Pouillart P, Jouve M, Palangié T, Garcia Giralt E, et al. (1982) Progesterone receptor as a more reliable prognostic parameter than estrogen receptor in patients with advanced breast cancer. Breast Cancer Res Treat 2: 195.
View Article
Google Scholar

[63] View Article

[64] Google Scholar

[ref25] 25. van de Vijver MJ, He YD, van’t Veer LJ, Dai H, Hart AAM, et al. (2002) A gene-expression signature as a predictor of survival in breast cancer. N Engl J Med 347: 1999–2009.
View Article
Google Scholar

[66] View Article

[67] Google Scholar

[ref26] 26. Filipits M, Rudas M, Jakesz R, Dubsky P, Fitzal F, et al. (2011) A new molecular predictor of distant recurrence in er-positive, her2-negative breast cancer adds independent information to conventional clinical risk factors. Clin Cancer Res 17: 6012–6020.
View Article
Google Scholar

[69] View Article

[70] Google Scholar

[ref27] 27. Sparano JA, Paik S (2008) Development of the 21-gene assay and its application in clinical practice and clinical trials. J Clin Oncol 26: 721–728.
View Article
Google Scholar

[72] View Article

[73] Google Scholar

[ref28] 28. Altman DG, Lausen B, Sauerbrei W, Schumacher M (1994) Dangers of using “optimal” cutpoints in the evaluation of prognostic factors. J Natl Cancer Inst 86: 829–835.
View Article
Google Scholar

[75] View Article

[76] Google Scholar

[ref29] 29. Halpern J (1982) Maximally selected chi square statistics for small samples. Biometrics 38: 1017–1023.
View Article
Google Scholar

[78] View Article

[79] Google Scholar

[ref30] 30. Koziol JA (1991) On maximally selected chi-square statistics. Biometrics 47: 1557–1561.
View Article
Google Scholar

[81] View Article

[82] Google Scholar

[ref31] 31. Lausen B, Schumacher M (1992) Maximally selected rank statistics. Biometrics 48: 73–85.
View Article
Google Scholar

[84] View Article

[85] Google Scholar

[ref32] 32. Miller R, Sigmund D (1982) Maximally selected chi square statistics. Biometrics 38: 1011–1016.
View Article
Google Scholar

[87] View Article

[88] Google Scholar

[ref33] 33. Györffy B, Lanczky A, Eklund AC, Denkert C, Budczies J, et al. (2010) An online survival analysis tool to rapidly assess the effect of 22,277 genes on breast cancer prognosis using microarray data of 1,809 patients. Breast Cancer Res Treat 123: 725–731.
View Article
Google Scholar

[90] View Article

[91] Google Scholar

[ref34] 34. Bonetti M, Gelber RD (2000) A graphical method to assess treatment-covariate interactions using the cox model on subsets of the data. Stat Med 19: 2595–2609.
View Article
Google Scholar

[93] View Article

[94] Google Scholar

[ref35] 35. Bonetti M, Gelber RD (2004) Patterns of treatment effects in subsets of patients in clinical trials. Biostatistics 5: 465–481.
View Article
Google Scholar

[96] View Article

[97] Google Scholar

[ref36] 36. Bonetti M, Zahrieh D, Cole BF, Gelber RD (2009) A small sample study of the stepp approach to assessing treatment-covariate interactions in survival data. Stat Med 28: 1255–1268.
View Article
Google Scholar

[99] View Article

[100] Google Scholar

Figures

Abstract

Introduction

Methods

Implementation and Availability

Analysis Workflow

Cutoff Determination

Visualization

Example Data Sets

Results

Fit of a Mixture Model

Optimizing the Correlation with Outcome

Optimizing the Correlation with Survival

Plots at Cutoff Point

Comparison of Different Methods for Cutoff Optimization

Use Case: Development of a Molecular Test with Sensitivity and Specificity Greater than 90%

Discussion

Supporting Information

Text S1.

Author Contributions

References