Leave-one-patient-out-cross-validation (LopoCV) and quantification of predictive uncertainty

Leland S. Hu; Lujia Wang; Andrea Hawkins-Daarud; Jennifer M. Eschbacher; Kyle W. Singleton; Pamela R. Jackson; Kamala Clark-Swanson; Christopher P. Sereduk; Sen Peng; Panwen Wang; Junwen Wang; Leslie C. Baxter; Kris A. Smith; Gina L. Mazza; Ashley M. Stokes; Bernard R. Bendok; Richard S. Zimmerman; Chandan Krishna; Alyx B. Porter; Maciej M. Mrugala; Joseph M. Hoxworth; Teresa Wu; Nhan L. Tran; Kristin R. Swanson; Jing Li

Improve Research Reproducibility A Bio-protocol resource

Home
Protocols

Concise Method

Leave-one-patient-out-cross-validation (LopoCV) and quantification of predictive uncertainty

LH Leland S. Hu

LW Lujia Wang

AH Andrea Hawkins-Daarud

JE Jennifer M. Eschbacher

KS Kyle W. Singleton

PJ Pamela R. Jackson

KC Kamala Clark-Swanson

CS Christopher P. Sereduk

SP Sen Peng

PW Panwen Wang

JW Junwen Wang

LB Leslie C. Baxter

KS Kris A. Smith

GM Gina L. Mazza

AS Ashley M. Stokes

BB Bernard R. Bendok

RZ Richard S. Zimmerman

CK Chandan Krishna

AP Alyx B. Porter

MM Maciej M. Mrugala

JH Joseph M. Hoxworth

TW Teresa Wu

NT Nhan L. Tran

KS Kristin R. Swanson

JL Jing Li

This method is extracted from research article: Sci Rep, Feb 2021

Uncertainty quantification in the radiogenomics modeling of EGFR amplification in glioblastoma

DOI: 10.1038/s41598-021-83141-z

Request a Protocol

Ask a question

Favorite

To determine predictive accuracies for each GP model (without vs with Transductive Learning), we employed LopoCV. In this scheme, one randomly selected patient (and all of their respective biopsy samples) served as the test case, while the other remaining patients (and their biopsy data) served to train the model. Training consisted of fitting a GP regression to the entire training data set, and then using the trained GP regression model to predict all of the samples from the test patient case. The output from each GP model comprised a predictive distribution on each biopsy sample of the test patient, including a predictive mean and a predictive variance. We used the predictive mean as the point estimator for the CNV on the transformed scale, and used this to classify each biopsy sample as either EGFR amplified (CNV > 3.5) or EGFR non-amplified (CNV ≤ 3.5). This process was iterated until every patient served as the test case. Note that LopoCV in theory provides greater rigor compared to k-fold cross validation or leave-one-out cross validation (LOOCV), which leaves out a single biopsy sample as the test case. LopoCV would likely better simulate clinical practice (i.e., the model is used on a per-patient basis, rather than on a per-sample basis). In addition to predictive mean, each GP model output also includes predictive variance for each sample, which allows for quantification of predictive uncertainty. Specifically, for each prediction on each biopsy, we tested the hypothesis that the sample belongs to the class predicted by the mean (H1) versus not (H0), using a standard one-sided z test. The p value of this test reflects the certainty of the prediction, such that smaller p values correspond with lower predictive uncertainty (i.e., greater certainty) for each sample classified by the model. We prioritized the lowest predictive uncertainty as those predictions with the lowest range of p values (p < 0.05). We also evaluated incremental ranges of p values (e.g., < 0.10; < 0.15, etc.) as gradations of progressively decreasing predictive uncertainty.

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Do you have any questions about this protocol?

Post your question to gather feedback from the community. We will also invite the authors of this article to respond.

Post a Question

0 Q&A

Share your protocol with your peers.

Submit a Preprint Protocol