PCA is the general name for a technique that uses sophisticated underlying mathematical principles to transform a number of possibly correlated variables into a smaller number of variables called principal components63–65. The process for the analysis is as follows:
I) Select sample parameters. Normalization seeks to obtain comparable scales, which allow for attribute comparisons. The dimensionality reduction approach involves minimizing the squared errors via a vector coordinate transformation, and the measurement data are defined based on the following equation:
where n is the measured value of the sample number (i.e., yield of tomato fruit, WUE, PFP and fruit quality in this study), and p is the variable number.
II) Sample parameters are converted to standardized values. It is convenient to standardize the sample with the following equation:
where , , and n is the measured value of the sample number.
III) The correlation matrix is calculated for the different irrigation and fertilization treatments and is defined based on the following equation:
where r ij is the correlation coefficient of the original variable, r ij = r ji, and r ij is given by the following equation:
IV) The eigenvalues of the R values and the eigenvectors for each sample number are calculated. A Jacobi iteration is used to determine the eigenvalues, as defined in the following equation:
where λ is the eigenvalue, E is the identity matrix and R is the correlation matrix. Next, these eigenvalues are sized down as λ 1 ≥ λ 2 ≥ … ≥λ p ≥ 0, and the respective eigenvector e i (i = 1, 2, ……) solved for:
where e ij is the j-th component of e i.
V) The contribution rate (C r) and accumulative contribution rate (AC r), with eigenvalues, are calculated using the following equations:
From the calculation results, the principal components corresponding to the characteristic value were greater than 1. The sample number of the principal components was selected as t; then, the factor of the former t was used as the corresponding data object associated with the component matrix S 1, S 2, ···, S t.
VI) The mathematical model is established based on the PCA, as defined in the following equation:
where S 1i, S 2i, …, S ti (i = 1, 2, …, t) are the eigenvectors corresponding to the principal components, and X 1, X 2, …, X p are the standardized values, the value of which is converted based on the sample parameters.
VII) The evaluation process is determined based on the comprehensive evaluation index (F). The F value is defined by the following equation:
where λ 1, λ 2 … λ t are the characteristic values corresponding to the principal components, and Q 1, Q 2, …, Q t are the evaluation values of the different irrigation and fertilization treatments. A higher comprehensive evaluation index indicates a better treatment.
Do you have any questions about this protocol?
Post your question to gather feedback from the community. We will also invite the authors of this article to respond.