Given a collection of points in two, three, or higher dimensional space, a "best fitting" line can be defined as one that minimizes the average squared distance from a point to the line. The next best-fitting line can be similarly chosen from directions perpendicular to the first. Repeating this process yields an orthogonal basis in which different individual dimensions of the data are uncorrelated. These basis vectors are called principal components, and several related procedures principal component analysis (PCA).
PCA is mostly used as a tool in exploratory data analysis and for making predictive models. It is often used to visualize genetic distance and relatedness between populations. PCA is either done by singular value decomposition of a design matrix or by doing the following 2 steps:

calculating the data covariance (or correlation) matrix of the original data
performing eigenvalue decomposition on the covariance matrixUsually the original data is normalized before performing the PCA. The normalization of each attribute consists of mean centering – subtracting its variable's measured mean from each data value so that its empirical mean (average) is zero. Some fields, in addition to normalizing the mean, do so for each variable's variance (to make it equal to 1); see z-scores. The results of a PCA are usually discussed in terms of component scores, sometimes called factor scores (the transformed variable values corresponding to a particular data point), and loadings (the weight by which each standardized original variable should be multiplied to get the component score). If component scores are standardized to unit variance, loadings must contain the data variance in them (and that is the magnitude of eigenvalues). If component scores are not standardized (therefore they contain the data variance) then loadings must be unit-scaled, ("normalized") and these weights are called eigenvectors; they are the cosines of orthogonal rotation of variables into principal components or back.
PCA is the simplest of the true eigenvector-based multivariate analyses. Often, its operation can be thought of as revealing the internal structure of the data in a way that best explains the variance in the data. If a multivariate dataset is visualised as a set of coordinates in a high-dimensional data space (1 axis per variable), PCA can supply the user with a lower-dimensional picture, a projection of this object when viewed from its most informative viewpoint. This is done by using only the first few principal components so that the dimensionality of the transformed data is reduced.
PCA is closely related to factor analysis. Factor analysis typically incorporates more domain specific assumptions about the underlying structure and solves eigenvectors of a slightly different matrix.
PCA is also related to canonical correlation analysis (CCA). CCA defines coordinate systems that optimally describe the cross-covariance between two datasets while PCA defines a new orthogonal coordinate system that optimally describes variance in a single dataset.Robust and L1-norm-based variants of standard PCA have also been proposed.

View More On Wikipedia.org
  1. Y

    Some people dont understand that electronics component can die

    Some people dont understand that electronics component can die. Yes, they do go dead without apparent reasons.If i tell you your galaxy Note,Tab eMMC is dead just accept the fact,cos we really understand the core of cell phone engineering that we base on.Just give us time to start rebelling...
  2. J

    How to Identify Component Symbols on Nokia Schematic Diagram?

    Identifying with Symbols on Schematic Diagram is very easy task, it's just like reading ABC's on English alphabet. Since we are talking about mobile phone's circuit here, we are going to tackle only on its symbols being used herein, unlike in some major electronics components which...
  3. M

    Apple Iphone 3G,3GS,iPad Parts Datasheets (Component,Schematics,Diagram and Info)

    Apple Iphone 3G and 3GS IC Parts Datasheets(Component Schematic Diagram and Info) For Hardware Pro's and Advance troubleshooter's Guide 3G 3GS SST SST25VF080B 1 MB SERIAL FLASH http://www.alldatasheet.com/datashee...T25VF080B.html Samsung ARM-based CPU (markings: 339S0036 ARM...
Back
Top