Bayesian Principal Component Analysis

Bayesian PCA (BPCA) is a further variant of PCA in that it imposes prior and encodes basis selection mechanism. Even though the model is fully Bayesian, do.bpca faithfully follows the original paper by Bishop in that it only returns the mode value of posterior as an estimate, in conjunction with ARD-motivated prior as well as consideration of variance to be estimated. Unlike PPCA, it uses full basis and returns relative weight for each base in that the smaller \(\alpha\) value is, the more likely corresponding column vector of mp.W to be selected as potential basis.

do.bpca(X, ndim = 2, ...)

Arguments

X

an \((n\times p)\) matrix or data frame whose rows are observations and columns represent independent variables.

ndim

an integer-valued target dimension.

...

extra parameters including

maxiter: maximum number of iterations (default: 100).
reltol: relative tolerance stopping criterion (default: 1e-4).

Value

a named Rdimtools S3 object containing

Y: an \((n\times ndim)\) matrix whose rows are embedded observations.
projection: a \((p\times ndim)\) whose columns are basis for projection.
mp.itercount: the number of iterations taken for EM algorithm to converge.
mp.sigma2: estimated \(\sigma^2\) value via EM algorithm.
mp.alpha: length-ndim-1 vector of relative weight for each base in mp.W.
mp.W: an \((ndim\times ndim-1)\) matrix from EM update.
algorithm: name of the algorithm.

References

Bishop C (1999). “Bayesian PCA.” In Advances in Neural Information Processing Systems, volume 11, 382--388.

Author

Kisung You

Examples