Fast Iterative Kernel Principal Component Analysis

S. Günter, N. N. Schraudolph, and S. Vishwanathan. Fast Iterative Kernel Principal Component Analysis. Journal of Machine Learning Research, 8:1893–1918, 2007.

Download


2.0MB	424.5kB	2.7MB

Abstract

We develop gain adaptation methods that improve convergence of the kernel Hebbian algorithm (KHA) for iterative kernel PCA (Kim et al., 2005). KHA has a scalar gain parameter which is either held constant or decreased according to a predetermined annealing schedule, leading to slow convergence. We accelerate it by incorporating the reciprocal of the current estimated eigenvalues as part of a gain vector. An additional normalization term then allows us to eliminate a tuning parameter in the annealing schedule. Finally we derive and apply stochastic meta-descent (SMD) gain vector adaptation (Schraudolph, 1999, 2002) in reproducing kernel Hilbert space to further speed up convergence. Experimental results on kernel PCA and spectral clustering of USPS digits, motion capture, image denoising, and image super-resolution tasks confirm that our methods converge substantially faster than conventional KHA. To demonstrate scalability, we perform kernel PCA on the entire MNIST data set.

BibTeX Entry

@article{GueSchVis07,
     author = {Simon G\"unter and Nicol N. Schraudolph and
               S.~V.~N. Vishwanathan},
      title = {\href{http://nic.schraudolph.org/pubs/GueSchVis07.pdf}{
               Fast Iterative Kernel Principal Component Analysis}},
      pages = {1893--1918},
    journal =  jmlr,
     volume =  8,
       year =  2007,
   b2h_type = {Journal Papers},
  b2h_topic = {>Stochastic Meta-Descent, Kernel Methods, Unsupervised Learning},
   abstract = {
    We develop gain adaptation methods that improve convergence of
    the kernel Hebbian algorithm (KHA) for iterative kernel PCA
    (Kim et al., 2005). KHA has a scalar gain parameter which is
    either held constant or decreased according to a predetermined
    annealing schedule, leading to slow convergence. We accelerate
    it by incorporating the reciprocal of the current estimated
    eigenvalues as part of a gain vector. An additional normalization
    term then allows us to eliminate a tuning parameter in the
    annealing schedule. Finally we derive and apply stochastic
    meta-descent (SMD) gain vector adaptation (Schraudolph, 1999,
    2002) in reproducing kernel Hilbert space to further speed up
    convergence. Experimental results on kernel PCA and spectral
    clustering of USPS digits, motion capture, image denoising, and
    image super-resolution tasks confirm that our methods converge
    substantially faster than conventional KHA. To demonstrate
    scalability, we perform kernel PCA on the entire MNIST data
    set.
}}