C-NLPCA: extracting non-linear principal components of image datasets
Abstract:
In this paper we apply a Neural Network (NN)to reduce image dataset, distilling the massive datasets down to a new space of smaller dimension. Due to the possibility of these data have nonlinearities, traditional multi-variate analysis, like the Principal Component Analysis (PCA), may not represent reality. Alternatively, Nonlinear Principal Component Analysis (NLPCA) can be performed by a NN model to fulfill that deficiency. However, when the dimension of the image increases, NN may easily saturate . This work presents an original methodology associated with the use of a set of cascaded multi-layer NN with a bottleneck structure to extract nonlinear information of the large set of image data. We illustrate its good performance with a set of tests against comparisons using this methodology and PCA in the treatment of oceanographic data associated with mesoscale variability of an oceanic boundary current.