Dimensional reduction of large image datasets using non-linear principal components
Abstract:
In this paper we apply a Neural Network (NN) to reduce image dataset,distilling the massive datasets down to a new space of smaller dimension. Due to the possibility of these data have nonlinearities, traditional multivariate analysis, like the Principal Component Analysis (PCA), may not represent reality. Alternatively, Nonlinear Principal Component Analysis (NLPCA) can be performed by a NN model to fulfill that deficiency. However, when the dimension of the image increases, NN may easily saturate. This work presents an original methodology associated with the use of a set of cascaded multi-layer NN with a bottleneck structure to extract nonlinear information of the large set of image data. We illustrate its good performance with a set of tests against comparisons using this methodology and PCA in the treatment of oceanographic data associated with mesoscale variability of an oceanic boundary current.