Performance Evaluation and Applications of the Self-Organizing Map

    How to extract characteristic patterns from a long time series of maps? Snapshot may be the simplest method, but it is hard to find the typical patterns that can represent the entire time series. Temporal averaging may be simple, but it is difficult to define suitable time scale over which to average; thus, mean patterns may be very misleading. The Empitical Orthogonal Function (EOF) technique is widely used. It requires the data to be gaps-free; moreover, conventional EOF, as a linear method, may not be as useful in extracting nonlinear information. There is a need for effective feature extraction methods.

    The Kohonen Self-Organizing Map (SOM) is a neural network that maps high-dimensional input data onto a low-dimensional (usually two-dimensional) output space while preserving the topology. It finds its widespread use across a number of disciplines. Recently, it has been applied to meteorology and oceanography as an effective tool of nonlinear clustering, pattern recognition and feature extraction.

    Yonggang Liu is one of the SOM users in the oceanography community. He and his colleagues have successfully applied the SOM in a series of oceanogrpahic data analyses. Some of his SOM papers are summarized as follows:

    1. Performance evaluation of the SOM for feature extraction

    Despite its wide applications as a tool for feature extraction, the SOM remains a black box to most meteorologists and oceanographers. Liu, Weisberg, and Mooers [2006] evaluates the feature extraction performance of the SOM by using artificial data representative of known patterns. The SOM is shown to extract the patterns of a linear progressive sine wave. Sensitivity studies are performed to ascertain the effects of the SOM tunable parameters. By adding random noise to the linear progressive wave data, it is demonstrated that the SOM extracts essential patterns from noisy data. Moreover, the SOM technique successfully chooses among multiple sets of patterns in contrast with an Empirical Orthogonal Function method that fails to do this. A practical way to apply the SOM is proposed and demonstrated using several examples, including long time series of coastal ocean currents from the West Florida Shelf.

    A 3x4 SOM representation of the linear progressive wave data. The top 12 panels show the SOM patterns with the frequency of occurrence atop each plot. The bottom plot is the BMU time series.

    (This is Figure 1 of Liu, Weisberg, and Mooers [2006])

    2. SOM analysis of the coastal currents

    Patterns of ocean current variability are examined on the West Florida Shelf using time series of moored velocity data that span the interval October 1998–September 2001 [Liu and Weisberg 2005]. On the synoptic weather timescale the variations of the SOM patterns are coherent with the local winds. On the seasonal timescale the variations of the SOM patterns are coherent with both the local winds and complementary sea surface temperature patterns. The spatial patterns extracted by the SOM method are asymmetric, a feature that is not captured by the conventional EOF method.

    3. Joint SOM analysis of the HF radar and ADCP data

    Patterns of ocean current variability are extracted from a joint HF radar and ADCP dataset to assess the 3D structures and temporal evolutions of distinct physical processes on the West Florida Shelf [Liu, Weisberg, and Shay 2007]. Three separate ocean–atmosphere frequency bands are considered: semidiurnal, diurnal, and subtidal. The currents in the semidiurnal band are relatively homogeneous in space, barotropic, clockwise polarized, and have a neap-spring modulation consistent with semidiurnal tides. The currents in the diurnal band are less homogeneous, more baroclinic, and clockwise polarized, consistent with a combination of diurnal tides and near-inertial oscillations. The currents in the subtidal frequency band are stronger and with more complex patterns consistent with wind and buoyancy forcing.

    4. GHSOM analysis of the satellite SST data

    The GHSOM is used to extract patterns of the SST variability on the West Florida Shelf from time series of daily SST maps from 1998 to 2002 [Liu, Weisberg, and He 2006]. Four characteristic SST patterns are extracted in the first-layer GHSOM array: winter and summer season patterns, and two transitional patterns. Three of them are further expanded in the second layer, yielding more detailed structures in these seasons. The winter pattern is one of low SST, with isotherms aligned approximately along isobaths. The summer pattern is one of high SST distributed in a horizontally uniform manner. The spring transition includes a midshelf cold tongue. Similar analyses performed on SST anomaly data provide further details of these seasonally varying patterns. It is demonstrated that the GHSOM analysis is more effective in extracting the inherent SST patterns than the widely used EOF method.

    5. Dual SOM anlaysis

    The SOM is used to examine spatial patterns of the Loop Current variability and to identify regions of SSH variability in the Gulf of Mexico from satellite altimetry data (1993 to 2015). More details can be seen in dual SOM analysis.

    6. Other SOM and GHSOM applications

    References:

    Useful MATLAB toolboxes:


    See Yonggang Liu's publications

    Back to Yonggang Liu's homepage

    (Last updated on 07/18/2016)











    eXTReMe Tracker