sammon_data


sammon_data, a MATLAB code which generates 6 files of test data for multivariate data clustering.

Usage:

sammon_data

Licensing:

The computer code and data files described and made available on this web page are distributed under the MIT license

Languages:

sammon_data is available in a MATLAB version and a Python version.

Related Data and Programs:

asa113, a MATLAB code which implements the Banfield and Bassill clustering algorithm using transfers and swaps.

asa136, a MATLAB code which implements the Hartigan and Wong clustering algorithm.

kmeans, a MATLAB code which contains several different algorithms for the K-Means problem.

martinez, a dataset directory which contains datasets for computational statistics;

mds, a dataset directory which contains datasets for M-dimensional scaling;

pcl, a dataset directory which contains datasets from a gene expression experiment on Arabidopsis, which are candidates for data cluster analysis;

ring_data , a MATLAB code which can create, plot, or save data generated by sampling a number of concentric, possibly overlapping rings.

sammon_data, a MATLAB code which generates six sets of M-dimensional data for cluster analysis.

sammon_data_test

spaeth, a dataset directory which contains datasets for cluster analysis;

spaeth2, a dataset directory which contains datasets for cluster analysis;

Reference:

  1. Ronald Fisher,
    The use of multiple measurements in taxonomic problems,
    Annual Eugenics,
    Volume 7, part II, 1936, pages 179-188.
  2. John Sammon,
    A nonlinear mapping for data structure analysis,
    IEEE Transactions on Computers,
    Volume C-18, Number 5, May 1969, pages 401-409.

Source Code:


Last revised on 08 February 2019.