SPAETH2
Cluster Analysis Datasets
SPAETH2
is a dataset directory which
contains data for testing
cluster analysis algorithms. The programs come from reference 1.
Licensing:
The computer code and data files described and made available on this web page
are distributed under
the GNU LGPL license.
Related Data and Programs:
HARTIGAN,
a dataset directory which
contains datasets for testing clustering algorithms;
MARTINEZ,
a dataset directory which
contains datasets for computational statistics,
including cluster analysis;
PCL,
a dataset directory which
contains datasets from a gene expression experiment on Arabidopsis,
which are candidates for data cluster analysis;
SAMMON,
a dataset directory which
contains six sets of M-dimensional data for cluster analysis.
SPAETH,
a dataset directory which
contains datasets for cluster analysis;
SPAETH2,
a FORTRAN90 library which
can cluster data according to various principles.
Reference:
-
Helmuth Spaeth,
Cluster Dissection and Analysis,
Theory, FORTRAN Programs, Examples,
Ellis Horwood, 1985,
QA278 S68213.
-
Helmuth Spaeth,
Cluster Analysis Algorithms
for Data Reduction and Classification of Objects,
Ellis Horwood, 1980,
QA278 S6813.
Data files:
-
spaeth2_01.txt, a sample data set
of 15 1D points.
-
spaeth2_02.txt, the same data as
spaeth2_01.txt, but in a different order.
-
spaeth2_03.txt, a sample data set
of 22 2D points, representing the relative locations of cities
in West Germany.
-
spaeth2_03.png,
a PNG image of
the data.
-
spaeth2_04.txt, 10 points in the
plane (2D).
-
spaeth2_04.png,
a PNG image of
the data.
-
spaeth2_05.txt, a sample data set
of 59 2D points, representing the relative locations of cities
in West Germany.
-
spaeth2_05.png,
a PNG image of
the data.
-
spaeth2_06.txt, a sample data set
of 23 12D points, representing the monthly values of 23 seasonal
quantities.
-
spaeth2_07.txt, for each of 89
Bavarian postal codes, the area, population, and population density.
-
spaeth2_08.txt, for each of 89
Bavarian postal codes, the number of workers in 4 categories.
-
spaeth2_09.txt, a "distance"
matrix representing the perceived pairwise distance between
11 car models.
-
spaeth2_10.txt, a "distance"
matrix representing the rank ordering of 15 breakfasts.
-
spaeth2_11.txt, the values of 5 socio-economic variables in 12 municipalities.
-
spaeth2_12.txt, a 4 by 4 matrix
of 0's and 1's, which is to be symmetrically permuted so that
the 1's form blocks.
-
spaeth2_13.txt, a 27 by 27 matrix
of 0's, 1's, 2's and 3's, which is to be symmetrically permuted
so equal numbers form blocks as far as possible.
You can go up one level to
the DATASETS directory.
Last revised on 31 August 2005.