SPAETH
Cluster Analysis Datasets
SPAETH
is a dataset directory which
contains examples for analyzing data
by grouping into clusters.
Licensing:
The computer code and data files described and made available on this web page
are distributed under
the GNU LGPL license.
Related Data and Programs:
SAMMON,
a dataset directory which
contains six sets of M-dimensional data for cluster analysis.
SPAETH,
a FORTRAN90 library which
clusters the sorts
of data contained in this collection.
SPAETH2,
a dataset directory which
contains datasets for cluster analysis;
Reference:
-
Helmuth Spaeth,
Cluster Dissection and Analysis,
Theory, FORTRAN Programs, Examples,
Ellis Horwood, 1985,
QA278 S68213.
-
Helmuth Spaeth,
Cluster Analysis Algorithms
for Data Reduction and Classification of Objects,
Ellis Horwood, 1980,
QA278 S6813.
Data files:
-
spaeth_01.txt, a sample data set
of 37 2D points.
-
spaeth_01.png,
a PNG image of
the data.
-
spaeth_02.txt, a sample data set
of 41 2D points.
-
spaeth_02.png,
a PNG image of
the data.
-
spaeth_03.txt, a sample data set
of 44 2D points.
-
spaeth_03.png,
a PNG image of
the data.
-
spaeth_04.txt, a sample data set
of 73 2D points.
-
spaeth_04.png,
a PNG image of
the data.
-
spaeth_05.txt, a sample data set of
55 2D points.
-
spaeth_05.png,
a PNG image of
the data.
-
spaeth_06.txt, a sample data set of
50 2D points.
-
spaeth_06.png,
a PNG image of
the data.
-
spaeth_07.txt, a sample data set of
52 2D points.
-
spaeth_07.png,
a PNG image of
the data.
-
spaeth_08.txt, a sample data set of
80 2D points.
-
spaeth_08.png,
a PNG image of
the data.
-
spaeth_09.txt, a sample data set of
122 sets of 24 values. The values are survey responses, and are
always 1, 2, 3 or 4.
-
spaeth_10.txt, a sample data set of
49 sets of 56 values. The values are either '0' or '1'.
-
spaeth_11.txt, an 11 by 11 "distance"
matrix between 11 objects. The "distances" are between 0 and 55.
-
spaeth_12.txt, an 42 by 42 "distance"
matrix between 42 objects. The "distances" are between 0 and 99.
-
spaeth_13.txt, a 96 by 5 set of data,
which actually represents a 96 by 1 column of response or dependent
data, and a 96 by 4 set of stimulus or independent data.
You can go up one level to
the DATASETS directory.
Last revised on 03 October 2005.