HARTIGAN
Clustering Algorithm Datasets


HARTIGAN is a dataset directory which contains test data for clustering algorithms.

The data files are all text files, and have a common, simple format:

Licensing:

The computer code and data files described and made available on this web page are distributed under the GNU LGPL license.

Related Data and Programs:

The PCL dataset directory contains sample datasets for clustering, based on gene expression experiments.

The SPAETH dataset directory contains sample datasets for clustering.

The SPAETH2 dataset directory contains sample datasets for clustering.

Reference:

  1. John Hartigan,
    Clustering Algorithms,
    Wiley, 1975,
    LC: QA278.H36,
    ISBN: 0-471-35645-X.

Datasets:

You can go up one level to the DATASETS directory.


Last revised on 31 August 2005.