ISWR
Statistical Datasets


ISWR is a dataset directory which contains example datasets used for statistical analysis.

Licensing:

The computer code and data files described and made available on this web page are distributed under the GNU LGPL license.

Related Data and Programs:

CENSUS, a dataset directory which contains US census data;

DRAFT_LOTTERY, a dataset directory which contains the numbers assigned to each birthday, for the Selective Service System lotteries for 1970 through 1976.

HARTIGAN, a dataset directory which contains datasets for testing clustering algorithms;

MARTINEZ, a dataset directory which contains datasets for computational statistics, including cluster analysis;

MDS, a dataset directory which contains datasets for M-dimensional scaling;

PCL, a dataset directory which contains datasets from a gene expression experiment on Arabidopsis, which are candidates for data cluster analysis;

REGRESSION, a dataset directory which contains datasets for testing linear regression;

SGB, a dataset directory which contains files used as input data for demonstrations and tests of Donald Knuth's Stanford Graph Base.

SOKAL_ROHLF, a dataset directory which contains biological datasets considered by Sokal and Rohlf.

SPAETH, a dataset directory which contains datasets for cluster analysis;

SPAETH2, a dataset directory which contains datasets for cluster analysis;

TIME_SERIES, a data directory of examples of time series, which are simply records of the values of some quantity at a sequence of times.

TRIOLA, a dataset directory which contains datasets used for statistical analysis.

WORDS, a dataset directory which contains lists of words;

Reference:

  1. Peter Dalgaard,
    Introductory Statistics with R,
    Springer, 2008,
    ISBN13: 978-0-387-79053-4,
    LC: QA276.45.R3.D35.

Datasets:

The examples are available in CSV (Comma Separated Value) format:

You can go up one level to the DATASETS directory.


Last revised on 29 August 2011.