TRIOLA
Statistical Datasets


TRIOLA is a dataset directory which contains example datasets used for statistical analysis.

Related Data and Programs:

CENSUS, a dataset directory which contains US census data;

DRAFT_LOTTERY, a dataset directory which contains the numbers assigned to each birthday, for the Selective Service System lotteries for 1970 through 1976.

HARTIGAN, a dataset directory which contains datasets for testing clustering algorithms;

ISWR, a dataset directory which contains datasets used for statistical analysis.

MARTINEZ, a dataset directory which contains datasets for computational statistics, including cluster analysis;

MDS, a dataset directory which contains datasets for M-dimensional scaling;

PCL, a dataset directory which contains datasets from a gene expression experiment on Arabidopsis, which are candidates for data cluster analysis;

REGRESSION, a dataset directory which contains datasets for testing linear regression;

SGB, a dataset directory which contains files used as input data for demonstrations and tests of Donald Knuth's Stanford Graph Base.

SOKAL_ROHLF, a dataset directory which contains biological datasets considered by Sokal and Rohlf.

SPAETH, a dataset directory which contains datasets for cluster analysis;

SPAETH2, a dataset directory which contains datasets for cluster analysis;

STATS, a dataset directory which contains datasets for computational statistics;

TIME_SERIES, a data directory of examples of time series, which are simply records of the values of some quantity at a sequence of times.

WORDS, a dataset directory which contains lists of words;

Reference:

  1. Mario Triola,
    Elementary Statistics,
    Addison Wesley, 2009,
    ISBN13: 978-0321500243,
    LC: QA276.12.T76.

Datasets:

The examples are available in CSV (Comma Separated Value) or XLS (Microsoft EXCEL) format:

You can go up one level to the DATASETS directory.


Last revised on 14 October 2011.