TRIOLA
Statistical Datasets
TRIOLA
is a dataset directory which
contains example datasets used for statistical analysis.
Related Data and Programs:
CENSUS,
a dataset directory which
contains US census data;
DRAFT_LOTTERY,
a dataset directory which
contains the numbers assigned to each birthday, for the Selective Service System
lotteries for 1970 through 1976.
HARTIGAN,
a dataset directory which
contains datasets for testing clustering algorithms;
ISWR,
a dataset directory which
contains datasets used for statistical analysis.
MARTINEZ,
a dataset directory which
contains datasets for computational statistics,
including cluster analysis;
MDS,
a dataset directory which
contains datasets for M-dimensional scaling;
PCL,
a dataset directory which
contains datasets from a gene expression experiment on Arabidopsis,
which are candidates for data cluster analysis;
REGRESSION,
a dataset directory which
contains datasets for testing linear regression;
SGB,
a dataset directory which
contains files used as input data for
demonstrations and tests of Donald Knuth's Stanford Graph Base.
SOKAL_ROHLF,
a dataset directory which
contains biological datasets considered by Sokal and Rohlf.
SPAETH,
a dataset directory which
contains datasets for cluster analysis;
SPAETH2,
a dataset directory which
contains datasets for cluster analysis;
STATS,
a dataset directory which
contains datasets for computational statistics;
TIME_SERIES,
a data directory of examples of time series,
which are simply records of the values of some quantity at
a sequence of times.
WORDS,
a dataset directory which
contains lists of words;
Reference:
-
Mario Triola,
Elementary Statistics,
Addison Wesley, 2009,
ISBN13: 978-0321500243,
LC: QA276.12.T76.
Datasets:
The examples are available in CSV (Comma Separated Value) or
XLS (Microsoft EXCEL) format:
-
bears.csv,
wild bears anesthetized,
samples: 54 bears.
-
bears.xls,
wild bears anesthetized,
samples: 54 bears.
-
bodytemp.csv,
body temperatures, in degrees Fahrenheit, of healthy adults,
samples: 105 temperatures.
-
bodytemp.xls,
body temperatures, in degrees Fahrenheit, of healthy adults,
samples: 105 temperatures.
-
bostrain.csv,
rainfall in Boston,
sample: 52 weeks.
-
bostrain.xls,
weekly rainfall in Boston,
sample: 52 weeks.
-
cans.csv,
axial load of aluminum cans,
sample: 175 cans.
-
cans.xls,
axial load of aluminum cans,
sample: 175 cans.
-
cars20.csv,
properties of various car models,
sample: 20 car models.
-
cars20.xls,
properties of various car models,
sample: 20 car models.
-
cars32.csv,
properties of various car models, including
name, weight, length, braking distance, number of cylinders,
engine displacement, city and highway mileages, and
greenhouse gas emissions,
sample: 32 car models.
-
cars32.xls,
properties of various car models,
sample: 32 car models.
-
cereal.csv,
properties of popular breakfast cereals,
sample: 16 brands of cereal.
-
cereal.xls,
properties of popular breakfast cereals,
sample: 16 brands of cereal.
-
chmovie.csv,
alcohol and tobacco use in children's movies,
sample: 50 movies.
-
chmovie.xls,
alcohol and tobacco use in children's movies,
sample: 50 movies.
-
cigaret.csv,
cigarette tar and nicotine,
sample: 29 values.
-
cigaret.xls,
cigarette tar and nicotine,
sample: 29 values.
-
clancy.csv,
Tom Clancy, "The Bear and the Dragon",
sample: 12 chapters.
-
clancy.xls,
Tom Clancy, "The Bear and the Dragon",
sample: 12 chapters.
-
cola.csv,
weight and volume of cola,
samples: 36 bottles.
-
cola.xls,
weight and volume of cola,
samples: 36 bottles.
-
cotinine.csv,
passive and active smoke,
samples: 40.
-
cotinine.xls,
passive and active smoke,
samples: 40.
-
diamonds.csv,
properties of diamonds,
samples: 30 diamonds.
-
diamonds.xls,
properties of diamonds,
samples: 30 diamonds.
-
everglade.csv,
Everglades temperature, rainfall, conductivity,
samples: 31 days of measurements.
-
everglade.xls,
Everglades temperature, rainfall, conductivity,
samples: 31 days of measurements.
-
fhealth.csv,
female health exams,
samples: 40 exam results.
-
fhealth.xls,
female health exams,
samples: 40 exam results.
-
freshman_15.csv,
weight in kilograms and BMI in September and April for male and female freshman,
samples: 67 exam results.
-
freshman_15.xls,
weight in kilograms and BMI in September and April for male and female freshman,
samples: 67 exam results.
-
garbage.csv,
weights of discarded garbage for one week,
samples: 43.
-
garbage.xls,
weights of discarded garbage for one week,
samples: 43.
-
headcirc.csv,
head circumferences,
samples: 50 heads.
-
headcirc.xls,
head circumferences,
samples: 50 heads.
-
homerun.csv,
home run distances,
samples: 73 home runs.
-
homerun.xls,
home run distances,
samples: 73 home runs.
-
homes.csv,
homes sold in Dutchess county,
samples: 50 homes.
-
homes.xls,
homes sold in Dutchess county,
samples: 50 homes.
-
lotto.csv,
New York State lottery,
samples: 40.
-
lotto.xls,
New York State lottery,
samples: 40.
-
mandm.csv,
weights of M&M candies,
samples: 33
-
mandm.xls,
weights of M&M candies,
samples: 33
-
marathon.csv,
New York City marathon finishers,
samples: 150 runners.
-
marathon.xls,
New York City marathon finishers,
samples: 150 runners.
-
mhealth.csv,
male health exams,
samples: 40 health exam results.
-
mhealth.xls,
male health exams,
samples: 40 health exam results.
-
misc.csv,
miscellaneous data: Year, Dow Jones High, Car Sales, Traffic Deaths, Murders, Sunspots, Super Bowl Points
samples: data for 21 years.
-
misc.xls,
miscellaneous data: Year, Dow Jones High, Car Sales, Traffic Deaths, Murders, Sunspots, Super Bowl Points
samples: data for 21 years.
-
movies.csv,
movie data: title, year, rating, budge, gross, length, viewer rating,
sample: 36 movies.
-
movies.xls,
movie data: title, year, rating, budge, gross, length, viewer rating,
sample: 36 moves.
-
oldfaith.csv,
Old Faithful geyser: duration, interval, and height,
samples: 50 eruptions.
-
oldfaith.xls,
Old Faithful geyser: duration, interval, and height,
samples: 50 eruptions.
-
parentht.csv,
parent and child heights,
samples: 40 children and their parents.
-
parentht.xls,
parent and child heights,
samples: 40 children and their parents.
-
quarters.csv,
weights of quarters,
samples: 50 quarters.
-
quarters.xls,
weights of quarters,
samples: 50 quarters.
-
rowling.csv,
J K Rowling, "Harry Potter and the Sorcerer's Stone",
samples: 12 chapters.
-
rowling.xls,
J K Rowling, "Harry Potter and the Sorcerer's Stone",
samples: 12 chapters.
-
solitaire.csv,
solitaire results,
samples: 500 games.
-
solitaire.xls,
solitaire results,
samples: 500 games.
-
stowaway.csv,
ages of stowaways on the Queen Mary,
samples: 75 ocean crossings.
-
stowaway.xls,
ages of stowaways on the Queen Mary,
samples: 75 ocean crossings.
-
sugar.csv,
weights of Domino sugar packets,
samples: 70 sugar packets.
-
sugar.xls,
weights of Domino sugar packets,
samples: 70 sugar packets.
-
tolstoy.csv,
Leo Tolstoy's "War and Peace",
samples: 12 chapters.
-
tolstoy.xls,
Leo Tolstoy's "War and Peace",
samples: 12 chapters.
-
weather.csv,
forecast and actual temperatures.
samples: 31 dates.
-
weather.xls,
forecast and actual temperatures;
samples: 31 dates.
You can go up one level to
the DATASETS directory.
Last revised on 14 October 2011.