SOKAL_ROHLF
Statistical Datasets
SOKAL_ROHLF
is a dataset directory which
contains some datasets used by Sokal and Rohlf.
Related Data and Programs:
CENSUS,
a dataset directory which
contains US census data;
DRAFT_LOTTERY,
a dataset directory which
contains the numbers assigned to each birthday, for the Selective Service System
lotteries for 1970 through 1976.
HARTIGAN,
a dataset directory which
contains datasets for testing clustering algorithms;
ISWR,
a dataset directory which
contains example datasets used for statistical analysis.
MARTINEZ,
a dataset directory which
contains datasets for computational statistics,
including cluster analysis;
MDS,
a dataset directory which
contains datasets for M-dimensional scaling;
PCL,
a dataset directory which
contains datasets from a gene expression experiment on Arabidopsis,
which are candidates for data cluster analysis;
REGRESSION,
a dataset directory which
contains datasets for testing linear regression;
SGB,
a dataset directory which
contains files used as input data for
demonstrations and tests of Donald Knuth's Stanford Graph Base.
SPAETH,
a dataset directory which
contains datasets for cluster analysis;
SPAETH2,
a dataset directory which
contains datasets for cluster analysis;
TIME_SERIES,
a data directory which
contains examples of time series,
which are simply records of the values of some quantity at
a sequence of times.
TRIOLA,
a dataset directory which
contains datasets used for statistical analysis.
WORDS,
a dataset directory which
contains lists of words;
References:
-
Robert Sokal, James Rohlf,
Biometry: The Principles and Practice of Statistics in Biological Research,
MacMillan, 1995,
ISBN13: 9780716724117,
LC: QH323.5.S63.
-
Robert Sokal, James Rohlf,
Introduction to Biostatistics,
Dover, 2009,
ISBN13: 9780486469614,
LC: QH323.5.S633.
Datasets:
-
archibald.txt,
frequency of the sedge Carex flacca in 500 quadrants.
-
allee.txt,
rate of growth of Ameiurus melas in conditioned
and unconditioned well water.
-
allee_bowen.txt,
survival time in goldfish in colloidal silver suspension.
-
banta.txt,
average age of reproduction in Daphnia longispina.
-
blakeslee.txt,
length/width ratios for 3 samples of globe and three
samples of nominal Jimson weed.
-
butterfat.txt,
butterfat percentages for 5 breeds of cattle, mature and 2 year old,
10 samples.
-
carter_mitchell.txt,
rabbit temperature after rinderpest inoculation.
-
crossley.txt,
two samples of nymphs, measuring the length of the cheliceral base
in micrometer.
-
french.txt,
energy utilization in the pocket mouse
-
gartler.txt,
milligrams of glycine per milligram of creatinine in the urine of 37 chimpanzees.
-
geissler.txt,
sex ratios in 6115 sibships of twelve in Saxony.
-
greenwood_yule.txt,
accidents in 5 weeks to 647 women working on high-explosive shells.
-
lee.txt,
melanoma cases over body regions of men and women.
-
leinert.txt,
S-PLP content of blood serum before and after ingestion of alcohol.
-
liu.txt,
blood neutrophil counts, divided by 1000, per microliter.
-
millis_seng1.txt,
birth weights of male Chinese in ounces.
-
millis_seng2.txt,
birth order and birth weight.
-
newman.txt,
lower face width for 15 girls.
-
olson_miller1.txt,
interorbital width of domestic pigeons.
-
olson_miller2.txt,
distance from narial opening to beak tip for 5 domestic pigeons, 20 observations.
-
park_williams.txt,
number of bacteria in 1cc of milk from three cows at three periods
-
purves.txt,
effect of different sugars on length of pea sections.
-
rohlf.txt,
oxygen consumption rates for two species of limpets.
-
sokal1.txt,
femur lengths of the aphid Pemphigus.
-
sokal2.txt,
fertility of eggs of the CP strain of Drosophila melanogaster.
-
sokal3.txt,
number of adult Drosophila emerging, two different medium formulations
-
sokal_karten.txt,
mean dry weights of three genotypes of beetles.
-
sokal_rohlf01.txt,
butterfat percentages from 120 3-year old Ayrshire cows.
-
sokal_rohlf02.txt,
frequency of infected insects in 2423 samples of 5, assuming 40% infection rate.
-
sokal_rohlf03.txt,
populations of wing lengths and milk yields.
-
sokal_rohlf04.txt,
manufactured data to be checked for normality.
-
sokal_rohlf05.txt,
length, in centimeters, of bass from a southern lake.
-
sokal_rohlf06.txt,
relative expected fequencies for samples of 17 animals under two hypotheses.
-
sokal_rohlf07.txt,
measurements of 5 individuals in each of 7 mouse litters.
-
sokal_rohlf08.txt,
plant height in centimeters in 4 plots.
-
sokal_rohlf09.txt,
thorax length for 4 aphids sampled from 28 galls.
-
student.txt,
yeast cells in 400 squares of a hemacytometer.
-
sullivan_sokal.txt,
mean developmental period for three strains of houseflies.
-
swanson.txt,
county, soil type, surface ph, subsoil ph.
-
thomas.txt,
of scutum of tick larvae sampled from 4 different cottontail rabbits.
-
utida.txt,
Azuki bean weevils emerging from 112 Azuki beans.
-
wright.txt,
Guinea pig litter sizes for two strains.
-
young.txt,
age of striped bass caught in Hudson River.
You can go up one level to
the DATASETS directory.
Last revised on 11 September 2011.