Datasets
-
adjacency,
a dataset directory which
contains adjacency matrices associated with an undirected graph.
-
alphabet_lowercase,
a dataset directory which
contains large images of the 26 lowercase alphabetic characters.
-
alphabet_uppercase,
a dataset directory which
contains large images of the 26 uppercase alphabetic characters.
-
bam,
a dataset directory which
???
-
beale_cipher,
a dataset directory which
contains the text of the three Beale cipher documents, which
are supposed to indicate the location of a hoard of gold and silver.
-
bin_packing,
a dataset directory which
contains examples of the bin packing problem, in which a number of
objects are to be packed in the minimum possible number of
uniform bins;
-
birthdays,
a dataset directory which
contains data related to birthdays, such as the birthdays of members
of hockey teams, and the number of babies born in the US on each
calendar day over an interval of several years.
-
boston_housing,
a dataset directory which
stores training and test data about housing prices in Boston.
This dataset is also available as a builtin dataset in keras.
-
burgers,
a dataset directory which
contains 40 solutions of the Burgers equation at equally
spaced times from 0 to 1, with values
at 41 equally spaced nodes in [0,1];
-
case1_flow,
a dataset directory which
lists 401 solutions of a flow problem in a channel;
-
cats,
a dataset directory which
contains jpg images of cats.
-
cavity_flow,
a dataset directory which
contains 500 time steps of Navier-Stokes flow in
a driven cavity;
-
ccs,
a data directory which
contains examples of sparse matrices stored as
Compressed Column Storage (CCS) files,
a three-file format;
-
census,
a dataset directory which
contains US census data;
-
chain_letters,
a dataset directory which
contains examples of a chain letter;
-
change_making,
a dataset directory which
contains test data for the change making problem;
-
cities,
a dataset directory which
contains sets of information about cities and the distances
between them;
-
clustering,
a dataset directory which
can be used with clustering algorithms;
-
color,
a dataset directory which
contains information about colors in terms of RGB values.
-
crs,
a dataset directory which
contains examples of sparse matrices stored in
Compressed Row Storage (CRS) format,
a three-file format;
-
csv,
a data directory which
contains examples of CSV files,
a flat file format of Comma Separated Values.
-
cvt,
a dataset directory which
contains examples of Centroidal Voronoi Tessellations;
-
cvtp,
a dataset directory which
contains examples of CVTP's, Centroidal Voronoi Tessellations
defined on a periodic domain, which is usually a rectangle
or hyperrectangle.
-
dates,
a dataset directory which
contains lists of dates in certain calendars.
-
dogs,
a dataset directory which
contains images of dogs.
-
draft_lottery,
a dataset directory which
contains the numbers assigned to each birthday, for the Selective
Service System lotteries for 1970 through 1976.
-
faces,
a dataset directory which
contains 10 photographs of each of 40 people, for use in
facial recognition experiments.
-
faces_angela_merkel,
a dataset directory which
contains images of Angela Merkel for facial recognition applications.
-
faces_arnold_schwarzenegger,
a dataset directory which
contains images of Arnold Schwarzenegger for facial recognition applications.
-
faces_emma_stone,
a dataset directory which
contains images of Emma Stone for facial recognition applications.
-
faces_matt_damon,
a dataset directory which
contains images of Matt Damon for facial recognition applications.
-
faces_michael_caine,
a dataset directory which
contains images of Michael Caine for facial recognition applications.
-
faces_sylvester_stallone,
a dataset directory which
contains images of Sylvester Stallone for facial recognition applications.
-
faces_taylor_swift,
a dataset directory which
contains images of Taylor Swift for facial recognition applications.
-
fasta,
a dataset directory which
contains examples of FASTA sequence data;
-
fastq,
a dataset directory which
contains examples of FASTQ sequence data;
-
faure,
a dataset directory which
contains examples of the Faure quasirandom sequence;
-
fingerprints,
a dataset directory which
contains a few images of fingerprints.
-
fna,
a dataset directory which
???.
-
ge,
a dataset directory which
contains matrices stored in General (GE) format;
-
generalized_assignment,
a dataset directory which
contains test data for the generalized assignment problem;
-
german,
a dataset directory which
contains some short texts in German;
-
gfd2,
a dataset directory which
???;
-
graffiti,
a dataset directory which
???;
-
graphics_examples,
a dataset directory which
contains examples of data used to illustrate or test various
graphics procedures for presenting and analyzing data.
-
grid,
a dataset directory which
???;
-
halton,
a dataset directory which
contains examples of the Halton quasirandom sequence;
-
hammersley,
a dataset directory which
contains examples of the Hammersley quasirandom sequence;
-
hartigan,
a dataset directory which
contains datasets for testing clustering algorithms;
-
hbsmc,
a dataset directory which
contains the Harwell Boeing Sparse Matrix Collection (HBSMC);
-
hex_grid,
a dataset directory which
???;
-
ihs,
a dataset directory which
contains examples of the Improved Distributed Hypercube
Sampling quasirandom sequence;
-
imagej,
a dataset directory which
contains image data suitable for use with the ImageJ program.
-
incidence,
a dataset directory which
contains incidence matrices associated with a directed graph.
-
inout_flow,
a dataset directory which
contains 500 time steps of Navier-Stokes flow in a region with
specified inflow and outflow;
-
inout_flow2,
a dataset directory which
contains more time steps of Navier-Stokes flow in a region with
specified inflow and outflow;
-
interpolation,
a dataset directory which
contains datasets to be interpolated.
-
iswr,
a dataset directory which
contains example datasets used for statistical analysis.
-
knapsack_01,
a dataset directory which
contains test data for the 0/1 knapsack problem;
-
knapsack_multiple,
a dataset directory which
contains test data for the multiple knapsack problem;
-
latin_center,
a dataset directory which
contains examples of the Latin Center Square quasirandom sequence;
-
latin_edge,
a dataset directory which
contains examples of the Latin Edge Square quasirandom sequence;
-
latin_random,
a dataset directory which
contains examples of the Latin Random Square quasirandom sequence;
-
lcvt,
a dataset directory which
contains examples of Latinized Centroidal Voronoi
Tessellations;
-
lcvtp,
a dataset directory which
contains examples of LCVTP's, that is, "Latinized"
Centroidal Voronoi Tessellations on a periodic domain;
-
lhs,
a dataset directory which
contains datasets related to Latin Hypercube Sampling;
-
lp,
a dataset directory which
contains datasets for linear programming, used for programs
such as CPLEX and GUROBI;
-
martinez,
a dataset directory which
contains datasets for computational statistics;
-
mds,
a dataset directory which
contains datasets for M-dimensional scaling;
-
mhd_control,
a dataset directory which
contains datasets for a magneto-hydrodyamics control problem.
-
mps,
a dataset directory which
contains datasets for linear programming;
-
mpsc,
a dataset directory which
contains compressed datasets for linear programming;
-
ngrams,
a dataset directory which
contains information about the observed frequency of "ngrams"
(particular sequences of n letters) in English text.
-
niederreiter2,
a dataset directory which
contains examples of the Niederreiter quasirandom sequence
using a base of 2;
-
oa,
a dataset directory which
contains datasets for orthogonal arrays;
-
partition_problem,
a dataset directory which
contains examples of the partition problem, in which a set of numbers
is given, and it is desired to break the set into two subsets with
equal sum.
-
pcl,
a dataset directory which
contains datasets from a gene expression experiment on Arabidopsis;
-
polygon,
a dataset directory which
contains examples of polygons;
-
population,
a dataset directory which
contains listings of populations.
-
presidents,
a dataset directory which
lists various facts about US presidents.
-
product_rule_gl,
a dataset directory which
contains M-dimensional quadrature rules formed as products
of 1D Gauss-Legendre rules.
-
product_rule_tanh_sinh,
a dataset directory which
???;
-
propack,
a dataset directory which
contains matrices in Harwell-Boeing format, used for testing
the SVD package propack();
-
quad_mesh,
a dataset directory which
contains examples of quad meshes.
-
quadrature_rules,
a dataset directory which
contains quadrature rules for 1D intervals,
2D rectangles or M-dimensional rectangular regions,
stored as a file of abscissas, a file of weights,
and a file of region limits.
-
quadrature_rules_ccn,
a dataset directory which
contains quadrature rules for integration on [-1,+1],
using a nested Clenshaw-Curtis rule.
-
quadrature_rules_chebyshev1,
a dataset directory which
contains quadrature rules for integration on [-1,+1],
using a Gauss-Chebyshev type 1 rule.
-
quadrature_rules_chebyshev2,
a dataset directory which
contains quadrature rules for integration on [-1,+1],
using a Gauss-Chebyshev type 2 rule.
-
quadrature_rules_clenshaw_curtis,
a dataset directory which
contains quadrature rules for integration on [-1,+1],
using a Clenshaw Curtis rule.
-
quadrature_rules_gegenbauer,
a dataset directory which
contains quadrature rules for integration on [-1,+1],
using a Gauss-Gegenbauer rule.
-
quadrature_rules_gen_hermite,
a dataset directory which
contains quadrature rules for integration on an infinite interval,
using a generalized Gauss-Hermite rule.
-
quadrature_rules_gen_laguerre,
a dataset directory which
contains quadrature rules for integration on a semi-infinite interval,
using a generalized Gauss-Laguerre rule.
-
quadrature_rules_halton,
a dataset directory which
contains quadrature rules for M-dimensional unit cubes,
based on a Halton quasirandom sequence.
stored as a file of abscissas, a file of weights,
and a file of region limits.
-
quadrature_rules_hermite_physicist,
a dataset directory which
contains Gauss-Hermite quadrature rules, for integration
on the interval (-oo,+oo), with the "physicist" weight function
exp(-x*x).
-
quadrature_rules_hermite_probabilist,
a dataset directory which
contains Gauss-Hermite quadrature rules, for integration
on the interval (-oo,+oo), with the "probabilist" weight
function exp(-x*x/2).
-
quadrature_rules_hermite_unweighted,
a dataset directory which
contains Gauss-Hermite quadrature rules, for integration
on the interval (-oo,+oo), with no weight function.
-
quadrature_rules_jacobi,
a dataset directory which
contains Gauss-Jacobi quadrature rules for the interval [-1,+1]
with weight function (1-x)^ALPHA * (1+x)^BETA.
-
quadrature_rules_laguerre,
a dataset directory which
contains Gauss-Laguerre quadrature rules for integration on
the interval [A,+oo), with weight function exp(-x).
-
quadrature_rules_latin_center,
a dataset directory which
contains quadrature rules for M-dimensional unit cubes,
based on centered Latin hypercubes.
stored as a file of abscissas, a file of weights,
and a file of region limits.
-
quadrature_rules_legendre,
a dataset directory which
contains Gauss-Legendre quadrature rules for the interval [-1,+1].
-
quadrature_rules_patterson,
a dataset directory which
contains Gauss-Patterson quadrature rules for the interval [-1,+1].
-
quadrature_rules_pyramid,
a dataset directory which
contains quadrature rules for a pyramid with a square base.
-
quadrature_rules_tet,
a dataset directory which
contains quadrature rules for tetrahedrons,
stored as a file of abscissas, a file of weights,
and a file of vertices.
-
quadrature_rules_tri,
a dataset directory which
contains quadrature rules for triangles,
stored as a file of abscissas, a file of weights,
and a file of vertices.
-
quadrature_rules_uniform,
a dataset directory which
contains quadrature rules for M-dimensional unit cubes,
based on a uniform pseudorandom sequence.
stored as a file of abscissas, a file of weights,
and a file of region limits.
-
quadrature_rules_wedge,
a dataset directory which
contains quadrature rules for a wedge (triangle x line).
-
regression,
a dataset directory which
contains datasets for testing linear regression;
-
romero,
a dataset directory which
collects 12 sets of 2D Latin Square points that were used as
initial generators for a CVT computation.
-
sam,
a dataset directory which
???;
-
sammon,
a dataset directory which
contains examples of six kinds of M-dimensional datasets for cluster analysis.
-
sample_2d,
a dataset directory which
collects examples of sample point sets in the unit square.
-
sgb,
a dataset directory which
contains files used as input data for
demonstrations and tests of Donald Knuth's Stanford Graph Base.
-
sgmg,
a dataset directory which
contains M-dimensional Smolyak sparse grids
based on a mixture of 1D rules, and with a choice of exponential
and linear growth rates for the 1D rules.
-
sgmga,
a dataset directory which
contains SGMGA files (Sparse Grid Mixed Growth Anisotropic), that is,
M-dimensional Smolyak sparse grids
based on a mixture of 1D rules, and with a choice of exponential and linear
growth rates for the 1D rules and anisotropic weights for the dimensions.
-
sobol,
a dataset directory which
contains samples of the Sobol quasirandom sequence;
-
sokal_rohlf,
a dataset directory which
contains biological datasets considered by Sokal and Rohlf.
-
spaeth,
a dataset directory which
contains datasets for cluster analysis;
-
spaeth2,
a dataset directory which
contains datasets for cluster analysis;
-
sparse_grid_cce,
a dataset directory which
contains M-dimensional Smolyak sparse grids
based on the Clenshaw Curtis Exponential growth rule;
-
sparse_grid_ccl,
a dataset directory which
contains M-dimensional Smolyak sparse grids
based on the Clenshaw Curtis Linear growth rule;
-
sparse_grid_ccs,
a dataset directory which
contains M-dimensional Smolyak sparse grids
based on the Clenshaw Curti Slow growth rule;
-
sparse_grid_composite,
a dataset directory which
contains M-dimensional Smolyak sparse grids
based on the composite midpoint rule;
-
sparse_grid_f2,
a dataset directory which
contains M-dimensional Smolyak sparse grids
based on the Fejer 2 Exponential growth rule;
-
sparse_grid_f2s,
a dataset directory which
contains M-dimensional Smolyak sparse grids
based on the Fejer 2 Slow growth rule;
-
sparse_grid_gle,
a dataset directory which
contains M-dimensional Smolyak sparse grids
based on the Gauss-Legendre Exponential growth rule;
-
sparse_grid_gll,
a dataset directory which
contains M-dimensional Smolyak sparse grids
based on the 1D Gauss-Legendre Linear growth rule;
-
sparse_grid_glo,
a dataset directory which
contains M-dimensional Smolyak sparse grids
based on the 1D Gauss-Legendre Linear (Odd) growth rule;
-
sparse_grid_gpe,
a dataset directory which
contains M-dimensional Smolyak sparse grids
based on the Gauss-Patterson Exponential growth rule;
-
sparse_grid_gps,
a dataset directory which
contains M-dimensional Smolyak sparse grids
based on the Gauss-Patterson Slow growth rule;
-
sparse_grid_hermite,
a dataset directory which
contains M-dimensional Smolyak sparse grids
based on the Gauss-Hermite rule;
-
sparse_grid_laguerre,
a dataset directory which
contains M-dimensional Smolyak sparse grids
based on the Gauss-Laguerre rule;
-
sparse_grid_mixed,
a dataset directory which
contains M-imensional Smolyak sparse grids
based on a mixture of 1D rules.
-
sparse_grid_ncc,
a dataset directory which
contains M-dimensional Smolyak sparse grids
based on the Newton Cotes Closed rule;
-
sparse_grid_nco,
a dataset directory which
contains M-dimensional Smolyak sparse grids
based on the Newton Cotes Open rule;
-
sphere_design_rule
is a dataset directory which
contains files defining point sets on the surface of the unit sphere,
known as "designs", which can be useful for estimating integrals
on the surface, among other uses.
-
sphere_grid,
a dataset directory which
contains grids of points, lines, triangles or quadrilaterals
on a sphere;
-
sphere_lebedev_rule,
a dataset directory which
contains sets of Lebedev points on a sphere which can be used for
quadrature rules of a known precision;
-
sphere_maximum_determinant,
a dataset directory which
contains files defining maximum determinant rules on the unit sphere,
which can be used for interpolation and quadrature;
-
square_hex_grid,
a dataset directory which
contains files defining hexagonal arrays of grid points
over the interior of a square in 2D.
-
st,
a dataset directory of examples of Sparse Triplet (ST) files,
a sparse matrix file format,
storing just (I,J,A(I,J)), and using zero-based indexing.
-
st1,
a dataset directory of examples of Sparse Triplet (ST1) files,
a sparse matrix file format,
storing just (I,J,A(I,J)), and using one-based indexing.
-
states,
a dataset directory which
contains some information about the individual American states.
-
stats,
a dataset directory which
contains some examples of statistical datasets.
-
subset_sum,
a dataset directory which
contains examples of the subset sum problem, in which a set of
numbers is given, and it is desired to find at least one subset
that sums to a given target value.
-
svdpack,
a dataset directory which
contains matrices in Harwell-Boeing format, used for testing
the singular value decomposition library svdpack();
-
symbols,
a dataset directory which
contains large images of numbers and symbols.
-
tcell_flow,
a dataset directory which
contains 500 time steps of Navier-Stokes flow in a T-cell;
-
test_approx,
a dataset directory which
contains sets of data (x,y) for which an approximating formula is desired.
-
test_con,
a dataset directory which
contains sequences of points that lie on M-dimensional curves defined by
sets of nonlinear equations;
-
tet_mesh_order4,
a dataset directory of examples of order 4 tetrahedral meshes.
-
tet_mesh_order10,
a dataset directory of examples of order 10 tetrahedral meshes.
-
tet_mesh_order20,
a dataset directory of examples of order 20 tetrahedral meshes.
-
tetrahedrons,
a dataset directory which
contains examples of tetrahedrons.
-
tetrahedron_samples,
a dataset directory which
contains examples of sets of sample points from tetrahedrons.
-
text,
a dataset directory which
contains some short texts in English, such as the Gettysburg Address;
-
time_series,
a data directory of examples of time series,
which are simply records of the values of some quantity at
a sequence of times.
-
timelines,
a data directory of examples of timelines,
that is, dates or durations or lifetimes meant to be displayed
in chronological order.
-
triangle_samples,
a dataset directory which
contains sets of sample points from triangles.
-
triangles,
a dataset directory which
contains examples of triangles.
-
triangulation_order3,
a dataset directory which
contains examples of order 3 triangulations,
a linear triangulation
of a set of 2D points, using a pair of files to list the node
coordinates and the 3 nodes that make up each triangle;
-
triangulation_order4,
a dataset directory which
contains examples of order 4 triangulations,
a triangulation
of a set of 2D points, using a pair of files to list the node
coordinates and the 4 nodes that define each triangle
(3 vertices and the centroid);
-
triangulation_order6,
a dataset directory which
contains examples of order 6 triangulations,
a quadratic triangulation
of a set of 2D points, using a pair of files to list the node
coordinates and the 6 nodes that make up each triangle; Six-node
triangles are used when a higher degree approximation is desired;
they may also be used as isoparametric elements that model curved
boundaries;
-
triola,
a dataset directory which
contains datasets used for statistical analysis.
-
tsp,
a dataset directory which
contains examples of the traveling salesperson problem.
-
uniform,
a dataset directory which
contains examples of a uniform pseudorandom sequence;
-
van_der_corput,
a dataset directory which
contains examples of one-dimensional van der Corput sequences,
for various bases;
-
words,
a dataset directory which
contains lists of words;
-
xls,
a data directory which
contains examples of XLS files,
used by the Microsoft Excel spreadsheet program.
Last revised on 02 May 2019.