double_c_data, a MATLAB code which creates, plots, or saves a double C dataset.
The data is confined to two separate regions, each having the shape of the letter "C". The two regions are close, and interlocking, but do not actually touch. This data represents a difficult challenge for algorithms that attempt to detect natural segmentations of data.
The information on this web page is distributed under the MIT license.
double_c_data is available in a MATLAB version and an Octave version and a Python version.
components, a MATLAB code which seeks the connected "nonzero" or "nonblack" components of an image or integer vector, array or 3D block.
kmeans, a MATLAB code which contains several different algorithms for the K-Means problem.
martinez, a dataset directory which contains datasets for computational statistics, including cluster analysis;
random_data, a MATLAB code which uses a random number generator (RNG) to sample points corresponding to various probability density functions (PDF), spatial dimensions, and geometries, including the annulus, circle, ellipse, ellipsoid, hypercube, hypersphere, simplex, tetrahedron and triangle.
ring_data , a MATLAB code which creates, plots, or saves data generated by sampling a number of concentric, possibly overlapping rings.
sammon_data, a MATLAB code which generates six sets of M-dimensional data for cluster analysis.