{
 "cells": [
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "kmeans.ipynb\n",
    "\n",
    "Discussion: This Jupyter notebook investigates clustering discrete data with kmeans.\n",
    "\n",
    "Licensing: This code is distributed under the GNU LGPL license.\n",
    "    \n",
    "Modified: 28 October 2016\n",
    "\n",
    "Author: John Burkardt, Lukas Bystricky"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 4,
   "metadata": {
    "collapsed": false
   },
   "outputs": [],
   "source": [
    "# Import necessary libraries and set plot option\n",
    "%matplotlib inline\n",
    "%config InlineBackend.figure_format = 'svg'\n",
    "import numpy as np\n",
    "import matplotlib.pyplot as plt\n",
    "import math\n",
    "import scipy.spatial as spatial"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "# K Means #\n",
    "\n",
    "In this module we will investigate the problem of arranging N items of\n",
    "M-dimensional data into K groups or \"clusters\", in such a way that the \n",
    "items in each group are close.\n",
    "\n",
    "Each group will be associated with a special value called the \"mean\".\n",
    "We may expect this value to be the centroid of the group values, but\n",
    "while we are computing, sometimes this will not quite be the case.\n",
    "\n",
    "Our examples will use M=2 dimensions, but the method works for any M.\n",
    "\n",
    "We need to understand \n",
    "* how the \"energy\" of a group measures closeness;\n",
    "* the importance of the centroid/mean/generator, \n",
    "* how to randomly assign a data item to a cluster, \n",
    "* how to compute the energy of a cluster and of the collection of clusters, \n",
    "* how to assign items to the nearest generator or mean, \n",
    "* how to update the means, \n",
    "* how to carry out an iteration that repeatedly reduces the clustering energy."
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "collapsed": false
   },
   "source": [
    "# Point energy, cluster energy, and the centroid #\n",
    "\n",
    "Suppose we have a set of points G whose typical point is x.\n",
    "We will call G a \"group\" or a \"cluster\", and the idea is that\n",
    "all the points in G ought to be close or tightly clustered.\n",
    "\n",
    "In order to evaluate how tightly clustered a set of points is, we \n",
    "use the idea of \"energy\".  To define the energy, we have to include\n",
    "a special value we might call the base point, or mean, or center, or generator.\n",
    "Let's call it \"m\".  \n",
    "\n",
    "The point energy of x (with respect to m ) is\n",
    "  e(x,m) = ||x - m||^2\n",
    "that is, just the square of the Euclidean distance.\n",
    "Obviously, for a given x, e(x,m) is minimized by setting m=x (not exciting!)\n",
    "\n",
    "Since x is part of a set G, we can now also assign a cluster energy to G,\n",
    "  e(G,m) = sum (all x in g ) e(x,m)\n",
    "Now we said that m could be any point at all.  An interesting question \n",
    "becomes: if we can choose any value for m, is there a value that minimizes\n",
    "the cluster energy.  The answer to this is that we should choose m to be\n",
    "the average of the x values.  (You can show this by differentiating the\n",
    "expression for e(G,m) with respect to m).\n",
    "\n",
    "We would like to make a small test to verify that, for a given set of points x\n",
    "in a cluster G, the value of e(G,m) is minimized by the average (which we\n",
    "might also call the centroid or mean.)\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 5,
   "metadata": {
    "collapsed": false
   },
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "[[ 0.62022095  0.25780003]\n",
      " [ 0.85224345  0.32585312]\n",
      " [ 0.18210333  0.57784647]\n",
      " [ 0.35746139  0.59488945]\n",
      " [ 0.49054516  0.21448136]\n",
      " [ 0.47745961  0.9499567 ]\n",
      " [ 0.22000546  0.11850909]\n",
      " [ 0.3525047   0.56682692]\n",
      " [ 0.64253934  0.47827504]\n",
      " [ 0.00637021  0.93105693]]\n",
      "(array([ 0.,  0.]), 5.5659070553836907)\n",
      "(array([ 0.2,  0.7]), 2.1636324489619718)\n",
      "(array([ 0.62022095,  0.25780003]), 2.2796071996895089)\n",
      "(array([ 0.42014536,  0.50154951]), 1.2851666876061194)\n"
     ]
    }
   ],
   "source": [
    "## Minimizing the energy of a cluster\n",
    "#\n",
    "#  Create a 10x2 array G containing 10 random 2D points in the unit square.\n",
    "#\n",
    "#  Define a function cluster_energy ( G, m ) which returns the cluster\n",
    "#  energy of the points in G relative to the point m.\n",
    "#\n",
    "#  Define\n",
    "#    m1 = (0,0)\n",
    "#    m2 = (0.2,0.7)\n",
    "#    m3 = the first point in G = G[0,:]\n",
    "#    m4 = average of points in G\n",
    "#\n",
    "#  and evaluate the cluster energies e(G,m1), e(G,m2), e(G,m3), e(G,m4).\n",
    "#\n",
    "#  You should see that e(G,m4) is lower than the other values, and indeed,\n",
    "#  it will be lower than the cluster energy for any other choice of m.\n",
    "#\n",
    "n = 10\n",
    "G = np.random.rand ( n, 2 )\n",
    "print ( G )\n",
    "\n",
    "def cluster_energy ( G, m ):\n",
    "  energy = 0.0\n",
    "  rows = G.shape[0]\n",
    "  for row in range ( 0, rows ):\n",
    "    energy = energy + ( np.linalg.norm ( G[row,:] - m[:] ) ) ** 2\n",
    "  return energy\n",
    "\n",
    "m1 = np.array ( [ 0.0, 0.0 ] )\n",
    "m2 = np.array ( [ 0.2, 0.7 ] )\n",
    "m3 = G[0,:]\n",
    "m4 = np.sum ( G, axis = 0 ) / float ( n )\n",
    "\n",
    "print ( m1, cluster_energy ( G, m1 ) )\n",
    "print ( m2, cluster_energy ( G, m2 ) )\n",
    "print ( m3, cluster_energy ( G, m3 ) )\n",
    "print ( m4, cluster_energy ( G, m4 ) )"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 6,
   "metadata": {
    "collapsed": false
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "[<matplotlib.lines.Line2D at 0x7f20e847cb50>]"
      ]
     },
     "execution_count": 6,
     "metadata": {},
     "output_type": "execute_result"
    },
    {
     "data": {
      "image/svg+xml": [
       "<?xml version=\"1.0\" encoding=\"utf-8\" standalone=\"no\"?>\n",
       "<!DOCTYPE svg PUBLIC \"-//W3C//DTD SVG 1.1//EN\"\n",
       "  \"http://www.w3.org/Graphics/SVG/1.1/DTD/svg11.dtd\">\n",
       "<!-- Created with matplotlib (http://matplotlib.org/) -->\n",
       "<svg height=\"256pt\" version=\"1.1\" viewBox=\"0 0 381 256\" width=\"381pt\" xmlns=\"http://www.w3.org/2000/svg\" xmlns:xlink=\"http://www.w3.org/1999/xlink\">\n",
       " <defs>\n",
       "  <style type=\"text/css\">\n",
       "*{stroke-linecap:butt;stroke-linejoin:round;stroke-miterlimit:100000;}\n",
       "  </style>\n",
       " </defs>\n",
       " <g id=\"figure_1\">\n",
       "  <g id=\"patch_1\">\n",
       "   <path d=\"M 0 256.117188 \n",
       "L 381.83125 256.117188 \n",
       "L 381.83125 0 \n",
       "L 0 0 \n",
       "L 0 256.117188 \n",
       "z\n",
       "\" style=\"fill:none;\"/>\n",
       "  </g>\n",
       "  <g id=\"axes_1\">\n",
       "   <g id=\"patch_2\">\n",
       "    <path d=\"M 30.2875 235.239062 \n",
       "L 365.0875 235.239062 \n",
       "L 365.0875 12.039062 \n",
       "L 30.2875 12.039062 \n",
       "z\n",
       "\" style=\"fill:#ffffff;\"/>\n",
       "   </g>\n",
       "   <g id=\"line2d_1\">\n",
       "    <defs>\n",
       "     <path d=\"M 0 -3 \n",
       "L -0.673542 -0.927051 \n",
       "L -2.85317 -0.927051 \n",
       "L -1.089814 0.354102 \n",
       "L -1.763356 2.427051 \n",
       "L -0 1.145898 \n",
       "L 1.763356 2.427051 \n",
       "L 1.089814 0.354102 \n",
       "L 2.85317 -0.927051 \n",
       "L 0.673542 -0.927051 \n",
       "z\n",
       "\" id=\"m7aeffd5faf\" style=\"stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\"/>\n",
       "    </defs>\n",
       "    <g clip-path=\"url(#pb7111ed48b)\">\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"41.4475\" xlink:href=\"#m7aeffd5faf\" y=\"161.3040625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"44.2375\" xlink:href=\"#m7aeffd5faf\" y=\"147.3540625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"58.1875\" xlink:href=\"#m7aeffd5faf\" y=\"152.9340625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"55.3975\" xlink:href=\"#m7aeffd5faf\" y=\"127.8240625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"66.5575\" xlink:href=\"#m7aeffd5faf\" y=\"166.8840625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"66.5575\" xlink:href=\"#m7aeffd5faf\" y=\"138.9840625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"63.7675\" xlink:href=\"#m7aeffd5faf\" y=\"112.4790625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"72.1375\" xlink:href=\"#m7aeffd5faf\" y=\"130.6140625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"80.5075\" xlink:href=\"#m7aeffd5faf\" y=\"150.1440625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"83.2975\" xlink:href=\"#m7aeffd5faf\" y=\"144.5640625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"91.6675\" xlink:href=\"#m7aeffd5faf\" y=\"132.0090625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"105.6175\" xlink:href=\"#m7aeffd5faf\" y=\"134.7990625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"108.4075\" xlink:href=\"#m7aeffd5faf\" y=\"129.2190625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"97.2475\" xlink:href=\"#m7aeffd5faf\" y=\"154.3290625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"105.6175\" xlink:href=\"#m7aeffd5faf\" y=\"158.5140625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"108.4075\" xlink:href=\"#m7aeffd5faf\" y=\"151.5390625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"113.9875\" xlink:href=\"#m7aeffd5faf\" y=\"162.6990625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"116.7775\" xlink:href=\"#m7aeffd5faf\" y=\"151.5390625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"119.5675\" xlink:href=\"#m7aeffd5faf\" y=\"150.1440625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"130.7275\" xlink:href=\"#m7aeffd5faf\" y=\"134.7990625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"108.4075\" xlink:href=\"#m7aeffd5faf\" y=\"30.1740625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"119.5675\" xlink:href=\"#m7aeffd5faf\" y=\"27.3840625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"127.9375\" xlink:href=\"#m7aeffd5faf\" y=\"21.8040625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"122.3575\" xlink:href=\"#m7aeffd5faf\" y=\"20.4090625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"136.3075\" xlink:href=\"#m7aeffd5faf\" y=\"24.5940625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"144.6775\" xlink:href=\"#m7aeffd5faf\" y=\"25.9890625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"136.3075\" xlink:href=\"#m7aeffd5faf\" y=\"32.9640625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"136.3075\" xlink:href=\"#m7aeffd5faf\" y=\"35.7540625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"119.5675\" xlink:href=\"#m7aeffd5faf\" y=\"35.7540625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"125.1475\" xlink:href=\"#m7aeffd5faf\" y=\"38.5440625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"153.0475\" xlink:href=\"#m7aeffd5faf\" y=\"17.6190625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"153.0475\" xlink:href=\"#m7aeffd5faf\" y=\"27.3840625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"153.0475\" xlink:href=\"#m7aeffd5faf\" y=\"35.7540625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"158.6275\" xlink:href=\"#m7aeffd5faf\" y=\"37.1490625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"161.4175\" xlink:href=\"#m7aeffd5faf\" y=\"27.3840625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"166.9975\" xlink:href=\"#m7aeffd5faf\" y=\"23.1990625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"169.7875\" xlink:href=\"#m7aeffd5faf\" y=\"37.1490625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"178.1575\" xlink:href=\"#m7aeffd5faf\" y=\"34.3590625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"175.3675\" xlink:href=\"#m7aeffd5faf\" y=\"23.1990625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"183.7375\" xlink:href=\"#m7aeffd5faf\" y=\"19.0140625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"180.9475\" xlink:href=\"#m7aeffd5faf\" y=\"62.2590625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"197.6875\" xlink:href=\"#m7aeffd5faf\" y=\"45.5190625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"206.0575\" xlink:href=\"#m7aeffd5faf\" y=\"41.3340625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"270.2275\" xlink:href=\"#m7aeffd5faf\" y=\"51.0990625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"267.4375\" xlink:href=\"#m7aeffd5faf\" y=\"74.8140625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"267.4375\" xlink:href=\"#m7aeffd5faf\" y=\"101.3190625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"247.9075\" xlink:href=\"#m7aeffd5faf\" y=\"104.1090625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"236.7475\" xlink:href=\"#m7aeffd5faf\" y=\"101.3190625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"300.9175\" xlink:href=\"#m7aeffd5faf\" y=\"65.0490625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"303.7075\" xlink:href=\"#m7aeffd5faf\" y=\"73.4190625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"303.7075\" xlink:href=\"#m7aeffd5faf\" y=\"62.2590625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"306.4975\" xlink:href=\"#m7aeffd5faf\" y=\"69.2340625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"306.4975\" xlink:href=\"#m7aeffd5faf\" y=\"56.6790625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"312.0775\" xlink:href=\"#m7aeffd5faf\" y=\"74.8140625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"331.6075\" xlink:href=\"#m7aeffd5faf\" y=\"80.3940625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"337.1875\" xlink:href=\"#m7aeffd5faf\" y=\"80.3940625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"331.6075\" xlink:href=\"#m7aeffd5faf\" y=\"73.4190625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"339.9775\" xlink:href=\"#m7aeffd5faf\" y=\"59.4690625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"351.1375\" xlink:href=\"#m7aeffd5faf\" y=\"72.0240625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"356.7175\" xlink:href=\"#m7aeffd5faf\" y=\"74.8140625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"225.5875\" xlink:href=\"#m7aeffd5faf\" y=\"229.6590625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"245.1175\" xlink:href=\"#m7aeffd5faf\" y=\"218.4990625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"261.8575\" xlink:href=\"#m7aeffd5faf\" y=\"205.9440625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"200.4775\" xlink:href=\"#m7aeffd5faf\" y=\"214.3140625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"222.7975\" xlink:href=\"#m7aeffd5faf\" y=\"214.3140625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"247.9075\" xlink:href=\"#m7aeffd5faf\" y=\"212.9190625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"214.4275\" xlink:href=\"#m7aeffd5faf\" y=\"210.1290625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"192.1075\" xlink:href=\"#m7aeffd5faf\" y=\"217.1040625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"208.8475\" xlink:href=\"#m7aeffd5faf\" y=\"207.3390625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"222.7975\" xlink:href=\"#m7aeffd5faf\" y=\"205.9440625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"214.4275\" xlink:href=\"#m7aeffd5faf\" y=\"203.1540625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"200.4775\" xlink:href=\"#m7aeffd5faf\" y=\"200.3640625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"242.3275\" xlink:href=\"#m7aeffd5faf\" y=\"197.5740625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"231.1675\" xlink:href=\"#m7aeffd5faf\" y=\"191.9940625\"/>\n",
       "     <use style=\"fill:#0000ff;stroke:#000000;stroke-linejoin:bevel;stroke-width:0.5;\" x=\"208.8475\" xlink:href=\"#m7aeffd5faf\" y=\"193.3890625\"/>\n",
       "    </g>\n",
       "   </g>\n",
       "   <g id=\"patch_3\">\n",
       "    <path d=\"M 30.2875 12.039062 \n",
       "L 365.0875 12.039062 \n",
       "\" style=\"fill:none;stroke:#000000;stroke-linecap:square;stroke-linejoin:miter;\"/>\n",
       "   </g>\n",
       "   <g id=\"patch_4\">\n",
       "    <path d=\"M 365.0875 235.239062 \n",
       "L 365.0875 12.039062 \n",
       "\" style=\"fill:none;stroke:#000000;stroke-linecap:square;stroke-linejoin:miter;\"/>\n",
       "   </g>\n",
       "   <g id=\"patch_5\">\n",
       "    <path d=\"M 30.2875 235.239062 \n",
       "L 365.0875 235.239062 \n",
       "\" style=\"fill:none;stroke:#000000;stroke-linecap:square;stroke-linejoin:miter;\"/>\n",
       "   </g>\n",
       "   <g id=\"patch_6\">\n",
       "    <path d=\"M 30.2875 235.239062 \n",
       "L 30.2875 12.039062 \n",
       "\" style=\"fill:none;stroke:#000000;stroke-linecap:square;stroke-linejoin:miter;\"/>\n",
       "   </g>\n",
       "   <g id=\"matplotlib.axis_1\">\n",
       "    <g id=\"xtick_1\">\n",
       "     <g id=\"line2d_2\">\n",
       "      <defs>\n",
       "       <path d=\"M 0 0 \n",
       "L 0 -4 \n",
       "\" id=\"mc3a28f2dac\" style=\"stroke:#000000;stroke-width:0.5;\"/>\n",
       "      </defs>\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"30.2875\" xlink:href=\"#mc3a28f2dac\" y=\"235.2390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"line2d_3\">\n",
       "      <defs>\n",
       "       <path d=\"M 0 0 \n",
       "L 0 4 \n",
       "\" id=\"m09346803ad\" style=\"stroke:#000000;stroke-width:0.5;\"/>\n",
       "      </defs>\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"30.2875\" xlink:href=\"#m09346803ad\" y=\"12.0390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"text_1\">\n",
       "      <!-- 0 -->\n",
       "      <defs>\n",
       "       <path d=\"M 31.78125 66.40625 \n",
       "Q 24.171875 66.40625 20.328125 58.90625 \n",
       "Q 16.5 51.421875 16.5 36.375 \n",
       "Q 16.5 21.390625 20.328125 13.890625 \n",
       "Q 24.171875 6.390625 31.78125 6.390625 \n",
       "Q 39.453125 6.390625 43.28125 13.890625 \n",
       "Q 47.125 21.390625 47.125 36.375 \n",
       "Q 47.125 51.421875 43.28125 58.90625 \n",
       "Q 39.453125 66.40625 31.78125 66.40625 \n",
       "M 31.78125 74.21875 \n",
       "Q 44.046875 74.21875 50.515625 64.515625 \n",
       "Q 56.984375 54.828125 56.984375 36.375 \n",
       "Q 56.984375 17.96875 50.515625 8.265625 \n",
       "Q 44.046875 -1.421875 31.78125 -1.421875 \n",
       "Q 19.53125 -1.421875 13.0625 8.265625 \n",
       "Q 6.59375 17.96875 6.59375 36.375 \n",
       "Q 6.59375 54.828125 13.0625 64.515625 \n",
       "Q 19.53125 74.21875 31.78125 74.21875 \n",
       "\" id=\"BitstreamVeraSans-Roman-30\"/>\n",
       "      </defs>\n",
       "      <g transform=\"translate(27.10625 246.8375)scale(0.1 -0.1)\">\n",
       "       <use xlink:href=\"#BitstreamVeraSans-Roman-30\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "    </g>\n",
       "    <g id=\"xtick_2\">\n",
       "     <g id=\"line2d_4\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"86.0875\" xlink:href=\"#mc3a28f2dac\" y=\"235.2390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"line2d_5\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"86.0875\" xlink:href=\"#m09346803ad\" y=\"12.0390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"text_2\">\n",
       "      <!-- 20 -->\n",
       "      <defs>\n",
       "       <path d=\"M 19.1875 8.296875 \n",
       "L 53.609375 8.296875 \n",
       "L 53.609375 0 \n",
       "L 7.328125 0 \n",
       "L 7.328125 8.296875 \n",
       "Q 12.9375 14.109375 22.625 23.890625 \n",
       "Q 32.328125 33.6875 34.8125 36.53125 \n",
       "Q 39.546875 41.84375 41.421875 45.53125 \n",
       "Q 43.3125 49.21875 43.3125 52.78125 \n",
       "Q 43.3125 58.59375 39.234375 62.25 \n",
       "Q 35.15625 65.921875 28.609375 65.921875 \n",
       "Q 23.96875 65.921875 18.8125 64.3125 \n",
       "Q 13.671875 62.703125 7.8125 59.421875 \n",
       "L 7.8125 69.390625 \n",
       "Q 13.765625 71.78125 18.9375 73 \n",
       "Q 24.125 74.21875 28.421875 74.21875 \n",
       "Q 39.75 74.21875 46.484375 68.546875 \n",
       "Q 53.21875 62.890625 53.21875 53.421875 \n",
       "Q 53.21875 48.921875 51.53125 44.890625 \n",
       "Q 49.859375 40.875 45.40625 35.40625 \n",
       "Q 44.1875 33.984375 37.640625 27.21875 \n",
       "Q 31.109375 20.453125 19.1875 8.296875 \n",
       "\" id=\"BitstreamVeraSans-Roman-32\"/>\n",
       "      </defs>\n",
       "      <g transform=\"translate(79.725 246.8375)scale(0.1 -0.1)\">\n",
       "       <use xlink:href=\"#BitstreamVeraSans-Roman-32\"/>\n",
       "       <use x=\"63.623046875\" xlink:href=\"#BitstreamVeraSans-Roman-30\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "    </g>\n",
       "    <g id=\"xtick_3\">\n",
       "     <g id=\"line2d_6\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"141.8875\" xlink:href=\"#mc3a28f2dac\" y=\"235.2390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"line2d_7\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"141.8875\" xlink:href=\"#m09346803ad\" y=\"12.0390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"text_3\">\n",
       "      <!-- 40 -->\n",
       "      <defs>\n",
       "       <path d=\"M 37.796875 64.3125 \n",
       "L 12.890625 25.390625 \n",
       "L 37.796875 25.390625 \n",
       "z\n",
       "M 35.203125 72.90625 \n",
       "L 47.609375 72.90625 \n",
       "L 47.609375 25.390625 \n",
       "L 58.015625 25.390625 \n",
       "L 58.015625 17.1875 \n",
       "L 47.609375 17.1875 \n",
       "L 47.609375 0 \n",
       "L 37.796875 0 \n",
       "L 37.796875 17.1875 \n",
       "L 4.890625 17.1875 \n",
       "L 4.890625 26.703125 \n",
       "z\n",
       "\" id=\"BitstreamVeraSans-Roman-34\"/>\n",
       "      </defs>\n",
       "      <g transform=\"translate(135.525 246.8375)scale(0.1 -0.1)\">\n",
       "       <use xlink:href=\"#BitstreamVeraSans-Roman-34\"/>\n",
       "       <use x=\"63.623046875\" xlink:href=\"#BitstreamVeraSans-Roman-30\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "    </g>\n",
       "    <g id=\"xtick_4\">\n",
       "     <g id=\"line2d_8\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"197.6875\" xlink:href=\"#mc3a28f2dac\" y=\"235.2390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"line2d_9\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"197.6875\" xlink:href=\"#m09346803ad\" y=\"12.0390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"text_4\">\n",
       "      <!-- 60 -->\n",
       "      <defs>\n",
       "       <path d=\"M 33.015625 40.375 \n",
       "Q 26.375 40.375 22.484375 35.828125 \n",
       "Q 18.609375 31.296875 18.609375 23.390625 \n",
       "Q 18.609375 15.53125 22.484375 10.953125 \n",
       "Q 26.375 6.390625 33.015625 6.390625 \n",
       "Q 39.65625 6.390625 43.53125 10.953125 \n",
       "Q 47.40625 15.53125 47.40625 23.390625 \n",
       "Q 47.40625 31.296875 43.53125 35.828125 \n",
       "Q 39.65625 40.375 33.015625 40.375 \n",
       "M 52.59375 71.296875 \n",
       "L 52.59375 62.3125 \n",
       "Q 48.875 64.0625 45.09375 64.984375 \n",
       "Q 41.3125 65.921875 37.59375 65.921875 \n",
       "Q 27.828125 65.921875 22.671875 59.328125 \n",
       "Q 17.53125 52.734375 16.796875 39.40625 \n",
       "Q 19.671875 43.65625 24.015625 45.921875 \n",
       "Q 28.375 48.1875 33.59375 48.1875 \n",
       "Q 44.578125 48.1875 50.953125 41.515625 \n",
       "Q 57.328125 34.859375 57.328125 23.390625 \n",
       "Q 57.328125 12.15625 50.6875 5.359375 \n",
       "Q 44.046875 -1.421875 33.015625 -1.421875 \n",
       "Q 20.359375 -1.421875 13.671875 8.265625 \n",
       "Q 6.984375 17.96875 6.984375 36.375 \n",
       "Q 6.984375 53.65625 15.1875 63.9375 \n",
       "Q 23.390625 74.21875 37.203125 74.21875 \n",
       "Q 40.921875 74.21875 44.703125 73.484375 \n",
       "Q 48.484375 72.75 52.59375 71.296875 \n",
       "\" id=\"BitstreamVeraSans-Roman-36\"/>\n",
       "      </defs>\n",
       "      <g transform=\"translate(191.325 246.8375)scale(0.1 -0.1)\">\n",
       "       <use xlink:href=\"#BitstreamVeraSans-Roman-36\"/>\n",
       "       <use x=\"63.623046875\" xlink:href=\"#BitstreamVeraSans-Roman-30\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "    </g>\n",
       "    <g id=\"xtick_5\">\n",
       "     <g id=\"line2d_10\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"253.4875\" xlink:href=\"#mc3a28f2dac\" y=\"235.2390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"line2d_11\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"253.4875\" xlink:href=\"#m09346803ad\" y=\"12.0390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"text_5\">\n",
       "      <!-- 80 -->\n",
       "      <defs>\n",
       "       <path d=\"M 31.78125 34.625 \n",
       "Q 24.75 34.625 20.71875 30.859375 \n",
       "Q 16.703125 27.09375 16.703125 20.515625 \n",
       "Q 16.703125 13.921875 20.71875 10.15625 \n",
       "Q 24.75 6.390625 31.78125 6.390625 \n",
       "Q 38.8125 6.390625 42.859375 10.171875 \n",
       "Q 46.921875 13.96875 46.921875 20.515625 \n",
       "Q 46.921875 27.09375 42.890625 30.859375 \n",
       "Q 38.875 34.625 31.78125 34.625 \n",
       "M 21.921875 38.8125 \n",
       "Q 15.578125 40.375 12.03125 44.71875 \n",
       "Q 8.5 49.078125 8.5 55.328125 \n",
       "Q 8.5 64.0625 14.71875 69.140625 \n",
       "Q 20.953125 74.21875 31.78125 74.21875 \n",
       "Q 42.671875 74.21875 48.875 69.140625 \n",
       "Q 55.078125 64.0625 55.078125 55.328125 \n",
       "Q 55.078125 49.078125 51.53125 44.71875 \n",
       "Q 48 40.375 41.703125 38.8125 \n",
       "Q 48.828125 37.15625 52.796875 32.3125 \n",
       "Q 56.78125 27.484375 56.78125 20.515625 \n",
       "Q 56.78125 9.90625 50.3125 4.234375 \n",
       "Q 43.84375 -1.421875 31.78125 -1.421875 \n",
       "Q 19.734375 -1.421875 13.25 4.234375 \n",
       "Q 6.78125 9.90625 6.78125 20.515625 \n",
       "Q 6.78125 27.484375 10.78125 32.3125 \n",
       "Q 14.796875 37.15625 21.921875 38.8125 \n",
       "M 18.3125 54.390625 \n",
       "Q 18.3125 48.734375 21.84375 45.5625 \n",
       "Q 25.390625 42.390625 31.78125 42.390625 \n",
       "Q 38.140625 42.390625 41.71875 45.5625 \n",
       "Q 45.3125 48.734375 45.3125 54.390625 \n",
       "Q 45.3125 60.0625 41.71875 63.234375 \n",
       "Q 38.140625 66.40625 31.78125 66.40625 \n",
       "Q 25.390625 66.40625 21.84375 63.234375 \n",
       "Q 18.3125 60.0625 18.3125 54.390625 \n",
       "\" id=\"BitstreamVeraSans-Roman-38\"/>\n",
       "      </defs>\n",
       "      <g transform=\"translate(247.125 246.8375)scale(0.1 -0.1)\">\n",
       "       <use xlink:href=\"#BitstreamVeraSans-Roman-38\"/>\n",
       "       <use x=\"63.623046875\" xlink:href=\"#BitstreamVeraSans-Roman-30\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "    </g>\n",
       "    <g id=\"xtick_6\">\n",
       "     <g id=\"line2d_12\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"309.2875\" xlink:href=\"#mc3a28f2dac\" y=\"235.2390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"line2d_13\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"309.2875\" xlink:href=\"#m09346803ad\" y=\"12.0390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"text_6\">\n",
       "      <!-- 100 -->\n",
       "      <defs>\n",
       "       <path d=\"M 12.40625 8.296875 \n",
       "L 28.515625 8.296875 \n",
       "L 28.515625 63.921875 \n",
       "L 10.984375 60.40625 \n",
       "L 10.984375 69.390625 \n",
       "L 28.421875 72.90625 \n",
       "L 38.28125 72.90625 \n",
       "L 38.28125 8.296875 \n",
       "L 54.390625 8.296875 \n",
       "L 54.390625 0 \n",
       "L 12.40625 0 \n",
       "z\n",
       "\" id=\"BitstreamVeraSans-Roman-31\"/>\n",
       "      </defs>\n",
       "      <g transform=\"translate(299.74375 246.8375)scale(0.1 -0.1)\">\n",
       "       <use xlink:href=\"#BitstreamVeraSans-Roman-31\"/>\n",
       "       <use x=\"63.623046875\" xlink:href=\"#BitstreamVeraSans-Roman-30\"/>\n",
       "       <use x=\"127.24609375\" xlink:href=\"#BitstreamVeraSans-Roman-30\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "    </g>\n",
       "    <g id=\"xtick_7\">\n",
       "     <g id=\"line2d_14\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"365.0875\" xlink:href=\"#mc3a28f2dac\" y=\"235.2390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"line2d_15\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"365.0875\" xlink:href=\"#m09346803ad\" y=\"12.0390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"text_7\">\n",
       "      <!-- 120 -->\n",
       "      <g transform=\"translate(355.54375 246.8375)scale(0.1 -0.1)\">\n",
       "       <use xlink:href=\"#BitstreamVeraSans-Roman-31\"/>\n",
       "       <use x=\"63.623046875\" xlink:href=\"#BitstreamVeraSans-Roman-32\"/>\n",
       "       <use x=\"127.24609375\" xlink:href=\"#BitstreamVeraSans-Roman-30\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "    </g>\n",
       "   </g>\n",
       "   <g id=\"matplotlib.axis_2\">\n",
       "    <g id=\"ytick_1\">\n",
       "     <g id=\"line2d_16\">\n",
       "      <defs>\n",
       "       <path d=\"M 0 0 \n",
       "L 4 0 \n",
       "\" id=\"m94960ee81f\" style=\"stroke:#000000;stroke-width:0.5;\"/>\n",
       "      </defs>\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"30.2875\" xlink:href=\"#m94960ee81f\" y=\"235.2390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"line2d_17\">\n",
       "      <defs>\n",
       "       <path d=\"M 0 0 \n",
       "L -4 0 \n",
       "\" id=\"m3f42e78902\" style=\"stroke:#000000;stroke-width:0.5;\"/>\n",
       "      </defs>\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"365.0875\" xlink:href=\"#m3f42e78902\" y=\"235.2390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"text_8\">\n",
       "      <!-- 0 -->\n",
       "      <g transform=\"translate(19.925 237.9984375)scale(0.1 -0.1)\">\n",
       "       <use xlink:href=\"#BitstreamVeraSans-Roman-30\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "    </g>\n",
       "    <g id=\"ytick_2\">\n",
       "     <g id=\"line2d_18\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"30.2875\" xlink:href=\"#m94960ee81f\" y=\"207.3390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"line2d_19\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"365.0875\" xlink:href=\"#m3f42e78902\" y=\"207.3390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"text_9\">\n",
       "      <!-- 20 -->\n",
       "      <g transform=\"translate(13.5625 210.0984375)scale(0.1 -0.1)\">\n",
       "       <use xlink:href=\"#BitstreamVeraSans-Roman-32\"/>\n",
       "       <use x=\"63.623046875\" xlink:href=\"#BitstreamVeraSans-Roman-30\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "    </g>\n",
       "    <g id=\"ytick_3\">\n",
       "     <g id=\"line2d_20\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"30.2875\" xlink:href=\"#m94960ee81f\" y=\"179.4390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"line2d_21\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"365.0875\" xlink:href=\"#m3f42e78902\" y=\"179.4390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"text_10\">\n",
       "      <!-- 40 -->\n",
       "      <g transform=\"translate(13.5625 182.1984375)scale(0.1 -0.1)\">\n",
       "       <use xlink:href=\"#BitstreamVeraSans-Roman-34\"/>\n",
       "       <use x=\"63.623046875\" xlink:href=\"#BitstreamVeraSans-Roman-30\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "    </g>\n",
       "    <g id=\"ytick_4\">\n",
       "     <g id=\"line2d_22\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"30.2875\" xlink:href=\"#m94960ee81f\" y=\"151.5390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"line2d_23\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"365.0875\" xlink:href=\"#m3f42e78902\" y=\"151.5390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"text_11\">\n",
       "      <!-- 60 -->\n",
       "      <g transform=\"translate(13.5625 154.2984375)scale(0.1 -0.1)\">\n",
       "       <use xlink:href=\"#BitstreamVeraSans-Roman-36\"/>\n",
       "       <use x=\"63.623046875\" xlink:href=\"#BitstreamVeraSans-Roman-30\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "    </g>\n",
       "    <g id=\"ytick_5\">\n",
       "     <g id=\"line2d_24\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"30.2875\" xlink:href=\"#m94960ee81f\" y=\"123.6390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"line2d_25\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"365.0875\" xlink:href=\"#m3f42e78902\" y=\"123.6390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"text_12\">\n",
       "      <!-- 80 -->\n",
       "      <g transform=\"translate(13.5625 126.3984375)scale(0.1 -0.1)\">\n",
       "       <use xlink:href=\"#BitstreamVeraSans-Roman-38\"/>\n",
       "       <use x=\"63.623046875\" xlink:href=\"#BitstreamVeraSans-Roman-30\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "    </g>\n",
       "    <g id=\"ytick_6\">\n",
       "     <g id=\"line2d_26\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"30.2875\" xlink:href=\"#m94960ee81f\" y=\"95.7390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"line2d_27\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"365.0875\" xlink:href=\"#m3f42e78902\" y=\"95.7390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"text_13\">\n",
       "      <!-- 100 -->\n",
       "      <g transform=\"translate(7.2 98.4984375)scale(0.1 -0.1)\">\n",
       "       <use xlink:href=\"#BitstreamVeraSans-Roman-31\"/>\n",
       "       <use x=\"63.623046875\" xlink:href=\"#BitstreamVeraSans-Roman-30\"/>\n",
       "       <use x=\"127.24609375\" xlink:href=\"#BitstreamVeraSans-Roman-30\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "    </g>\n",
       "    <g id=\"ytick_7\">\n",
       "     <g id=\"line2d_28\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"30.2875\" xlink:href=\"#m94960ee81f\" y=\"67.8390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"line2d_29\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"365.0875\" xlink:href=\"#m3f42e78902\" y=\"67.8390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"text_14\">\n",
       "      <!-- 120 -->\n",
       "      <g transform=\"translate(7.2 70.5984375)scale(0.1 -0.1)\">\n",
       "       <use xlink:href=\"#BitstreamVeraSans-Roman-31\"/>\n",
       "       <use x=\"63.623046875\" xlink:href=\"#BitstreamVeraSans-Roman-32\"/>\n",
       "       <use x=\"127.24609375\" xlink:href=\"#BitstreamVeraSans-Roman-30\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "    </g>\n",
       "    <g id=\"ytick_8\">\n",
       "     <g id=\"line2d_30\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"30.2875\" xlink:href=\"#m94960ee81f\" y=\"39.9390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"line2d_31\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"365.0875\" xlink:href=\"#m3f42e78902\" y=\"39.9390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"text_15\">\n",
       "      <!-- 140 -->\n",
       "      <g transform=\"translate(7.2 42.6984375)scale(0.1 -0.1)\">\n",
       "       <use xlink:href=\"#BitstreamVeraSans-Roman-31\"/>\n",
       "       <use x=\"63.623046875\" xlink:href=\"#BitstreamVeraSans-Roman-34\"/>\n",
       "       <use x=\"127.24609375\" xlink:href=\"#BitstreamVeraSans-Roman-30\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "    </g>\n",
       "    <g id=\"ytick_9\">\n",
       "     <g id=\"line2d_32\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"30.2875\" xlink:href=\"#m94960ee81f\" y=\"12.0390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"line2d_33\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"365.0875\" xlink:href=\"#m3f42e78902\" y=\"12.0390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"text_16\">\n",
       "      <!-- 160 -->\n",
       "      <g transform=\"translate(7.2 14.7984375)scale(0.1 -0.1)\">\n",
       "       <use xlink:href=\"#BitstreamVeraSans-Roman-31\"/>\n",
       "       <use x=\"63.623046875\" xlink:href=\"#BitstreamVeraSans-Roman-36\"/>\n",
       "       <use x=\"127.24609375\" xlink:href=\"#BitstreamVeraSans-Roman-30\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "    </g>\n",
       "   </g>\n",
       "  </g>\n",
       " </g>\n",
       " <defs>\n",
       "  <clipPath id=\"pb7111ed48b\">\n",
       "   <rect height=\"223.2\" width=\"334.8\" x=\"30.2875\" y=\"12.0390625\"/>\n",
       "  </clipPath>\n",
       " </defs>\n",
       "</svg>\n"
      ],
      "text/plain": [
       "<matplotlib.figure.Figure at 0x7f20e94ae4d0>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "# Reading data #\n",
    "#\n",
    "#  For our first experiments, the items we are going to try to cluster \n",
    "#  will be the (x,y) coordinates of 75 points.\n",
    "#\n",
    "#  The data values are stored in the file \"ruspini.txt\".\n",
    "#\n",
    "#  Download the data from the web page:\n",
    "#\n",
    "#    http://people.sc.fsu.edu/~jburkardt/classes/urop_2016/ruspini.txt\n",
    "#\n",
    "#  Read the data from the file.\n",
    "#\n",
    "#  Rewrite your work so it is a function:\n",
    "#\n",
    "#    def ruspini_data ( ):\n",
    "#      ***\n",
    "#      return xy\n",
    "#\n",
    "def ruspini_data ( ):\n",
    "    \n",
    "  xy = np.zeros ( [ 75, 2 ] )\n",
    "  \n",
    "  input = open ( 'ruspini.txt', 'r' )\n",
    "  \n",
    "  i = 0\n",
    "  for line in input:\n",
    "    j = 0\n",
    "    for word in line.strip().split():\n",
    "      xy[i,j] = float ( word )\n",
    "      j = j + 1\n",
    "    i = i + 1\n",
    "\n",
    "  input.close ( )\n",
    "\n",
    "  return xy\n",
    "#\n",
    "#  Use the function.\n",
    "#\n",
    "#  Plot the data, and see that it naturally separates into\n",
    "#  a certain number of clusters.\n",
    "#\n",
    "xy = ruspini_data ( )\n",
    "plt.plot ( xy[:,0], xy[:,1], 'b*' )"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 9,
   "metadata": {
    "collapsed": false
   },
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "\n",
      "   I  MOD1  MOD2 RANDOM1 RANDOM2:\n",
      "\n",
      "   0     0     0     2     0\n",
      "   1     1     1     2     2\n",
      "   2     2     2     0     2\n",
      "   3     0     0     1     0\n",
      "   4     1     1     0     1\n",
      "   5     2     2     1     0\n",
      "   6     0     0     2     2\n",
      "   7     1     1     1     2\n",
      "   8     2     2     1     1\n",
      "   9     0     0     1     1\n",
      "  10     1     1     2     0\n",
      "  11     2     2     0     0\n",
      "  12     0     0     2     0\n",
      "  13     1     1     1     2\n",
      "  14     2     2     1     1\n",
      "  15     0     0     1     1\n",
      "  16     1     1     1     0\n",
      "  17     2     2     2     1\n",
      "  18     0     0     0     0\n",
      "  19     1     1     2     0\n",
      "  20     2     2     1     2\n",
      "  21     0     0     0     0\n",
      "  22     1     1     1     2\n",
      "  23     2     2     2     1\n",
      "  24     0     0     2     1\n",
      "  25     1     1     0     2\n",
      "  26     2     2     0     1\n",
      "  27     0     0     2     1\n",
      "  28     1     1     1     0\n",
      "  29     2     2     0     2\n",
      "  30     0     0     0     0\n",
      "  31     1     1     2     2\n",
      "  32     2     2     1     1\n",
      "  33     0     0     2     1\n",
      "  34     1     1     2     0\n",
      "  35     2     2     0     0\n",
      "  36     0     0     1     0\n",
      "  37     1     1     0     1\n",
      "  38     2     2     2     2\n",
      "  39     0     0     2     1\n",
      "  40     1     1     1     2\n",
      "  41     2     2     2     0\n",
      "  42     0     0     1     1\n",
      "  43     1     1     0     0\n",
      "  44     2     2     2     1\n",
      "  45     0     0     0     0\n",
      "  46     1     1     1     2\n",
      "  47     2     2     0     2\n",
      "  48     0     0     0     0\n",
      "  49     1     1     1     0\n",
      "  50     2     2     0     0\n",
      "  51     0     0     2     1\n",
      "  52     1     1     0     2\n",
      "  53     2     2     2     2\n",
      "  54     0     0     2     1\n",
      "  55     1     1     0     0\n",
      "  56     2     2     2     2\n",
      "  57     0     0     2     2\n",
      "  58     1     1     0     0\n",
      "  59     2     2     2     2\n",
      "  60     0     0     1     2\n",
      "  61     1     1     0     1\n",
      "  62     2     2     0     2\n",
      "  63     0     0     1     2\n",
      "  64     1     1     0     0\n",
      "  65     2     2     2     2\n",
      "  66     0     0     1     1\n",
      "  67     1     1     0     2\n",
      "  68     2     2     0     0\n",
      "  69     0     0     0     1\n",
      "  70     1     1     1     1\n",
      "  71     2     2     2     0\n",
      "  72     0     0     0     0\n",
      "  73     1     1     2     0\n",
      "  74     2     2     1     0\n"
     ]
    }
   ],
   "source": [
    "# Initial Cluster Assignment\n",
    "#\n",
    "#  If xy represents our data values, then for each xy[i],\n",
    "#  we need an array c so that c[i] records the cluster \n",
    "#  that the points belongs to.\n",
    "#\n",
    "#  To start the K-means algorithm, we might as well just\n",
    "#  randomly assign points to clusters.\n",
    "#\n",
    "#  We don't want any cluster to be empty.  So the easiest\n",
    "#  thing to do is assign point 0 to cluster 0, point 1 to\n",
    "#  cluster 1, ..., point K-1 to cluster K-1, and then\n",
    "#  wrap around to point K is sent to cluster 0, point K+1\n",
    "#  to cluster 1 and so on, which is essentially using the\n",
    "#  mod function.\n",
    "#\n",
    "#  The other simple choice is to call NUMPY's random.randint()\n",
    "#  function to return a random list of integers between 0\n",
    "#  and k-1.  The advantage of this choice is that we will\n",
    "#  get a different initial configuration every time we call.\n",
    "#\n",
    "#  Write functions:\n",
    "#\n",
    "#    def initial_clusters_mod ( n, k ):\n",
    "#      ***\n",
    "#      return c\n",
    "#\n",
    "#    def initial_clusters_random ( n, k ):\n",
    "#      ***\n",
    "#      return c\n",
    "#\n",
    "#  For the case N = 20 and K = 3, call each function twice, \n",
    "#  print the results, and show that the second function gives\n",
    "#  different results each time.\n",
    "#\n",
    "def initial_clusters_mod ( n, k ):\n",
    "  c = np.zeros ( n, dtype = np.int32 )\n",
    "  for i in range ( 0, n ):\n",
    "    c[i] = ( i % k )\n",
    "  return c\n",
    "\n",
    "def initial_clusters_random ( n, k ):\n",
    "  c = np.random.randint ( 0, k, n )\n",
    "  return c\n",
    "\n",
    "xy = ruspini_data ( )\n",
    "n = 75\n",
    "k = 3\n",
    "\n",
    "c1 = initial_clusters_mod ( n, k )\n",
    "c2 = initial_clusters_mod ( n, k )\n",
    "c3 = initial_clusters_random ( n, k )\n",
    "c4 = initial_clusters_random ( n, k )\n",
    "\n",
    "print ( '' )\n",
    "print ( '   I  MOD1  MOD2 RANDOM1 RANDOM2:' )\n",
    "print ( '' )\n",
    "for i in range ( 0, n ):\n",
    "  print ( '%4d  %4d  %4d  %4d  %4d' % ( i, c1[i], c2[i], c3[i],c4[i] ) )  \n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 19,
   "metadata": {
    "collapsed": false
   },
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "\n",
      "  Cluster   Pop  Energy\n",
      "\n",
      "        0    27  90289.037037\n",
      "        1    23  63374.782609\n",
      "        2    25  85914.320000\n",
      "\n",
      "  Total      75                        239578.139646\n"
     ]
    }
   ],
   "source": [
    "# Cluster statistics #\n",
    "#\n",
    "#  Assuming we have K clusters, and that the vector J=C[I] stores the\n",
    "#  cluster J to which point X[I] is assigned, we want to compute the \n",
    "#  following arrays for each cluster J\n",
    "#\n",
    "#  POP(J) = number of items in the cluster\n",
    "#  MN(J,2) = the mean or average of the items in the cluster\n",
    "#  E(J) = energy of the cluster\n",
    "#\n",
    "#  Use K = 3, and N = 30, and the random cluster assigment function\n",
    "#  initial_clusters_random(), compute these quantities.\n",
    "#\n",
    "#  When you like what you have written, rewrite it as a function:\n",
    "#\n",
    "#  def cluster_stats ( n, xy, c, k ):\n",
    "#    ***\n",
    "#    return pop, mn, e\n",
    "#\n",
    "#  so we can update these statistics as we change the cluster assignments.\n",
    "#\n",
    "def cluster_stats ( n, xy, c, k ):\n",
    "\n",
    "  pop = np.zeros ( k, dtype = np.int32 )\n",
    "  dim = xy.shape[1]\n",
    "  mn = np.zeros ( [ k, dim ] )\n",
    "  e = np.zeros ( k )\n",
    "\n",
    "  for i in range ( 0, n ):\n",
    "    j = c[i]\n",
    "    pop[j] = pop[j] + 1\n",
    "    mn[j,:] = mn[j,:] + xy[i,:]\n",
    "\n",
    "  for j in range ( 0, k ):\n",
    "    mn[j,:] = mn[j,:] / float ( max ( pop[j], 1 ) )\n",
    "\n",
    "  for i in range ( 0, n ):\n",
    "    j = c[i]\n",
    "    e[j] = e[j] + ( np.linalg.norm ( xy[i,:] - mn[j,:] ) ) ** 2\n",
    "\n",
    "  return pop, mn, e\n",
    "#\n",
    "#  Use the function.\n",
    "#\n",
    "n = 75\n",
    "k = 3\n",
    "xy = ruspini_data ( )\n",
    "c = initial_clusters_random ( n, k )\n",
    "pop, mn, e = cluster_stats ( n, xy, c, k )\n",
    "\n",
    "print ( '' )\n",
    "print ( '  Cluster   Pop  Energy')\n",
    "print ( '')\n",
    "\n",
    "for j in range ( 0, k ):\n",
    "  print ( '      %3d  %4d  %f' % ( j, pop[j], e[j] ) )\n",
    "\n",
    "print ( '' )\n",
    "print ( '  Total    %4d                        %f' % ( np.sum ( pop ), np.sum ( e ) ) )\n",
    "\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 12,
   "metadata": {
    "collapsed": false
   },
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "\n",
      "Point  Current Nearest\n",
      "\n",
      "   0     1     0\n",
      "   1     0     2\n",
      "   2     0     1\n",
      "   3     2     2\n",
      "   4     0     0\n",
      "   5     0     2\n",
      "   6     0     0\n",
      "   7     0     1\n",
      "   8     1     1\n",
      "   9     0     0\n",
      "  10     0     0\n",
      "  11     1     1\n",
      "  12     2     2\n",
      "  13     0     0\n",
      "  14     1     1\n",
      "  15     0     0\n",
      "  16     0     2\n",
      "  17     0     0\n",
      "  18     2     0\n",
      "  19     0     0\n",
      "  20     2     2\n",
      "  21     1     2\n",
      "  22     0     0\n",
      "  23     0     2\n",
      "  24     1     1\n",
      "  25     0     2\n",
      "  26     1     0\n",
      "  27     1     0\n",
      "  28     1     1\n",
      "  29     1     0\n",
      "  30     0     0\n",
      "  31     2     0\n",
      "  32     1     1\n",
      "  33     1     2\n",
      "  34     0     0\n",
      "  35     1     1\n",
      "  36     1     1\n",
      "  37     1     2\n",
      "  38     2     2\n",
      "  39     2     1\n",
      "  40     0     0\n",
      "  41     1     0\n",
      "  42     1     2\n",
      "  43     1     1\n",
      "  44     1     2\n",
      "  45     2     1\n",
      "  46     1     1\n",
      "  47     1     1\n",
      "  48     2     2\n",
      "  49     2     0\n",
      "  50     1     0\n",
      "  51     2     2\n",
      "  52     2     0\n",
      "  53     1     2\n",
      "  54     1     1\n",
      "  55     1     2\n",
      "  56     0     2\n",
      "  57     1     2\n",
      "  58     1     0\n",
      "  59     2     2\n",
      "  60     1     0\n",
      "  61     2     0\n",
      "  62     1     0\n",
      "  63     0     1\n",
      "  64     1     2\n",
      "  65     2     1\n",
      "  66     2     2\n",
      "  67     0     0\n",
      "  68     1     2\n",
      "  69     0     1\n",
      "  70     2     1\n",
      "  71     0     0\n",
      "  72     1     1\n",
      "  73     0     2\n",
      "  74     2     1\n"
     ]
    }
   ],
   "source": [
    "#  Find Nearest Cluster\n",
    "#\n",
    "#  We want the data to be clustered, and we use the energy to measure this.  \n",
    "#  Energy depends on how close you are to the mean of your cluster.\n",
    "#  Suppose a data item is in the \"wrong\" cluster.  We will take this\n",
    "#  to mean that the data item is closer to the mean of some other cluster\n",
    "#  than it is to the mean of its current cluster.  What would happen if\n",
    "#  we transferred that item to the cluster of the closer mean?  The energy\n",
    "#  in the old cluster would go down, and the energy in the new cluster would\n",
    "#  go up.  But the energy of the old cluster will go down more than the \n",
    "#  energy of the new cluster will go up (because they are based on the distance\n",
    "#  to the cluster mean)  That means our total energy will go down, and that's good.\n",
    "# \n",
    "#  Write a function which looks like this:\n",
    "#\n",
    "#    def nearest_cluster ( n, i, xy, k, mn ):\n",
    "#      *****\n",
    "#      return j2\n",
    "#\n",
    "#  which has as input and output:\n",
    "#    N, the number of data items;\n",
    "#    I, the data item that we are interested in;\n",
    "#    XY, the list of all data item coordinates;\n",
    "#    K, the number of clusters;\n",
    "#    MN, the cluster means;\n",
    "#    J2, the index of the cluster mean that is closest to data item I.\n",
    "#\n",
    "def nearest_cluster ( n, i, xy, k, mn ):\n",
    "  d2 = np.inf\n",
    "  j2 = -1\n",
    "  for j in range ( 0, k ):\n",
    "    d = ( xy[i,0] - mn[j,0] ) ** 2 + ( xy[i,1] - mn[j,1] ) ** 2\n",
    "    if ( d < d2 ):\n",
    "      d2 = d\n",
    "      j2 = j\n",
    "  return j2\n",
    "#\n",
    "#  Use the function on an example.\n",
    "#\n",
    "n = 75\n",
    "k = 3\n",
    "xy = np.random.random ( [ n, 2 ] )\n",
    "c = initial_clusters_random ( n, k )\n",
    "pop, mn, e = cluster_stats ( n, xy, c, k )\n",
    "print ( '' )\n",
    "print ( 'Point  Current Nearest' )\n",
    "print ( '' )\n",
    "for i in range ( 0, n ):\n",
    "  j = c[i]\n",
    "  j2 = nearest_cluster ( n, i, xy, k, mn )\n",
    "  print ( '%4d  %4d  %4d' % ( i, j, j2 ) )"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 13,
   "metadata": {
    "collapsed": false
   },
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "53 data items moved\n",
      "\n",
      " I  C(old) C(new)\n",
      "\n",
      " 0   1   2\n",
      " 1   1   2\n",
      " 2   2   2\n",
      " 3   2   2\n",
      " 4   0   2\n",
      " 5   1   2\n",
      " 6   1   2\n",
      " 7   0   2\n",
      " 8   2   2\n",
      " 9   1   2\n",
      "10   1   2\n",
      "11   1   2\n",
      "12   1   2\n",
      "13   3   2\n",
      "14   1   2\n",
      "15   0   2\n",
      "16   2   2\n",
      "17   0   2\n",
      "18   2   2\n",
      "19   1   2\n",
      "20   3   1\n",
      "21   1   1\n",
      "22   1   1\n",
      "23   1   1\n",
      "24   0   1\n",
      "25   2   1\n",
      "26   0   1\n",
      "27   1   1\n",
      "28   3   1\n",
      "29   2   1\n",
      "30   0   1\n",
      "31   0   1\n",
      "32   0   1\n",
      "33   2   1\n",
      "34   1   1\n",
      "35   1   1\n",
      "36   3   1\n",
      "37   1   1\n",
      "38   0   1\n",
      "39   3   1\n",
      "40   1   1\n",
      "41   3   3\n",
      "42   0   3\n",
      "43   2   3\n",
      "44   1   3\n",
      "45   0   3\n",
      "46   0   3\n",
      "47   3   3\n",
      "48   2   3\n",
      "49   3   3\n",
      "50   3   3\n",
      "51   0   3\n",
      "52   1   3\n",
      "53   3   3\n",
      "54   1   3\n",
      "55   1   3\n",
      "56   1   3\n",
      "57   3   3\n",
      "58   1   3\n",
      "59   2   3\n",
      "60   2   2\n",
      "61   3   2\n",
      "62   3   2\n",
      "63   2   2\n",
      "64   1   2\n",
      "65   0   2\n",
      "66   1   2\n",
      "67   3   2\n",
      "68   3   2\n",
      "69   0   2\n",
      "70   2   2\n",
      "71   0   2\n",
      "72   1   2\n",
      "73   0   2\n",
      "74   1   2\n"
     ]
    }
   ],
   "source": [
    "# Function to reassign data\n",
    "#\n",
    "#  Now we want to allow each data item to change its cluster.\n",
    "#  Write a function which, for a given data item X(I), computes the\n",
    "#  distance between X(I) and the mean MN(J) of each cluster.\n",
    "#  Assume X(I) is currently assigned to cluster J=C[I], but is\n",
    "#  closest to the mean of cluster J2.  Unless J=J2, move X(I)\n",
    "#  to the new cluster (that is, change the value of C[I])\n",
    "#\n",
    "#  Write a function\n",
    "#\n",
    "#    def move_data ( n, xy, c, k, mn ):\n",
    "#      ***\n",
    "#      return c, swaps\n",
    "#\n",
    "#  which returns the updated cluster information C, but also\n",
    "#  in SWAPS returns the number of points that changed their cluster.\n",
    "#\n",
    "def move_data ( n, xy, c, k, mn ):\n",
    "    \n",
    "  swaps = 0\n",
    "\n",
    "  for i in range ( 0, n ):\n",
    "    j = c[i]\n",
    "    j2 = nearest_cluster ( n, i, xy, k, mn  )\n",
    "    if ( j2 != j ):\n",
    "      swaps = swaps + 1\n",
    "      c[i] = j2\n",
    "\n",
    "  return c, swaps\n",
    "#\n",
    "#  Try the function on the ruspini example.\n",
    "#  The data is sorted and will break up into 4 groups naturally.\n",
    "#\n",
    "k = 4\n",
    "n = 75\n",
    "xy = ruspini_data ( )\n",
    "c = initial_clusters_random ( n, k )\n",
    "cold = c.copy ( )\n",
    "pop, mn, e = cluster_stats ( n, xy, c, k )\n",
    "\n",
    "c, swaps = move_data ( n, xy, c, k, mn )\n",
    "\n",
    "if ( swaps == 0 ):\n",
    "  print ( 'No data items moved!')\n",
    "else:\n",
    "  print ( '%d data items moved' % ( swaps ) )\n",
    "  print ( '' )\n",
    "  print ( ' I  C(old) C(new)' )\n",
    "  print ( '' )\n",
    "  for i in range ( 0, n ):\n",
    "    print ( '%2d  %2d  %2d' % ( i, cold[i], c[i] ) )\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 14,
   "metadata": {
    "collapsed": false
   },
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "\n",
      "Trial 0\n",
      " 0  1672.907720\n",
      " 1  924.645848\n",
      " 2  685.233876\n",
      " 3  665.626837\n",
      " 4  661.556076\n",
      " 5  660.697151\n",
      " 6  660.449910\n",
      " 7  660.303590\n",
      " 8  660.242451\n",
      " 9  660.216640\n",
      "\n",
      "Trial 1\n",
      " 0  1672.125335\n",
      " 1  682.548801\n",
      " 2  664.353694\n",
      " 3  662.542911\n",
      " 4  661.994316\n",
      " 5  661.598716\n",
      " 6  661.174854\n",
      " 7  660.790794\n",
      " 8  660.447890\n",
      " 9  660.195312\n",
      "\n",
      "Trial 2\n",
      " 0  1673.050488\n",
      " 1  1080.100079\n",
      " 2  699.487326\n",
      " 3  665.575445\n",
      " 4  664.513566\n",
      " 5  664.285867\n",
      " 6  664.228903\n",
      " 7  664.197820\n",
      " 8  664.125623\n",
      " 9  664.043365\n"
     ]
    }
   ],
   "source": [
    "#  Now take 10000 random points, and try to cluster them into 3 groups.\n",
    "#\n",
    "#  Use an initial random clustering, and then recluster 10 times:\n",
    "#\n",
    "#  n = 10000\n",
    "#  k = 3\n",
    "#  xy = np.random.random ( [ n, 2 ] )\n",
    "#  c = initial_clusters_random ( n, k )\n",
    "#  for step in range ( 0, 10 ):\n",
    "#    get cluster stats\n",
    "#    move data\n",
    "#    print total energy\n",
    "#\n",
    "#  We said that the total energy should always go down \n",
    "#  (or at least never go up!).  Is this what you observe.\n",
    "#  \n",
    "#  What is the minimum energy you reach after 10 steps?\n",
    "#  Repeat the whole calculation two more times and note the\n",
    "#  minimum energy you reach after 10 steps.  Is there any difference?\n",
    "#  Much difference?\n",
    "#\n",
    "n = 10000\n",
    "k = 3\n",
    "xy = np.random.random ( [ n, 2 ] )\n",
    "\n",
    "for rep in range ( 0, 3 ):\n",
    "  print ( '' )\n",
    "  print ( 'Trial %d' % ( rep ) )\n",
    "  c = initial_clusters_random ( n, k )\n",
    "  for step in range ( 0, 10 ):\n",
    "    pop, mn, e = cluster_stats ( n, xy, c, k )\n",
    "    print ( '%2d  %f' % (step, np.sum ( e ) ) )\n",
    "    c, swaps = move_data ( n, xy, c, k, mn )\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 20,
   "metadata": {
    "collapsed": false
   },
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "No point swaps!\n"
     ]
    },
    {
     "data": {
      "text/plain": [
       "[<matplotlib.lines.Line2D at 0x7f20e8045c90>]"
      ]
     },
     "execution_count": 20,
     "metadata": {},
     "output_type": "execute_result"
    },
    {
     "data": {
      "image/svg+xml": [
       "<?xml version=\"1.0\" encoding=\"utf-8\" standalone=\"no\"?>\n",
       "<!DOCTYPE svg PUBLIC \"-//W3C//DTD SVG 1.1//EN\"\n",
       "  \"http://www.w3.org/Graphics/SVG/1.1/DTD/svg11.dtd\">\n",
       "<!-- Created with matplotlib (http://matplotlib.org/) -->\n",
       "<svg height=\"256pt\" version=\"1.1\" viewBox=\"0 0 385 256\" width=\"385pt\" xmlns=\"http://www.w3.org/2000/svg\" xmlns:xlink=\"http://www.w3.org/1999/xlink\">\n",
       " <defs>\n",
       "  <style type=\"text/css\">\n",
       "*{stroke-linecap:butt;stroke-linejoin:round;stroke-miterlimit:100000;}\n",
       "  </style>\n",
       " </defs>\n",
       " <g id=\"figure_1\">\n",
       "  <g id=\"patch_1\">\n",
       "   <path d=\"M 0 256.117188 \n",
       "L 385.0125 256.117188 \n",
       "L 385.0125 0 \n",
       "L 0 0 \n",
       "L 0 256.117188 \n",
       "z\n",
       "\" style=\"fill:none;\"/>\n",
       "  </g>\n",
       "  <g id=\"axes_1\">\n",
       "   <g id=\"patch_2\">\n",
       "    <path d=\"M 36.65 235.239062 \n",
       "L 371.45 235.239062 \n",
       "L 371.45 12.039062 \n",
       "L 36.65 12.039062 \n",
       "z\n",
       "\" style=\"fill:#ffffff;\"/>\n",
       "   </g>\n",
       "   <g id=\"line2d_1\">\n",
       "    <path clip-path=\"url(#pd07ba94dd4)\" d=\"M 36.65 29.938214 \n",
       "L 47.81 192.946577 \n",
       "L 58.97 205.582429 \n",
       "L 70.13 209.208392 \n",
       "L 81.29 211.004134 \n",
       "L 92.45 211.871171 \n",
       "L 103.61 212.538347 \n",
       "L 114.77 213.120929 \n",
       "L 125.93 213.5459 \n",
       "L 137.09 213.822431 \n",
       "L 148.25 213.966988 \n",
       "L 159.41 214.037768 \n",
       "L 170.57 214.083073 \n",
       "L 181.73 214.119327 \n",
       "L 192.89 214.146338 \n",
       "L 204.05 214.161318 \n",
       "L 215.21 214.170039 \n",
       "L 226.37 214.174418 \n",
       "L 237.53 214.176682 \n",
       "L 248.69 214.17706 \n",
       "L 259.85 214.177203 \n",
       "L 271.01 214.177386 \n",
       "L 282.17 214.177593 \n",
       "L 293.33 214.177637 \n",
       "L 304.49 214.177694 \n",
       "L 315.65 214.177748 \n",
       "L 326.81 214.177848 \n",
       "L 337.97 214.177933 \n",
       "L 349.13 214.178012 \n",
       "L 360.29 214.178056 \n",
       "L 371.45 214.178067 \n",
       "\" style=\"fill:none;stroke:#0000ff;stroke-linecap:square;\"/>\n",
       "   </g>\n",
       "   <g id=\"patch_3\">\n",
       "    <path d=\"M 36.65 12.039062 \n",
       "L 371.45 12.039062 \n",
       "\" style=\"fill:none;stroke:#000000;stroke-linecap:square;stroke-linejoin:miter;\"/>\n",
       "   </g>\n",
       "   <g id=\"patch_4\">\n",
       "    <path d=\"M 371.45 235.239062 \n",
       "L 371.45 12.039062 \n",
       "\" style=\"fill:none;stroke:#000000;stroke-linecap:square;stroke-linejoin:miter;\"/>\n",
       "   </g>\n",
       "   <g id=\"patch_5\">\n",
       "    <path d=\"M 36.65 235.239062 \n",
       "L 371.45 235.239062 \n",
       "\" style=\"fill:none;stroke:#000000;stroke-linecap:square;stroke-linejoin:miter;\"/>\n",
       "   </g>\n",
       "   <g id=\"patch_6\">\n",
       "    <path d=\"M 36.65 235.239062 \n",
       "L 36.65 12.039062 \n",
       "\" style=\"fill:none;stroke:#000000;stroke-linecap:square;stroke-linejoin:miter;\"/>\n",
       "   </g>\n",
       "   <g id=\"matplotlib.axis_1\">\n",
       "    <g id=\"xtick_1\">\n",
       "     <g id=\"line2d_2\">\n",
       "      <defs>\n",
       "       <path d=\"M 0 0 \n",
       "L 0 -4 \n",
       "\" id=\"m26082b9a89\" style=\"stroke:#000000;stroke-width:0.5;\"/>\n",
       "      </defs>\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"36.65\" xlink:href=\"#m26082b9a89\" y=\"235.2390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"line2d_3\">\n",
       "      <defs>\n",
       "       <path d=\"M 0 0 \n",
       "L 0 4 \n",
       "\" id=\"m613319be21\" style=\"stroke:#000000;stroke-width:0.5;\"/>\n",
       "      </defs>\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"36.65\" xlink:href=\"#m613319be21\" y=\"12.0390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"text_1\">\n",
       "      <!-- 0 -->\n",
       "      <defs>\n",
       "       <path d=\"M 31.78125 66.40625 \n",
       "Q 24.171875 66.40625 20.328125 58.90625 \n",
       "Q 16.5 51.421875 16.5 36.375 \n",
       "Q 16.5 21.390625 20.328125 13.890625 \n",
       "Q 24.171875 6.390625 31.78125 6.390625 \n",
       "Q 39.453125 6.390625 43.28125 13.890625 \n",
       "Q 47.125 21.390625 47.125 36.375 \n",
       "Q 47.125 51.421875 43.28125 58.90625 \n",
       "Q 39.453125 66.40625 31.78125 66.40625 \n",
       "M 31.78125 74.21875 \n",
       "Q 44.046875 74.21875 50.515625 64.515625 \n",
       "Q 56.984375 54.828125 56.984375 36.375 \n",
       "Q 56.984375 17.96875 50.515625 8.265625 \n",
       "Q 44.046875 -1.421875 31.78125 -1.421875 \n",
       "Q 19.53125 -1.421875 13.0625 8.265625 \n",
       "Q 6.59375 17.96875 6.59375 36.375 \n",
       "Q 6.59375 54.828125 13.0625 64.515625 \n",
       "Q 19.53125 74.21875 31.78125 74.21875 \n",
       "\" id=\"BitstreamVeraSans-Roman-30\"/>\n",
       "      </defs>\n",
       "      <g transform=\"translate(33.46875 246.8375)scale(0.1 -0.1)\">\n",
       "       <use xlink:href=\"#BitstreamVeraSans-Roman-30\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "    </g>\n",
       "    <g id=\"xtick_2\">\n",
       "     <g id=\"line2d_4\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"92.45\" xlink:href=\"#m26082b9a89\" y=\"235.2390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"line2d_5\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"92.45\" xlink:href=\"#m613319be21\" y=\"12.0390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"text_2\">\n",
       "      <!-- 5 -->\n",
       "      <defs>\n",
       "       <path d=\"M 10.796875 72.90625 \n",
       "L 49.515625 72.90625 \n",
       "L 49.515625 64.59375 \n",
       "L 19.828125 64.59375 \n",
       "L 19.828125 46.734375 \n",
       "Q 21.96875 47.46875 24.109375 47.828125 \n",
       "Q 26.265625 48.1875 28.421875 48.1875 \n",
       "Q 40.625 48.1875 47.75 41.5 \n",
       "Q 54.890625 34.8125 54.890625 23.390625 \n",
       "Q 54.890625 11.625 47.5625 5.09375 \n",
       "Q 40.234375 -1.421875 26.90625 -1.421875 \n",
       "Q 22.3125 -1.421875 17.546875 -0.640625 \n",
       "Q 12.796875 0.140625 7.71875 1.703125 \n",
       "L 7.71875 11.625 \n",
       "Q 12.109375 9.234375 16.796875 8.0625 \n",
       "Q 21.484375 6.890625 26.703125 6.890625 \n",
       "Q 35.15625 6.890625 40.078125 11.328125 \n",
       "Q 45.015625 15.765625 45.015625 23.390625 \n",
       "Q 45.015625 31 40.078125 35.4375 \n",
       "Q 35.15625 39.890625 26.703125 39.890625 \n",
       "Q 22.75 39.890625 18.8125 39.015625 \n",
       "Q 14.890625 38.140625 10.796875 36.28125 \n",
       "z\n",
       "\" id=\"BitstreamVeraSans-Roman-35\"/>\n",
       "      </defs>\n",
       "      <g transform=\"translate(89.26875 246.8375)scale(0.1 -0.1)\">\n",
       "       <use xlink:href=\"#BitstreamVeraSans-Roman-35\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "    </g>\n",
       "    <g id=\"xtick_3\">\n",
       "     <g id=\"line2d_6\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"148.25\" xlink:href=\"#m26082b9a89\" y=\"235.2390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"line2d_7\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"148.25\" xlink:href=\"#m613319be21\" y=\"12.0390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"text_3\">\n",
       "      <!-- 10 -->\n",
       "      <defs>\n",
       "       <path d=\"M 12.40625 8.296875 \n",
       "L 28.515625 8.296875 \n",
       "L 28.515625 63.921875 \n",
       "L 10.984375 60.40625 \n",
       "L 10.984375 69.390625 \n",
       "L 28.421875 72.90625 \n",
       "L 38.28125 72.90625 \n",
       "L 38.28125 8.296875 \n",
       "L 54.390625 8.296875 \n",
       "L 54.390625 0 \n",
       "L 12.40625 0 \n",
       "z\n",
       "\" id=\"BitstreamVeraSans-Roman-31\"/>\n",
       "      </defs>\n",
       "      <g transform=\"translate(141.8875 246.8375)scale(0.1 -0.1)\">\n",
       "       <use xlink:href=\"#BitstreamVeraSans-Roman-31\"/>\n",
       "       <use x=\"63.623046875\" xlink:href=\"#BitstreamVeraSans-Roman-30\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "    </g>\n",
       "    <g id=\"xtick_4\">\n",
       "     <g id=\"line2d_8\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"204.05\" xlink:href=\"#m26082b9a89\" y=\"235.2390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"line2d_9\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"204.05\" xlink:href=\"#m613319be21\" y=\"12.0390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"text_4\">\n",
       "      <!-- 15 -->\n",
       "      <g transform=\"translate(197.6875 246.8375)scale(0.1 -0.1)\">\n",
       "       <use xlink:href=\"#BitstreamVeraSans-Roman-31\"/>\n",
       "       <use x=\"63.623046875\" xlink:href=\"#BitstreamVeraSans-Roman-35\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "    </g>\n",
       "    <g id=\"xtick_5\">\n",
       "     <g id=\"line2d_10\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"259.85\" xlink:href=\"#m26082b9a89\" y=\"235.2390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"line2d_11\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"259.85\" xlink:href=\"#m613319be21\" y=\"12.0390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"text_5\">\n",
       "      <!-- 20 -->\n",
       "      <defs>\n",
       "       <path d=\"M 19.1875 8.296875 \n",
       "L 53.609375 8.296875 \n",
       "L 53.609375 0 \n",
       "L 7.328125 0 \n",
       "L 7.328125 8.296875 \n",
       "Q 12.9375 14.109375 22.625 23.890625 \n",
       "Q 32.328125 33.6875 34.8125 36.53125 \n",
       "Q 39.546875 41.84375 41.421875 45.53125 \n",
       "Q 43.3125 49.21875 43.3125 52.78125 \n",
       "Q 43.3125 58.59375 39.234375 62.25 \n",
       "Q 35.15625 65.921875 28.609375 65.921875 \n",
       "Q 23.96875 65.921875 18.8125 64.3125 \n",
       "Q 13.671875 62.703125 7.8125 59.421875 \n",
       "L 7.8125 69.390625 \n",
       "Q 13.765625 71.78125 18.9375 73 \n",
       "Q 24.125 74.21875 28.421875 74.21875 \n",
       "Q 39.75 74.21875 46.484375 68.546875 \n",
       "Q 53.21875 62.890625 53.21875 53.421875 \n",
       "Q 53.21875 48.921875 51.53125 44.890625 \n",
       "Q 49.859375 40.875 45.40625 35.40625 \n",
       "Q 44.1875 33.984375 37.640625 27.21875 \n",
       "Q 31.109375 20.453125 19.1875 8.296875 \n",
       "\" id=\"BitstreamVeraSans-Roman-32\"/>\n",
       "      </defs>\n",
       "      <g transform=\"translate(253.4875 246.8375)scale(0.1 -0.1)\">\n",
       "       <use xlink:href=\"#BitstreamVeraSans-Roman-32\"/>\n",
       "       <use x=\"63.623046875\" xlink:href=\"#BitstreamVeraSans-Roman-30\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "    </g>\n",
       "    <g id=\"xtick_6\">\n",
       "     <g id=\"line2d_12\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"315.65\" xlink:href=\"#m26082b9a89\" y=\"235.2390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"line2d_13\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"315.65\" xlink:href=\"#m613319be21\" y=\"12.0390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"text_6\">\n",
       "      <!-- 25 -->\n",
       "      <g transform=\"translate(309.2875 246.8375)scale(0.1 -0.1)\">\n",
       "       <use xlink:href=\"#BitstreamVeraSans-Roman-32\"/>\n",
       "       <use x=\"63.623046875\" xlink:href=\"#BitstreamVeraSans-Roman-35\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "    </g>\n",
       "    <g id=\"xtick_7\">\n",
       "     <g id=\"line2d_14\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"371.45\" xlink:href=\"#m26082b9a89\" y=\"235.2390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"line2d_15\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"371.45\" xlink:href=\"#m613319be21\" y=\"12.0390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"text_7\">\n",
       "      <!-- 30 -->\n",
       "      <defs>\n",
       "       <path d=\"M 40.578125 39.3125 \n",
       "Q 47.65625 37.796875 51.625 33 \n",
       "Q 55.609375 28.21875 55.609375 21.1875 \n",
       "Q 55.609375 10.40625 48.1875 4.484375 \n",
       "Q 40.765625 -1.421875 27.09375 -1.421875 \n",
       "Q 22.515625 -1.421875 17.65625 -0.515625 \n",
       "Q 12.796875 0.390625 7.625 2.203125 \n",
       "L 7.625 11.71875 \n",
       "Q 11.71875 9.328125 16.59375 8.109375 \n",
       "Q 21.484375 6.890625 26.8125 6.890625 \n",
       "Q 36.078125 6.890625 40.9375 10.546875 \n",
       "Q 45.796875 14.203125 45.796875 21.1875 \n",
       "Q 45.796875 27.640625 41.28125 31.265625 \n",
       "Q 36.765625 34.90625 28.71875 34.90625 \n",
       "L 20.21875 34.90625 \n",
       "L 20.21875 43.015625 \n",
       "L 29.109375 43.015625 \n",
       "Q 36.375 43.015625 40.234375 45.921875 \n",
       "Q 44.09375 48.828125 44.09375 54.296875 \n",
       "Q 44.09375 59.90625 40.109375 62.90625 \n",
       "Q 36.140625 65.921875 28.71875 65.921875 \n",
       "Q 24.65625 65.921875 20.015625 65.03125 \n",
       "Q 15.375 64.15625 9.8125 62.3125 \n",
       "L 9.8125 71.09375 \n",
       "Q 15.4375 72.65625 20.34375 73.4375 \n",
       "Q 25.25 74.21875 29.59375 74.21875 \n",
       "Q 40.828125 74.21875 47.359375 69.109375 \n",
       "Q 53.90625 64.015625 53.90625 55.328125 \n",
       "Q 53.90625 49.265625 50.4375 45.09375 \n",
       "Q 46.96875 40.921875 40.578125 39.3125 \n",
       "\" id=\"BitstreamVeraSans-Roman-33\"/>\n",
       "      </defs>\n",
       "      <g transform=\"translate(365.0875 246.8375)scale(0.1 -0.1)\">\n",
       "       <use xlink:href=\"#BitstreamVeraSans-Roman-33\"/>\n",
       "       <use x=\"63.623046875\" xlink:href=\"#BitstreamVeraSans-Roman-30\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "    </g>\n",
       "   </g>\n",
       "   <g id=\"matplotlib.axis_2\">\n",
       "    <g id=\"ytick_1\">\n",
       "     <g id=\"line2d_16\">\n",
       "      <defs>\n",
       "       <path d=\"M 0 0 \n",
       "L 4 0 \n",
       "\" id=\"m0282bb1dd7\" style=\"stroke:#000000;stroke-width:0.5;\"/>\n",
       "      </defs>\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"36.65\" xlink:href=\"#m0282bb1dd7\" y=\"235.2390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"line2d_17\">\n",
       "      <defs>\n",
       "       <path d=\"M 0 0 \n",
       "L -4 0 \n",
       "\" id=\"mfe05eeb551\" style=\"stroke:#000000;stroke-width:0.5;\"/>\n",
       "      </defs>\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"371.45\" xlink:href=\"#mfe05eeb551\" y=\"235.2390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"text_8\">\n",
       "      <!-- 0 -->\n",
       "      <g transform=\"translate(26.2875 237.9984375)scale(0.1 -0.1)\">\n",
       "       <use xlink:href=\"#BitstreamVeraSans-Roman-30\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "    </g>\n",
       "    <g id=\"ytick_2\">\n",
       "     <g id=\"line2d_18\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"36.65\" xlink:href=\"#m0282bb1dd7\" y=\"210.4390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"line2d_19\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"371.45\" xlink:href=\"#mfe05eeb551\" y=\"210.4390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"text_9\">\n",
       "      <!-- 200 -->\n",
       "      <g transform=\"translate(13.5625 213.1984375)scale(0.1 -0.1)\">\n",
       "       <use xlink:href=\"#BitstreamVeraSans-Roman-32\"/>\n",
       "       <use x=\"63.623046875\" xlink:href=\"#BitstreamVeraSans-Roman-30\"/>\n",
       "       <use x=\"127.24609375\" xlink:href=\"#BitstreamVeraSans-Roman-30\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "    </g>\n",
       "    <g id=\"ytick_3\">\n",
       "     <g id=\"line2d_20\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"36.65\" xlink:href=\"#m0282bb1dd7\" y=\"185.6390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"line2d_21\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"371.45\" xlink:href=\"#mfe05eeb551\" y=\"185.6390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"text_10\">\n",
       "      <!-- 400 -->\n",
       "      <defs>\n",
       "       <path d=\"M 37.796875 64.3125 \n",
       "L 12.890625 25.390625 \n",
       "L 37.796875 25.390625 \n",
       "z\n",
       "M 35.203125 72.90625 \n",
       "L 47.609375 72.90625 \n",
       "L 47.609375 25.390625 \n",
       "L 58.015625 25.390625 \n",
       "L 58.015625 17.1875 \n",
       "L 47.609375 17.1875 \n",
       "L 47.609375 0 \n",
       "L 37.796875 0 \n",
       "L 37.796875 17.1875 \n",
       "L 4.890625 17.1875 \n",
       "L 4.890625 26.703125 \n",
       "z\n",
       "\" id=\"BitstreamVeraSans-Roman-34\"/>\n",
       "      </defs>\n",
       "      <g transform=\"translate(13.5625 188.3984375)scale(0.1 -0.1)\">\n",
       "       <use xlink:href=\"#BitstreamVeraSans-Roman-34\"/>\n",
       "       <use x=\"63.623046875\" xlink:href=\"#BitstreamVeraSans-Roman-30\"/>\n",
       "       <use x=\"127.24609375\" xlink:href=\"#BitstreamVeraSans-Roman-30\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "    </g>\n",
       "    <g id=\"ytick_4\">\n",
       "     <g id=\"line2d_22\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"36.65\" xlink:href=\"#m0282bb1dd7\" y=\"160.8390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"line2d_23\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"371.45\" xlink:href=\"#mfe05eeb551\" y=\"160.8390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"text_11\">\n",
       "      <!-- 600 -->\n",
       "      <defs>\n",
       "       <path d=\"M 33.015625 40.375 \n",
       "Q 26.375 40.375 22.484375 35.828125 \n",
       "Q 18.609375 31.296875 18.609375 23.390625 \n",
       "Q 18.609375 15.53125 22.484375 10.953125 \n",
       "Q 26.375 6.390625 33.015625 6.390625 \n",
       "Q 39.65625 6.390625 43.53125 10.953125 \n",
       "Q 47.40625 15.53125 47.40625 23.390625 \n",
       "Q 47.40625 31.296875 43.53125 35.828125 \n",
       "Q 39.65625 40.375 33.015625 40.375 \n",
       "M 52.59375 71.296875 \n",
       "L 52.59375 62.3125 \n",
       "Q 48.875 64.0625 45.09375 64.984375 \n",
       "Q 41.3125 65.921875 37.59375 65.921875 \n",
       "Q 27.828125 65.921875 22.671875 59.328125 \n",
       "Q 17.53125 52.734375 16.796875 39.40625 \n",
       "Q 19.671875 43.65625 24.015625 45.921875 \n",
       "Q 28.375 48.1875 33.59375 48.1875 \n",
       "Q 44.578125 48.1875 50.953125 41.515625 \n",
       "Q 57.328125 34.859375 57.328125 23.390625 \n",
       "Q 57.328125 12.15625 50.6875 5.359375 \n",
       "Q 44.046875 -1.421875 33.015625 -1.421875 \n",
       "Q 20.359375 -1.421875 13.671875 8.265625 \n",
       "Q 6.984375 17.96875 6.984375 36.375 \n",
       "Q 6.984375 53.65625 15.1875 63.9375 \n",
       "Q 23.390625 74.21875 37.203125 74.21875 \n",
       "Q 40.921875 74.21875 44.703125 73.484375 \n",
       "Q 48.484375 72.75 52.59375 71.296875 \n",
       "\" id=\"BitstreamVeraSans-Roman-36\"/>\n",
       "      </defs>\n",
       "      <g transform=\"translate(13.5625 163.5984375)scale(0.1 -0.1)\">\n",
       "       <use xlink:href=\"#BitstreamVeraSans-Roman-36\"/>\n",
       "       <use x=\"63.623046875\" xlink:href=\"#BitstreamVeraSans-Roman-30\"/>\n",
       "       <use x=\"127.24609375\" xlink:href=\"#BitstreamVeraSans-Roman-30\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "    </g>\n",
       "    <g id=\"ytick_5\">\n",
       "     <g id=\"line2d_24\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"36.65\" xlink:href=\"#m0282bb1dd7\" y=\"136.0390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"line2d_25\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"371.45\" xlink:href=\"#mfe05eeb551\" y=\"136.0390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"text_12\">\n",
       "      <!-- 800 -->\n",
       "      <defs>\n",
       "       <path d=\"M 31.78125 34.625 \n",
       "Q 24.75 34.625 20.71875 30.859375 \n",
       "Q 16.703125 27.09375 16.703125 20.515625 \n",
       "Q 16.703125 13.921875 20.71875 10.15625 \n",
       "Q 24.75 6.390625 31.78125 6.390625 \n",
       "Q 38.8125 6.390625 42.859375 10.171875 \n",
       "Q 46.921875 13.96875 46.921875 20.515625 \n",
       "Q 46.921875 27.09375 42.890625 30.859375 \n",
       "Q 38.875 34.625 31.78125 34.625 \n",
       "M 21.921875 38.8125 \n",
       "Q 15.578125 40.375 12.03125 44.71875 \n",
       "Q 8.5 49.078125 8.5 55.328125 \n",
       "Q 8.5 64.0625 14.71875 69.140625 \n",
       "Q 20.953125 74.21875 31.78125 74.21875 \n",
       "Q 42.671875 74.21875 48.875 69.140625 \n",
       "Q 55.078125 64.0625 55.078125 55.328125 \n",
       "Q 55.078125 49.078125 51.53125 44.71875 \n",
       "Q 48 40.375 41.703125 38.8125 \n",
       "Q 48.828125 37.15625 52.796875 32.3125 \n",
       "Q 56.78125 27.484375 56.78125 20.515625 \n",
       "Q 56.78125 9.90625 50.3125 4.234375 \n",
       "Q 43.84375 -1.421875 31.78125 -1.421875 \n",
       "Q 19.734375 -1.421875 13.25 4.234375 \n",
       "Q 6.78125 9.90625 6.78125 20.515625 \n",
       "Q 6.78125 27.484375 10.78125 32.3125 \n",
       "Q 14.796875 37.15625 21.921875 38.8125 \n",
       "M 18.3125 54.390625 \n",
       "Q 18.3125 48.734375 21.84375 45.5625 \n",
       "Q 25.390625 42.390625 31.78125 42.390625 \n",
       "Q 38.140625 42.390625 41.71875 45.5625 \n",
       "Q 45.3125 48.734375 45.3125 54.390625 \n",
       "Q 45.3125 60.0625 41.71875 63.234375 \n",
       "Q 38.140625 66.40625 31.78125 66.40625 \n",
       "Q 25.390625 66.40625 21.84375 63.234375 \n",
       "Q 18.3125 60.0625 18.3125 54.390625 \n",
       "\" id=\"BitstreamVeraSans-Roman-38\"/>\n",
       "      </defs>\n",
       "      <g transform=\"translate(13.5625 138.7984375)scale(0.1 -0.1)\">\n",
       "       <use xlink:href=\"#BitstreamVeraSans-Roman-38\"/>\n",
       "       <use x=\"63.623046875\" xlink:href=\"#BitstreamVeraSans-Roman-30\"/>\n",
       "       <use x=\"127.24609375\" xlink:href=\"#BitstreamVeraSans-Roman-30\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "    </g>\n",
       "    <g id=\"ytick_6\">\n",
       "     <g id=\"line2d_26\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"36.65\" xlink:href=\"#m0282bb1dd7\" y=\"111.2390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"line2d_27\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"371.45\" xlink:href=\"#mfe05eeb551\" y=\"111.2390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"text_13\">\n",
       "      <!-- 1000 -->\n",
       "      <g transform=\"translate(7.2 113.9984375)scale(0.1 -0.1)\">\n",
       "       <use xlink:href=\"#BitstreamVeraSans-Roman-31\"/>\n",
       "       <use x=\"63.623046875\" xlink:href=\"#BitstreamVeraSans-Roman-30\"/>\n",
       "       <use x=\"127.24609375\" xlink:href=\"#BitstreamVeraSans-Roman-30\"/>\n",
       "       <use x=\"190.869140625\" xlink:href=\"#BitstreamVeraSans-Roman-30\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "    </g>\n",
       "    <g id=\"ytick_7\">\n",
       "     <g id=\"line2d_28\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"36.65\" xlink:href=\"#m0282bb1dd7\" y=\"86.4390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"line2d_29\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"371.45\" xlink:href=\"#mfe05eeb551\" y=\"86.4390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"text_14\">\n",
       "      <!-- 1200 -->\n",
       "      <g transform=\"translate(7.2 89.1984375)scale(0.1 -0.1)\">\n",
       "       <use xlink:href=\"#BitstreamVeraSans-Roman-31\"/>\n",
       "       <use x=\"63.623046875\" xlink:href=\"#BitstreamVeraSans-Roman-32\"/>\n",
       "       <use x=\"127.24609375\" xlink:href=\"#BitstreamVeraSans-Roman-30\"/>\n",
       "       <use x=\"190.869140625\" xlink:href=\"#BitstreamVeraSans-Roman-30\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "    </g>\n",
       "    <g id=\"ytick_8\">\n",
       "     <g id=\"line2d_30\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"36.65\" xlink:href=\"#m0282bb1dd7\" y=\"61.6390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"line2d_31\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"371.45\" xlink:href=\"#mfe05eeb551\" y=\"61.6390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"text_15\">\n",
       "      <!-- 1400 -->\n",
       "      <g transform=\"translate(7.2 64.3984375)scale(0.1 -0.1)\">\n",
       "       <use xlink:href=\"#BitstreamVeraSans-Roman-31\"/>\n",
       "       <use x=\"63.623046875\" xlink:href=\"#BitstreamVeraSans-Roman-34\"/>\n",
       "       <use x=\"127.24609375\" xlink:href=\"#BitstreamVeraSans-Roman-30\"/>\n",
       "       <use x=\"190.869140625\" xlink:href=\"#BitstreamVeraSans-Roman-30\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "    </g>\n",
       "    <g id=\"ytick_9\">\n",
       "     <g id=\"line2d_32\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"36.65\" xlink:href=\"#m0282bb1dd7\" y=\"36.8390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"line2d_33\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"371.45\" xlink:href=\"#mfe05eeb551\" y=\"36.8390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"text_16\">\n",
       "      <!-- 1600 -->\n",
       "      <g transform=\"translate(7.2 39.5984375)scale(0.1 -0.1)\">\n",
       "       <use xlink:href=\"#BitstreamVeraSans-Roman-31\"/>\n",
       "       <use x=\"63.623046875\" xlink:href=\"#BitstreamVeraSans-Roman-36\"/>\n",
       "       <use x=\"127.24609375\" xlink:href=\"#BitstreamVeraSans-Roman-30\"/>\n",
       "       <use x=\"190.869140625\" xlink:href=\"#BitstreamVeraSans-Roman-30\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "    </g>\n",
       "    <g id=\"ytick_10\">\n",
       "     <g id=\"line2d_34\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"36.65\" xlink:href=\"#m0282bb1dd7\" y=\"12.0390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"line2d_35\">\n",
       "      <g>\n",
       "       <use style=\"stroke:#000000;stroke-width:0.5;\" x=\"371.45\" xlink:href=\"#mfe05eeb551\" y=\"12.0390625\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "     <g id=\"text_17\">\n",
       "      <!-- 1800 -->\n",
       "      <g transform=\"translate(7.2 14.7984375)scale(0.1 -0.1)\">\n",
       "       <use xlink:href=\"#BitstreamVeraSans-Roman-31\"/>\n",
       "       <use x=\"63.623046875\" xlink:href=\"#BitstreamVeraSans-Roman-38\"/>\n",
       "       <use x=\"127.24609375\" xlink:href=\"#BitstreamVeraSans-Roman-30\"/>\n",
       "       <use x=\"190.869140625\" xlink:href=\"#BitstreamVeraSans-Roman-30\"/>\n",
       "      </g>\n",
       "     </g>\n",
       "    </g>\n",
       "   </g>\n",
       "  </g>\n",
       " </g>\n",
       " <defs>\n",
       "  <clipPath id=\"pd07ba94dd4\">\n",
       "   <rect height=\"223.2\" width=\"334.8\" x=\"36.65\" y=\"12.0390625\"/>\n",
       "  </clipPath>\n",
       " </defs>\n",
       "</svg>\n"
      ],
      "text/plain": [
       "<matplotlib.figure.Figure at 0x7f20e84b3b10>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "#  Look at a larger, random data set.\n",
    "#  Watch what happens to energy as the iterations proceed.\n",
    "#\n",
    "n = 10000\n",
    "k = 10\n",
    "xy = np.random.random ( [ n, 2 ] )\n",
    "c = initial_clusters_random ( n, k )\n",
    "ivec = []\n",
    "evec = []\n",
    "for i in range ( 0, 100 ):\n",
    "  pop, mn, e = cluster_stats ( n, xy, c, k )\n",
    "  ivec.append ( i )\n",
    "  evec.append ( sum ( e ) )\n",
    "  c, swaps = move_data ( n, xy, c, k, mn )\n",
    "  if ( swaps == 0 ):\n",
    "    print ( 'No point swaps!')\n",
    "    break\n",
    "    \n",
    "plt.plot ( ivec, evec )\n",
    "\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "collapsed": true
   },
   "outputs": [],
   "source": []
  }
 ],
 "metadata": {
  "anaconda-cloud": {},
  "kernelspec": {
   "display_name": "Python 2",
   "language": "python",
   "name": "python2"
  },
  "language_info": {
   "codemirror_mode": {
    "name": "ipython",
    "version": 2
   },
   "file_extension": ".py",
   "mimetype": "text/x-python",
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython2",
   "version": "2.7.8"
  }
 },
 "nbformat": 4,
 "nbformat_minor": 0
}