KNN实现图像分类

2018-10-23 | 阅读：次

数据集

http://www.cs.toronto.edu/~kriz/cifar.html

Baseline results

You can find some baseline replicable results on this dataset on the project page for cuda-convnet. These results were obtained with a convolutional neural network. Briefly, they are 18% test error without data augmentation and 11% with. Additionally, Jasper Snoek has a new paper in which he used Bayesian hyperparameter optimization to find nice settings of the weight decay and other hyperparameters, which allowed him to obtain a test error rate of 15% (without data augmentation) using the architecture of the net that got 18%.

您可以在cuda-convnet的项目页面上找到此数据集上的一些基线可复制结果。这些结果是用卷积神经网络获得的。简而言之，它们是18％的测试错误，没有数据增加，11％有。此外，Jasper Snoek有一篇新论文，他使用贝叶斯超参数优化来找到重量衰减和其他超参数的好设置，这使他能够使用网络的体系结构获得15％的测试错误率（无数据增加）得到了18％。

Dataset layout布局

Python / Matlab versions

I will describe the layout of the Python version of the dataset. The layout of the Matlab version is identical.

The archive contains the files data_batch_1, data_batch_2, …, data_batch_5, as well as test_batch

. Each of these files is a Python “pickled” object produced with cPickle 文件中的每一个都是使用cPickle生成的Python”pickled”对象

. Here is a python2 routine例程 which will open such a file and return a dictionary:将会打开这样一个文件，并且返回一个字典

def unpickle(file):
    import cPickle
    with open(file, 'rb') as fo:
        dict = cPickle.load(fo)
    return dict

And a python3 version:

def unpickle(file):
    import pickle
    with open(file, 'rb') as fo:
        dict = pickle.load(fo, encoding='bytes')
    return dict

Loaded in this way以这种方式加载, each of the batch files contains a dictionary with the following elements:

data – a 10000x3072 numpy array of uint8s. Each row of the array stores a 32x32 colour image. The first 1024 entries条目 contain the red channel values, the next 1024 the green, and the final 1024 the blue. The image is stored in row-major order图像以主顺序存储, so that the first 32 entries of the array are the red channel values of the first row of the image.数组的前32个条目是图像第一行的红色通道数
labels – a list of 10000 numbers in the range 0-9. The number at index i indicates the label of the ith image in the array data. 0-9范围内的10000个数字列表。索引i处的数字表示阵列数据中第i个图像的标签。

The dataset contains another file, called batches.meta

. It too contains a Python dictionary object. It has the following entries:

label_names – 十维元素列表-a 10-element list which gives meaningful names to the numeric labels in the labels array described above. For example, label_names[0] == “airplane”, label_names[1] == “automobile”, etc.

stone

Linxia Yao

KNN实现图像分类

数据集

Baseline results

Dataset layout布局

Python / Matlab versions