TensorFlow教程Softmax逻辑回归识别手写数字MNIST数据集

Meta ·

更新时间:2024-09-20

· 1353 次阅读

基于MNIST数据集的逻辑回归模型做十分类任务

没有隐含层的Softmax Regression只能直接从图像的像素点推断是哪个数字，而没有特征抽象的过程。多层神经网络依靠隐含层，则可以组合出高阶特征，比如横线、竖线、圆圈等，之后可以将这些高阶特征或者说组件再组合成数字，就能实现精准的匹配和分类。


import tensorflow as tf
import numpy as np
import input_data
print('Download and Extract MNIST dataset')
mnist = input_data.read_data_sets('data/', one_hot=True) # one_hot=True意思是编码格式为01编码
print("tpye of 'mnist' is %s" % (type(mnist)))
print("number of train data is %d" % (mnist.train.num_examples))
print("number of test data is %d" % (mnist.test.num_examples))
trainimg = mnist.train.images
trainlabel = mnist.train.labels
testimg = mnist.test.images
testlabel = mnist.test.labels
print("MNIST loaded")
"""
print("type of 'trainimg' is %s"    % (type(trainimg)))
print("type of 'trainlabel' is %s"  % (type(trainlabel)))
print("type of 'testimg' is %s"     % (type(testimg)))
print("type of 'testlabel' is %s"   % (type(testlabel)))
print("------------------------------------------------")
print("shape of 'trainimg' is %s"   % (trainimg.shape,))
print("shape of 'trainlabel' is %s" % (trainlabel.shape,))
print("shape of 'testimg' is %s"    % (testimg.shape,))
print("shape of 'testlabel' is %s"  % (testlabel.shape,))
"""
x = tf.placeholder(tf.float32, [None, 784])
y = tf.placeholder(tf.float32, [None, 10]) # None is for infinite
w = tf.Variable(tf.zeros([784, 10])) # 为了方便直接用0初始化，可以高斯初始化
b = tf.Variable(tf.zeros([10])) # 10分类的任务，10种label，所以只需要初始化10个b
pred = tf.nn.softmax(tf.matmul(x, w) + b) # 前向传播的预测值
cost = tf.reduce_mean(-tf.reduce_sum(y*tf.log(pred), reduction_indices=[1])) # 交叉熵损失函数
optm = tf.train.GradientDescentOptimizer(0.01).minimize(cost)
corr = tf.equal(tf.argmax(pred, 1), tf.argmax(y, 1)) # tf.equal()对比预测值的索引和真实label的索引是否一样，一样返回True，不一样返回False
accr = tf.reduce_mean(tf.cast(corr, tf.float32))
init = tf.global_variables_initializer() # 全局参数初始化器
training_epochs = 100 # 所有样本迭代100次
batch_size = 100 # 每进行一次迭代选择100个样本
display_step = 5
# SESSION
sess = tf.Session() # 定义一个Session
sess.run(init) # 在sess里run一下初始化操作
# MINI-BATCH LEARNING
for epoch in range(training_epochs): # 每一个epoch进行循环
    avg_cost = 0. # 刚开始损失值定义为0
    num_batch = int(mnist.train.num_examples/batch_size)
    for i in range(num_batch): # 每一个batch进行选择
        batch_xs, batch_ys = mnist.train.next_batch(batch_size) # 通过next_batch()就可以一个一个batch的拿数据，
        sess.run(optm, feed_dict={x: batch_xs, y: batch_ys}) # run一下用梯度下降进行求解，通过placeholder把x，y传进来
        avg_cost += sess.run(cost, feed_dict={x: batch_xs, y:batch_ys})/num_batch
    # DISPLAY
    if epoch % display_step == 0: # display_step之前定义为5，这里每5个epoch打印一下
        train_acc = sess.run(accr, feed_dict={x: batch_xs, y:batch_ys})
        test_acc = sess.run(accr, feed_dict={x: mnist.test.images, y: mnist.test.labels})
        print("Epoch: %03d/%03d cost: %.9f TRAIN ACCURACY: %.3f TEST ACCURACY: %.3f"
              % (epoch, training_epochs, avg_cost, train_acc, test_acc))
print("DONE")

迭代100次跑一下模型，最终，在测试集上可以达到92.2%的准确率，虽然还不错，但是还达不到实用的程度。手写数字的识别的主要应用场景是识别银行支票，如果准确率不够高，可能会引起严重的后果。


Epoch: 095/100 loss: 0.283259882 train_acc: 0.940 test_acc: 0.922

插一些知识点，关于tensorflow中一些函数的用法


sess = tf.InteractiveSession()
arr = np.array([[31, 23,  4, 24, 27, 34],
                [18,  3, 25,  0,  6, 35],
                [28, 14, 33, 22, 30,  8],
                [13, 30, 21, 19,  7,  9],
                [16,  1, 26, 32,  2, 29],
                [17, 12,  5, 11, 10, 15]])


在tensorflow中打印要用.eval()
tf.rank(arr).eval() # 打印矩阵arr的维度
tf.shape(arr).eval() # 打印矩阵arr的大小
tf.argmax(arr, 0).eval() # 打印最大值的索引，参数0为按列求索引，1为按行求索引

以上就是TensorFlow教程Softmax逻辑回归识别手写数字MNIST数据集的详细内容，更多关于Softmax逻辑回归MNIST数据集手写识别的资料请关注软件开发网其它相关文章！

回归 mnist softmax tensorflow

1024 个赞

需要登录后方可回复, 如果你还没有账号请注册新账号

探索PowerShell(一) 初识 PowerShell

Maleah 2021-05-23

828

css实现流程导航效果(三种方法)

Nimat 2021-01-25

942

使用go xorm来操作mysql的方法实例

Vala 2021-02-11

794

CSS未知高度垂直居中的实现

Karima 2020-11-22

556

Tensorflow2.1实现文本中情感分类实现解析

Rose 2022-11-20

1182

Tensorflow2.1完成对MPG回归预测详解

Querida 2022-11-20

696

Tensorflow2.4从头训练Word Embedding实现文本分类

Serafina 2023-01-06

1212

Tensorflow2.4搭建单层和多层Bi-LSTM模型

Kathy 2023-01-06

深度学习Tensorflow 2.4 完成迁移学习和模型微调

Tani 2023-01-06

1085

Python利用keras接口实现深度神经网络回归

Tina 2023-02-18

418

Python基于TensorFlow接口实现深度学习神经网络回归

Lark 2023-02-18

364

Matlab利用随机森林(RF)算法实现回归预测详解

Tesia 2023-02-18

326

Python实现随机森林回归与各自变量重要性分析与排序

Dulcea 2023-02-20

1012

tensorflow1.x和tensorflow2.x中的tensor转换为字符串的实现

Olivia 2023-02-25

1309

基于Matlab实现人工神经网络(ANN)回归的示例详解

Tallulah 2023-02-26

1622

tensorflow基于Anaconda环境搭建的方法步骤

Oria 2023-02-28

278

Anaconda中安装Tensorflow的过程

Psyche 2023-03-31

506

Python实现softmax反向传播的示例代码

Olivia 2023-04-10

286

使用Python、TensorFlow和Keras来进行垃圾分类的操作方法

Laila 2023-05-12

349

tensorflow之如何使用GPU而不是CPU问题

Ida 2023-05-13

644

我要提问

致谢

帮助他人，成就自己。

人生最大成功就是伸出热情而温暖的双手，尽自己所能去帮助身边的每一个人，只要无私的奉献，就会收获到美好的生活。

1024问感谢每一位朋友的帮助和支持。

软件开发网提供编程的基础软件技术培训教程,软件开发编程实例讲解Go,Node,HTML,CSS,Javascript,Python,Java,Ruby,C,PHP,MySQL等软件开发编程语言以及数据开发的基础知识，也提供大量的软件开发在线实例、从入门到精通就在1024问。

育儿网微养生全球行美食街育儿菜谱大全海南旅游女性养狗百科星座