tensorflow-gpu、CUDA、cuDNN版本对应

发表于 2021-06-10 更新于 2022-01-19 分类于实用配置

版本	Python 版本	编译器	构建工具	cuDNN	CUDA
tensorflow_gpu-2.4.0	3.6-3.8	MSVC 2019	Bazel 3.1.0	8.0	11.0
tensorflow_gpu-2.3.0	3.5-3.8	MSVC 2019	Bazel 3.1.0	7.6	10.1
tensorflow_gpu-2.2.0	3.5-3.8	MSVC 2019	Bazel 2.0.0	7.6	10.1
tensorflow_gpu-2.1.0	3.5-3.7	MSVC 2019	Bazel 0.27.1-0.29.1	7.6	10.1
tensorflow_gpu-2.0.0	3.5-3.7	MSVC 2017	Bazel 0.26.1	7.4	10
tensorflow_gpu-1.15.0	3.5-3.7	MSVC 2017	Bazel 0.26.1	7.4	10
tensorflow_gpu-1.14.0	3.5-3.7	MSVC 2017	Bazel 0.24.1-0.25.2	7.4	10
tensorflow_gpu-1.13.0	3.5-3.7	MSVC 2015 update 3	Bazel 0.19.0-0.21.0	7.4	10
tensorflow_gpu-1.12.0	3.5-3.6	MSVC 2015 update 3	Bazel 0.15.0	7.2	9.0
tensorflow_gpu-1.11.0	3.5-3.6	MSVC 2015 update 3	Bazel 0.15.0	7	9
tensorflow_gpu-1.10.0	3.5-3.6	MSVC 2015 update 3	Cmake v3.6.3	7	9
tensorflow_gpu-1.9.0	3.5-3.6	MSVC 2015 update 3	Cmake v3.6.3	7	9
tensorflow_gpu-1.8.0	3.5-3.6	MSVC 2015 update 3	Cmake v3.6.3	7	9
tensorflow_gpu-1.7.0	3.5-3.6	MSVC 2015 update 3	Cmake v3.6.3	7	9
tensorflow_gpu-1.6.0	3.5-3.6	MSVC 2015 update 3	Cmake v3.6.3	7	9
tensorflow_gpu-1.5.0	3.5-3.6	MSVC 2015 update 3	Cmake v3.6.3	7	9
tensorflow_gpu-1.4.0	3.5-3.6	MSVC 2015 update 3	Cmake v3.6.3	6	8
tensorflow_gpu-1.3.0	3.5-3.6	MSVC 2015 update 3	Cmake v3.6.3	6	8
tensorflow_gpu-1.2.0	3.5-3.6	MSVC 2015 update 3	Cmake v3.6.3	5.1	8
tensorflow_gpu-1.1.0	3.5	MSVC 2015 update 3	Cmake v3.6.3	5.1	8
tensorflow_gpu-1.0.0	3.5	MSVC 2015 update 3	Cmake v3.6.3	5.1	8

tensorflow-gpu单离线包下载地址清华镜像
 tensorflow-gpu单离线包下载地址豆瓣镜像
 N卡驱动下载地址
 CUDA下载地址
 cuDNN下载地址（需登录N卡官网）
安装11版本的CUDA可能存在缺失的dll合集（bug）访问码:2vmn

简单的KNN实现

发表于 2021-06-01 分类于机器学习

KNN简介：

k-Nearest Neighbor:kNN 即k近邻算法
分类问题：对新的样本，根据其k个最近邻的训练样本的类别，通过多数表决等方式进行预测。
回归问题：对新的样本，根据其k个最近邻的训练样本标签值的均值作为预测值。
优缺点：

k近邻模型具有非常高的容量，这使得它在训练样本数量较大时能获得较高的精度

计算成本很高

在训练集较小时，泛化能力很差，非常容易陷入过拟合

无法判断特征的重要性

数据集采用阿里巴巴天池项目中的“渔船轨迹数据”
本数据集为h5数据，运行代码目录结构如下

代码如下：

import numpy as np
import pandas as pd
from sklearn.model_selection import train_test_split
from sklearn.preprocessing import StandardScaler
from sklearn.neighbors import KNeighborsClassifier
import pickle

data = pd.read_hdf("data/hy_round1_train_20200102.h5")
print(data.shape)
print(data.columns)
# 校验是否存在NaN
# print(np.all(pd.notnull(data)))

# 载入数据集
data = np.array(data)
# 量化标签
# print(np.where(data[:,-1] == "刺网"))
data[np.where(data[:,-1] == "刺网"),-1] = 0
data[np.where(data[:,-1] == "围网"),-1] = 1
data[np.where(data[:,-1] == "拖网"),-1] = 2
x_train, x_test, y_train, y_test = train_test_split(data[:,:-2],data[:,-1],test_size=0.25,random_state=42)
# 标准化（归一化存在离群点影响弊端）
transfer = StandardScaler()
x_train = transfer.fit_transform(x_train)
x_test = transfer.fit_transform(x_test)
y_train = y_train.astype("int")
y_test = y_test.astype("int")
print(y_train)

# 模型训练
estimator = KNeighborsClassifier(n_neighbors=5)
estimator.fit(x_train,y_train)

# 保存模型
f = open("model/hy_knn_model.pickle","wb")
pickle.dump(estimator, f)

# 模型评估
# 预测值
y_predict = estimator.predict(x_test)
print("模型预测结果：\n", y_predict)
print("真实值对比结果：\n", y_test == y_predict)

# 准确率
score = estimator.score(x_test,y_test)
print("模型准确率：",score)

问题描述：

windows环境，python3，正确使用matplotlib绘图后存在的中文显示异常问题

解决方案：

1.找到当前python环境下如下文件

1	lib\\site-packages\\matplotlib\\mpl-data\\matplotlibrc

2.采用notepad++（文本编辑器）打开matplotlibrc

ctrl+f 搜索 font.family

根据定位寻找到”font.family“和”font.sans-serif“字段

取消注释或直接改成如下形式：

1 2	font.family: sans-serif font.sans-serif: SimHei,Bitstream Vera Sans, Lucida Grande, Verdana, Geneva, Lucid, Arial, Helvetica, Avant Garde, sans-serif

如果存在负号显示异常则可以在添加一行
axes.unicode_minus:False
保存文件，重新加载python环境

Redis安装踩坑

发表于 2021-03-21 更新于 2022-10-12 分类于解决方案

本章记录：Redis6.0.6安装于Ubuntu中遇到的坑

注：使用前记得apt-get update一下

报错1：

1 2	python@python:/usr/local/redis$ sudo make sudo: make: command not found

==解决方案：==

sudo apt-get install make

报错2：

make[3]: cc: Command not found
make[3]: *** [Makefile:201: net.o] Error 127
make[3]: Leaving directory '/usr/local/redis/deps/hiredis'
make[2]: *** [Makefile:50: hiredis] Error 2
make[2]: Leaving directory '/usr/local/redis/deps'
make[1]: [Makefile:264: persist-settings] Error 2 (ignored)
    CC adlist.o
/bin/sh: 1: cc: not found
make[1]: *** [Makefile:315: adlist.o] Error 127
make[1]: Leaving directory '/usr/local/redis/src'
make: *** [Makefile:6: all] Error 2

解决方案：

sudo apt-get install gcc

报错3：

cc: error: ../deps/hiredis/libhiredis.a: No such file or directory
cc: error: ../deps/lua/src/liblua.a: No such file or directory
make[1]: *** [Makefile:283: redis-server] Error 1
make[1]: Leaving directory '/usr/local/redis/src'
make: *** [Makefile:6: all] Error 2

解决方案：

cd deps/
make lua hiredis linenoise
cd ..
sudo make

报错4：

You need tcl 8.5 or newer in order to run the Redis test

解决方案：

tcl8.6.1-src.tar.gz

tar zxvf tcl8.6.1-src.tar.gz
cd tcl8.6.1/unix/
./configure
make
sudo make install

Leetcode-2. 两数相加

发表于 2020-09-24 分类于算法详解

LeetCode链表问题：

输入：(2 -> 4 -> 3) + (5 -> 6 -> 4) 两个已知链表对象ListNode l1, ListNode l2
输出：7 -> 0 -> 8
原因：342 + 465 = 807

1
2
3

graph LR
1((2)) --> 2((4)) --> 3((3))
4((5))--> 5((6)) --> 6((4))

1 2	graph LR 7((7)) --> 8((0)) --> 9((7))

备注：一开始是真没看懂题目。。。后来发现是反向存储的链表结构，又发现不知道ListNode这个类，，万事开头难，看着大家的方案题解，也琢磨了一份，并给出了详细的注释，算是留个笔记吧，继续努力！

/**
 * Definition for singly-linked list.
 * public class ListNode {
 *     int val;
 *     ListNode next;
 *     ListNode(int x) { val = x; }
 * }
 */
class Solution {
    public ListNode addTwoNumbers(ListNode l1, ListNode l2) {
        //记录结果，用于最后返回
        ListNode root = new ListNode();
        //存储过程中的游标
        ListNode cursor = root;
        //保存进位
        int carry = 0;
        
        //计算共同长度的部分
        while(l1 != null && l2 != null){
            int sumVal = l1.val + l2.val + carry;
            cursor.next = new ListNode(sumVal % 10);
            //计算进位
            carry = sumVal / 10;
 
            //下一节点
            l1 = l1.next;
            l2 = l2.next;
            cursor = cursor.next;
        }
 
        //根据剩余长度构建存储节点
        ListNode overplus = null;
        overplus = l1 != null ? l1 : l2;
 
        while(overplus != null){
            int sumVal = overplus.val + carry;
            cursor.next = new ListNode(sumVal % 10);
            //计算进位
            carry = sumVal / 10;
 
            //下一节点
            overplus = overplus.next;
            cursor = cursor.next;
        }
 
        //判断是否有进位残留
        if(carry != 0){
            cursor.next = new ListNode(carry);
        }
 
        return root.next;
    }
}

拉取github项目后Visual Studio 2017 报错system未定义识别符

发表于 2020-08-08 分类于解决方案

问题描述

我在实验室的电脑上的VS2017上编写好了相关代码，为了能够在自己电脑上也能同步编写，我配置好了github进行版本控制，采用推送和拉取进行项目同步，但是在我笔记本上拉取后程序却报出了未定义识别符的错误。

问题抽象

1.VS的集成编程环境

2.非本机构建的C++项目

3.程序本身没有问题，但拉取克隆到本机后出现了“未定义识别符”错误

解决方案

其实通过上面的问题抽象我已经能够推断问题原因了，出在了非本机项目的问题上，说明是环境不同所致，所以解决方案如下：

右键解决方案 >> 属性 >> 常规 >> Windows SDK 版本

换到本机安装的对应版本即可！

收工！

用keras搭建一个简单的一维卷积神经网络

发表于 2019-12-16 分类于机器学习

编程环境

python 3.6.8

tensorflow 1.12.3 离线包

matplotlib 3.1.2

numpy 1.17.4

数据集说明

我所采用的数据集，是我自己构建的一个网络流量数据集，借鉴了Wei Wang等人端到端的思想，

但是处理成的数据集却不同于他们的MNIST型数据集，而是采用的npy进行存储。

由于只是用于测试模型搭建，该数据集仅包含了一部分数据（Chat流量），

原数据来源于加拿大网络安全研究所的公开数据集(ISCX2016)

模型代码

训练模型部分：训练模型部分：

from tensorflow import keras
from tensorflow.python.keras.layers import Dense, Dropout, Activation, Flatten, Conv1D, MaxPool1D
import matplotlib.pyplot as plt
import numpy as np
import os
 
os.environ['TF_CPP_MIN_LOG_LEVEL'] = '2'
 
# 数据集路径
dataset_path = 'dataset.npy'
# 接入softmax的全连接层维数
dense_num = 6
# 保存的模型文件路径
model_file = 'model/cnn_6traffic_model.h5'

# 数据集存储结构
# [training_images, training_labels, validation_images, validation_labels, testing_images, testing_labels]
data = np.load(dataset_path, allow_pickle=True)
 
x_train, y_train, x_test, y_test = np.array(data[0]), np.array(data[1]), np.array(data[2]), np.array(data[3])
print(x_train.shape, y_train.shape)
 
# 一维化
X_train = x_train.reshape(-1, 784, 1)
# print(X_train)
# X_train = X_train.astype('float32')
X_test = x_test.reshape(-1, 784, 1)
# X_test = X_test.astype('float32')
 
 
# 将像素值做归一化，也就是从0~255的取值压缩到0~1之间
# X_train /= 255
# X_test /= 255
 
# 构建模型
model = keras.models.Sequential()
 
# 卷积层1 + relu
# 25 卷积核的数量 即输出的维度
# 3 每个过滤器的长度
model.add(Conv1D(32, 3, activation='relu', input_shape=(784, 1), padding="same"))
# 池化层1
model.add(MaxPool1D(pool_size=3, strides=3))
 
# 卷积层2 + relu
model.add(Conv1D(64, 3, strides=1, activation='relu', padding='same'))
# 池化层2
model.add(MaxPool1D(pool_size=3, strides=3))
 
# 神经元随机失活
model.add(Dropout(0.25))
# 拉成一维数据
model.add(Flatten())
# 全连接层1
model.add(Dense(1024))
# 激活层
model.add(Activation('relu'))
 
# 随机失活
model.add(Dropout(0.4))
# 全连接层2
model.add(Dense(dense_num))
# Softmax评分
model.add(Activation('softmax'))
 
# 查看定义的模型
model.summary()
 
# 自定义优化器参数
# rmsprop = keras.optimizers.RMSprop(lr=0.001, rho=0.9, epsilon=1e-08, decay=0.0)
 
# lr表示学习速率
# decay是学习速率的衰减系数(每个epoch衰减一次)
# momentum表示动量项
# Nesterov的值是False或者True，表示使不使用Nesterov momentum
sgd = keras.optimizers.SGD(lr=0.01, decay=1e-4, momentum=0.9, nesterov=True)
 
model.compile(optimizer='sgd', loss='sparse_categorical_crossentropy', metrics=['accuracy'])
 
# 训练
history = model.fit(X_train, y_train, epochs=10, batch_size=1000,
                    verbose=1, validation_data=[X_test, y_test])
 
model.save(model_file)
print(history.params)

注：神经网络初涉，有啥问题请直接指出，谢谢！有流量识别领域的小伙伴欢迎打扰！相互交流！

说明：鉴于很多人问我数据集的问题，但写这个文章时所用的仅有“Chat”的流量的数据集我已经删除了，所以我在这里提供了包含有我已处理好的六类网络流量的npy数据集，有需要的自取天翼云盘地址(访问码:hp8m)，鉴于之前的数据集是二分类的，但我提供的数据集的六个标签，所以代码中需要做出相应修改，我已将修改后的代码附上了。

线性回归的推导与简单的一元线性回归代码实现

发表于 2019-12-16 更新于 2022-10-19 分类于算法详解

直入正题，一元线性回归就是一次函数，即 $y=kx+b$

在线性函数中，$x$就是自变量，即模型需要输入的数据，$y$就是因变量，即我们需要预测的值

我们如何拟合一条符合数据变化趋势的曲线呢？这就要涉及误差值了，因为拟合出来的曲线能否代表数据特征，我们需要一个评判标准

曲线绝对无法完全拟合到每一个数据点上，但是我们能在这种拟合中寻找最优的那条曲线，评判依据就是误差值，什么是这里的误差值呢？

公式推导

在这里我们将误差设为$e$，$e=y-kx-b$鉴于误差存在正负的波动所以我们以$|e|$来代表误差的大小，要求最小的误差，即求$|e|$的最小值

在整个训练数据集中就求：$\sum_{n=i}^{n}|e|$的最小值，将公式进行推导，如下：
$$
{min}(\sum_{n=i}^n|e|)\Rightarrow{min}(\sum_{n=i}^n e^2)\Rightarrow{min}(\sum_{n=i}^n ({y_{i}-kx_i-b})^2)
$$
令S为：
$$
S =\sum_{n=i}^n ({y_{i}-kx_i-b})^2
$$
为求S的极值，需要对k，b求偏导，并使其等于0，最后求出k，b的值用于代码的编写：
$$
\begin{cases}
\frac{\partial S}{\partial k} = 2 \times \sum_{i=1}^n (y_i-kx_i-b)(-x_i)
\\\
\frac{\partial S}{\partial b} = 2 \times \sum_{i=1}^n (y_i-kx_i-b)(-1)
\end{cases}
$$

$$
\Rightarrow
\begin{cases}
b = \frac{\sum_{i=1}^n (y_i-kx_i)}{n} = \frac{\sum_{i=1}^n y_i-k \sum_{i=1}^{n} x_i}{n}
\\\
k = \frac{n\times\sum_{i=1}^n x_iy_i-\sum_{i=1}^n y_i\times\sum_{i=1}^n x_i}{n\times\sum_{i=1}^n x^2 - (\sum_{i=1}^n x_i)^2}
\end{cases}
$$

以下是代码部分，采用python语言编写：

生成测试数据

import numpy as np
import matplotlib.pyplot as plt

x = np.random.uniform(-10,10,size=100)
y = .4 * x + 3 + np.random.uniform(-1,1,size=100)

f = open('dataset/simulation_data_1d.txt','w')
for i in range(0,100):
    line = str(x[i]) + ',' + str(y[i])
    f.write(line)
    f.write('\n')

plt.plot(x,y,'b.')
plt.show()

编写算法部分

import matplotlib.pyplot as plt
import random

data = open('dataset/simulation_data_1d.txt', 'r').read()
# print(data.split('\n'))

# 数据读取
x = []
y = []
for item in data.split('\n'):
    if item != '':
        item = item.split(',')
        x.append(float(item[0]))
        y.append(float(item[1]))


# 数据集构建
def seperateData(x, y, test_scale):
    # 将x，y进行矩阵组合
    data = []
    for _x in x:
        i = x.index(_x)
        data.append([])
        if type(_x) == 'list':
            for feature in range(len(_x)):
                data[i].append(feature)
            data[i].append(y[i])
        else:
            data[i].append(_x)
            data[i].append(y[i])

    # 切割数据集
    train = []
    test = []
    random.seed(3)
    for i in range(len(data)):
        count = random.random()
        if count > test_scale:
            train.append(data[i])
        elif count < test_scale:
            test.append(data[i])
    return train, test


train, test = seperateData(x, y, 0.2)

# 处理成能够用于模型训练的数据结构
X_train = []
Y_train = []
X_test = []
Y_test = []
for item in train:
    X_train.append(item[:-1])
    Y_train.append(item[-1])
for item in test:
    X_test.append(item[:-1])
    Y_test.append(item[-1])

Xsum = 0.0
Ysum = 0.0
XY = 0.0
X2sum = 0.0
n = len(Y_train)
# 推导的算法结果部分的实现
for i in range(n):
    Xsum += X_train[i][0]
    Ysum += Y_train[i]
    XY += X_train[i][0] * Y_train[i]
    X2sum += X_train[i][0] ** 2
k = (n * XY - Xsum * Ysum) / (n * X2sum  - Xsum ** 2)
b = (Ysum - k * Xsum) / n
print("拟合的曲线：y =",k,"* x +",b)

plt.plot(X_train, Y_train, 'b.')
plt.plot(X_test, Y_test, 'r*')
plt.xlabel("Trian_Area")
plt.ylabel("Trian_Price")

# 绘制拟合的曲线
Y = []
for item in X_test:
    Y.append(k * item[0] + b)
plt.plot(X_test, Y, 'g-')

plt.show()

总结：我jio的还口以，如有错误欢迎指正

护眼色数值

发表于 2019-08-05 分类于实用配置

咳咳，今天写个记录，记录自用护眼色

十六进制颜色如下：

#CCE8CF

Lang's Blog

tensorflow-gpu、CUDA、cuDNN版本对应

简单的KNN实现

windows环境下matplotlib图片中中文显示异常

问题描述：

解决方案：

Redis安装踩坑

ubuntu换源笔记

1.备份源

2.打开源文件

bfsu源

清华源

3.更新源

Leetcode-2. 两数相加

拉取github项目后Visual Studio 2017 报错system未定义识别符

问题描述

问题抽象

解决方案

用keras搭建一个简单的一维卷积神经网络

编程环境

数据集说明

模型代码

线性回归的推导与简单的一元线性回归代码实现

公式推导

生成测试数据

编写算法部分

护眼色数值