LDA线性判别式-scikitlearn和numpy两种实现方法

Hea ·

更新时间:2024-09-21

· 645 次阅读


'''
Linear Discriminant Analysis (LDA) in manuer and scikit-learn
1. Calculate mean vectors of each class
2. Calculate within-class and between-class scatter matrices
3. Calculate eigenvalues and eigenvectors for
4. Keep the top k eigenvectors by ordering them by descending eigenvalues 
5. Use the top eigenvectors to project onto the new space
'''
# import the Iris dataset from scikit-learn
from sklearn.datasets import load_iris
# import our plotting module
import matplotlib.pyplot as plt
# load the Iris dataset
iris = load_iris()
# 创建X，y变量来表示特征和响应变量列。create X and y variables to hold features and response column
iris_X, iris_y = iris.data, iris.target
# calculate the mean for each class
mean_vectors = []
for cl in [0, 1, 2]:
	class_mean_vector = np.mean(iris_X[iris_y==cl], axis=0)
	mean_vectors.append(class_mean_vector)
	print( label_dict[cl], class_mean_vector)
# Calculating within-class and between-class scatter matrices
# Calculate within-class scatter matrix
S_W = np.zeros((4,4))
# for each flower
for cl,mv in zip([0, 1, 2], mean_vectors):
	# scatter matrix for every class, starts with all 0's
	class_sc_mat = np.zeros((4,4))
	# for each row that describes the specific flower
	for row in iris_X[iris_y == cl]:
	# make column vectors
		row, mv = row.reshape(4,1), mv.reshape(4,1)
	# this is a 4x4 matrix
		class_sc_mat += (row-mv).dot((row-mv).T)
	# sum class scatter matrices
	S_W += class_sc_mat
S_W
# calculate the between-class scatter matrix
# mean of entire dataset
overall_mean = np.mean(iris_X, axis=0).reshape(4,1)
# will eventually become between class scatter matrix
S_B = np.zeros((4,4))
for i,mean_vec in enumerate(mean_vectors):
	# number of flowers in each species
	n = iris_X[iris_y==i,:].shape[0]
	# make column vector for each specied
	mean_vec = mean_vec.reshape(4,1)
	S_B += n * (mean_vec - overall_mean).dot((mean_vec - overall_mean).T)
S_B
#Calculating eigenvalues and eigenvectors for SW-1SB
# calculate eigenvalues and eigenvectors of S−1W x SB
eig_vals, eig_vecs = np.linalg.eig(np.dot(np.linalg.inv(S_W), S_B)) 
eig_vecs = eig_vecs.real
eig_vals = eig_vals.real
for i in range(len(eig_vals)):
	eigvec_sc = eig_vecs[:,i]
	print ('Eigenvector {}: {}'.format(i+1, eigvec_sc))
	print ('Eigenvalue {:}: {}'.format(i+1, eig_vals[i]))
	print()
# keep the top two linear discriminants
linear_discriminants = eig_vecs.T[:2]
linear_discriminants
#explained variance ratios
eig_vals / eig_vals.sum()
# LDA projected data
lda_iris_projection = np.dot(iris_X, linear_discriminants.T)
# plot
label_dict = {i: k for i, k in enumerate(iris.target_names)}
def plot(X, y, title, x_label, y_label):
	ax = plt.subplot(111)
	for label,marker,color in zip(range(3),('^', 's', 'o'),('blue', 'red', 'green')):
		plt.scatter(x=X[:,0].real[y == label],y=X[:,1].real[y == label],color=color,alpha=0.5,label=label_dict[label])
		plt.xlabel(x_label)
		plt.ylabel(y_label)
	leg = plt.legend(loc='upper right', fancybox=True)
	leg.get_frame().set_alpha(0.5)
	plt.title(title)
	plt.show()
plot(lda_iris_projection, iris_y, "LDA Projection", "LDA1", "LDA2")
#How to use LDA in scikit-learn
from sklearn.discriminant_analysis import LinearDiscriminantAnalysis
# instantiate the LDA module
lda = LinearDiscriminantAnalysis(n_components=2)
# fit and transform our original iris data . LDA is supervised clssifier
X_lda_iris = lda.fit_transform(iris_X, iris_y)
lda.scalings_
lda.explained_variance_ratio_
'''scikit学习线性判别式对特征向量进行了标准化缩放，它们都是有效的特征向量。唯一的区别是投影数据的缩放'''
# plot the projected data
plot(X_lda_iris, iris_y, "LDA Projection", "LDA1", "LDA2")




作者：夜已.入深
                    
 
                

                            判别式
                            lda
                            NumPy
                            方法


           
    
    

            
                
                    
                
            
            
                
    
        
            需要 登录 后方可回复, 如果你还没有账号请 注册新账号
        
    
                
            
                
                    
                        相关文章

    
        
            Laravel 中使用简单的方法跟踪用户是否在线(推荐)
        
        
            Serwa
            2020-03-20
        
    
    
        874
    


    
        
            shell监控linux系统进程创建脚本分享
        
        
            Adalia
            2020-01-30
        
    
    
        735
    


    
        
            Ubuntu VMware出现提示No 3D support is available的解决方法
        
        
            Lacie
            2021-06-13
        
    
    
        773
    


    
        
            iis创建用户隔离模式FTP站点的方法
        
        
            Carol
            2021-02-27
        
    
    
        660
    


    
        
            docker网卡的IP地址修改方法总结
        
        
            Rae
            2023-07-22
        
    
    
        1847
    


    
        
            docker命令中必须加上sudo的问题解决方法
        
        
            Rhoda
            2023-07-22
        
    
    
        1038
    


    
        
            Elasticsearch/Kibana密码设置方法
        
        
            Hester
            2023-07-22
        
    
    
        1081
    


    
        
            docker查询日志并输出到文件的方法
        
        
            Grace
            2023-07-22
        
    
    
        1029
    


    
        
            docker容器/etc/hosts文件修改方法
        
        
            Vanna
            2023-07-22
        
    
    
        1279
    


    
        
            docker容器连接宿主机redis与mysql的配置方法
        
        
            Peony
            2023-07-22
        
    
    
        1975
    


    
        
            Docker镜像之不同服务器间迁移方法大全
        
        
            Dorothy
            2023-07-22
        
    
    
        1993
    


    
        
            docker容器使用内存大小限制方法
        
        
            Dulcea
            2023-07-22
        
    
    
        493
    


    
        
            在Linux中列出Systemd下所有正在运行的服务的方法指南
        
        
            Zandra
            2023-07-22
        
    
    
        507
    


    
        
            一文详解Python中多进程和进程池的使用方法
        
        
            Serafina
            2023-07-24
        
    
    
        338
    


    
        
            VMware克隆虚拟机并重新设置IP和主机名的实现方法
        
        
            Kathy
            2023-08-08
        
    
    
        194
    


    
        
            使用nginx.exe时闪退的原因和解决方法
        
        
            Olivia
            2023-08-08
        
    
    
        694
    


    
        
            阿里云服务IIS搭建Web网站外网无法访问的解决方法
        
        
            Elina
            2023-08-08
        
    
    
        897
    


    
        
            ssh报错nokeyalg的解决方法(关于低版本连接高版本ssh)
        
        
            Jacinthe
            2023-08-08
        
    
    
        339
    


    
        
            在没有Docker缓存的情况下构建镜像的方法分享
        
        
            Viridis
            2023-08-08
        
    
    
        1779
    


    
        
            docker-compose中启动镜像失败的几种解决方法
        
        
            Hana
            2023-08-08
        
    
    
        725


        
    
        
            我要提问
        
    
    
        
        
    
        致谢
        
            帮助他人，成就自己。
            人生最大成功就是伸出热情而温暖的双手，尽自己所能去帮助身边的每一个人，只要无私的奉献，就会收获到美好的生活。
            1024问感谢每一位朋友的帮助和支持。
            软件开发网提供编程的基础软件技术培训教程,软件开发编程实例讲解Go,Node,HTML,CSS,Javascript,Python,Java,Ruby,C,PHP,MySQL等软件开发编程语言以及数据开发的基础知识，也提供大量的软件开发在线实例、从入门到精通就在1024问。
        
    
    
        
            
    育儿网
    微养生
    全球行
    美食街
    育儿
    菜谱大全
    海南旅游
    女性
    养狗百科
    星座