利用scikit-plot可视化机器学习模型！

news2026/2/7 18:45:19

关注“Python专栏”微信公众号，回复暗号【面试大全】，立即领取面试题+简历模板。

scikit-learn (sklearn)是Python环境下常见的机器学习库，包含了常见的分类、回归和聚类算法。在训练模型之后，常见的操作是对模型进行可视化，则需要使用Matplotlib进行展示。

scikit-plot是一个基于sklearn和Matplotlib的库，主要的功能是对训练好的模型进行可视化，功能比较简单易懂。

https://scikit-plot.readthedocs.io

pip install scikit-plot

功能1：评估指标可视化

scikitplot.metrics.plot_confusion_matrix快速展示模型预测结果和标签计算得到的混淆矩阵。

import scikitplot as skplt
rf = RandomForestClassifier()
rf = rf.fit(X_train, y_train)
y_pred = rf.predict(X_test)

skplt.metrics.plot_confusion_matrix(y_test, y_pred, normalize=True)
plt.show()

scikitplot.metrics.plot_roc快速展示模型预测的每个类别的ROC曲线。

import scikitplot as skplt
nb = GaussianNB()
nb = nb.fit(X_train, y_train)
y_probas = nb.predict_proba(X_test)

skplt.metrics.plot_roc(y_test, y_probas)
plt.show()

scikitplot.metrics.plot_ks_statistic从标签和分数/概率生成 KS 统计图。

import scikitplot as skplt
lr = LogisticRegression()
lr = lr.fit(X_train, y_train)
y_probas = lr.predict_proba(X_test)

skplt.metrics.plot_ks_statistic(y_test, y_probas)
plt.show()

scikitplot.metrics.plot_precision_recall从标签和概率生成PR曲线

import scikitplot as skplt
nb = GaussianNB()
nb.fit(X_train, y_train)
y_probas = nb.predict_proba(X_test)

skplt.metrics.plot_precision_recall(y_test, y_probas)
plt.show()

scikitplot.metrics.plot_silhouette对聚类结果进行silhouette analysis分析

import scikitplot as skplt
kmeans = KMeans(n_clusters=4, random_state=1)
cluster_labels = kmeans.fit_predict(X)

skplt.metrics.plot_silhouette(X, cluster_labels)
plt.show()

scikitplot.metrics.plot_calibration_curve绘制分类器的矫正曲线

import scikitplot as skplt
rf = RandomForestClassifier()
lr = LogisticRegression()
nb = GaussianNB()
svm = LinearSVC()
rf_probas = rf.fit(X_train, y_train).predict_proba(X_test)
lr_probas = lr.fit(X_train, y_train).predict_proba(X_test)
nb_probas = nb.fit(X_train, y_train).predict_proba(X_test)
svm_scores = svm.fit(X_train, y_train).decision_function(X_test)
probas_list = [rf_probas, lr_probas, nb_probas, svm_scores]
clf_names = ['Random Forest', 'Logistic Regression',
              'Gaussian Naive Bayes', 'Support Vector Machine']

skplt.metrics.plot_calibration_curve(y_test,
                                      probas_list,
                                      clf_names)
plt.show()

功能2：模型可视化

scikitplot.estimators.plot_learning_curve生成不同训练样本下的训练和测试学习曲线图。

import scikitplot as skplt
rf = RandomForestClassifier()

skplt.estimators.plot_learning_curve(rf, X, y)
plt.show()

scikitplot.estimators.plot_feature_importances可视化特征重要性。

import scikitplot as skplt
rf = RandomForestClassifier()
rf.fit(X, y)

skplt.estimators.plot_feature_importances(
     rf, feature_names=['petal length', 'petal width',
                        'sepal length', 'sepal width'])
plt.show()

功能3：聚类可视化

scikitplot.cluster.plot_elbow_curve展示聚类的肘步图。

import scikitplot as skplt
kmeans = KMeans(random_state=1)

skplt.cluster.plot_elbow_curve(kmeans, cluster_ranges=range(1, 30))
plt.show()

功能4：降维可视化

scikitplot.decomposition.plot_pca_component_variance绘制 PCA 分量的解释方差比。

import scikitplot as skplt
pca = PCA(random_state=1)
pca.fit(X)

skplt.decomposition.plot_pca_component_variance(pca)
>plt.show()

scikitplot.decomposition.plot_pca_2d_projection绘制PCA降维之后的散点图。

import scikitplot as skplt
pca = PCA(random_state=1)
pca.fit(X)

skplt.decomposition.plot_pca_2d_projection(pca, X, y)
plt.show()

本文来自互联网用户投稿，该文观点仅代表作者本人，不代表本站立场。本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如若转载，请注明出处：http://www.coloradmin.cn/o/623682.html

如若内容造成侵权/违法违规/事实不符，请联系多彩编程网进行投诉反馈，一经查实，立即删除！

利用scikit-plot可视化机器学习模型！

功能1：评估指标可视化

功能2：模型可视化

功能3：聚类可视化

功能4：降维可视化

相关文章

这么坑？3年经验我要25K，结果只给15K····

北漂5年终上岸，年薪40W并非遥不可及····

C++ Release版软件程序运行丢失MSVCR120D.dll的解决方法

Python数据分析——教育平台的线上课程智能推荐策略（2020泰迪杯数据分析技能赛）

SciencePub学术 | 网络通信类重点SCIEI征稿中

metasploit-framework（msf）——学习与实践

【深度学习】日常笔记2

小程序项目—知识付费系统源码（多版本）

云渲染对学生党有哪些好处和挑战？

Flutter路由——Navigator2.0

Threejs进阶之十八:使用ExtrudeGeometry从二维图形创建三维几何体

(转载)基于模拟退火算法的TSP问题求解(matlab实现)

【6.08 代随_51day】最佳买卖股票时机含冷冻期、买卖股票的最佳时机含手续费

【uniapp 小程序实现已授权用户直接自动登录,未授权用户展示授权页面并实现一键登录】

LVS负载均衡群集部署——DR直接路由模式

00后干一年跳槽就20K，测试老油条表示真怕被这个“卷王”干掉····

2023-06-05 stonedb-在聚合的场景查询为空无法执行case属性-问题分析-及定位问题的思路

【OpenCV DNN】Flask 视频监控目标检测教程 07

2023上海国际嵌入式展 | 如何通过人工智能驱动的自动化测试工具提升嵌入式开发效率

idea代码检查插件