Python模型评估与验证

张

张建站

2026/5/31 14:48:48

10分钟阅读

# Python模型评估与验证# 模型评估是机器学习流程的关键环节# 交叉验证能更可靠地评估模型泛化性能# 1. 导入库import numpy as npfrom sklearn.datasets import load_breast_cancerfrom sklearn.model_selection import (cross_val_score, StratifiedKFold, train_test_split)from sklearn.linear_model import LogisticRegressionfrom sklearn.metrics import (confusion_matrix, classification_report,precision_score, recall_score, f1_score,roc_curve, roc_auc_score)from sklearn.ensemble import RandomForestClassifier# 2. 加载数据cancer load_breast_cancer()X, y cancer.data, cancer.targetX_train, X_test, y_train, y_test train_test_split(X, y, test_size0.3, random_state42)# 3. 交叉验证基础model LogisticRegression(max_iter5000, random_state42)cv_scores cross_val_score(model, X_train, y_train, cv5, scoringaccuracy)print(f 5 折交叉验证 )print(f每折得分: {cv_scores})print(f平均准确率: {cv_scores.mean():.4f})# 4. StratifiedKFold 分层交叉验证skf StratifiedKFold(n_splits5, shuffleTrue, random_state42)cv_strat cross_val_score(model, X_train, y_train, cvskf, scoringaccuracy)print(f\nStratifiedKFold 平均准确率: {cv_strat.mean():.4f})# 5. 多种评估指标print(f\n多种指标 (5折CV):)for metric in [accuracy, precision, recall, f1, roc_auc]:scores cross_val_score(model, X_train, y_train, cv5, scoringmetric)print(f {metric}: {scores.mean():.4f})# 6. 混淆矩阵model.fit(X_train, y_train)y_pred model.predict(X_test)cm confusion_matrix(y_test, y_pred)print(f\n 混淆矩阵 )print(f 预测负类预测正类)print(f实际负类 TN{cm[0,0]:4d} FP{cm[0,1]:4d})print(f实际正类 FN{cm[1,0]:4d} TP{cm[1,1]:4d})# 7. 精确率、召回率、F1precision precision_score(y_test, y_pred)recall recall_score(y_test, y_pred)f1 f1_score(y_test, y_pred)print(f\n精确率 (Precision): {precision:.4f})print(f召回率 (Recall): {recall:.4f})print(fF1 分数: {f1:.4f})print(f\n完整分类报告:)print(classification_report(y_test, y_pred, target_namescancer.target_names))# 8. ROC 曲线和 AUCy_prob model.predict_proba(X_test)[:, 1]fpr, tpr, thresholds roc_curve(y_test, y_prob)auc_score roc_auc_score(y_test, y_prob)print(f\n ROC-AUC )print(fAUC 值: {auc_score:.4f})# 9. 不同模型对比print(f\n模型对比 (5折CV AUC):)models {LR: LogisticRegression(max_iter5000, random_state42),RF: RandomForestClassifier(n_estimators100, random_state42)}for name, m in models.items():scores cross_val_score(m, X_train, y_train, cv5, scoringroc_auc)print(f {name}: {scores.mean():.4f})# 10. 验证策略选择# 数据量大: 简单 train/test split# 数据量小: 必须交叉验证 (K5 或 K10)# 类别不平衡: 用 StratifiedKFold# 时间序列: 用 TimeSeriesSplitprint(f\n测试集准确率: {model.score(X_test, y_test):.4f})print(f交叉验证准确率: {cv_scores.mean():.4f})

RevokeMsgPatcher：Windows平台终极防撤回解决方案深度解析

RevokeMsgPatcher：Windows平台终极防撤回解决方案深度解析【免费下载链接】RevokeMsgPatcher :trollface: A hex editor for WeChat/QQ/TIM - PC版微信/QQ/TIM防撤回补丁（我已经看到了，撤回也没用了） 项目地址: https://gitcod…...

2026/5/31 14:45:29 阅读更多 →

如何快速解锁VMware macOS支持：终极虚拟化工具使用指南

如何快速解锁VMware macOS支持：终极虚拟化工具使用指南【免费下载链接】unlocker VMware Workstation macOS 项目地址: https://gitcode.com/gh_mirrors/unloc/unlocker 你是否想在Windows或Linux系统上体验macOS的流畅操作？VMware默认不支持ma…...

2026/5/31 14:43:03 阅读更多 →

消息防撤回终极方案：RevokeMsgPatcher深度解析与实战指南

消息防撤回终极方案：RevokeMsgPatcher深度解析与实战指南【免费下载链接】RevokeMsgPatcher :trollface: A hex editor for WeChat/QQ/TIM - PC版微信/QQ/TIM防撤回补丁（我已经看到了，撤回也没用了） 项目地址: https://gitcode…...

2026/5/31 14:41:08 阅读更多 →

AnolisOS 8.8安装源配置踩坑实录：从‘设置基础软件仓库时出错’到成功联网的保姆级指南

AnolisOS 8.8安装源配置实战指南：从诊断到解决方案的全流程解析当你在安装AnolisOS 8.8时遇到"设置基础软件仓库时出错"的提示，这通常意味着系统无法访问或识别安装源。这个问题看似简单，但背后可能涉及网络配置、镜像选择、启动参…...

2026/5/31 0:02:01 阅读更多 →

Lindy路线图前瞻：3个已被验证的信号，预示Q3将启动下一代AI原生平台重构

更多请点击： https://intelliparadigm.com 第一章：Lindy路线图前瞻：3个已被验证的信号，预示Q3将启动下一代AI原生平台重构信号一：核心基础设施层API调用量连续8周突破临界阈值 Lindy平台的 /v2/execute与 /v3/plan端…...

2026/5/31 0:05:14 阅读更多 →

【AI工具智能排行榜TOP10】：2024年实测数据驱动的生产力跃迁指南（仅限本周开放下载）

更多请点击： https://kaifayun.com 第一章：AI工具智能排行榜TOP10的底层逻辑与评估范式 AI工具排行榜并非主观评分的产物，而是由多维可量化指标驱动的系统性工程。其核心在于构建一个兼顾能力广度、推理深度、工程鲁棒性与生态协同性的评估范…...

2026/5/31 0:08:54 阅读更多 →

3步解决博德之门3模组管理难题：BG3ModManager完整使用指南

3步解决博德之门3模组管理难题：BG3ModManager完整使用指南【免费下载链接】BG3ModManager A mod manager for Baldurs Gate 3. This is the only official source! 项目地址: https://gitcode.com/gh_mirrors/bg/BG3ModManager BG3ModManager是专为《博德之…...

2026/5/31 0:17:22 阅读更多 →