交叉验证

why use Training set

  • 用于检查过拟合
  • 对模型在一个独立数据集的表现

How

分离训练集&测试集

1
2
3
4
from sklearn.model_selection import train_test_split

features_train, features_test, labels_train, labels_test = cross_validation.train_test_split(
iris.data, iris.target, test_size=0.4, random_state=0)