菜单 学习猿地 - LMONKEY

VIP

开通学习猿地VIP

尊享10项VIP特权 持续新增

知识通关挑战

打卡带练!告别无效练习

接私单赚外块

VIP优先接,累计金额超百万

学习猿地私房课免费学

大厂实战课仅对VIP开放

你的一对一导师

每月可免费咨询大牛30次

领取更多软件工程师实用特权

入驻
132
0

Kaggle

原创
05/13 14:22
阅读数 45747
import numpy as np
import pandas as pd
from sklearn.ensemble import RandomForestClassifier


train_data = pd.read_csv('/kaggle/input/titanic/train.csv')
train_data.head()   # 训练集

test_data = pd.read_csv('/kaggle/input/titanic/test.csv')
test_data.head()    # 测试集


features = ['Pclass', 'Sex', 'SibSp', 'Parch']
X = pd.get_dummies(train_data[features])    # X:训练数据
y = train_data['Survived']  # y:类别


model = RandomForestClassifier(n_estimators=100,    
                               max_depth=5, random_state=1)
model.fit(X,y)  # 模型


X_test = pd.get_dummies(test_data[features])    # 测试
predictions = model.predict(X_test)


output = pd.DataFrame({'PassengerId': test_data.PassengerId,
                       'Survived': predictions})
output.to_csv('my_submission.csv', index=False) # 输出


print('Your submission was successfully saved!')

发表评论

0/200
132 点赞
0 评论
收藏