首页
/
关于使用 Python 编写一个机器学习模型例如决策树或随机森林用于进行特征选择?
/
使用 Python 编写一个机器学习模型例如决策树或随机森林用于进行特征选择?

使用 Python 编写一个机器学习模型例如决策树或随机森林用于进行特征选择?

6个月前

import pandas as pd
import numpy as np
from sklearn.ensemble import DecisionTreeClassifier
from sklearn.ensemble import RandomForestClassifier

# Load the dataset
data = pd.read_csv('features_selection_data.csv')

# Select features
features = data.iloc[:, 1:]

# Split the data into training and testing sets
X_train, X_test, y_train, y_test = train_test_split(features, data['target'], test_size=0.2, random_state=42)

# Create decision tree and random forest models
model_tree = DecisionTreeClassifier()
model_rf = RandomForestClassifier()

# Train the models
model_tree.fit(X_train, y_train)
model_rf.fit(X_train, y_train)

# Evaluate the models on the test set
score_tree = model_tree.score(X_test, y_test)
score_rf = model_rf.score(X_test, y_test)

# Print the scores
print('Decision Tree Score:', score_tree)
print('Random Forest Score:', score_rf)

数据准备

假设您有一个名为 features_selection_data.csv 的 CSV 文件，其中包含一个特征和一个目标变量的数值数据。

代码解释