sklearn.multioutput.RegressorChain¶

class sklearn.multioutput.RegressorChain(base_estimator, *, order=None, cv=None, random_state=None)

[源码]

一种多标签模型，可将回归安排到一个链中。

每个模型都使用提供给模型的所有可用功能以及链中较早的模型的预测，按链指定的顺序进行预测。

在用户指南中阅读更多内容。

0.20版中的新功能。

参数	说明
base_estimator	estimator 建立分类器链的基础估计量。
order	array-like of shape (n_outputs,) or ‘random’, optional 默认情况下，顺序将由标签矩阵Y中的列顺序确定。 `order = [0, 1, 2, ..., Y.shape[1] - 1]` 可以通过提供整数列表来明确设置链的顺序。例如，对于长度为5的链： `order = [1, 3, 2, 4, 0]` 意味着链中的第一个模型将对Y矩阵中的第1列进行预测，第二个模型将对第3列进行预测，依此类推。如果顺序是“随机的”，将使用随机排序。
cv	int, cross-validation generator or an iterable, optional (default=None) 确定是对链中以前的估计量的结果使用交叉验证的预测还是真实的标签。如果cv为None，则在拟合时使用真实标签。否则，cv的可能输入为： - 整数，用于指定（分层）KFold中的折叠数， - CV分配器， - 可迭代的数据（训练，测试）拆分为索引数组。
random_state	int, RandomState instance or None, optional (default=None) 如果为`order='random'`，则确定链顺序的随机数生成。另外，它控制`base_estimator` 在每个链式迭代中每次给出的随机种子。因此，仅在`base_estimator` 暴露时使用`random_state`。为多个函数调用传递可重复输出的int值。请参阅词汇表。

属性	说明
estimators_	list base_estimator的克隆列表。
order_	list 标签在分类器链中的顺序。

另见

ClassifierChain

相当于分类
MultioutputRegressor
独立学习每个输出，而不是链接

方法	说明
`fit`(X, Y, **fit_params)	使模型拟合数据矩阵X和目标Y。
`get_params`([deep])	获取此估计量的参数。
`predict`(X)	使用ClassifierChain模型预测数据矩阵X。
`score`(X, y[, sample_weight])	返回预测的确定系数R ^ 2。
`set_params`(**params)	设置此估算器的参数。

__init__(base_estimator, *, order=None, cv=None, random_state=None)

[源码]

初始化self，请参阅help(type(self))以获得准确的说明。

fit(X, Y, **fit_params)

[源码]

使模型拟合数据矩阵X和目标Y。

参数	说明
X	{array-like, sparse matrix}, shape (n_samples, n_features) 输入数据。
Y	array-like, shape (n_samples, n_classes) 目标值。
**fit_params	dict of string -> object `fit`在回归链的每个步骤传递给方法的参数。

返回值	说明
self	object

get_params(deep=True)

[源码]

获取此估计量的参数。

参数	说明
deep	bool, default=True 如果为True，则将返回此估算器和作为估算器的所包含子对象的参数。

返回值	说明
params	mapping of string to any 参数名称映射到其值。

predict(X)

[源码]

使用ClassifierChain模型预测数据矩阵X。

参数	说明
X	{array-like, sparse matrix}, shape (n_samples, n_features) 输入数据。

返回值	说明
Y_pred	array-like, shape (n_samples, n_classes) 预测值。

score(X, y, sample_weight=None)

[源码]

返回预测的确定系数R ^ 2。

系数R ^ 2定义为（1- u / v），其中u是平方的残差和((y_true - y_pred) ** 2).sum()，而v是平方的总和 ((y_true - y_true.mean()) ** 2).sum(). 。可能的最高分是1.0，并且可能为负（因为该模型可能会更差）。不管输入特征如何，始终预测y的期望值的恒定模型将获得0.0的R ^ 2分数。

参数	说明
X	array-like of shape (n_samples, n_features) 测试样本。对于某些估计量，这可能是一个预先计算的核矩阵或泛型对象列表，shape=（n_samples，n_samples_fitted），其中n_samples_fitted是在拟合估计器时使用的样本数。
y	array-like of shape (n_samples,) or (n_samples, n_outputs) X的真实值。
sample_weight	array-like of shape (n_samples,), default=None 样本权重

返回值	说明
score	float self.predict(X) wrt. y.的R^2

注

调用score回归器时使用的R2得分multioutput='uniform_average'从0.23版开始使用，r2_score已与默认值保持一致。这会影响score所有多输出回归变量的方法（除外 MultiOutputRegressor）。

set_params(**params)

[源码]

设置此估算器的参数。

该方法适用于简单的估计器以及嵌套对象（例如 pipelines）。后者具有形式的参数， <component>__<parameter>以便可以更新嵌套对象的每个组件。

参数	说明
**params	dict 估算量参数。

返回值	说明
self	object 估算量实例。