sklearn.dummy.DummyRegressor¶

class sklearn.dummy.DummyRegressor(*, strategy='mean', constant=None, quantile=None)

[源码]

DummyRegressor是一种使用简单规则进行预测的回归工具。

这个回归变量作为与其他(真实)回归变量进行比较的简单基线非常有用。不要用它来解决真正的问题。

在用户指南中阅读更多内容。

新版本0.13。

参数	说明
strategy	str 用来产生预测的策略。 “mean”:总是预测训练集的均值 “中值”:总是预测训练集的中值 “分位数”:总是预测训练集的指定分位数，并提供分位数参数。 “常量”:总是预测用户提供的常量值。
constant	int or float or array-like of shape (n_outputs,) 由“常数”策略预测的显式常数。这个参数只对“常量”策略有用。
quantile	float in [0.0, 1.0] 使用“分位数”策略预测的分位数。分位数0.5对应中位数，最小值为0.0，最大值为1.0。

属性	说明
constant_	array, shape (1, n_outputs) 训练目标的平均值、中位数、分位数或用户给定的常数值。
n_outputs_	int, 数量的输出。

示例

>>> import numpy as np
>>> from sklearn.dummy import DummyRegressor
>>> X = np.array([1.0, 2.0, 3.0, 4.0])
>>> y = np.array([2.0, 3.0, 5.0, 10.0])
>>> dummy_regr = DummyRegressor(strategy="mean")
>>> dummy_regr.fit(X, y)
DummyRegressor()
>>> dummy_regr.predict(X)
array([5., 5., 5., 5.])
>>> dummy_regr.score(X, y)
0.0

Methods

方法	说明
`fit`(self, X, y[, sample_weight])	拟合随机回归。
`get_params`(self[, deep])	获取这个估计器的参数。
`predict`(self, X[, return_std])	对测试向量X进行分类。
`score`(self, X, y[, sample_weight])	返回预测的决定系数R^2。
`set_params`(self, **params)	设置这个估计器的参数。

__init__(self, *, strategy='mean', constant=None, quantile=None)

[源码]

初始化self. See 请参阅help(type(self))以获得准确的说明。

fit(self, X, y, sample_weight=None)

[源码]

拟合随机回归。

参数	说明
X	{array-like, object with finite length or shape} 训练数据，要求length = n_samples
y	array-like of shape (n_samples,) or (n_samples, n_outputs) 目标的价值。
sample_weight	array-like of shape (n_samples,), default=None 样本权重。

返回值	说明
self	object

get_params(self, deep=True)

[源码]

获取这个估计器的参数。

参数	说明
deep	bool, default=True 如果为真，将返回此估计器的参数以及包含的作为估计器的子对象。

返回值	说明
params	mapping of string to any 参数名称映射到它们的值。

predict(self, X, return_std=False)

[源码]

对测试向量X进行分类。

参数	说明
X	{array-like, object with finite length or shape} 训练数据，要求length = n_samples
return_std	boolean, optional 是否返回后验预测的标准差。这里都是0。新版本0.20。

返回值	说明
y	array-like of shape (n_samples,) or (n_samples, n_outputs) X的预测目标值。
y_std	array-like of shape (n_samples,) or (n_samples, n_outputs) 查询点预测分布的标准差。

score(self, X, y, sample_weight=None)

[源码]

返回预测的决定系数R^2。

定义系数R^2为(1 - u/v)，其中u为(y_true - y_pred) ** 2).sum()的残差平方和，v为(y_true - y_true.mean()) ** 2).sum()的平方和。最好的可能的分数是1.0，它可能是负的(因为模型可以任意地更糟)。常数模型总是预测y的期望值，而不考虑输入特征，得到的R^2得分为0.0。

参数	说明
X	{array-like, None} 使用shape = (n_samples, n_features)或None测试样本。对于某些估计器，这可能是一个预先计算的核矩阵，形状= (n_samples, n_samples_fitting]，其中n_samples_fitting是用于拟合估计器的样本数量。由于DummyRegressor的操作独立于采样的观察结果，所以通过测试样本的结果与通过真实测试样本的结果是一样的。
y	array-like of shape (n_samples,) or (n_samples, n_outputs) X的真值。
sample_weight	array-like of shape (n_samples,), default=None 样本权重。

返回值	说明
score	float self.predict(X) 与y的R²。

set_params(self, **params)

[源码]

设置这个估计器的参数。

该方法适用于简单估计量和嵌套对象。后者具有形式为<component>_<parameter>的参数，这样就让更新嵌套对象的每个组件成为了可能。

参数	说明
**params	dict 估计器参数。

返回值	说明
self	object 估计实例。

sklearn.dummy.DummyRegressor使用示例¶

泊松回归与非正常损失 ¶