sklearn.kernel_approximation.SkewedChi2Sampler¶

class sklearn.kernel_approximation.SkewedChi2Sampler(*, skewedness=1.0, n_components=100, random_state=None)

[源码]

通过Fourier变换的Monte Carlo逼近来近似“斜卡方（skewed chi-squared）”核的特征图。

阅读更多内容用户指南.

参数	说明
skewedness	float 内核的“偏度”参数。需要进行交叉验证。
n_components	int 每个原始特征的Monte Carlo样本数。等于计算的特征空间的维数。
random_state	int, RandomState instance or None, optional (default=None) 伪随机数发生器在拟合训练数据时控制随机权值和随机偏移的产生。通过多个函数调用传递可重复输出的int值。请参阅词汇表。

另见

AdditiveChi2Sampler

另一种近似卡平方核加性变量的方法。。

sklearn.metrics.pairwise.chi2_kernel

精确的卡平方核。

参考文献

1 See “Random Fourier Approximations for Skewed Multiplicative Histogram Kernels” by Fuxin Li, Catalin Ionescu and Cristian Sminchisescu.

实例

>>> from sklearn.kernel_approximation import SkewedChi2Sampler
>>> from sklearn.linear_model import SGDClassifier
>>> X = [[0, 0], [1, 1], [1, 0], [0, 1]]
>>> y = [0, 0, 1, 1]
>>> chi2_feature = SkewedChi2Sampler(skewedness=.01,
...                                  n_components=10,
...                                  random_state=0)
>>> X_features = chi2_feature.fit_transform(X, y)
>>> clf = SGDClassifier(max_iter=10, tol=1e-3)
>>> clf.fit(X_features, y)
SGDClassifier(max_iter=10)
>>> clf.score(X_features, y)
1.0

方法	说明
`fit`(self, X[, y])	拟合模型与X
`fit_transform`(self, X[, y])	拟合数据，然后进行转化。
`get_params`(self[, deep])	获取此估计量的参数。
`set_params`(self, **params)	设置此估计量的参数。
`transform`(self, X)	将近似特征映射应用于X。

__init__(self, *, skewedness=1.0, n_components=100, random_state=None)

[源码]

初始化self，请参阅help(type(self))以获得准确的说明

fit(self, X, y=None)

[源码]

用X拟合模型。

根据n_features对随机投影进行采样。

参数	说明
X	{array-like, sparse matrix, dataframe} of shape (n_samples, n_features)
y	ndarray of shape (n_samples,), default=None 目标值。
**fit_params	dict 其他拟合参数。

返回值	说明
X_new	ndarray array of shape (n_samples, n_features_new) 变换数组

get_params(self, deep=True)

[源码]

获取此估计量的参数。

参数	说明
deep	bool, default=True 如果为True，则将返回此估算量和作为估算量的所包含子对象的参数。

返回值	说明
params	mapping of string to any 映射到其值的参数名称。

set_params(self, **params)

[源码]

设置此估计量的参数。

该方法适用于简单的估计量以及嵌套对象（例如pipelines）。后者的参数格式为 __ ，以便可以更新嵌套对象的每个组件。

参数	说明
**params	dict 估计参数

返回值	说明
self	object 估计实例

transform(self, X)

[源码]

将近似特征映射应用于X。

参数	说明
x	array-like, shape (n_samples, n_features) 新数据，其中n_samples表示样本数，n_features表示特征数。X的所有值必须严格大于“-倾斜度（-skewedness）”。

返回值	说明
X_new	array-like, shape (n_samples, n_components)