ITMO_FS.filters.univariate.gini_index

ITMO_FS.filters.univariate.gini_index(X, y)

Gini index is a measure of statistical dispersion. Note: before counting gini index data is normalized with MinMaxScaler

Parameters:
  • X (numpy array, shape (n_samples, n_features)) – The input samples.
  • y (numpy array, shape (n_samples, )) – The classes for the samples.
Returns:

Return type:

Score for each feature as a numpy array, shape (n_features, )

See also

https()
//en.wikipedia.org/wiki/Gini_coefficient

Examples

import sklearn.datasets as datasets from ITMO_FS.filters.univariate import gini_index

X, y = datasets.make_classification(n_samples=200, n_features=7, shuffle=False) scores = gini_index(X, y) print(scores)

>>> import sklearn.datasets as datasets
>>> from ITMO_FS.filters.univariate import gini_index
>>> X, y = datasets.make_classification(n_samples=200, n_features=7, shuffle=False)
>>> scores = gini_index(X, y)
>>> print(scores)