featuretools.selection.
remove_highly_null_features
Removes columns from a feature matrix that have higher than a set threshold of null values.
feature_matrix (pd.DataFrame) – DataFrame whose columns are feature names and rows are instances.
pd.DataFrame
features (list[featuretools.FeatureBase] or list[str], optional) – List of features to select.
featuretools.FeatureBase
pct_null_threshold (float) – If the percentage of NaN values in an input feature exceeds this amount, that feature will be considered highly-null. Defaults to 0.95.
The feature matrix and the list of generated feature definitions. Matches dfs output. If no feature list is provided as input, the feature list will not be returned.
pd.DataFrame, list[FeatureBase]
FeatureBase