nlp_primitives.NumUniqueSeparators

class nlp_primitives.NumUniqueSeparators(separators=[' ', '.', ',', '!', '?', ';', '\n'])[source]

Calculates the number of unique separators.

Description:

Given a string and a list of separators, determine the number of unique separators in each string. If a string is null determined by pd.isnull return pd.NA.

Parameters

separators (list, optional) – a list of separator characters to count. [" ", ".", ",", "!", "?", ";", "\n"] is used by default.

Examples

>>> x = ["First. Line.", "This. is the second, line!", "notinlist@#$%^%&"]
>>> num_unique_separators = NumUniqueSeparators([".", ",", "!"])
>>> num_unique_separators(x).tolist()
[1, 3, 0]
__init__(separators=[' ', '.', ',', '!', '?', ';', '\n'])[source]

Methods

__init__([separators])

flatten_nested_input_types(input_types)

Flattens nested column schema inputs into a single list.

generate_name(base_feature_names)

generate_names(base_feature_names)

get_args_string()

get_arguments()

get_description(input_column_descriptions[, ...])

get_filepath(filename)

get_function()

Attributes

base_of

base_of_exclude

commutative

compatibility

Additional compatible libraries

default_value

Default value this feature returns if no data found.

description_template

input_types

woodwork.ColumnSchema types of inputs

max_stack_depth

name

Name of the primitive

number_output_features

Number of columns in feature matrix associated with this feature

return_type

ColumnSchema type of return

uses_calc_time

uses_full_dataframe