featuretools.primitives.TotalWordLength#
- class featuretools.primitives.TotalWordLength(do_not_count='[!"#$%&\'()*+,-./:;<=>?@[\\]^_`{|}~\n\t ]')[source]#
- Determines the total word length. - Description:
- Given list of strings, determine the total word length in each string. A word is defined as a series of any characters not separated by a delimiter. If a string is empty or NaN, return NaN. 
 - Parameters:
- delimiters_regex (str) – Delimiters as a regex string for splitting text into words. Defaults to whitespace characters. 
 - Examples - >>> x = ['This is a test file', 'This is second line', 'third line $1,000', None] >>> total_word_length = TotalWordLength() >>> total_word_length(x).tolist() [15.0, 16.0, 13.0, nan] - Methods - __init__([do_not_count])- flatten_nested_input_types(input_types)- Flattens nested column schema inputs into a single list. - generate_name(base_feature_names)- generate_names(base_feature_names)- get_args_string()- get_arguments()- get_description(input_column_descriptions[, ...])- get_filepath(filename)- get_function()- Attributes - base_of- base_of_exclude- commutative- default_value- Default value this feature returns if no data found. - description_template- input_types- woodwork.ColumnSchema types of inputs - max_stack_depth- name- Name of the primitive - number_output_features- Number of columns in feature matrix associated with this feature - return_type- ColumnSchema type of return - stack_on- stack_on_exclude- stack_on_self- uses_calc_time- uses_full_dataframe