nlp_primitives.
MeanCharactersPerWord
Determines the mean number of characters per word.
Given list of strings, determine the mean number of characters per word in each string. A word is defined as a series of any characters not separated by white space. Punctuation is removed before counting. If a string is empty or NaN, return NaN.
Examples
>>> x = ['This is a test file', 'This is second line', 'third line $1,000'] >>> mean_characters_per_word = MeanCharactersPerWord() >>> mean_characters_per_word(x).tolist() [3.0, 4.0, 5.0]
__init__
Initialize self. See help(type(self)) for accurate signature.
Methods
__init__()
Initialize self.
generate_name(base_feature_names)
generate_name
generate_names(base_feature_names)
generate_names
get_args_string()
get_args_string
get_arguments()
get_arguments
get_description(input_column_descriptions[, …])
get_description
get_filepath(filename)
get_filepath
get_function()
get_function
Attributes
base_of
base_of_exclude
commutative
compatibility
default_value
description_template
input_types
max_stack_depth
name
number_output_features
uses_calc_time
uses_full_entity