diggrtoolbox.standardize package¶
Submodules¶
diggrtoolbox.standardize.standardize module¶
-
diggrtoolbox.standardize.standardize.
remove_bracketed_text
(s)[source]¶ Removes text in brackets from string :s: .
-
diggrtoolbox.standardize.standardize.
std
(s, lower=True, rm_punct=True, rm_bracket=True, rm_spaces=False, rm_strings=None)[source]¶ Combined string stardardization function. :lower: lower case :rm_punct: remove punctuation :rm_bracket: remove brackets () [] :rm_spaces: remove white spaces :rm_stirng: list of substrings to be removed from string before comparison
Module contents¶
-
diggrtoolbox.standardize.
remove_bracketed_text
(s)[source]¶ Removes text in brackets from string :s: .
-
diggrtoolbox.standardize.
std_url
(url)[source]¶ Standardizes urls by removing protocoll and final slash.
-
diggrtoolbox.standardize.
std
(s, lower=True, rm_punct=True, rm_bracket=True, rm_spaces=False, rm_strings=None)[source]¶ Combined string stardardization function. :lower: lower case :rm_punct: remove punctuation :rm_bracket: remove brackets () [] :rm_spaces: remove white spaces :rm_stirng: list of substrings to be removed from string before comparison