clean_text

clean_text(df_col)

Function that combines all text pre-processing tasks.

  • remove accented_chars
  • remove punctuation
  • remove repeating_letters
  • remove newline_chars
  • remove digits
  • remove non_char
  • remove url
  • remove email
  • remove consecutive_spaces
  • remove stopwords
Parameters:df_col – a single dataframe column <class ‘pandas.core.series.Series’>
Returns:a single dataframe column <class ‘pandas.core.series.Series’>