Stemming or Lemmatisation and tm library

Updated:

less than 1 minute read

Stemming is a process that removes affixes. Lemmatisation is the process of grouping inflected forms together as a single base form.

the differences between the two can be seen in the word clouds below, the one on the left is based on stemming technique, and on the right based on lemmatisation of the same corpus.

Comments