Removing extra white spaces, replacing symbols, emojis, etc. There's a lot you can do with your text.
Do more with less code. Cucco offers the means to simplify your work.
Removing English stop words is nice but removing them in more than 50+ languages is better, right?
Directly from your code or the command line. You will always find a way to normalize your text in a couple of steps.
Worried about broken updates? With a 100% test coverage we have your back. Find us on Codecov.
Do you want to test cucco but you don't want to bother installing it? Use our API.
Or you can simply choose not to add them and use the default normalizations. Your pick.
from cucco import Cucco
cucco = Cucco()
normalizations = [
'remove_extra_whitespaces',
('replace_punctuation', {'replacement': ' '})
]
print(cucco.normalize('Who let the cucco out?',
normalizations))
Who let the cucco out