Skip to content

Conversation

@mikolajb
Copy link

@mikolajb mikolajb commented Dec 3, 2014

New filters:

  • language detection using NLTK (supports danish, dutch, english, finnish, french, german, hungarian, italian, norwegian, portuguese, russian, spanish, swedish and turkish)
  • named entity extractor using ne_chunk method from NLTK
  • topic detection (basing on words statistics in movie_review and reuters corpus from NLTK)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant