Keyphrase extraction using Word2vec and Page rank
These instructions will get you a copy of the project up and running on your local machine for development and testing purposes.
- Spacy
- Networkx
- Word2vec vectors trained on Wikipedia data
- Stopwords and topic-file have been added to repository
python SgRank.py
python SemRank.py
Extract noun chunks using Spacy. Apply pagerank on graph with noun chunks as nodes and edge weights can either be calculated using syntax heuristics or semantic similarity or both.